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*KFFFKKKFPPPSKKTP 


1167 


15068 


A 


1174 


410 


0 


FSYYPPPTRGGCPSSPPQIFLPPPLLGV 
FFPFSPLKIFFFPRGFKFFGGWPFFSP 
PKKKVFFKNPRSVYKNPPKKEKK*SLQP 
PG*VWAPPGIFKR/PPPPFFFYIFS 


1168 


15069 


A 


1176 


319 


3 


KEARSVFRVEGRG*KNFSASGPPGGGNP 
GGGRPGRGNF VFLI KKGGPPPGPKG/S P 
FFDF/GGPPPPPPPSGGGSGVNPPPPPF 
FFFFEMESRSVSQSGVPDAWADAW 


1169 


15070 


A 


1177 


385 


1 


SAS FGLPKCWDYRHBPTRPAS VS VT VTS 
SRLSGA*ARSGKGTLVFWAQMVFK\RIP 
LTKYSDHSREPSSLQLCMQSTPSKAEFT 
VAKADKRLCCTAKS STAKS IAQ I KCQND . 
AGRSPQESLKVSGRV 


1170 


15071 


A 


1178 


147 


808 


KLKEIKKLLEENAGINLYDLRLGSGFLD 
MTPKAKQQKKENLKWDVIRMKNSCASKD 
TITFYN\ YRSDKGLV/ SKKYKELNSITE 
RQPW F * KDLNKDFS K/EQMAKS T* KTTE 
RLFIREMKTKTT/ME/YHFLSTRMAKI * 
KDSNR/ RLCRKTGTLIHCW * GTAPANLL 
KNCQPLFPF*GTWQFLKRLNIiELPDDP 
AI P PLVT CPKEMKT / C /C YTE I CTQMFT 
AA*III 


1171 


15072 


A 


1179 


1 


398 


SRSRHCTPAWVRVriiVLKKKEKEKEKEE 
KKRCPCYIVSHQSL*KPAWQFli/RNVK/ 
LELP YDPAI / PLLGKCLKE I * KYAYTKT 
CM*MFIALPI IAKKYKQLKRLSTDEW IN 
KMWYIHAI 


1172 


15073 


A 


1180 


3 


389 


GYDRVIPNP* PLTGA\LRALLLTCGLAM 
*FYLQSMWVLILGLIjTDTLTIYHRWSDV 
ARKS TYQGHHS PPAQKGLRYG IML YMTS 
EVLLLGGLF*AFYHS\SLCPTPQLGGHW 
APTGMTPLDPVEVPLLNT 


1173 


15074 


A 


1181 


354 


2 


ANSSDACILDGHRALPLQLPPCGYSISP 
AAQSSTVPMPLLLIPPPHCNRTPSLWHY 
SPASNPTNPSPY\*TAPSPPPPIPRNNP 
FS* KFP YYVWVYTSLTTCSQP P VS SPNF 
TQPSVL 


1174 


15075 


A 


1182 


3 


384 


GANVSG/DL/KLKPVLIYYSSNPR/ALK 
NYARS ILPMLCKQEKKEKKKK\AWMTAH 
LFTAWFT* YFKSTGET/CAKEKI PFKIL 
VLiIDSACSHPRALMEMCKEINWFMPVN 
* HS I / LFCMQPMDQGALSTYKS 


1175 


15076 


A 


1183 


1 


410 


PPLP*KFFSPGGVQS*KGGRFWPPP*VG 
VLPSSSP/IIFFTPGNWGCFSPFSP*KF 
FFSPKGFI FCGGVGPFFPPPKKRFFSKI 
P PLVF FS P? FKKKI FFFPPPVN FGP PRV 
FFKRPPSIFFF 


1176 


15077 


A 


1184 


406 


62 


PPIINMLCSLAPPFFSPPPLRGVLPPFP 
LXNFFFP*GVYFWWGGGPHF/SPPPKKG 
FFPKFPPWVFNPPPLRKK/YENFPPPGK 
FGPPRVFFKAPP 


1177 


15078 


A 


1185 


337 


383 


RSYISFQK*VKDLNKHF3KDDTQMAN/K 
HIKRYSTLVIREMKIKPMTRYHFTPTRK 
AVPF 


1178 


15079 


A 


1186 


3 


471 


LiAPSDK* ENRLPGNRPL* EVRS PSARQP 
PHLRSEEPLRPAATPSGK*GASPPSSHA 
VREGGGGQPPPGQPPRPGGEGRLCPAAP 
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TGK* GAPLP / EPP PGLGG 


1179 


15080 


A 


1187 


2 


406 


FLVETEFCYVGQAGLELLTSRDPPASAS 
KGAGMTGVSHQVQPQ**S*LWT*/PSSV 
EAGTSFGLSFLSSSWALSAQEGCLAVPS 
/SGSRGLLVGALLLWTKPSPQLSPVPAS 
QRLSSLSLMPPLPQPQHLTHTSIET 


1180 


15081 


A 


1188 


160 


459 


NFMTIDMLCSAWIHFCSSGLDFQL* KL 
FKSQ * ENNL INKWAKDLNS FT I FSYG* K 
LAHE KMLN 1 1 RE I Y FTAI MS * YTLTRT V 
KVKKTDT KCW * GCTATGNL I HCCKNV* P 
LRNTAWQ\FHKILNIYLPYHPAIPLSDM 
NSRE*KNI/CHAKMCAWIFI 


1181 


15082 


A 


1189 


232 


2 


KAPPPFFFFFFFFFFFFC* SGSHSVSV7A 
GI * W/ PGV* WHHHGSLQPQS PRLK* SSC 
LSLSSSWDYRYVPPHLANLKK 


1182 


15083 


A 


1190 


2 


402 


PRVRHASGSPSPPPPPPGIASHTSPSQ* 
VFSWPSY*TPCLSALTLASVLSLL*QRS 
PRTL F I TNKC /D F/ PASKS S CR I PAGL * 
ALGRQGLFSCFFCFFETI ESHS FAQAEVQ 
WYNLSSAQPSSPEFK*FS/CLSLPSS 


1183 


15084 


A 


1191 


19 


390 


WCVPAVPATWEAEPRRSR\RSKPLTGRQ 
SKSLSY/NKKKKKKKKKKKKKKKTGGGP 
* KKLLGGPKYKGGKKKKFFFFKGEKKKS 
LGGILKKKTFFWGGKKWPTPPKKNKALK 
GKKKFLGGRGGNPP 


1184 


15085 


A 


1192 


485 


112 


QRPDPR \ SAEAAI KYFLTQATAS 1 1 LL I 
AILFNNILSGQ* TITNTTNQYSSLI I IM 
AIAI KLGIAPFHF* VPEVTQGTPLTSGL 
LLLT*QKLAPISIIYQISPSLNV 


1185 


15086 


A 


1193 


55 


385 


THAFADAWADAWGLFKG I LTENFPNLQK 
YINIQVQGG*RTSSRFNTKKTNSRDLII 
ILPKVKDKLSKKKKKN\AENKKILKYRG 
GPF**KPGGAQFYGGGRKSFFFFFGG 


1186 


35087 


A 


1194 


408 


3 


CQSA\LLGGASQLG\SRGSGVRDPLEEA 
VCPFSDLQLHAERTTALLKAVRQGHLSL 
PRLLLS F\VCLCPAPRGGACRGRQASLS 
CX5GLHPVRASRLLCLRKRAWAF4ASVPPP 
ASLPPCSLISDCCVSNQ*DSTGRV 


1187 


15088 


A 


1195 


5 


371 


LFSTNHGDFGPLYLFP\A*AGVLGTDLS 
LLIRAELGQPGNLLGNDHMYNGIVTAHA 
FVIIFFIVIPIIIGGFGN*LVPLIIGAP 
DVAI PRISNISF*LLPPSLLLLLACAIT 
EAGAGTG * TVY 


1188 


15089 


A 


1196 


76 


404 


PTPLRTHDQS S KVSRYKVN 1 Q KVGAFLY 
MLSTRTTGI+ \IKSTSFPFASPKVKYLT 
INLTKCVQ/DLWKEKKL/NEIKEDQNK* 
/NVPFSWIKGPNIVN 


1189 


15090 


A 


1197 


50 


334 


ILHMVS I PS I S I Y * HLLPAG * AGTH IG* 
LPPA* FFWVMG / RDRVLLCH PGWNAWQ 
S*LWASNSW\VK*SSHLGLSKY*D*RH 
EPSYPASGTKLN 


1190 


15091 


A 


1198 


250 


1 


QYYLVSSNT* S I IDFLQLPQKCI FTVGF 
SK* * CICGFI FKNI * LFF\ FCLFESES C 
S VAQAGVQWQDLSS *QPLPPVFKQFS 


1191 


15092 


A 


1199 


2 


402 


TDARHHTWLI FVFLLEMGFHHVGRAGLG 
UjTSSDPPRPAKLNFL*R*GHAVM\CPG 
WPKE \TCLGLPKCWDYRCE /KTAPKPEH 
FLPFFFFSEKNFIPVPRRGGG/VJKNLN* 
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MEPPPPG * KGFFCPGPPK YWN* KAP 


1192 


15093 


A 


1200 


50 


437 


STLYLEREKEQISLSVIVSSPLPSTSTG 
PTAAAQS S I SG P S L PLP PKNGAGDAP AG 
LG/YGQG?SGPPWGPS/SGPSPR*ALVC 
PTDPSGAARGGRGG\SRGSCCAPAGPAG 
LGDHRPGGMGEGPAAPPKSS 


1193 


15094 


A 


1201 


86 


313 


PPPPGGY^HFSFXXXXXXXXXXXXKSLL 
PPGKGPNPPKVGLPPFXXXXVSPKXXXX 
XXXXX *NP PHS PPRAPGGPP 


1194 


15095 


A 


1202 


105 


485 


VQQTTMAR I YVCN KPARSTPR FTKQILL 
DVLK/YIDTQTIILRDFNTP/LNSVWSW 
RQKTNKDNLDLNLMLGQLDLIDISRILH 
VSTR/YIFFSSLHEIYSKSGHMLSHKAC 
HNNF* KIEI ILI ILLYHC 


1195 


15096 


A 


1203 


1 


476 


PHFGPPPPPQDPRF*G\GPSTRKGFFPQ 
PPGFFPQAWPRAGQPPPGGT/EPVFPPP 
KHP*RRNPYPKFGSPPLPGKIHQGSPGP 
SGQFR\PPRWGKKRAQPTFWAHPPPIFF 
FF*GVLLCHPDWGTVARSR\PPQPPPPG 


1196 


15097 


A 


1204 


1 


269 


WEFETYNLGIK/ WSKDMKRNFTKEDLM 
MADEHPRRC\ * HHLTP PRTAQVKLELE P 
NE/WQGCGEAGPATPCWGDAAPVQPLLQ 
IGRRFLKK 


1197 

i 


15098 


A 


1205 


459 


42 


KLKPMLIYHSENPRAFQDYAKSMLSVLY 
K* KNEAGLRAHLFTAGFTENFKPAVET Y 
CSE*EISLQI*LLI/GHPR/SLMEMYKE 
MMWFMPANTTS ILQSMDQGVALTYEYH 
YISNILYEAIDNTENPLIDLGKANAWVG 
PG 


1198 


15099 


A 


1206 


408 


1 


EMKTLTEMSSPGMPTEKVSELED/DIN/ 
EMHR KLQ CREKR / 1 KRN /EI NVQEL* YK 
YKRCNHHVMRMPEKEQRKEEIFKVIMAE 
NFPNLATNNKSKVKEAQRTPERI KTIYI 
YIYIS*LQXAKEKESILKETCQKTSKPK 


1199 


15100 


A 


1207 


7 


383 


LDIGCFWWSDKYSSCYFSLATSGKLKYF 
SFSCLIHCFLVCVCVRAHK\CI * SRAS * 
CVCVHK\CI * SSAS * CVCVCARTNGI * / 
CQCFXjVCVCAQV/ CI * S S AS * CVCVCTS 
GI * /CQCFLVCVCVCVC 


1200 


15101 


A 


1208 


405 


214 


AEAGRSLCPGGGGCNEL* SPPCPSAWVT 
\ SETLSQKQNPHHHKKKTGQVRGQSSYL 
PLWEVNR 


1201 


I5I02 


A 


1209 


384 


2 


GVTHH/ ARARFFC/ LLDTGFKS VFQAAV 
Q*A\NHGSLQPQPPGLXPSSCLSLPSSR 
DNK 


1202 


15103 


A 


1210 


403 


2 


VPS IQTNGKSLTYFDFFFKQLFKTKLKL 
DASFVNVCVNVEP FRTFALSKTQSLIiCL 
SLFLLIF* TINCNFCFLRQN/LRSVAQA 
GVQWRDVGLLQPLPPEFKQFC 


1203 


15104 


A 


1211 


156 


405 


ESKQMLIQCKSQMLYC*RI*TV/CLCVC 
VCVCVCVCVCLTCGTKRVCIVFLFIWL 
NCK/PLCEPCC*CNRQGWAPWLLPVIST 
RW 


1204 


15105 


A 


1212 


70 


408 


KAFSLLPPGVSPGPLCNPRRPNFIGGGP 
KKKPPPPPQRWTQGFFNPPPFFFFLWG 
G*GQD/PPPVAQG*MDPPPRPPK\GRGP 
RHEPPPPAF 


1205 


15106 


A 


1213 


410 


3 


KKSMPG FKAS KDRMTLLLEAQAAGDF\N 
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LKPVFI \YHSENPRII»KNYAKYT/PVLY 
KWSNKAWMIANLFIA*FIKYFKPTVETC 
YSDKKIPFKILLLIENAPDHLRALMEMY 
KE IHAVFMPANTTS I LQPMDQGVT LT 


J 206 


15107 


A 


1214 


3 


340 


KIRVEVNKMYNRKLLEEIKKIKSLFFEN 
INKIDQFLPRLRKTRQKMQ INKI RNEQG 
GISINIMEIKYSYKEML*AI ICKKLNNL 
HEMNKFL/EHRVPKLTQVEIENLNYIYK 
K 


1207 


15108 


A 


1215 


1 


57 


RGLKIQNVNEIKS*CFEKIKIDK?LARQ 
NFKRKRFKFRD\ * KGDIMADI TE I *GI I 
RAYY*QL*GNYLENLEEMGKFLVTYNLP 
KLNH/DIENLKKPVTT/REFKSVIKSLP 
LKKSPLHDSFTAQFYQTFEBE*VSVLKR 


1208 


15109 


A 


1216 


350 


1 


QS S FFS I YPNFHLLS FLFCFKDFLFF YA 
SVLATNSLPLFFPQISRYFP\PFERERE 
RERENWL\ CCLG WSAVMQS * LTPALNY 
W\ VRQSShhShQAS *S YRCGPPHPANIF 
HFIFCR 


1209 


15110 


A 


1217 


139 


358 


KNTFFVWEAKRLFVPQAGGQGGNLSFK 
FPAPP/NLRG*SASSPSSGDYRPTPPCP 
ANFFFFKKNRISPWGSGW 


1210 


15111 


A 


1218 


361 


50 


WGDHGSLKPQPPLRSKMPPPLASLVTGT 
Q* LG/HPVAPFP IQLIFFFFFLRRSL/D 
S I AQAGAQWCDLSSLQAPPPRFTPLSCL 
SLKKKGNPVFVITWMNLEDIMLS 


1211 . 


15112 


A 


1219 


1 


361 


LKQHVSNVE KTAF Y WKTQSKTFVARE * K 
GEILSSNRGECLLQSFKGQPLLLGANAA 
GDFKVK\PMLLDHSEN/PKALKNYTTS 


1212 


15113 


A 


1220 


292 


362 


HIPWPAIQQAEVGGSLEPRRSRP*PPR 
LK*SSHLSLLNSWDYRNVLP/RLANFCI 
FLCPVR\ FKLLGSNHQNASVSQSARITG 
VSHHAWPILKLKKLFVCLFEMGSHYVAR 
AGLK/PPGLK*SSRLSLQNCWESRREPP 
HPDTIH 


1213 


15114 


A 


1221 


372 


3 


RLFFLLPPRPKGDFFPTLLIW*RHGFSP 
PHVFKPPPLNLILGALKKKFFLPS/LPY 
VKFYFFKRAPLFFF/ 1 FFFRDRVSLCWS 
* PPGLKQYTHLGLPKHWDYRHESACLAN 
NHNNNNTN F F FE TE 


1214 


15115 


A 


1222 


3 


291 


RSGDQDCPGQHGETLSLLSLTFVNLSLI 
CNL* TLS L I FVKI QKLAWHGGVRL* SQL 
/LRRLRQENHLNQEG\DCIPAWAT3KDS 
VSKKKKKKKGGPF 1 


1215 


15116 


A 


1223 


379 


19 


HMQILTIMRYYTPIRMAK^KIGYTK/C 
W*GCAATGYI \WECKMVQSL/WQNTWAD 
S * KLNTHLS YDI AIQFL/GFYSKE I KAY 
IHTKPCT*MFTATLI IKSQI /MKQYKCL 
STDKWIKQNRGIYI 


1216 


15117 


A 


1224 


387 


40 


KKTIiSTP\PEKHPTPPL?HPPPQKGGKA 
PPSFKPPPPKHPPPPKGISPPPP* IPPP 
PIYPPPPPAKPPPDFFL*SPPPPPPFSP 
PPPP*TPPPFFSPPLFFFFFFFFFLLIR 
LLV 


1217 


15118 


A 


1225 


21 


477 


IVSRLITVRLQKPRLDPRVRPRVRRKEN 
YSSV/SFINIDRNVLNKIL/SSQIQQYI 
KKLIRHE * FWF/IPGIQGWFNI LKSVI V 
TYV5JK\QKWKKHT 1 1 SVDTEKAFDKI H Y 
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LALILI KKKKKKKRAIiPQS F I PGDLFKK. 
PNGGFPGLKMMGRAPGE 


1218 


15119 


A 


1226 


1 


398 


ERINHTLFFLAEAQKRLPTGISGRGRPA 
I PHNTPQRAPDHAYLPAALAAQHRAGGQ 
QAPPPG\SSPSSPYDEVKDREGDVTASH 
GLRGNGWGS P * ATSLVLNNLMYMTAKYG 
DEVPGPEMENAWNALANNEKW 


1219 


15120 


A 


1227 


3 


238 


DAVJVAGHDG/ RTP*SQLLRKLRWEHRFS 
SGDRVSDP*SCLCTLAWAE*DSSSKKK 
KGGPFKGTKFNSRGGGRNYFFYGA 


1220 


15121 


A 


1228 


207 


2 


NRVSPCCPVQWHDHSSLQ/ PRTFGLKGP 
\SASAF*VAGTTGVHHHAQLIFHFFFFY 
* DRVI QAGVQWRNL 


1221 


15122 


A 


1229 


15 


413 


RKS VNVI IHHS E I LKKKN YMI I S I DTEK 
SFDKI * YPFMTKLSEN*DSLNIiINNI Y/ 
AKP / VANT I LHS KRLNAF P I KQGCLQGC 
\PPFLFNIILEVLASTKGRKEIKSLQIR 
VEE I / KPLSLFADNM / IVY IENPKES 


1222 


15123 


A 


1230 


363 


1 


AADPFACRPVSPHPPLYLVTSSQSLLTS 
VSPKKQPPSAGMQLGVGDSLSGWGWGRT 
KKTRLFFFKTRSL.SAAQTRRQWCEHRSQ 
P*PPGLIKRSSHLSLPSS*NHRHMPPHL 
AN\LCLFV 


1223 


15124 


A 


1231 


3 


402 


QANSCI FSRDFGSVGQAGLEPLTPGDPP 
AS ASPRCRDYRC\GPHAQLLLTFC* I S I 
LILV/CIS*MRDYFITCIYFFNYS*QII 
FYRRASDFFPFLRQGLALSPMHDHGSLQ 
PHPPRLN\H PPTSAKG VTGT 


1224 


15125 


A 


1232 


3 


396 


FLSQHGFLFLFFAGIDKLILKFIWKDN* 
NNYEKED*RGVI TLP/QYKAYSVATVIK 
MVWHWQSDKHVDQWNRBPRNRPNYYMSQ 
* FPL * YVSQRNENLPLEKNP YMNVHNGF 
ICNSRKQSRYPSTGEWVNKL 


1225 


15126 


A 


1233 


3 


395 


LPKCWDYRR*PPHPALF/IiFF*KHPKFP 
KMQVK/WRKH*TENRLVMYFSEKFGD*L 
KDQHKL YYLDMAFQFI FTFTYVMR YS IL 
FKFNIR*LTLSSVKMAVFLVETGF\TML 
ARMVLIS*LRDLPTSASQNAGI 


1226 


15127 


A 


1234 


416 


2 


KIDKLI LTF I WKCKRPLLAKI VLKKKNK 
I*RLPLPSFKTYGKGMV/TK/ TKWGWPK 
NRP I KGTGF * VQKKKTPHI YGQLVFDSG 
ARTITN/WY*DN*IST*KRMKLDLYLTI 
YKINSKWIKDLNVRAKTMTFLEENIWVI 
LD 


1227 


15128 


A 


1235 


384 


22 


FFQEI *NAIPVNTQMVRKQNSHPANIEK 
VL/VWIEGQTSNNIPLSQSLTQSKALT 
LFNSMKA/E/RGDKAAEEKMETCRGWFM 
RFRERRHVHNI KV* GEAARS CGS LAAGS 
PLKHLLLHNTLIC 


1228 


15129 


A 


1236 


379 


2 


SPYTKIHSKCLKGLNVRPLTFKPLEENP 
G/VMVPDLGPGKKFI S *APKAWATKTKI 
IQWGYIGLK/AFCPAKEAIPRVKTWPSE 
SEGIPAGHAFYEGFIFQIFKGLQPFHSK 
KKKDL I LKLDLL S ADI S 


1229 


15130 


A 


1237 


1 


407 


YRVGQAGLKLLTL * SAHLGLPKCWDCGR 
EPPCLTI IAL* SVFSLVLPAVLI KLINF 
F\CRIX3GQTMLPRLVSNSWPQMIC/LPW 
PPKVLGL.Q 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Pbenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Mcthionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y^Tyrosine, 
X=Un known, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


1230 


15131 


A 


1238 


3 


380 


ELSQRTCCMDIMSYYKAIVITMAWY* FK 
DKQRVQQNEVE S PKTLLY I YRLW I Y YEG 
DTTDKG*TF\NKWYWDSWI /ST/CKNM* 
FDYHFTQCIKIN/ SQWM 


1231 


15132 


A 


1239 


1 


396 


FADDMIVYLENLKDSSKKLLELVNQFSK 
VSGYKSKV/NVHKS VALLYANSNQAE / Q 
I KNPTPFTIAAKXQK/ YKQKNNLG I Y * T 
KEVKDLYKENYKTSLKEI IDDANKWKYI 
FCSWMGRIDIVKMTILPKAI*RFS 


1232 


15133 


A 


1240 


392 


58 


SFSMIiARLVSNSR/PSRVLPASASQSAE 
IIGVSHYAWPSKLSF*LTIDQTHLSCNL 
FIYVFFERRSCSVTQAGGQWYGHSSLQP 
*TPGLKQSSCFGLPKCWDYRLEALPRLM 


1233 


15134 


A 


1241 


500 


204 


SLS LS FS * DG VLLCLGWS S TPGLKR FS C 
LSLRSSWDYRCVPSSQTOFWFLVEMGF 
HKVGQAGLE LLTS SDNARLGLPKCWDYR 
RVPPRPAAFFLFFKG 


1234 


15135 


A 


1242 


2 


397 


N FM I NNLPKKKALCP WFTGE FYL P FKE 
ETIP*VIRISLSL/IFSIIFQKIETEGI 
LPNSFYEGCI ILISKPYKDI * \ENYRLT 
SVMHIDARFLDSILANPIQQCIKVIHCS 
HVAFV/SGTQDWFNIQKSM 


1235 


15136 


A 


.1243 


12 


362 


AGFYHVGQDGLKLLTS SD P PASASQS AG 
I KA* ATVPGLS PLNFCEVRFMESWS WKR 
P*RTS\VCSALAKCWDRNASQWDKLPS 
FPT/RM*SPCLHPAVSAFCGSASVLPGI 
IVTTSV 


1236 


15137 


A 


1244 


183 


383 


KQAGRGGSCL * / LPRCWD YRR ELLHLAF 

MPG* lkk/ffckdggltmwprlvlnfwp 

PVILLPRPPKVLG 


1237 


15138 


A 


1245 


410 


2 


FSRDRVS SCWPGWS / PNS * PQSDQLALK 
CWDYRCEPLHPAFC*MNFLKVSC*HGTS 
/DSKY/CQHVSPKNKDISLCNYNIIIIP 
KKFNIL*YIWFLDFFFFETEFCSCCW*D 
LGSLQPPPPGFK* FSCPSLPSSWVHRHV 


1238 


15139 


A 


1246 


440 


46 


KTEGANINKNTTYQ/NLWDADKAVFRRK 
FRALNAYTRK+ERAQLNLSSTLKTRKKE 
QNKLKADRR/QIMQI* *KLGKVENKQTK 
TIQQIH*TKSWFFEKISKIYEL*QA* *R 
KKGEKTT I TNVRUKRYSGRVG 


1239 


15140 


A 


1247 


266 


340 


RKCWPGAVAHTCNLXTLGGQGGRIT 


1240 


15141 


A 


1248 


3 


391 


DAWADAWGSRSRAVALFFFF WG FLGGGG 
LKTGFYF I PQVERRGLNFD * WNPPPPGL 
RGSSPPT/LPKQWEPGGGPPAPSNFWFF 
FEKRGFPQVTQAGFKLWN * GDP PAGPS K 
GVGITGGTPSPHPLFLKKR 


1241 


15142 


A 


1249 


3 


323 


MHRHA*LVLVFFCGDKVSLCCPG*S*TP 
DLKRLSHHGLPKHWDYRC/RATTPGFFS 
LFFFFFFNPPEFLLPWAPPSYSLKQNKI 
FLKRFLEPLAPPKGLPLKRAKGRI 


1242 


15143 


A 


1250 


3 


405 


QPDSYSPQPGRLSAPPEQEGGPWLILPH 
ACAPSQI CVGPHGAS I LSYQERKGTKVL 
SCEGHCKIiSSPVGLVGQSFCWQQPDAVQ 
WVPFRRRTQTPGTVAHACNPSTLGG*GR 
RITS/RPGVRDQPEQH 


1243 


15144 


A 


1251 


407 


2 


P FLKKTKKELP * GPAI PALGVP S F FP-F F 
FLPYKRKEIKGIRTPPMRIAALFTIAKI 
LKQPKYPSTOI *LKKRHTSNTTLLSRMK 
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Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=PhenyIalanine, 
G=Glycine, H=Histidine, I=Iso!eucine, 
K=Lysine, L=Leucine ? M=Methionine, 
N^Asparagine, P=Proline, 
Q=Glutamine, R=Arginine» S=Serine, 
T=Threonine, V=Valine, 
W^Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, ^possible 
nucleotide insertion 














H* WIKKT* /HTDMMEYYSAIEKNGILS F 
AATGMSPKDIMLSE I CQAQKNRY 


1244 


15145 


A 


1252 


1 


253 


VADVRES LDPGD* GCS ELRLCHCT PAWA 
T\ SRTTSQQQLKKQNENTT * KTYGIQLM 
NCQIANYCSKLLYFRKMSNLQSNLKKKK 
K 


1245 


15146 


A 


1253 


3 


372 


KKKAQKSGTIVLPCNPSYSGG*VGENA\ 
WARSQPGLIS*KQQ*QQQNQQQTTSKTD 
PYIYG/YLIHDRGGK\NTQGWDRLUSIRW 
GWGIWLMTQRKIKLELYLTPYTRTNSTW 
IKDLNVAGCGGSCL 


1246 


15147 


A 


1254 


2 


410 


KNLTPIVGLKAATN*ESIPPQT/SRKKK 
KKKKKKKKRGGALKKKKFKGGGGGKKNF 
. F KGG KKKKLGGGVKKRGEG KKPGG KKKK 
RFGKKS FFSRGGKKKKNRS SSSSSSSSS 
SSSS 


1247 


15348 


A 


1255 


61 


487 


GRPGPTHAFWIDSFFMGNEARFINHSC 
DPNCEMQKWS VNGVYR IGLYALKDMPAG 
TELTYDYNFHSFNVEKQQLCKCGFEKCR 
GIIGGKSQRVNGLTSSKNSQPMATHKKS 
GRSY/RE*XKNKKKRGGRFKGSQFSSPG 
MQG 


1248 


15149 


A 


1256 


118 


5 


MFIAELFI IA/RRWK* PKCPLTDEWINK 
M*YSHTM*HY 


1249 


15150 


A 


1257 


390 


1 


TQKNTPTPKPKPTSQQHHQKNKPTPPGF 
FFFSPPGK\GGFFPSPLFWVPPGFFPPP 
VFKTRPPEFIFGAP*KKFFFSPPRSLNF 
FFLRGP PS FFFFFFFFS * VGKEGS S PSH 
ENPLFVPTEGW 


1250 


15151 


A 


1258 


378 


3 


GAFFFFFPPPKKGFFFTPFFFFGPGFFF 
SPPFFIPPPQIFFFGPKKKKKFLPPPPQ 
KIFFFLRPPPLFFFFFFFFFFFFFFFFF 
FFFL\DLD*NVLSGTSAPEKNQELLGMV 
AYACNPSTLGGRGG 


1251 


15152 


A 


1259 


462 


287 


RDG \ FH HVGQDGLHLLTS * S TRLS L P EC 
WDYRCDLPCPAIPAVTLYQIQYRPLGLE 
SKA 


1252 


15153 


A 


1260 


446 


34 


NVRAET/ 1 KTLEVNTGVNLHDLS * /GKA 
FLDARPKAQVAKEKSRQPSLHRRLR/FC 
ASKDPIKEVKRQPTE/WSKIKHVSDKGL 
EFKIHKELLEKLSNNPI *KWDKDLNGHF 
SK/ELQIASKHMRRCSASLVIRKCNEIV 
DPD 


1253 


15154 


A 


1261 


2 


383 


GSQRKWFLEMESTPGDNAID I VEVTTKG 
FNY YMNLiVDXAVAGFER IDSKFERSSSV 
SKILSNSIACYR* IFCKKKSQLMQ/QTS 
LWFYFKKLPQPPQPSAATTLISQQLYTL 
RQDSPPAKRL* FTDGL 


1254 


15155 


A 


1262 


476 


50 


FFFFFSDTGSHSW*AAVQWHDHDSLQP 
*P?GP\SDPPTKSSTREF 


1255 


15156 


A 


1263 


143 


472 


TGAVP I RPS WN/RP PAM I FF*NAQG I LF 
AEFLASQRAILWEFFEKAYHESVLRKSA 
KGIAEK CPGKLHQR VL VQ Y DNAIiAHFSH 
QTRTTKSSTRQFR 


1256 


15157 


A 


1264 


179 


489 


YIFFFLSLFFFLWPRPEYRGAITVHCSL 
NLPGSSDPTASASSVAGTKPH* YI FFFL 
SLF/CFSVAQA*VQGGNHSSLQPQPPGL 
K\ Y PTAS AS 3 VAGT KPK P Li I FF F F * K.KK 
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Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIaIanine, 
G=Glycine, H=Histidine, J=Iso leucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R^Arginine, S=Serine, 
T—Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /-possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 














FL\CFVPQAGGQGHDFG*L*PPPPSLKQ 
GGGLTLSRIGDYRGPP 


1257 


15158 


A 


1265 


1 


536 


FRGGWGS VRRAPGTASC YLALAGG P PGQ 
CPPAAISCPSPLPWGS*TE/PYVPRRPG 
DPSAAP/PSW*VPRPGVPRNVPGRGRVG 
QDGCGQAQDI QAAASLMADTAPDSPGSA 
■ GSVRALPACVPEISGS/SGLPPGAALP* 
VAA/RPTPGRHVDTHPRQHSFCGGQBGD 
IRAFPTLYLEVYPGPP 


1258 


15159 


A 


1266 


403 


2 


TSLHPRGYMRLLRQGFISAPCGYMRLPR 
GSSbHPRGYMRLLKTG\7HRCTPWVHAPS 
QTGVHLCTPWEHTPS*RFISAPLGAASH 
VT/GSLSIQHIYIFVFCLLRQGL/NSVT 
QAGVQWCNLGSLQPKLPQAQVT LTK 


1259 


15160 


A 


1267 


2 


401 


FVLNPGGRSCSEPRSLHCTPAWATRAYL 
QLGKKKKEKKKERGEENKNQGPPRPLKR 
EVGTPGQKKPL/WGGSNGAGQKQPA/QK 
KGGKKKADHKGQRGKN* KRKREGGGSTS 
KNNSRGTGAKA*NPTIWGGGGKKI 


1260 


15161 


A 


1268 


49 


416 


LRGRALDPRLLRE CGDLGAP PA PEVALR 
AGTCWTR/CTL*APPRGA/DRSPWPPRS 
PMCKAG/DECQDGI PG \M KAW S CGLRTR 
QCPWP*PKLPCGPGHPAWRT*PLPQTA/ 
CGPGPAAPCAG 


126! 


15162 


A 


1269 


420 


47 


GPPGWASFRLNFPKARR*GEWKTPGESG 
GAFFS PPG KNF\AGQN*GRPPNPPPPGP 
GKGG I QTRGGAGLG KNPFRFWGGFPNPG 
NK*GGGTKKKEGPPPPLFFFFLNTDFCN 
L I KRDRLG VGAHL 


1262 


15163 


A 


1270 


404 


1 


AEILELKNVIDILKNVSESLWSRIDQAE 
ERISELEDRLFENTQSEESK* KR I * KNE 
ARQQDLENSFKKAWLRVIGLKEEVEREM 
GVESLFKW III ENS PNLEKH IHI VQEGY 
RAP\ FNSNKKTSRHS I INSHTK 


1263 


15164 


A 


1271 


387 


2 


KKEKKKEVI 1 1 FVLLLPLK* FQNIQVWL 
VRELB KKFTGKYW\ FA* RK I LPKPTQK 
RCTKNKQKRPRIHAPTAAHVAILEDSVF 
PGEIVGRR IRVKWTQQLTRVHSDKAQQN 
NVERKVQTFSGIDKKLR 


1264 


15J65 


A 


1272 


1 


393 


FRMGRLPGGGAPHFSDGVAGQRRSSPKR 
RSRGRAEALLTSQTGRLGRGAPHI SDDG 
RPGRDAP\PS*TGWQ?GRGAPHFPD\GQ 
PGRGAPHI PDDGRPGRDAPHFP YGVAAG 
QRLQSRHFGRPRQAAGRWRL 


1265 


15166 


A 


1273 


3 


317 


S S YETKGVMIAS FS SREADNHTAFI RI K 
TNASDSTEFIILPVEVEVTTG*WKTDEI 
ESCFVFSIAYSGKVKNNC*LFFF?\APG 
I YSSTEMLDFGTLRTQGKKI F 


1266 


15167 


A 


1274 


92 


368 


LCPPGGRAGQGQNLN * GVPS P PGPKGFW 
GPTYGGGGGKRTTRPGGGKPEKSFD/GP 
PGPPPGGTKGNPAPKNFFAQKERGPFFQ 
NGGPGNKLF 


1267 


15168 


A 


1275 


401 


3 


DMCDWFKKEFSDTTPKA* SIKEKQLTSF 
I K I KSS CS PS \ KNT I KGLTRQEKLWGK I 
FANHMSDKGLVSRI YEELSKLN\K* KLN 
F*KWAKQ*DHLLKKTH* *QICKERYSLG 
KGKITMRYHSTSIIMCIIKTIHS 


1268 


15169 


A 


1276 


298 


426 


GATM^VLLF* LAFDLRQS L/NCFPQAGV 
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K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine T P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














QWRDLGSLQPSPPWFK 


1269 


15170 


A 


1277 


421 


1 


VSFKDQKL* INKENHPIKKWKKICNQPL 
PQIRYMDANKHIKIFSTSLAARETNMKI 
TT/RILWKTWJQFFKKVSIYLPLEPAIS 
SYLFTQEK*KHIDGSCSPSRVATAKLLA 
LQWRRPSWNCMHQRAGSWKQIGAPPFSK 
L 


1270 


15171 


A 


1278 


1 


218 


TRSGVQDQPNQHGETLSLLKIQKLASYG 
GVCV* SQLLRELRQKNClANGPRSHHCT 
PAWAT\EQNS I * KKKKG 


1271 


15172 


A 


1279 


88 


396 


TFLFS S S FFF FGGGVP PCPPGLRPG \ PN 
FASLHPPPPGFGGFPSLTSPRA*NWGPR 
PPPQATLGAFGLEGE* IFPPGRPPEP*L 
WGPPPPPPQRGGMAWPTPL 


1272 


15173 


A 


1280 


94 


399 


TDFLFL* TDFLFLCLCSLKNKIWVNEFR 
YGGFSLGVSNTQALPPSQEVNDAIKQMK 
KHLKLAK/ DKI S I VRC I TKMGM* LLGYR 
SS WQVKRI TWQL I VLNWLAYRAVFG I 


1273 


15174 


A 


1281 


236 


2 


TQSRLVFFYMTGPAVYLNHHALRTHQGS 
HLCFFFCFEMESHSVTQAGW\VWWRDLS 
SSQPPPPRLKRFSCLSFPSS *DY 


1274 


15175 


A 


1282 


37 


410 


KKKTLfCKIKKLCPPGVKGEFNPPFYVLS 
KVFP KKERGLFFKVTFVS LTPFFKKKNF 
KIGGGSGFPPPLWFFFFFY/RDRVLLCH 
PGWNAVTRS * LTTS VDSS DPS / CLRLPS 
SLDY 


1275 


15176 


A 


1283 


404 


232 


LSSWDYRCLPPHPANPLYF* *RRGFTVL 
ARMV/GPRDS PASS SQSAGI TGMSHCAQ 
PG 


1276 


15177 


A 


1284 


3 


402 


mqi kitmrnyctpvrmp* kkk\ srtkpr 
c*ggggttkilipcwgdykige/plw/k 
svwqflik*nnhllydpailplifytre 
mktyvhtitkycnhnek/ cmqgfteaiif 
/tlpkte/ sqpkhsst 


1277 


15178 


A 


1285 


1 


253 


LRGKFMTLHS / S ILKRG*SQINN* TATL 
IK*KKKGKIKPK/RSRRKVIIMNRNK*D 
* KQNNNKFNETRS W FFKNMKKI YKSLVR 
VT 


1278 


15179 


A 


1286 


400 


105 


FNFKKKKKKKNREVSE*QY* QAKYLKRN 
FTKNI * VAEKHMKNAQYVIKEMLI *LTM 
RYYYT P I K\MAKI KLTDNHKCQQLKPS Y 
MVGKEFGKFL I KLNM 


1279 


15180 


A 


1287 


400 


119 


TVFHHIGQAGLELLTSSDPPVSASQSAG 
ITGLSHW\PASSSFHSNHSCPVPLEPHQ 
VAQLDSFGYEKVCSFSN*VPGAQDKNDL 
SEN 


1280 


15181 


A 


1288 


157 


386 


NDLQFHP FYCLF/ DLFCQSLKKAGMQ WH 
DLGSLQPLLPRFRRLLCLSLPSSWDCGH 
VS/PMPG*FCIYTHIYFLVEMGF 


1281 i 


15182 


A 


1289 


400 


124 


FDAPAKGQGGDFGSPPPPPPG/AQKIFP 
PHFSQ* IG * KEG APMGQP I FVS FS KTGA 
PPPGQGGFQIPTPCGGPPQNFGIQGGAP 
APGPGGGFF 


1282 


15183 


A 


1290 


419 


3 


KNFFFLERGFFFFPPGGRAGGGF* FPPP 
FFFQGKKISGPPPPKKRGPPGAPQKL/R 
EIFLVLKKKGGPPLWPGGFLNPAPKNFA 
RANFPKRGGSRVGPPSGGPNLFNLGGGW 
FFRFFFFFFFFFLVFETEFRFCCPGWSA 
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W=Tryptpphan, V=Tyrosine, 
X^Un known, *=Stop codon, /^possible 
nucleotide deletion, V=possible 
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M 


L283 


15184 


A 


1291 


397 


39 


WGQVWAKDWIHGLGPKTQ6CPPPLGVFG 
CP/SLIPPNLFFYGIPGGLKIFVHKNPP 
FPIKKEPANWIATFFPPGVFLFFL/CFF 
ETRSHS VTQAGVQ* RDLVSL * PTSGFKR 
FSCLSLPSSR 


1284 


15185 


A 


1292 


399 


1 


LNFF FLHRQ KLCN I LGS DD KVPA FS I Y P 
EPRI CYVFSLLFEI IFSFDFNKKIRI YK 
KHVSR I YKDLLKFNNKDNPVKKWAKDLN 
RHFSKDGIQKVKWHIRNC*TLAIK/EMQ 
NTTTMR * HLMPNRMAKI RKT I 


1285 


15186 


A 


1293 


412 


) 


ARMVSISRPLGTPIWRSQKGGNKRVSPP 
TRPNPLFLKNPKKCPKSRD*NKPPLLKK 
KKKWSGPCPAQ/SCPGQNPHP/SPPKQL 
PPSPSTPPSRPSPTLAFPMAS*LAISPA 
HTHCCWRELSGTHTLS I P CLKPLRGP 


1286 


15187 


A 


1294 


229 


1 


FFFETESHSVTQAGVQWCNPGFKRFSCF 
GLSSSWDYRYAPPRP\ANF\* FLVETGF 
YYVAQAGLKLLS PGDLPALAS 


1287 


15188 


A 


1295 


401 


2 


STLP VH * KWTTKAWMTAHL / FTAWFTE Y 
FKPTVQNYCSEEKIPFKILVLIDNAPGH 
PQALMGMNKEMS WFIPV/ NTTS I LQP \ 
ADQGVI FTLKSYCIRNLFQHQCTPAWVT 
QLDSVSKK* INI FQPGMVAHACN 


1288 


15189 


A 


1296 


1 


336 


KKT PRR KTHKEHHNGEK \ LRA FP LR SG I 
KQ/GCAF*P/YILNIVVA/VLAKEYGKE 
IKASFRKEELKLCLFIDMIIFVEIPENS 
PQKL\ IN* FSKVAG* * VSTQNS VAFLFF 
FFFF 


1289 


15190 


A 


1297 • 


3 


384 


HTNMQKVLV/VKTEDQPSHNIPLNQILI 
QNKSLTLFNS IKAERSEEAA* EKCEDSR 
GWFMRFKKKK/RI S ITKMQGEAASAWQ 
AGTSYPKDLAK/DEGSYTKQQiraSTVNET 
VLY*KMI/PTEDFWRKDKSM 


1290 


15191 


A 


1298 


3 


395 


SRSVTRAGVQWHDLGSLQSPPPGFKQFC 
LSLLRSWDYRLSPPHQQMGGI FL/VFLF 
FFFPGKGVFLGG PQAGGKGPNLG * WNP P 
PRGLGEFFGLNPPRGWGFQ/HPPPHPQL 
F/CCFFRGKGGSPRRAGRAPNP 


1291 


15192 


A 


1299 


260 


4 


GFIVSSAVFPLLKCLLDILVSSSVLLYC 
LWNFKIQKSFRFYNLSPLVSSFKNSH*N 
SNI/WPGAVAHACNPSTLGGRGGRITRS 
GD 


1292 


15193 


A 


1300 


91 


219 


LSRL»S AWDY *HVPTHVFFVEMG FHRWQ 
ASLELLS * GDPPALA 


1293 


15194 


A 


1301 


277 


441 


TVEF*GVQSMKEEI \ELCLFEDDMSVYE 
ENLKELTKN\LLK* I SNYS KVAGYKVN I 
*KAIAVLHTSNEQ*NFEIGNTILF/TIS 
TPKYLGINLTKYAQGLYEENYSNLMNEI 
KELNTW/RDILCS 


1294 


15195 


A 


1302 


256 


482 


YNVYFKICIGPGTVAHACNPSTLG\G*G 
GRl/TLRSGVRDQQVQHG 


1295 


15196 


A 


1303 


182 


460 


VGREFLDMTGKEFIYKFLYIRKIN*SSS 
KLKTFVLPKTLLRD* KAKLQTGRKYL* I 
TYPVKGLVSRLCKKLSKI^NSKKTTQLKK 
WAKDMNRHF 


1296 


15J97 


A 


1304 


1 


468 


FKQFSSLSLLSSWDYRHPTGNFCELAPY 
KQNAPS CSNVFTDNV P VI ST * QGLQALS 
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SEQ IB 
NO: of 
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SEQID 
NO: of 
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eth 
od 
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NO: in 
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09/515,1 
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nucleotide 

location 
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ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=A)anine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K^Lysine, L=Leucine, MNVIethionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possibJe 
nucleotide deletion, \-posstble 
nucleotide insertion 














PNVNEQSPWRETKKTAVDLCGVIWKPCQ 
VFLRKKPKTKTGQ/HEQQIFGF/CFFSB 
TESHSVTQAGVQ*HHLGSL* PLPPPFKR 
NSTTRTSDYDGP/HVPP/HPA 


1297 


15198 


A 


1305 


880 


1026 


EEHGAGLSGSQDAAGGVPAG*GGWAQLW 
VTRRASLFLDKTHWPVDEQNLGSLYTIE 
ATAYGLMQKLELGRYNETHAIAKWLLEK 
QELGGGFRSTQ/SDGDPRETTWALEAL 
TRFREAVPFSCG IQDLHVQIRAPKTALNV 
NWY I DHSNAYQQRS AKFLAQDDLE I KAS 
GNGRGT I S I LTMYHKS PESREDNCNLYH 
LNATLHSALEENKKGGETFRLRMETRFQ 
NN* EATMTI MEVSLLTGFYPNQDDLKQL 
TSDVERYAFQYKTKTSTSDSTWLYLEK 
LSHEKNTELGFRVHRMLQAEFLQAALVT 
I YDY YEPSRRCSTF YNLPTEQS SL 


1298 


15199 


A 


1306 . 


3 


726 


RTDHYQFQSLKHCLTGGEALNPDVREKW 
KRQTGVELYEGYGQSETVVTYKWELSYE 
DANTY I VK* KTL * TQKEEG \ 1 1 CANPKG 
MKIKSVSMVKESLPYYVHIVDDEGNVLP 
PVEERNVTVRIKPTQLLCILNCYLDKCE 
KTAVS*QGDFYITGDRARMDKDGYFWFM 
GRNDDVINSSSYRIGPVEEESALVEHTA 
VLES T WS S P Y P I MGE VGKA Y I VX*TRA Y 
SSHDT*ALTRVLQEHVIK 


1299 


15200 . 


A 


1307 


230 


486 


NAI CPNGS KGS LGLGS CSVTQTGVRW* D 
HSSLQPRPPGLK\ YPPTSAS * VA\GPQA 
GVQWHGLDSLQTPPPG/FKRFSCLSLPS 
SWDY 


1300 


15201 


A 


1308 


463 


440 


SAI PVHT * RRRKQTS LNADMEKV* WW I 
EDQISHNI PLNKSLIQGKALTLFNGMKA 
KRSEEAGEGKFEASRGQVMKFKERSHLY 
NVKVQGEAAGADGEAAASYPEDLAQSTD 
EGGYTKQQNFSIDV/TTFYWKKMSSRIF 
I AE FHHTD * GDNG S VP 


1301 


15202 


A 


1309 


463 


59 


EVISTLTKMGKILKTKNAK/CWQDCGTK 
RTFIHCWWGYKIiVQPLWKRV\YTFPYNL 
AI PL IGI YP * /NMKI YSHKRTCTKMF/ 1 
NS LF 1 1 S KNWKHLR YTFTRE W I K/ M * YS 
HTKDYYSAI KRNKLDMFSNMKESQKHYG 


1302 


15203 


A 


1310 


438 


29 


PGGKGGGGPTPAPFPFPPPPPKVGVFW* 
GFPRPFLFKPPPPNKAPFPGAPHFF/SL 
PPFPRVFKKTFPTKKKFFPPGGGGKKKP 
PPPLQGGKKMGSALKFPSGGKKPWGISP 
QI 


1303 


15204 


A 


1311 


2 


310 


HNQKNEFGP YVTS YVKVKEAK* IAKTRK 
LLEENIGVNIHDIELGSGFLAMMP\EA* 
AIRLKIEKLDFIKIKNSCASKDTINKVK 
KTGRPGMVAHAYNPSTLGG 


1304 


15205 


A 


1312 


19 


348 


RGQQDVLPG*RGQQDVliPGRGTYSEYGL 
I FSARNPSMEV*SVNHR?FHHHGVP/HQ 
NCSDSGSYFISKECGN/WVSACGIH*CC 
LVPCLPKAADLGE * VIWGDFSLSGKEKKK 
EFYLIMTFSQ 


1305 


15206 


A 


1313 


2 


424 


ENKLTNHGKTGNGGAQSQ/ PPECEPRTH 
LQRGLEGRGGGEP WGQGQPD\ PPSNS S L 
KNPQAGVPPFSSbKGKVKRDRSVSVDSG 
EQREAGTQ*TFVNGKKKKKKKKKKAKKA 
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SEQ JO 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=A)anine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X^Unknown, *=Stop codon, /-possible 
nucleotide deletion, \=possible 
nucleotide insertion 














AYSPDSKA/PSSKSSPKSS 


1306 


15207 


A 


1314 


375 


3 


KKRTLLI PLMNMDAI /MNKI LAN* IQQH 
IKRI I HNDQVQ FVPGMQG WFNIQKNINV 
I PYY* LFY*CYINRI KDQTLI 1 I SIDTE 
KNDKIQNSPIVKTLRK\IKENFLTLIKR 
IY*KSSNNKKNID 


1307 


15208 


A 


1315 


403 


3 


LEGVSFFFPRVASQGSILGSCNPPLPRF 
HHFS/CPHLLSKWGYRFPPSPPA\FFFF 
FLKTG F F F FCKN FALCS F P * KQ AS P P P P 
LTVFFFS /HHPFFFFFFPLFFFLRWS FA 
LVAQAGVQWHDLGSLQPPS LGFKRF 


1308 . 


15209 


A 


1316 


38 


427 


PEXPARPTRPRPSAWQPPRLRSEEPLRP 
AAAPS£K*GASPPGSHPIWEVRSVSARQ 
PPRPGGRWWGVNPPPGQPPRPGGEGRLC 
PAAPTGK*GAPLPGQPPRPGGRWGGYPP 
ARSAASYGR*GAPLSSRP 


1309 


15210 


A 


1317 


43 


399 


LTFFFFLGEGARPPPPGWGPGAHQGITA 
PLFWGGQGNPPP*PPGEGEPS/QGPPPP 
GGNVTFWKKKGSPRAPGGPWTGGPKGLP 
RPPPPKGCE*RGNPPPPTPKFLGFTTFQ 
KKGKNSGP 


1310 


15211 


A 


1318 


437 


54 


MNEQKGERGLSSLGLPAR*D*GKGWVKE 
GR/EGVTFSPREPKEESLRWSTPQKEIV 
GVINHPRGQRVDRQGAGSGCEGFALRAQ 
TGG P PA * E ITRAGEQKAGGGS KGAQTLQ 
RETRP PRGQRGGGRSASL 


1311 


15212 


A 


1319 


351 


133 


GGGWAEVPLTS QVGRS GRGTPQLPDGAA 
RQRRPPPPRRGSWAEAPTSQTGRPGRGA 
PHLPDDGRPGRDAPHLP 


1312 


15213 


A 


1320 


474 


11 


KINSPSQKKKKKKQCSSIRKIA* /DQNR 
HFIEEETQIPNKHMKRSSIS/LAIKEMQ 
IQITKSYCIIIRLCK*LIKNSDSIKCWQ 
GCREI /GSLIHCWRKLKMNSTALQMVYK 


1313 


15214 


A 


1321 


485 


33 


C PAS RVAG I TG AHHHAWL I FVFLVEMGF 
HHVGQAGLE/LPVSWGNGGHL/RLCLLF 
GYCGQCCYKPS *RRFFCVSFYIRA*K*G 
FVCLFV*MVSCCVTRLEFWTHRCNHSK 
LQP* I PGLKCS\PASGF*VARTTGLYHG 
VWHVSNSTALTSGLQ 


1314 


15215 


A 


1322 


451 


145 


THPFGRPRGGVFKVRKLNPPWLKKENPL 
FIKKKKKKPGPGGGPFIPTPSEG* /GKK 
KGFNPEKEASNKPKFAPSNPQTGLGKKK 
KPPS LKKKKKKKKVYS LI F 


1315 


15216 


A 


1323 


470 


151 


GKAEERHDPGSRACSEPRSCHWTPAWAT 
\SETPFKKKKKKLCISWPGTVESCFSVR 
T * * I * LRQYVLYLFLFQPKNVI YLFICL 
HDKANMVMFLDFKYYFCFLFLD 


1316 


15217 


A 


1324 


64 


367 


TWEGEGSWLTSQDRTTAPLHPSLDNRVK 
LRLKKKKS VGFLS PS I LLAKNQ I KKPAP 
FTMA/S K/R I KYLG INLTKDVKDLYNEN 
Y * TLMKNLKGGAMCS FT3 L 


1317 


15218 


A 


1325 


193 


381 


AQLFKTSMGNSETLSLQKKKKEKS * LTL 
PS SWDYRHPPPRPVNF\ * FLGEMGFRRV 
AQAGLEL 


1318 


15219 


A 


1326 


410 


7 


GFFPYP*LPK/SLRC*GKDFYNQGSGQA 
RWLPPVIPPPLEAQAGGSP*GRSLRPTW 
PAWSNPS FLKKPTPLFLKFS * GRR IALT 
P KAKVS VN WDS P P ALQPGGPSKT F FP KK 
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beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 
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Predict- 
ed end 
nucle- 
otide 
location 
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ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, I^=Leucine, M=Methionine, 
N^Asparagine, PHProline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














KKKLR 1 1 GPGWAHACNPNTLGD 


1319 


15220 


A 


1327 


295 


3 


VCHFGIYFCVCSLYFTCLYFPFLIFL*V 
TLTFLVFYFDFTVLFI ISM YFNDCSRD 
CNIHM*LLSLPVLIFHCCC\FVFRDRVS 
LCHPGWSALASSLL 


1320 


15221 


A 


1328 


2 


392 


RQASRPP * SAPPAPAGKEGSGE * PPS PK 
/PPPPPPKWPPRGFPFPPAGPPPPPPP 
KKSPPPKTNPPPPPP/IKPPPPPPPPPP 
PPFFFSPPFP 


1321 


15222 


A 


1329 


378 


7 


TDDLILKFI*ERYKPNQF*KRTKLKDSL 
PDFENYYKAKWKTRW* * YKE\KHIDQW 
NRI ESR*QI FN IAS I I /HIEKEYLCNKW 
CYRTTKMEEKNMNLKPLP*TIKFNSK* I 
TDLNIKTITIWLKL 


1322 


15223 


A 


1330 


403 


2 


VETGFRHVGQAGLELIiTSGDLPTLASQC 
WDYKHE PLRLGELRS LTAAWAMQQDP VS 
PNK* I KI K* I / P * S EQFSG I * S IHNWQ 
NRHFYPDPTHFHHSKVKPLTHYWSPHS 
FLSPTPGNHQCLLSVSMDFSIL 


1323 


15224 


A 


1331 




389 


KMK S Q A I DLE K I FT KH I S DKGLG YGL Y K 
ELL*FN/R/RCQTTQLKMGER*TNSSTM 
GMSFEWMVSKHMKRCSCAFPSLVTREMQ 
VH/TTSLFVERYHYTPTRVAVIKQSDQV 
\W*ECGIRTLIHC*WECKMIAT 


1324 


15225 


A 


1332 


392 


3 


NNFMPSSA/PPPPPPFFFGGPRVFSPPP 
FFKPPPPFFFFGPQKKKIFSPPPPLKFF 
FFLRPPPPFFFFFFFFFFFQKNLGKGFF 
F* PRPEKKKKPPPGGVFFFSLFFLRCS V 
ALSPRLECSGAISAHCNLCI 


1325 


15226 


A* 


1333 


394 


1 


SQVVGPTAIHYCAWLVFKFLGHFSKTTI 
SAPLLK*TSTRAETSKSFS\CPRKMERA 
EGRRMFLGKSLLKQIVLLF/LISGSCVT 
AS*FL/CFFQTESHSVAQARVQ*HDLGS 
LQPLSSRLKRFSCFSLPSRTRG 


1326 


15227 


A 


1334 


416 


1 


LTLLPKPTPYNTRKENYRPISLINIDAK 
IL/NILAGKIPQYIKVHN\NMGLTPEMQ 
GLFNI * K/RKSVNVI CHINRKEEKKILL 
INA*IVFDKNPTMI*KNSWQ\GEYKEIY 
SNILLNGKMLKACHL\KTRIN*GCQLSP 
DAW 


1327 


15228 


A 


1335 


391 


57 


WQRVCNWDKNLYQKEKRQDNVREK/WAT 
DLIINFTHEEKQSVNNLQKDTHPHSLVI 
REMQIKTTSY/HLLAKILNSDNSSYW*G 
CGGMGTL+ CCVJWVSKEIHALWEIVWFIY 
SK 


1328 


15229 


A 


1336 


285 


2 


I FFFPWRGS 1 1 * HLSLINKGSRQNERAH 
\KDNGDFSQIISKLKLMYKYKILNQTTI 
XYMFFPKKHRTSIKISHILGSKGSLNNC 
QRISVLQTGRV 


1329 


15230 


A 


1337 


34 


391 


AKIASLYSSLGNRANSVSKQNKKTKNQN 
LKN/IKTAGGLGWAHSCSPSTLGGLH\ 
DPGV*GCRSL*WCRCTPAWASG\ETLSQ 
K*INKGKERKKKMKNCWWMNNSLIQLTS 
FSFTKTDWSS 


1330 


15231 


A 


1338 


292 


1 


GGGISKTPGGEGAINPKKPPALPPGGKK 
EAFSPPQKXKKGERA*RDISSEDI *MAN 
KHMKS CP PLLMI REMQ I KSTMR YR F IP I 
RME\ILKKQKN3 
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nucleotide 
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correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=JLysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, Impossible 
nucleotide insertion 


1331 


15232 


A 


1339 


369 


16 


DPWQNEQLSRVTCPGPRRASMFRIT*YK 
VI IAALMAYSVGPRAVSCIRALWTTYG\ 
IMDNYKRHERRFLKPL*WPGMVAHACNP 
STLGGHGRHERVGRVRS I PDCWRVAAG 
VKASI 


1332 


15233 


A 


1340 


394 


1 


TQSGVQRHDAGSLQPQPPRLKQSPHPSL 
PSjSQDHRRVPPCPARPCSSHDSYVLREG 
PCGRW/HESRGRVFLVLFS * * * LSLMRS 
GGFKKGEFPCTSSL/SCLPPST*DVTCS 
SLPSAMNVGPLQPHIFMHGF 


1333 


15234 


A 


1341 


400 


161 


KIGPNLPCFSTKRPWKIDKIQFWPPGKV 
F*/SPPPPKKILFFPPPPSGPFF\PPLP 
PLPKSFFPPKNPNLPPSFGFDIPPP 


1334 


15235 


A 


.1342 


377 


3 


NRHFPKAETLKASKHLKRHLPLLVIREV 
Q I KTTN I CHN I ATRLAKM * KADNTKCWE 
SCEPVLTFY\C* WEYKLVQPL * KMVWQK 
P KVCVP Y YLAI LLLQHI PEGTCAS/CY * 
DIHKRLLTAMLFAM 


1335 


15236 


A 


1343 


170 


369 


RCNENK I TKE VYFGE I FVRLDNKEKNTS 
FFYFYFLFY/ CWR*GLALLPRLVLNS*A 
QWLLPWPPKML 


1336 


15237 


A 


1344 


385 


2 


SDLRRSTHLGLPKCWDYRC/RATVPGRL 
FIFLMVSLKHKSFKF* *RPI YLFFLSSL 
MLLVS YLRNH* L I QGHKDLLPMFS SKSF 
M I LVLTSRS ILS LLF C FLFFVLRQGL/N 
SVTQAGVQRHNHGSLQPQP 


1337 


15238 


A 


1345 


1 


281 


HMATKHMKRC* /SEMCTbKPQLNTTTPD 
HQNG * N * K * LVNFKG WQ * FGAVWNL I H C 
W *DCKLAQPFWKIDLS IKAKISMS YSHV 
VRKIFKLKNK 


1338 


15239 


A 


1346 


176 


1 


SVIWNIKEP* I AKI I LKKKT I VGGLTVP 
DFKTYYKAMVI /KTV* S WLKDRQRNQWY 
RED 


1339 


15240 


A 


1347 


398 


2 


PRPPGPVRRRCPS/LTATSGSSSPPSSP 
FYLGYPQGFPLPAVLNRGPGILFWGPHK 
KNITLPARGR * IGSS *TAPPFFF/SLFS 
FLLFSFLLFDTGSYSVPQAAVQWFNHGS 
LKPQPPGLKLSSQLLRRLGW 


1340 


15241 


A 


1348 


9 


395 


GLQNPCVGFLVSGFFFFFFFPPKKGLGV 
PPKKK/RGPNPNGPVGSFLATGPFFWTG 
PLKKNPAPPRALFWGPPPPPPWGG*NPP 
PFLARGS PFFKKFFGGPDPFFYRTNPRG 
PNKRGPPWPLKWGNPPPK 


1341 


15242 


A 


1349 


119 


1 


ENVFRNM/WPGMMAHTCNTSTLGGQGEW 
I TRSGV * DQPGQ 


1342 


15243 


A 


1350 


58 


396 


GIRVGKVCFI IFFFGFFFFFGKGVSLPP 
GRETGGPPlALIKPPPFGVKGIFLPPPP 
GGGG\CGPPPPPRVIFVF*GKGFPP*GP 
GGF*PPAPKGSAPPPPPLWKNWGNRIFG 
GR 


1343 


15244 


A 


1351 


396 


1 


GPTKGPGPFLiDGQGFPPPTLKQNFFFPA 
LFFPPLGKRLILGGF\PKPGGLLQP?T* 
KPLRFKPGGESDCFPKFF*GRIVCFQQS 
LCFFPPPKKKKTAFGKRFKKKTLLFFFF 
SDGVSLCHPGWSTVAQCRL 


1344 


15245 


A 


1352 


1 


250 


RRL* SQLLGRLRRQNGVNLGSGACSERS 
\ CS ERRS RHCTP AW AT KRD S VS KKT KCR 
RKQHVG SLTVRLRPG Y WI RKCGCHw WP 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=Iso!eucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possib!e 
nucleotide deletion, \=possible 
nucleotide insertion 


1345 


15246 


A 


1353 


249 


3 


CGATLCPKKYMKITPSFFVEIDN/AILK 
FIWKFKGPRIAKTTLK*KQG+RTHTFHF 
HNILQKAI IVKTMWY* YKHRQSPGRVAH 


1346 


15247 


A 


1354 


15 


416 


LLTLSSSLKITIHTNKGRT+SFIRENII 
IFIATTNLLGLLPH* FTPTTQLFINLAM 
AIPL* AGAVI IGFRSKIKNALAHSLPQG 
TPTPLIPILVIIETISLLIQPIALA\AR 
LTAN I TAGHLLMPL I GSATLT I 


1347 


15248 


A 


1355 


2 


416 


IKYLNVRPETLKLli*/ENIE*NPHNIGL 
GSDFFNLTSNSQGIKEKIDESDYFKLKS 
CCTESDTTNRIRQLKNERKCLQITCDKG 
LIF*KKLK*LY/KQXTNNH*K/WSKRLK 
YFSRKDIHMAKR Y/MKKCSTS I IRENKP 
K 


1348 


15249 


A 


1356 


426 


85 


HARLVL* FVF/LFETAYLSVAQA/GGAM 
AQSAHFSLELPGSSDTATSTSQVCYHRN 
T/RLLF* IFCGYG/RLCCLGWSRTPRLK 
QSSHLSLPKCWDYRWEPLYARPYLSCFP 
SN7VRLG 


1349 


15250 


A 


1357 


2 


301 


GGLLEVKVQDQLGQHNKTPSL/ IKIYIL 
P I YTQKN * KNLTRHGGMCL * S QLLARLR 
*ED*SSPGI*GCREP*WRKWTPVWTIQQ 
DSKSRGEKKQVI FTHYT 


1350 


15251 


A 


1358 


2 


389 


FLH VGQSG CE LP TS VDLP AS AS QNAG I T 
GVSHRSWLKFSLLDVPSFLTP/VSSAVP 
LISYLATGWRQAAI AASPI FLHQLARPA 
QPAREAAADS*LPADSAFPPKQWTC**M 
FTGALFKKKKKPCFVAKK 


1351 


15252 


A 


1359 


309 


3 


KWDHI KLKNFCTAK* TINKVQRQPMEW * 
K/I FVNYPVDKGLITRIYKELKQLYRKK 
KSNNLIFXMSKS\SSLAIREMQIKTTMR 
YHLTPVRILVYVLPKRARS 


1352 


15253 


A 


1360 


300 


2 


KRAFH * KKMP S RT FTAREG KS M P C FK / A 
SGWATPVI PAL * EAKVGES LE PRS SRPA 
WATRRDS CLF * K* * INK* KLTLLLGANA 
AGNKTLKLMLI chsxs 


1353 


15254 


A 


1361 


396 


60 


H KVTRKS DGML CHS G FCS INQLAFCCLF 
FQPGGPRGTPLP/ PYKPPPQKTPKKTGP 
GGGGLYSPPFGGGGRGNPFFPGAQGFF* 
PPS P PPPPPPGGRKKI FLPKKKKKSLLF 
FF 


1354 


15255 


A 


1362 


37 


385 


ALFSFSFFFGFFWGKKIFFFGQGGRAGG 
HSNLPEPPPPGGGAFPRPNLSGGGE*RG 
PPIS\GEIFGFLKKTGVPPGGRGWFKPP 
APKEPPPPAPPRGGISGQDPLPPPVLPG 
WGKKN 


1355 


15256 


A 


1363 


12 


421 


EPWEPQTLGICTHLQTLFHDYQVLMKML 
WTVR YHLTPVKM P FI RKT FDTAGM * KK 
KKGCYQGS KKGGML I /HPPWGAL * MS IV 
XKKTWGTFFKKIQKELPWDPAIPWLGMY 
PKEGNSVFQRGMGLPSVIGTLFP I AG 


1356 


15257 


A 


1364 


278 


3 


CCTGEKLETFLLR * ATRQGCPHLFDNVM 
EVLANAVRYEKEIKGIQNGKKKVKT\SL 
FPGDE IV/ YAENPHKS * PKNSL / KLKS D 
CSKVNIQKPIAF 


1357 


15258 


A 


1365 


265 


330 


WPGAXAHACNPSTLGGRGERIT 


1358 


15259 


A 


1366 


379 


1 


KRKLMYCWWECK* IQPGWKTVWHFLKKL 
KIiELFYOSVILLLYTFLKES/PVYARDI 
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E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, LHLeucine, M=Methionine, 
N=Asparagine, P=Proline, 
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X=Un known, *=Stop cod on, /—possible 
nucleotide deletion, \=possible 
nucleotide insertion 














CTPKFI ATLF I IA\RCSLTHD* TKKMWY 
LSIMEYYSAIKNE/YLPFATTQMNLEDN 
I FS EESQA* KHGVF I FP 


1359 


15260 


A 


1367 


1 


376 


HLPGAESQPPPV/DNSWDRPAGRTQLLW 
TPA/DPHSYG*GGAGPHPCPSQPGCCAP 
VQSCS * APSEAQSLGAADS\GPAATLPA 
RQLITKKKKKKKKKKKKKKKKKKKKKKK 
KKKKKKKKDDSGKG 


1360 


15261 


A 


1368 


2 


409 


EEAMPKAKMG*RPFAPNSQILNAKKKLL 
KEIKSVTPVNI*MIRK*NSLIADMKEV* 
VI WIEDQTSHNI PLSQSQIQSKVRMLFS 
SMKAERREEASEEKIiEVSR/GWKSAGFM 
RFKERSNLHNI KVQGEATSTDGEGA 


1361 


15262 


A 


1369 


220 


416 


PQPLFDWQMYT YTLHNDI LVNDGRLS PG 
GQGCSELSLCHCTPI*MT\SETLSQKKK 
KKMGQARGLS 


1362 


15263 


A 


1370 


310 


289 


AFR*AFHSKGSK**H*EKKW/WNFTKT 
*NFCTAKD 1 1 KRMKRQPTKWEVI FANHI 
YLTGILIS\KIYKEIiRTTQ* PETQSLK 


1363 


15264 


A 


1371 


3 


322 


HASERTHRRGKSTETTTPAWATERGSVS 
KQKTNKKTNK/RNTGSSIHNMVSDFEKQ 
VTQTF * S SMQMSNKPLKR YLTS VI I EMN 
IKPEYHFTPIRMAI IKKTDNTKC 


1364 


15265 


A 


1372 


3 


332 


GKEVS * EDIMMLHVYAPNNRASKHMKQN 
LIEMQGEI/DE/HPVIIGDFS/TPLSII 
DRSGQKIIKDIV/ELS/STVKQLDLI/D 
IYRTLYLKRVEYVF\FTSLSKTFATINC 
ILGHKV 


1365 


15266 


A 


1373 


381 


2 


CSVAQAGVKWCNHSSLQP*TPGVK*SSC 
FSLPSHSDYRHE\PALFFKFFVQMGS\T 
ML*FS*ESCIKGSRQPCSYTYLHSPPIF 
SSLFPSLAPES I KRQE PS FF FEMES CSV 
TQAGVQWRHLSSLQA 


1366 


15267 


A 


1374 


1 


385 


YKKGINAKIFNKI* *V\ESAISHDQVGI 
VLVLQGFLSI/RNKSISVTHHVNRLKN/ 
HMLI*IDA3KAFDKIQQSSKGKIGIKGN 
FFDLLKS I YRKPMANV I LNS E KM KCLP C 
KC\KTSQGCQLS PLFFLFFF 


1367 


15268 


A 


1375 


1 


357 


CSGVI S PHCGLKFLGSNDLPALtASRVAG 
I IGMTP \HAQL I KNFCCCWWR* CLAFGG 
*RFKTMCRPIVK*NKSIMK*NRTVCMSG 
CGDS /RQLLGRLRWS PGLRNVQGCSEP * 
WHHCTSACL 


1368 


15269 


A 


1376 


1 


375 


HRPKFKS I S LLEENMGES FHDLGb / GSD 
LLDMAPIVQSIRDKTSDFLEIKKSCSSK 
GTV* IMKKQATDWE /RTFVKHTSNKGS I 
MYEEL*NL*KL*NNPI*KWAREFSRHL/ 
EDTQMATKHMK/ KCSTAL 






A 


1377 


AW 


400 


KRGLGKTVFPWPKKKNGPFFFFPKTTSK 
GRKTLMVNPE* PKYYSVKKKKKKKGGGL 
PFPCKIFFPNLTIQTTFFWPKTNPLTLK 
KRKKPPKKTPTFWGHFFFERGTKKTQGG 
KKNPFKKGGWEKLFFLGQKKKMAPFFFF 
QKPPQR/WRKTLMVNPEP*KLLKKNM 


1370 


15271 


A 


1378 


403 


1 


TGVSQALSLFFFKYFKPRMVEFLVEPSR 
*KGPSVYGLQIFFSIPKVPFPCPFWAF/ 
SLV*SSPPCLSLHLLPAFGVLPKRSLPR 
PMLQSFSPMFFARIG* FQILLLNFQSLL 
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E=Glutamic Acid, FHPhenylalanine, 
G=Glycine, H=Histidine, l=Iso)eucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
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X^Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














S * FFFFFGDRVSLCRPGWSAVAP 


1371 


15272 


A 


1379 


195 


381 


AERYQTSRSTADRISCLFI * SAEI / VFP 
DSGKKTGGNK1WNNNNNNTNNKDNNNNK 
ISLSLKP 


1372 


15273 


A 


1380 


361 


39 


PPVKETAKDVNRRFPKCPMTNST*SDA* 
VDEWCL*FQLIGRLK*SRRIA*AQEFK 
AS/HGQHSETLSLKNYLKKKRFSTLLFL 
REMQI KTTTRYHFTPIRMAKK/SHHTKC 
*QGLAMLPRLALNSWAQAILLIiHFSLPI 
SWNYRHTTSSTQASLHVLFVIGCLGNRL 
LTSFAVSFTGG 


1373 


15274 


A 


1381 


400 


90 


LPSSWDYRCAPPLLANFCVF/M* SQGFT 
RL VSNS *PQDPPTSASQSAGITGVSKHA 
WPAFFIMWFSSSYSR/TMH**QHPHV*K 
LHICPN**KS\LSCPQVPTI IW 


1374 


15275 


A 


1382 


1 


431 


DNI PKKS AVY Y W I TLFKKG* DNVEDEAY 
SGTLAISIC/EENIHLVCSLSDEEQEST 
AQTIANTIDITVGLAYTILTAKSKLNKL 
CT * * MPKLS YPLF* KIL * KNXTKNI V\ P 
NQLQIKAELPVEILHNCDQDPETWLGGV 
AHTYN 


1375 


15276 


A 


1383 


2 


432 


ELSADVSFFFITTPLSGVL*QNWGIAAF 
IPIELRSPTEVTFSFDVGNGPFEISLQS 
PTHFSDHQWHKVRVERNMKEASL/ QVDQ 
LTPNTQPAPADGHVLLQLNSHLFVGGTA 
TTQRGFLGCIRAb/RMNGMTLDLEERAQ 
VTPEVQ 


1376 


15277 


A 


1384 


1 


421 


NPPALASQRAGIADICHCAWPLLRLSKP 
QFSNIi*SKKFALGDV*SSNIL*YIYIFF 
EMESRSVTQAGVQRHDLGSLQP\LPPKL 
SLLPPKLSLLPPK\SASCLRLAGSWNYM 
HVPPRPANFCIFSGDGVSPCWPGRTRTP 
G 


1377 


15278 


A 


1385 


2 


416 


I FSVDETALYWKMMPSRTWYL/RKEKSM 
SGFRDSEERLTLFL\GLNAPGELELEIL 
LI*HPEILGPLNYVKFTIjSVFY*WINEA 
LITAHMFTAWLTECFKPSVETYCLGEEM 
PFNILLLIGNAPGYPRAPMEIQEINI IV 


1378 


15279 


A 


1386 


170 


1 


DGVFLI FGGQNEKLNKNKDG/DLTKLPR 
IiA*NSWARQSSCLAFSKCKDYQREPPCL 
A 


1379 


15280 


A 


1387 


47 


418 


FWGFFFFFLKKKKAALGPWTPPPPGQGG 
PPPPPPRGPGKGGPNPPRQTI YGFWGQR 
GPPPGGGGEPGPPPPGEP\PAGPPQTGG 
PQNWPPAPGP*KGF* GASKGAP * KGVDP 
GGERSQVPNRGA 


1380 


15281 


A 


1388 


58 


503 


RPTRPGNYIMIKEFI FQKDRTI *NVYAP 
KNIALKYIKQKLINLKGKRDKLTIiTVAN 
I STCP* VTDITSR * KI CKOTEELF* QPA 
\DLIDIYRTLYPTAA*YTFFSSAYNYSK 

IGNSIGHKTFSN/ ckrndniqwlfsdhn 

GIK*BINIII 


1381 


15282 


A 


1389 


2 


401 


livsqvwsknrilkkiknstpvht/rqm 
irkqnnliadmetvlwwiedqtshni \ 
sqsmtqs kai/tlfnsvkaergeeaa\ea 
grgwftrfkerrql*nlqvesepps\ad 
veavasy pedvaki i dkdgytkqr 


1382 


15283 


A 


1390 


294 


3 


KI VI FDFNDVKKCS S * KI VI FDFNDVKN 
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&=G!utamic Acid, F=Phenylalanine, 
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K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine. 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, V=possible 
nucleotide insertion 














CSSKTVIEGKVSDCKKIFSKHISDKYLI 
SIX YKELSNLNNNQQPN* KWAKDL/NNM 
SSKHRKICSISFGKMQIKTQIRNHYIPT 
DAW 


1383 


15284 


A 


1391 


3 


422 


PLFKKEEGTNFPFPPFWGRFFPGGKSFS 
PFFFPPPPFIKGGPKVSQKGPRFFFFFF 
F*DGVSLCRPGWIIiFPPQVTEALLRGFL 
ALiSSLCFRLDKFFFFF*R*GHLLSRWDY 
RHEPPHLRIS*RKTKE\KGLIMIiTRLVL 


1384 


15285 


A 


1392 


3 


400 


FLYIYKDIYR*YKTFQINNKLKIIYFK* 
EKDLS KLFTKDVQMTNMH I *MGSTSLI I 
KQMQIKTTMKHHSSLPECLHFNAANITC 
WKRLGPAGTLI LRW * ECKS VKPL/WETF 
* QCLI QLSMQENYH/DPATPLIG 


1385 


15286 


A 


1393 


409 


2 


IVRHFSKEDIHSTNEHMKKGFLSLVKEV 
Q I ETTMGHHH I TVRRVKI KTD I FAS A\N 
TKC* *GYGKTRTLINS/YWECKI VQPL/ 
WKNILAVAYKIKHTFFPPRIHIiSGSQKS 
HFS I YSKEMKTLRQHKDI FMATLSQN 


1386 


15287 


A 


1394 


409 


24 


KSPFFFFEKGFPSPPPGGAQWGGFPPPE 
PPPPWK* FSPPTPPNKWGNRPPPSW/P 
G*LFFFCSPGGFPPLPNWFLTPPLR*SY 
PPGLPKRWGFKQKPMGRG * RAFFCPPK I 
SKKLWSGKKKKKGRPLAI 


1387 


15288 


A 


1395 


83 


411 


QRDSVSKTKQKKINNKSRTDLNRHFTKE 
DRWM/ANKHLKRYST*LVIREVQIKTTM 
R YY Y I S I KKSKINKPDHTNCWQGCGRPG 
TLIYC*WE/CKMVPSIiK*TFV7*FLKRLN 


1388 


15289 


A 


1396 


308 


14 


NFFFFYSWSNI IYCQKGTAFTPISDKID 
FN*KIISRQK*QYMLIKGSIKN/DITII 
NTYTSNNSSPKYIKQTLTGLKGEIAPST 
I WGVFNTTLS I IK 


1389 


15290 


A 


1397 


321 


1 


KQ*CSIEYS/FKYTR***SKL/WPGAVA 
HAYSPSTLGGRGGRIT*GQEFKTSIA 


1390 


15291 


A 


1398 


183 


48 


KWPGAVNHACNPSTLGGQGGRI/TLRSG 
V* DQHGQHGES PSLVGRVR 


1391 


15292 


A 


1399 


18 


422 


KAGMAILIDKVDFRTKNIITE*/EKHFI 
ITKGSVREENISVLNV/WPHYRASKHMK 
Q/ ILIELHVELGEHTI I VG/DFNI SFS V 
LNR I DKE S ARRR * N I * NNTNHQIiNLVD I 
Y* KTKNTTNGRTTFFPSAHKLFTI I IH I 


1392 


15293 


A 


1400 


28 


339 


YSCDHNSVQPQTPGLKQSLCQPPE*IAG 
I TDARHH I WPI FF \LKRSFVF IAQAGTQ 
WRGLL* KKKKVGGFKKGSCLTLPGRGD Y 
RRFREEKILIPGRGFCNELI 


1393 


15294 


A 


1401 


390 


I 


FGLGKHVLHMTSKAQSIKEKTD* LDFLK 
I KNFNFS KDTI KGMKRQARNGEK I FPRH 
QS/DDFF*WAIDLTFPKEDKVMANKHIK 
RCLRLLLL \ SMRYH YH *NKKTDHTKC*R 
ERG\ELELLCCW*EYKMVQ 


1394 


15295 


A 


1402 


149 


402 


RFWIHLYRKRVFPPLLNPVITWKEKRGF 
VFVFVF * DRVLLCHPGWS AVP W S * LTAA 
STSWA\K*SSHLSLPSY*DHQHASPCLC 
N 


1395 


15296 


A 


1403 


I 


403 


KRKSVPELIC/APRC*APCSYRPNLPSS 
PGPHLDPVGPTPTPSPPRPGPARGPPAA 
RPKEPAEPGKEERRGL\ PAPGG 


1396 


15297 


A 


3404 


342 




KEDXL I DXGGSPKQQI FKVN * TALY/ YT 
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X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














ALYSKKIPPRTLIAGKEKLMPGFKASKN 
R/TLLLGANAAG\DFKLRPMLIDHSEL* 
/ RPKSTLPVL* KKA* LRAHLF I ALFTKH 
FKPTVE 


1397 


15298 


A 


1405 


498 


66 


KN YAKSTLPML * KWNS KALMTAHLFTAW 1 

ftk\ffmptvetycsekkrf/sfkmlll 
i dnahshpralme i ykeini afmsakta 
stlkpmdq\kvilaykssylrni\fhkt 
taalssdssdasgqsxfk* i wkgftrtr 
grtrgsar 


1398 


15299 


A 


1406 


3 


411 


VSPLCLGRSQTPE I K* SSPLGLPKCVDY 
RHEPLYLASMGVLNCPISNLGQCIPP*A 
AWWS LLLG C\P I LMLNLVQTPDLPSVLQ 
PRAPGPR\HPPVSAS * VAGTLGTHHYAW 
LSMLLGSTL*DVFPFLSPSLIPSEE 


1399 


15300 


A 


3407 


361 


2 


IRVFQKKKIKSPGRGHKIFSF*RAGPPF 
FFSFFFLRQKESHSVTQAREQWHDHGSL 
QSRLPGPK*FSK\PPPPHPPVTGTTPLF 
IYYYYYFLRQSL/DSVALAGVQWHDLGS 
LHPRVRPRV 


1400 


15301 


A 


1408 


393 


2 


PGFNISVLKKASGGLFFFSPLGKKGFFS 
QPFFFGSPRVFPPPPFFNPPPFFFFCCP 
L*KIFFSPPPGLKFFFFKRAPPFFFFFF 
FFFPDRVSIACCPGWSAMVQSRFSATST 
LRAQAIL 


1401 


15302 


A 


1409 


393 


60 


LNVNPNL*SYLDKHRGMLYHILQSKEFL 
S KTS KAQAKE VTL CKWD YNKL KM FCTAK 
KTL/IKVKR*STE*KKIFVKY/ sttk/g 
* TYR I FNKLNNKNWTQFKNRAETSRSKM 

AE 


1402 


15303 


A 


1410 


422 


2 


VNDRKMDGWMNGWKEGR+TQKKKGGRRK 
REKERSMFLEGRKEGREKEREEREKESQ 
KERQKRGKARKEQRQEARKQGGR/EREK 
ERERERGREREGERGRKSSS*TGSKWK 
I ILLAVKMLVHLKSLHLQKVCVFSFSRS 


1403 


15304 


A 


1411 


398 


64 


PGFFFKKIFVENFFGPHFQFFFPPPPKG 
KNPL* PPQRGPG* \RFPGVFPPLKKGSK 
KKFPPPPRGTQPRWGKKPLLPKRPGFPF 
GPRVS FFCL I E IGWFFWAQVPKKKKKK 


1404 


15305 


A 


1412 


391 


125 


EPAPFFFFPPPGKRGFFP\PP* FGYPPG 
FSPPPFLKTPPGNFFLGPKKKKFFFPPP 
GEKICFL*RAPPHFFFFFFFFFWPPRQA 
ICFIIE 


1405 


15306 


A 


1413 


3 


387 


TPDLK*LTRLSLPK*WDYRRATTPG\LF 
FFKPFASLGSISM+YHSPVCLASKLRIL 
FDSSFSPSATAKKPLTSAFAAAWLSPFA 
LIQPHSSESSLYSSYCPVPILLF/rYLF 
I FLRQSL /DS VAQAEVQWRD 


1406 


15307 


A 


1414 


409 


1 


KRSGPFFYF/ SPPFKKG3FAKPFFLGSP 
VFFPPPSSPP\GFFFFSPPPKRGFFPNP 
FFWGAPGFFPPPLF*NPPPDFFFWGPKK 
KKNFPPPRGKKFFFFKGPPPLFFFFFFF 
FFFFFFFFFYKI I S I KGSIGRSS WR 


1407 


15308 


A 


1415 


3 


385 


HRPPPVMGLS PRPAAAEGVPGPPAVLAH 
QHHAQFLARP*LPPHGAGLRTCSPPCLS 
LPNPPPWAPVWPEPP**ALPPAP/AVPG 
PIN 


1408 


15309 


A 


3416 


2 


238 


PGGRGCSELLPRHCTPAWVT \SKrPSQK 
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OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=lsoleucine, 
K=Lysine, L=Leucine, M=Meth ion ine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine T S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Ty rosin e, 
X=Unknbwn, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














RKKREKPGSL*EKTLFFCQTLVFGPKKK 
NGPPLLGRVWAKRKGGFHPPLKN 


1409 


15310 


A 


1417 


3 


394 


NKHLKRCSTSSVIREMQIKTTNRCHFSE 
*KHW/IINMCW*ECGQTELSALLIRC*\ 
WDSEMVQPLWERKEYLKEITAESSWDPA 
I PLLD I \ YPKRNEASC 


1410 


15311 


A 


1418 


307 


357 


LETRKTA2KIKRRAGFEK*NLETNKNGN 
TI SQNLRD VAKEVRREKF I AMNA\ * S KK 
KEKP* INNLMIiSLKELEKDEQTKPKVRR 
KKKI LN I RAE 1 1 RD * KNSGKD * KKS WF 


1411 


15312 


A 


1419 


341 


3 


I FVFLVGRGFPPLTRMVLI FWPHDPPPL 
P P * V I EGLFH/ LPAFRPGAFLRKiMVP P Y 
*KNKIPPPRKNTFFFLKPPPFFFFF*DR 
VXjLCHPGWRAVAQSWLATASTSQAPAIL 
PPR 


1412 


15313 


A 


1420 


405 


1 


WGEELVPSGRVHELFSSALRSERDTDSV 
CSCGGQCHCPGAI PGAALKAACRGDPAS 
RVGVETGSTMGNNGFFSFFIFKPYFFRD 
RVLP CHA\ AMAQS * LTAASTS * \VKSSF 
CLSLRSSWDYRCVS PHLANFKTF 


1413 


15314 


A 


1421 


1 


399 


QPTLLTELGLRPVPPHPATMNILY* * IN 
TI LHYS YKNYFLFLFLRQGLQPLQAGVQ 
WQNHGSLQL*TPGVSFLSSWDHPNPANF 
FI I FFLERQGLTLL PRQVLNS *AQ/CNT 
PASAFQSAGI IGMSHCSWP YKN 


1414 


15315 


A 


1422 


168 


414 


EWREKAGESLEPG\GGGCSELRLCHCTP 
AWVTEQDSSYKKKKKTPGGGKGVF/C* K 
LG3GINF* KRVFCFGKGPTKKKTGGEA 


1415 


15316 


A 


1423 


369 


I 


EHFCKSQDRDS VPKKK YI Y I YPMS VFM P 
INLTNW I KFLERQKLS *Q I EEAI ENLNS 
I Y* / PNES2FI I * SLPITKAPGPYGFTE 
FYYSFKGEAMQILHKLFQKRG*RHSFSN 
VI KTLKS KPEK 


1416 


15317 


A 


1424 


403 


1 


KNTEHSKRYSIPQRYCGFSSREFFHFLV 
HMKVMFPLFKCAIALCLKQVYTVFFETE 
SCSVTR* / EVQQQDHGSQQPQPPRFQ\H 
P PTSASQ I AETTAI LENFRKI CI YFLRQ 
SL/HSATQA*VQWRNHGSPQAPPPG 


1417 


15318 


A 


1425 


401 


3 


FFFFFKKNFPFFFQVEGKGRKLG* QPFP 
PG\LKNFSAPPLPGGGN*R\RPPFARLF 
FFFL*KRGFSPLNQGGLGSPPFFSPP\R 
APKNVG FQG I TP P PGNFFFFFFFSEAES 
RSVTQAGVQWREPGS \QPLPPGF 


1418 


15319 


A 


1426 


386 


27 


CRINGVL I LC* CECKMAQTL* I * FGNFL 
QKKLDLTCDSTIPL3jGIYPRDMKTYVH\ 
KKTCT*MFTAALLI ITKSWEKLLRPSEG 
EWINSRHPYDRIVLSNYETHNVWKIQRH 
YAEAKKTV 


1419 


15320 


A 


1427 


398 


2 


GLQVHATTKFFKFFVETGSCYIAQAGLE 
LLVSSNPPG/SSFPKCWDYRCETRRWPF 
LFKFSLPIFVF*LNHLFCLKLLPLFILK 
LNLPCFMYYLFIPPALYSLSLLALFRND 
CDLS P I HLE VKKYFS KKDRL 


1420 


15321 


A 


1428 


401 


70 


HHAQLRCFFNSFVEVGV3LCCPGWSQTP 
GLKRSSHLNFSKCWDYKR/AAITLPSFF 
* IALF*ENPPNCKSLPTPLSPPKLAALG 
NSCLHPWLPQPPVKGLSDSFVCMHPGS 


1421 


15322 


A 


1429 


388 


131 


UGGGCLQRK\RGFKQENWFwPGGRGSS* 
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ing to first 
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peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
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correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=G1ycine, H=Histidine, I-lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P^Proline, 
Q^GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














PRSMYCPPAWGAQGGSLSQKKKKKKKKS 
KNYQKILSTQKTQKSRVFIVWSKHCTRH 
CEK 


1422 


15323 


A 


1430 


184 


413 


VSFLFDFYFLFF*RW/RSHSVTQAAVQ* 
CDCGSLQPFILFYFLRQSL/NSVAQAG/ 
VGSLQPLPPGFKQFSCLSLLSIWD 


1423 


15324 


A 


1431 


76 


533 


SCKRTTGRQFPDLPTRPPTRPQGFALVA 
HAGG*WRDLRSLQPPPPGFKRFSCISSV 
LWCTKAFNFDFQEI YSFSS* I*GFDSFE 
VNFFFIiEMGVSL\FARAGLKLLGSGKLP 
ASASE/SGGI IGVCHWAQLL 


1424 


15325 


A 


1432 


373 


29 


RQGFPPMGRWLDSLKKKKFQIPNSKI F 
PGGGAPLVIPPSREGEAGKFF* P\GGKG 
A I KQNYC/ HCP PTWGKEGN FVS KKKKKK 
EKKMHQK*RCVKDQQ * T *MLLFPSLKS I 
YYFK 


1425 


15326 


A 


1433 


389 


73 


TDC P AI GRNHRD P VRPLLS S PHRK I / YA 
NLYYY 1 1 VRI LTLIQSTDLM* ISPWLV 
LTRVCVCVCVC VCVC I ILPS /CYPVCRF 
MYPTTTVKVQNI S IGQEKQKTKT 


1426 


15327 


A 


1434 


402 


39 


QAGLQLLASSDLPALASQSAGITGVSHC 
ASPSISL*APLGIiDTFSDFPCFNDLDSF 
EEHWSGML/SECP*TGI*DLSFFS*VYW 
GYGFLEDHRGKVPFFITSYQSYTPPTQL 
1 1 ADVKFEQLV 


1427 


15328 


A 


1435 


105 


387 


E FS KVAGYKINTYKS I VI LYNSNE * FEN 
\KKI IPFIIASKRIKYLKI /QFAKAVQD 
FYAEN YKTW* R 1 1 KEDLNK* KNVLC* V? I 
GRL/NI /LKMVLLP 


1428 


15329 


A 


1436 


242 


2 


FGGGFFWVGTPNKMGPFPQI IKNLF/TR 
PPENF*KPP/LPFIAFFSLGF;LGVFFFF 
* D I VSLCCPGWS A* AQS LLAAALTS PG 


1429 


15330 


A 


1437 


411 


0 


LLFVAQECLKWFSALGGPGVPCS \LPAN 
QQPS VWVP*QS \ EHVTVLQRFCLWPVLG 
PAYGQIWEPVPSTLK/PPPPGFK*FS*V 
SLLSAWDCRCTP/PTDPANFCIFSRDAV 


1430 


15331 


A 


1438 


41 


412 


FFCTD+GSLCCSSWTQTSGLKRSFHLSF 
PNCVJNYGH/RASVLGLIFFFFIiRKKT/C 
P*KKFL*NEDI»LCCQLGLELLAPRDSPA 
1jGPQSVGTAKVTPRPW/LLFL*TQNYIaN 
SSRVRFSPAPLKYKEV 


1431 


15332 


A 


1439 


1 


406 


HAYN I V \NPS PVSLTGALS SLLMTYGLT 
MGCHPLSITLLILGLLTNTLTIYH+WRD 
VTRESTYQGHHTPPGQ*GPRYGIILFIT 
SQVI FFTRFF *AFYHSSLSPTPQLRRHW 
PPTGITPLNPLEVPLLNTCVLLR 


1432 


15333 


A 


1440 


3 


422 


MEKVSWWIDQTSHNIPLSQSLIQNSAL 
SLFSSMKTE*GEEAAEEKFEASRAWLMR 
/FEERCHHNVKVQG \ EAASGDREATG S Y 
PDDLAKI IDDSG YPKIiHI FNIC* KPTDT 
DIVDETALC*KKPSTYFLFFFLKQSLAL 
PP 


1433 


15334 


A 


1441 


375 


2 


KVSS PRAENVI LYNCPPLFFFL.CQD/ CH 
SVTQAARQWRKHSSLQP*TGL\ASQVAG 
TAGAHLYPSQNLIGRGIRSLA/SVAQTG 
VQWRNIGSWQPLPVRIKGFSSLSLXSSW 
EYRCPPPCPD\LFIFL 


1434 


15335 


A 


1442 


397 


122 


GQASLELPTSGuPFASASQSVRITGMSH 
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Predicted 

beginning 
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ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIalanine, 
G=Glycine, H=Histidine, ]=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, PHProline, 
Q=Glutamine, R=Arginine, S=Serine, 
T-Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *~Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














CARPILGSCISLFAYPFCFLKCFLF/CC 
REVI LL * CSGWS * TPGLKQS SGLSLPKC 
WDYRCEPP 


1435 


15336 


A 


1443 


3 


299 


WEKMFAM YLSD * AT VSRP YR * LLQLNKT 
KYKLKLSQS/MNRHFTRGDVAMANKKTK 
RKSTSLVIREI KTMRYKYLLKRMAEVQK 
/SVKCWQECG*GGGKCG 


1436 


15337 


A 


1444 


10 


410 


CTLFLS I SPDSAGG I CTMLLFQNTLNFK 
ISLALGVSDKCFSNFLIMTIVRNVFYIH 
GKKCILYCSPGCAHTHMHTLMHTHIHLK 
QFFLRRGLTLLPRLECSGAITAHCSLDL 
LGSDS PP**EMYFIYMV\EMYFILFPRV 
CTHT YAHTDAHTHTPET I F FETGSHS VA 
QAGVQWRNYCSLQPRPPGLRFSSYLSLP 
GS*NHRHVPP 


1437 


15338 


A 


1445 


431 


9 


GPAQSLDSSVSPLGTISSTVKWRHNAYL 
AGLSGGI /PCDCFLGPRVPWGSGICQWH 
WAVS PTLS \ PTVRPPLFRETLYFT* V* V 
PPLETCPQ*HISHCRCVINWCEDTHTPK 
KHLVCVHACVCVCVCVCVCVCLGRSRQS 
EH 


1438 


15339 


A 


1446 


85 


416 


PGCLSLPKIWDYRCEPPRPPHFFIFSS I 
QGPLTDFFLTPLEQVQISPTAKGFLKKI 
PFFFF* * RRGVP I LARWFKS * PHDLAA 
SASQKFGITGVSPHVHPIFLY/CSSIQG 
PLTDFFLTPLEQVQISFTAKGFLKKIPF 
FFFLRQGLT\RQWSTHLGLPKCWD+GCE 
P/PMPGLHCEFFKG 


1439 


15340 


A 


1447 


376 


3 


IKSEI IETESRKPIEKIS/RAQSCFCGM 
VYE I D KtiLARLDR KKMKTQLNN I RN * RD 
EINTDFTDIKRI I RENYKQD YGTKFNNL 
DETDI FLERHQLPKLTQEEKDNLNSPI T 
IKDIEIIV*NLSTG 


1440 


15341 


A 


1448 


2 


192 


SKWIKNLNKELKP*DS*KKPEGNLHDIR 
FGNDFLDVTPKN/MATKGKIDN* TLPKF 
KMLMRIWRN 


1441 


15342 


A 


1449 


411 


3 


VFLPPLPPFCFLNFF*GKGGFFFKIFF* 
KKKGVEGPQLGPPPPFGPFKKFFFF*IF 
LKAPLFVPPTLKPLF/CPPKKKNWENNP 
PPFFFCFFVKKTQFYFFFFFFIxRNRVLL 
CHPRWSVMV* SCLAVASTSWAQAI CP 


1442 


15343 


A 


1450 


413 


1 


EA*TGEWCNPGGGACHEPR*PPCPPPWA 
TERGFVSKfCKRKQEK/ SRIMECFQDNLP 
GFFQIFSVMKNKQEGR*FWTK/VKMTKY 
NA*NTOGS * YWKKRENEEECYDWKKLF 
QSIhS * YVKKAR*KPGCWLMAAIPALW 


1443 


15344 


A 


1451 


4 


384 


DP AI P LLGT YPREMKTYVH I KTCTQMFA 
AVLFTVAETSKR PKCPTDE / VNKI * CTY 
I / MTYYS AMKRDE 


1444 


15345 


A 


1452 


3 


390 


LPDHPGSSVSTPRGVI ITGRGFWLLFPW 
VSFFFFFFFLKRI SLLPPNWRGGGQNLL 
NKSPPPRGF*NFLA*PPQGGRKKGPPHP 
PGAPGDKNP /Q PFFFFGEKKKI FNPPTG 
GE KKKPPS P FKWGGGGPN 


1445 


15346 


A 


1453 


37 

i 


381 


LILYINVCVCVCVYIYTHTQMGSHIVAE 
AGVQWCNLGMW*P*TPGLK* /FLLSSGD 
YRLTLPHLANLYI FFFFFEQRG \ FFFLL 
KLV*NSGPKATLFRVGITGLTKTPRPYR 
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ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Aianine 
OCysteine, D^Aspartic Acid, 
E=Glutamic Acid> ^Phenylalanine, 
G=Glycine, H=Histidine, I=Iso leucine, 
K=Lysine, L=Leucine,M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, V=Tyrosine, 
X=Unknown, *=Stop codon, /= possible 
nucleotide deletion, V=possible 
nucleotide insertion 














GDLFF 


1446 


15347 


A 


1454 


3 


400 


TEMVLHTEMVLHI S VWDPLLDRPGRGVT 
WLMLTAELFLYGGTEAFSSS FDV*GPCS 
ETVECFSD/L*A*GPP*NT*SPIYRFAS 
FLLAFCIYLLETRSP\SVTQSGGK/WQD 
LTSLQPLPPRLK+SSCLSLPSSWE 


1447 


15348 


A 


1455 


77 


398 


RAEIVPLYSSLGNRVRPPSQKMYIYYEQ 
LHAHT FGNLAEMD KL VERY KVL S KFT * E 
E I \LNSPVS I KEREF\AVI SLPQKETLG 
PDSFTGGFYIFKEELMPILQRIiFW 


1448 


15349 


A 


1456 


397 


69 


CPPGLSGLPWVA*AVPP/RRPAPPPPPP 
AGIDAGLGDDPLQQTHHA/ ppaaagsap 
AGCAAGPARGAPRGRS PPRRGSAGAAPP 
PWPPAAAAASSAGGSSAPCASSPAAPPT 


1449 


15350 


A 


1457 


400 


2 


ALFFWAPKKKKNFS PPPGKKFFFFKGPP 
PLFFFFFFFFFFFFFFFFFFFFFFFFFF 
FF*RRPIFILFYFLNQKNIPRDFFFFFF 
FLRDRVS FCQPGWTAMV*SQLTAA\ *TP 
GLKQSSHLS LPS S * DYRWVPPC 


1450 


15351 


A 


1458 


343 


11 


QGVHSRKYKEISKLNS* KKNPTRKWAKD 
TNRHFTKDNIQMENKNMKDVQARCSGS * 
F* KN*NFKS YAL * PKCSEI / KKS VTGTN 
WRNSQIY0KLSTLQNN*WIFSRSLYNFE 


1451 


15352 


A 


1459 


109 


401 


QNYRQLGQWDRIESTVIDPCKYGKLIFD 
QS AKATEWRKDS * NHQTS / W WKES FTRV 
KALEEITKRKVNSLAS IS VKTD VHVIKV 
* LKKKKRKRGGRFKE 


1452 


15353 


A 


1460 


42 


424 


CPAN*NSFSRDR/SLPMLPRLILNSWPQ 
WLL/PWPPKVMGLQ 


1453 


15354 


A 


1461 


419 


69 


I F PLKKKKRGGGVG P PL YPP PLGAKRGG 
S P* KQNLKPPRPHKENS PFFYKKKKKGG 
RGGAPFFP PPLGGKTKKFFLPPK/ 1 KVS 
FNPNLFP P P PPPGGKKKFFFPKKKKKKK 
SRNTVWF 


1454 


15355 


A 


1462 


1 


277 


CHTNFLCMSFPPAISHHPLMPPHPVSRS 
VAQAGV* WCDLGRG/CSELRSCHCTPAW 
MTERDS I SQLIRKKKKL I KIKKKKNKNK 
SVIFLGGGS 


1455 


15356 


A 


1463 


382 


3 


KAGGS INQNSPPPPPPGEKKKKPPPKKK 
KKKKNPFWGGPRILPHNPPPFER*R*KI 
FLGPKFLTPLGPKI KPLFFFFLKKKKKK 
RMTY/HSVAQGPLLNKDTLQAG\LSKA* 
RSPPKSKSMGQSFHRN 


1456 


15357 


A 


1464 


561 


86 


NDPILSLKAEKTFGKIQDSFLIVSSSL\ 
NKPGIEGNLLS**KASTKQNPIMNLRV\ 
LNVFPLRS * TRQVCLLS PLLFNI VLE I L 
AHIISQEKEIKKIQI*YKEEKLPLFTSS 
SLFI * VKNLMEFAKKLLELINEYWKVER 
YKINIKNILLAKNTWTLKF 


1457 


15358 


A 


1465 


3 


221 


RFHRVSQDGLDLLTS*STRLGLPKC/WD 
YRLEPPCPAKTCLILNGSC*VFLCYLSC 
FKAQERPSQNSWGAFVTL 


1458 


15359 


A 


1466 


2 


396 


WW PAWTHTPGLKQTS R FS L PECWD YRRE 
PP \ PGLVKFLL/ 1 QYRWTQHVGLGDWV 
QDHHPHI E I CLHKS V I *NG I VFAYNLRT 
SSLTLFYLRRGLF/SAPQAGVQRHKLSL 
LEPLPPSFKRSS\CSSLLSSWDYR 


^J459 


15360 


A 


1467 


349 


413 


RLGL- PRKDITD/HvSLRKJDTG/ WPGAV 
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Predict- 
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nucle- 
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ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, P=Phenylalanine, 
G=Glycine, H=Histidine, I=Iso leucine, 
K=Lysine, L=Leucine, M=Metbionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possib!e 
nucleotide deletion, V=possib!e 
nucleotide insertion 














AHACNHS TMAGHRRQI PRSGV* GQPGQ 


1460 


15361 


A 


1468 


403 


2 


LPVEWKGGI FGSFHPPPLRIiKKFFSPTP 
PKS * D / YRGGPPPPG YFFFFFLKRGFS P 
FGRVIFKFPPPGVPPPPPPQNFGFKGRS 
/HPSRALFFFFKRVFF* IYPFFFFFEME 
PRSVPQAGVQWHDLGSLQALPPRFM 


1461 


15362 


A 


1469 


423 


3 


IEFSFFAPSKDKGAILGPSTLPLPRFNN 
FFCPPFPINKDIRRGPPPRPNFFFFRKR 
GGF PHWAGMFLS FNLQEGP PPSPPQ/SV 
GF*GRN/HPPRAQFFFFKKFFF*CYPFF 
FFFEMEPRSVPQAGVQWHDLGSLQALPP 
RFM 


1462 


15363 


A 


1470 


13 


427 


RTRGLVFDKTEFKPPKIKKKKKA\GPFL 
RGNRS I LKKKATFPKNLFAPNPGP PKF I 
KKVLSDLQKNLNPPPRMGGNFTPSIiLK* 
NKSSKQQITRDFRDLTSPLDQGDL*KIY 
KTFYPKTTE YTFS S APHGFY / S * FDHKI 


1463 


15364 


A 


1471 


378 


1 


FVRPPFFFFSSSRPFFKWGGPLPPAPQ 
FFFKTPRGNPLLRG* KPPPQTPVGGAKG 
SPPW/VPPGFPKKGGGVFFFQQGL*KIS 
PPGPAPP *LVWGGESPPFKKKKSRPGW 
AHACNLALWEAKAG 


1464 


15365 


A 


1472 


412 


53 


SRLSFFIiSSEPHGPPSPLGPFATNKVAF 
C* /PPPPF* PSPPPKFFFF*GPKSVIYF 
YQR*PPTSRIFPPKGGAGPP/PPFWGFV 
NPKKK\LKPFSKPSPPAAKFKNPEGTKL 
GFFKKKKKKL 


1465 


15366 


A 


1473 


116 


34 


DWNIKPVLISNVMTGIiMSMI ITLLLQLF 
LL*LFFFK/WEIiTFFGGKFHQRVDPSFQ 
GSIiGRGRQFFLLTKTEYHIFFLIKVFNS 
KTEGAGLAT* I K 


1466 


15367 


A 


1474 


42 


428 


E I IME S INRFDVI T F * NFCL * CYLMSKI 
NRQASSW/ENV*NQYVGEKILIFLIYKE 
CIQINKK\KLRPKIDTQVKDKQISEEGM 
QMANQHMIQC\QPSLVLNKMQIEIAEGK 
HLPYQINKDSKMMVEERRQ 


1467 


15368 


A 


1475 


87 


433 


PQSSPFSDHCSYQRI/FLHLVKVSTYRLQ 
KIRKIHKSPGNNEYFGGFFLFFSFLFFF 
ETGFNFVPQAGVQGQDLG* LQPLPLGFK 
GFS/CPQPPRTFFFFFFLRVWGGFIQKK 
KFLFV 


1468 


15369 


A 


1476 


212 


430 


SLILMTSNGIHFIYFLFLLLLFFFF*DR 
VSACHPGWRAMS* SYFT* ALTSQVK\QS 
YLSLSSS *NYRHTAIMP 


1469 


15370 


A 


1477 


493 


2 


PGAAAHACNPSTLGSQGGWITRSKIPEH 
PG\TLWNPRSY*KTKMCGLEAP 


1470 


15371 


A 


1478 


454 


492 


HRVGEERF/CLFETESHSLTQDRVQWHD 
LGSLQPPPPRFKQFSWYHI*PHAW 


1471 


15372 


A 


1479 


2 


213 


IDQERERLMEGDRERDTETDABKDMGRE 
/RNRYRERERLRG\RRERRKRRDRMT*M 
PRERERENLNSLYRETYRDFETEWVMDR 
ERQRLKRRLL*AVIVPSHSSLGSRRKTP 
FQKKR* REGERKSEFLIQRDI 


1472 


15373 


A 


1480 


77 


453 


SFGDSLTLSPRLAVQWVYLGSL*PPPPE 
LK\YSPTSASQVIHYLLFFFFFFGKKVS 
FCPQGGGEGPPFGLLEIFAPGLMPFFCL 
NPPKGWVWR/RPPTMPKLFFVFFIKRGF 
SPGEPRGVSFPEPGT 
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nucleotide 

location 

correspond 
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sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A^Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenylaIanine, 
G=Glycine, rl=Histidine, ]=Isoleucine, 
K=Lysine, L=Leucine, M^Methionine, 
N~Asparagine, P=Proline, 
Q=Glutamine> R=Arginine, S=Serine, 
T=Threonine, V-Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon 7 /=possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 


1473 


15374 


A 


1481 


398 


96 


KRPWCFFSPPPKKGFFPPLIFVGPRFF 
SPPPVFK/SPPPFLFFFPP*KKKLFSPP 
PRKI I FF*KPPPPFFFFFFFFFFFFFFF 
FFFFFFFFFFFFVFILLLS 


1474 


15375 


A 


1482 


484 


64 


QKRQMANKYMKI CSPLFVIREMQTNTT I 
RYDDI PTRVAKNKTNK* KSNSTKYCQEC 
ESARP*IYCWV?KYKMVQPLWK/S/VWQY 
LLKLNI*LPYDPVIPLLSMYPSQIKTCI 
PTKTCAQMYLVTLFI ITKNQKLAHSARV 
G 


1475 


15376 


A 


1483 


2 


400 


YNKKGQGVAPRYDVDTAANFPKELAKI I 
DEGGYTKQQSFNIDYTALYWKKTPCKSY 
SWR* IH/SLASKDRLT\LLVGNWGDFK 
LKPVLIHCSKNPRVFENYTKSTLPVLCK 
WNSKAWITAHLFTAWVAEYFEPT 


1476 


15377 


A 


1484 


459 


65 


GGPPPPHKTIDFFFFFFFRGESPPPPKK 
KKKKKKG* DNAEDE AQ S * RP PPSI*EKI 
MKL I CALTE E Y * * LTAE 1 1 ASN I H I STG 
SAYI I LTEMLELS KLST* WVPKLFYPNQ 
LRT/RAEL*MVILNK*DQQF 


1477 


15378 


A 


1485 


2 


518 


P P P P P QRFGLRGG APP P P KKKG * DN AED 
EAQS * RPHPS I * EKIMHLI CALTEEY* * 
LTAEI IANN IH I STGSAYI I LTEMLELS 
KLST*WVPKLFYPNQLRT/RAELSMVIL 
NK*TSDP 


1478 


15379 


A 


1486 


122 


501 


PRDPPALALQNAGITGLKQSYFLSLLSS 
WDYKHKPPCAPGYLK\VFFLEN*YLALY 
VF* *SYQGTSVLTQPPSLIPRHSSPGVS 
VQLFQKK/DLQHGCLLTPS I SG YSVTWD 
GVQWRDHGSL* P* PPRF 


1479 


15380 


A 


1487 


491 


425 


IPKNDI*AEP*RINKR/CAKRKMVRKGF 
EAESTICKALRYEEIVCSRNTEFVHGWL 
*ENRGQIMKG FGNS VEKLEHN* KFYEG* 
LLYRFE*LAAGFIVFQQ*VSKEIVKAWP 
GTVAHACNPS TLGGRGGW ITRSGDRDHP 
G*HA 


1480 


15.381 


A 


1488 


441 


17 


KKTOI YDQL I FNKGAKSTKLRKNSLFNK 
WCQDKIS / 1 IQKMKVDPYLI SNI \NLKW 
LKDLNVTAKTI KLLKENTGAI LHDLGFC 
NAFLDGKPKAQTTKKKQVK* TSSKFKSR 
CQWLIPIISTLWETEAGGSPELRSSKPA 
WPT 


1481 


15382 


A 


1489 


413 


1 


LFWKE WFLFWPGRVGNPGPK \GWLRLR 
PPPF*FSFQPFWGPPVGLLFPKNFYY*P 
PGGAHGPNPPPPGGKG*GPPRVGGGKFN 
* PKFGPCPPGWATKQKPVFQKKKKPNKI 
KNRTVKVPKVTKFI Y YVP YLNGVSN 


1482 


15383 


A 


1490 


359 


407 


RHRMITFFT*IVTQNFPYL*KKMDIQIQ 
ED* RTPN/RDQHKHTPRHI I I KLSNSQ\ 
NSKRILKTRKK* LWY*GTHIRLSTYFW 
PKS 


1483 


15384 


A 


1491 


397 


39 


RKFRGRS FAKTLFSGPGQG/ PKGNPGL I 
PEGPPPFFFWPAEDGGFFFPPGQLPPQG 
PIRGPGPFLGFPVPPPEGEGPPPTGP* K 
PGQKR PKKRGWS PQKKGPKKKTLFFFFF 
CQMESRSVT 


1484 


15385 


A 


1492 


378 


1 


FG VFW FLAPRRKGG FFPSKL I WV P PG FS 
PPPGV/SNPGPGIICFGGPIKKIFPCPPR 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, IMProline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Th reonine, V=Valine, 
W=Tryptophan, Y-Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, V=possible 
nucleotide insertion 














GGKFGPFKGAPPFFFFFFPQDGVCSVAQ 
AGVLQ * AEIVP\ FPPT* ATEQDPVSKKK 
PTRPPTRPPTRPPTRP 


1485 


15386 


A 


1493 


3 


428 


IKSP* PDRFIAEFYQTFKELVPILLKRF 
HKI /EEGThPS * FYE/ACVTLI PKPGKD 
TTK 


1486 


15387 


A 


1494 


3 


417 


I LNNARLKPFLLR * TTRQG ILLLSLLFN 
KI*VLEFLAGAISQEKEIKVIQVRNEEM 
SKTI>FSQTT*S*GRNPFIKIPLPKMSW 
N/NFIKVTEYKVSIH 


1487 


15388 


A 


1495 


429 


4 


PFCLGSKRFPPFFNPPRGGKF*KKKKIF 
GPPRGGPPLFPPLWAPKGGGPPRAGGSG 
PPPPKGGNPLFINPKKN/PPPPGGAIFQ 
SRFLGGVNPKI FLFPGGKFLTDPGFPPS 
LP PGG KKKKPR FQKKKKRKKRKVKEKKY 
FL 


1488 


15389 


A 


1496 


3 


433 


FSKEDTPMVNKHMKD/CCTSLVIREVQI 
KTTTR YHLMPTRMAMCI Y I FN Y I LFL K / 
3KNNKC* *RYREIGTLIH/AQWKYKMLQ 
LLW / KT I WQLLRMLNTKLS KDQE \ I PLL 
GI * KKKKKKKKKRGGRFKGINFTDPGVE 
RINFYNSAPK 


1489 


15390 


A 


1497 


3 


326 


WPACL * AVAAVALLVPEATRLTMGNNliT 
VCT PH S I AELLS S KG / DL WLTDNR/ L L K 
YQALLLE\ DLQLRTFTCLNPATF / VPE E 
TGEPEHDCEWVWQTGKRNTtfKDHCLYSIi 


1490 


15391 


A 


1498 


345 


22 


KSKWLIDLHGKCKTLKR* YRKPTK* \LG 
H3NDFLDTTSNAWFI /RKIDKLDFIINK 
NVCSGKVTVKRMKR*TTDRDKISSKGIS 
DKRLLCKIYIKILTTQQ 


1491 


15392 


A 


1499 


194 


432 


P WCVCVCVYVCMCVFETQS / HVARAGM 
Q*HNHSSLQL*TPGLKQVSCLNLiSSSWD 
YRHTPN * FFFF/ CNF YLERGGVS 


1492 


15393 


A 


1500 


2 


417 


RD*FMRFX\EKSYSCNIKIQREAASANV 
ETMAS Y * EDLAKI I NEGS YTK\ SQI FNV 
DEVAL\KKMPTRTFIVRERKSIPGFKAS 
KDRLILFLG/ANAPGNIKWKPVLIYHST 
NPRAFKNDAKS I LLVL YKLNS KAWRIAH 
LF 


1493 


15394 


A 


1501 


414 


0 


SSSSSSSSSSSPKWTTGALQP/LLSRTP 
LKDSSEEESSQ*AE]j*AVHLWHFAWKE 
KWPDM *LYTDSLAVASGLAGWSGT\WKK 
HDWKICDK/DWGRGMWMNLS 


1494 


15395 


A - 


1502 


3 


125 


RLGL PKC * D YR / RAATTPGLH * F * S WKE 
QRPQ+LALGRKPVNRNKNTQKYIHIYTY 
TH/PIPTNVYICIYTHIHTPV*YTHIIY 
I IYLFKVLVFGFLRRSLT/ SVAQAGVPW 
RDHSSMQPRPPYKQLTCH* *LASQSARI 
TGGSHHTWPALILKLERTEAPVISFRTE 
T3K 


1495 


15396 


A 


1503 


406 


172 


DI ILDR+RQKQRLRLRQKQRETETE/RR 
DRGRERQRYRQRQVQRQGRRQRQRLRHR 
Q* QRQ IDRLRQRQ / RAERDRGRGRDRDR 
GRDRGRDRERQRLRQRQR*RERQRHRQR 
/RERDRQRQRQRDRGRRRDRETDRGLDR 
G*DRGIDRGRCSDRGGDRDRGLDIGSDR 
DR 


1496 


15397 


A 


1504 


3 


420 


I TGVSHRAWLPS * FLKFr \ATv EMeaHSV 
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C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methiontne, 
N=Asparagine,P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaHne, 
W=Tryptophan, Y=Tyrosine, 
X=Unkno>vn, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














AQAGLK/ PSGLKRS SHLDLPKCWD YRHE 
P\PKLAFFQFF 


1497 


15398 


A 


1505 


407 


1 


PISIYHSKDPGSLKSYTKSTLPVF*KCN 
NKTWLTAYL FTAWFP KFFKPT VET YC/ S 
RKKSPFKMLLLIANAPSHPRALMEMHKE 
I P WFMSAVTS I LQPMDQGWLTFKS YY 
LRNTFCKAIAAIDNDSSGGSGQIQ 


J 498 


15399 


A 


1506 


1 


408 


PPGFKPFCLSLPSSWNYRHPPP/RPG*F 
FVFLGGTRFHHVG Y T AS QH LT S RE THAY 
ALQ 


1499 


15400 


A 


1507 


2 


416 


EPRSHHCTPAWQLSKTLSQKKKKKKRKR 
MVFTGRKKGLFFGN/LKNLGTLSREYPL 
GPGIKNRLAQKRKPLFYICKRF*NriSrPGG 
GAH PGGP KS WERGGGRKI * TPVGKYASN 
PEYHICIPPGKRNQNPFLQKKKKRRAD 


1500 


15401 


A 


1508 


276 


14 


SPPPYFLLIR*GRKGRGQF* *LRLFFII 
NLRQCL/DSVSQAGFQWHNKSSLQPRTP 
GPK*PSFLSLPSSWDYSHAPQLLAFYGA 
NCFN 


1501 


15402 


A 


1509 


1 


391 


NIFKEIMSENFPSLMSSNFPSLGKEIES 
QIQE AQRTPNKMNPKRSTPKQVI I KF* V 
RE/MLKTAREK* IVI CKGTTLRPAVDFT 
AETXQIRRE* DDTFKTLKGKKKKKKTLP 
AKLTFPS I LVFKKTRGG3R 


1502 


15403 


A 


1510 


2 


419 


PRVRSRATNVISKYKQHKKKNTSK* IKD 
LNVKPE P I Khh \ EKTTGE KLLD I E LGND 
FLDS/TPKTRAPKANLTP*NYFKLKSFL 
TAKETFN\KTPTKRGANHISDRGLISKI 
YKELTIQ*QKNNLISK*AKDLKRHFSKE 
DV 


1503 


15404 


A 


1511 


392 


122 


SVSLCVSLQSSGMFRC/LPTMPLRR*RQ 
ENPLTSAGGGCSEMRSHPCTPAWVTQQD 
SVSKK* INK* IK* /Q* LRQNVKIKPSPI 
FCEQGKRRK 


1504 


15405 


A 


1512 


2 


281 


GGCSELRSCHCTPAWTT\SETLSQKKKK 
RKKICI ECNTLKSH/ I hS FVCGGKYHSP 
VSCTCL* WSLGRSLKRSLERHQKNLNDP 
YPIFQNETNIH 


1505 


15406 


A 


1513 


242 


382 


QGPNLG*LHPPPGGLKGFSPLTLLRSWK 
NRLPPQHPFYFCFFRKNKV 


1506 


15407 


A 


1514 


1 


388 


RTRGTERDT/RFRERRRERDRGREGYRW 
RYRERERHRERES * V * RDRVRDGDRVID 
RERERQSERERLR/RERDWERDTYIVRE 
TETERETEGERDIERERDIVRDRYM/RD 
RDRDI LRE * ERDRETALDSEK 


1507 


15408 


A 


1515 


4 


285 


TRXQI OTGDKTAL Y*KEMPSRMFPARVE 
LMPGFKASKDTLTLLLRPNAVGDVNLKP 
MMIYYSENPRALKNYDKTQLCLYSTNGN 
KAWMTDYRFT 


1508 


15409 


A 


1516 


415 


2 


MGDRESLPPSRVFLFFLAPCPKGILFHP 
V* FGS VRS FS I ERC YR * GQRNE F WGP V I 
RVKASSSRAG*WFSSTAPPLFF/CFFE 
TESCSVAQAGGQ*CDFSSLRPLPPKFKG 
FLCPSLPDAW 


1509 


15410 


A 


1517 


1 


324 


P TR PEMG FRHAAP AG L E LLS SSDLPTSA 
S QSGG I TGVSHRTW P I LASNNY / SMDKL 
CAI /CERFILFFLKNI LLWILILFI FI Y 
FCIiIK*CITIKFSQriSL»lj?HSlHLYI 
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nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenyhlanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 


1510 


15411 


A 


1518 


2 


163 


ACRYP WLNFVLLVEMGL \ TMLASDLERS 
G*PQ/CDLPASASQSAGITAESHHGWP 


1511 


15412 


A 


1519 


425 


52 


VLRGAQFLFQEARLKWDFGPGGMAPPCN 
PPPLGV*KGPLPKG\GGPNPPGPPRGNP 
VFSKKPKITPPGGGAPEVPPPWGGGAEK 
FFYPGGPRVQGTE /RKTPPPPPGRQKGT 
PLPQKQKKKKPWSL 


1512 


15413 


A 


1520 


3 


404 


THASGKSNIRGLTLPVFKTYYKAAVIR/ 
TVWP / WLRANTNRQNRL /DGPEVDP YKC 
SQM I FDKGAKA IQ * RTDNNLLNKW * / E S 
*TSTCQKKKKKNQFR/DPAFLIYTKFNS 
NGR KNLR V I G KT * KVLKKNI KQNWGDLG 


1513 


15414 


A 


1521 


332 


39 


TSRETGNFQLDLTAFLNEVGVLFLWRKG 
KGKIDIGIGTKEMP+FPI*LLMFSYIVI 
ERERKPE /LSLSPS LECSGMI LAHKKNS 
LPGSSDSHDPASRVAG 


1514 


15415 


A 


1522 


482 


0 


RARTSGVLLCS PG WS * TPDLR * S LCLS F 
PKRWDYRT*ATVPGLLYSLYSRFSP/DE 
LKGCEKSRTSPA 


1515 


15416 


A 


1523 


1 


417 


NKC* *RCEEKGMPVYYWYECK*GQPFWK 
TKERFFKKNL /N I ELPYNTAIALLNMR P 
Q*I*SQ*RKVCSCMIiCAT\PTPNK*IKT 
MW*VCLVEYSSPLKKN\LLFSTTWINLE 
D I S SNK/ LRHRKS S I SRFHLLL 


1516 


15417 


A 


1524 


1 


397 


RDSTYQGHHTPPVQ * GLRYGVI LFITSE 
VFFFAGFL/WSAFTRSSLAPTPQLGGHW 
PRTGITRLNRLEVPLLNTFVL1ASGVS I 
T * AKHSL I ESNRNR I IQALLITILLGLY 
FTLLQASEYFESPCTISDGIY 


1517 


15418 


A 


1525 


2 


376 


LKAKTGQ KLGLLHQT VS KF VNAKE KF W K 
ELL\KSATPVDI*MIRKRTSLITDMETV 
*W*KEDQTSKPIPLS*SLTQSKALiNLF 
KAMKTDRGKGAVE * KS EANRGWFMQFKE 
RSC/RFCNIKVQG 


1518 


15419 


A 


1526 


3 


386 


ESMLKAKTGQKLGLLHQPVSKFVNAKEK 
FWKELL\KSATPVDI *MIRKRTSLITDM 
ETV* W* KSDQTSHHI PLS * SLTQSKAL 
NLFKAMKTDRGKGAVE *KS EANRGWFMQ 
FKERSC/RFCNIKVQG 


1519 


15420 


A 


1527 


127 


388 


KRKSQINNLLLQFKELEK/QEETKPKAS 
RRNKKKKIRVDLLKIKKGKPLEGVKKKG 
GFF*RTNKRDKPLLKLPKKKGGGRIKTF 
HKTS 


1520 


15421 


A 


1528 


3 


402 


HENHMKICSTSYVIRELQIKTTMX*YYT 
PVRTTAIQNTDETKCWQG*REQ/GSLIY 
CW * Q CKMVQ / PLWKRVWQFPTKLKHSLN 
I * SAWLLGI YP \KSGKFNVHTKTCT * M 
FLAK* PQCSSVDEERKKM/WYMYGFE 


1521 


15422 


A 


1529 


2 


365 


IEKLYRSGAKFFCRDRVSLCCPGWSRTI 
GLKQSSCFGFPKCWDYR/R*AAIiSGRLL 
LTISSH/REQIEQEFTHYPEKSAKLFKQ 
DGPPRRK/HPPTRPQASNIQNKFSFFLR 
QSL/NSLAQAG 


1522 


15423 


A 


1530 


I 


418 


GTDTE *ACDKIQKPFPDKRLNKLG/ IRK 
KHLQLDKYL/WKNPIASIILKSRRLKSV 
RLRPGDGRQGCSFPAWLFNI ILEGLARA 
IR*EKQKQXQGQ\QIGKKEVNLSLFIED 
IKLYIENL.KESTKKPIIIINEFSRKBDF 
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C=Cysteine, D=Aspartic Acid, 
E=G)utamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, i=Iso)eucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
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T=Threonine, V=Valine, 
W=Tryptophan, Y=Ty rosin e, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possib!e 
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RP 


1523 


15424 


A 


1531 


381 


3 


ILQGFVIRKKSKI FQKACHTSDLWTLTN 
L S CLHM KGLP PHRL YS S CTVLL FLR KP V 
MRKTTLSCCFLT*DFFLPLLPPCFFR*F 
ILLIF*LYFCKDKVSLCCPSWSKLLP\Q 
QSSCLSLPKYQDYKA 


1524 


15425 


A 


1532 


38 


479 


DEACGPQDPYLTP YVKTQWI KD* TRNKG 
IQFLEENGKN\ FDIGFSSDLLDMTPKTR 
ATKVKL\NDIRLRNFCASKDTINLSLLC 
R\KR* PVEWEKISANHISDKGLI SGI YR 
QPPLNS KTSHL I * K*ARDFNRHFSEEDI 
QSALYRWVL 


1525 


15426 


A 


1533 


105 


447 


LIFCRVFBYLHSLHLPQEICLSLALFSR 
FTFCVI ICEVDVWSVIFKVPFCSKRNKV 
AVHTMLYIQIFVSLFI *PQNWKQPKCPA 
TVER INKMWY I H I V/ E YYS ANKR 


1526 


15427 


A 


1534 


76 


471 


VWVCLLS LEGS QS KFGNS I EFGVLLS SG 
GFSAWRLFFFVYFLRQSL/NSVAQAGVQ 
QWRDLSSLQPLPPGFK*VLKQRGVCLFV 
CFETESHSIAQAGTQWCDLGSLQPLSPE 
FKRFSCL\SLSPPWPSG 


1527 


15428 


A 


3535 


45 


338 


SNNEPFI*DWIVM*RKVDFIRQPAMTSSV 
VGPRRRSKALPKAKLAPKKVMVTIWWSS . 
ARLIHCSFLNASETIASEECTQQIDDMH 
* KLQRLQAAL I NRK 


1528 


15429 


A 


1536 


425 


I 


FFNITFHSVSFSPLSQKCILTL* ILLKS 
IVKNNMRNFQRSLVRK*AKDMNRHFTDD 
HVQMAS/KHMKRCSLVIGEMQIKN\TVS 
YHYPPIRMIKVRNSSNTKCV?*GCGQTGS 
LMRCWW/NK*NQLLWKTGI PPHGTjVSTR 
SRVF 


1529 


15430 


A 


1537 


4 


443 


ETFVDHYQCGGIRPFSDLQLHAGRTTAL 
FKAVRQGHLSLQRLLLSF\VCLCPAPRG 
GAYRGRQVSLS CGGLHPVRAS WLLCLP K 
*AWTMEGTSTPASLPPCSLISDCCASNQ 
RDSVGIGPSEPGAEYNLLVPRFLjSPSEK 
RSIWVGV 


1530 


15431 


A 


1538 


487 


3 


TQNGG VLLS APRS VFS PTTLR / CTLQAQ 
C*AFWWGGTQQAASSTAAMAAMKPLGIW 
AGGAAG ILPKLGFQD/ LPLS AEADPAGK 
ELSIGRQRAWREQPDQSAEPPFSQAPRP 
GYPPSPQPLSMRRGPGANPRLARPLRGP 
VRVRLRRASSERQKRSR3GSGPLG 


1531 


15432 


A 


1539 


394 


489 


IYLFIFETESRSVAHAGMQWRDLDSLQP 
SPTG 


1532 


15433 


A 


1540 


475 


202 


PGGGWFSPPDNFSLKENSQGGPPVSHPP 
PPPGNGEGGKTPGAGHSGI*NPPPPLKI 
N/liEKGGRGGPPGPIiLNPPFPKEKPRGG 
KKKKKKKR 


1533 


15434 


A 


1541 


14 


468 


LSMWWNS RLKARRLVLVSQ WNA\ KERF 
LKEIKSVIPMNTIiMVRERNSLIVAVEKV 
LVA* VKEQTSHNI PLCQSLVQS KALTLL 
HSVKAKRG* EAAEEKLEAGKHWFLRFKG 
KSRLHNI KVQGEATSTDGEIiQQVT YPDL 
VKIVGEGDYPKQ 


1534 


15435 


A 


1542 


479 


141 


RKTDSWDLIK\SFSTAK*AINEADGQTT 
EWEKTFANYASDKGLISRKELKQINKKK 
ANSPI KK*AKDTKKQPTT^3KKCRTSLI I 
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N=Asparagine, P=Proline, 
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T=Threonine, V=Valine, 
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nucleotide deletion, \=possible 
nucleotide insertion 














REMQ I KTTMRYHLTTESE* LLI KSQGRA 
L 


1535 


15436 


A 


1543 


17 


474 


NPKKKKKKKIDLLDTYYTLYPK/ AEYTF 
SSEIHKNWLYIN*LRANLNKFIN*KIKT 
I LSDHNRRQPE INKCNKTKR/RVT/TQK 
FKNL I LSN YWVKGE LQTQ I AE FLKRNVN 
KNT I YLNT *NT I KVLNSGGS T / ALNT YI 
DKNFKKRVPS 


1536 


15437 


A 


1544 


89 


481 


ICLKVISLRQENATSVACIVSCLSEGSR 
ASENLVLHNAS PNSCGLGI AFTCLCNW 
SSGNKI TDCYD * LS S S/WLGFFFFFFLK 
KS FL FVAQAGGQGGNLG* PKPLP PGLKQ 
FLPPPRANFCFLEKTGFFLF 


1537 


15438 


A 


1545 


1 


300 


PPPPAPXXCRPY*XPWXXPVYHSSWRHG 
SGAAQGAVLAGFGGVGRQGPGAAS VS I P 
LCPENQGCRE PG P SHAVPAPS AL P SLR S 
LTGGQIGGTRAAQAVG 


1538 


15439 


A 


1546 


2 


436 


GAPEIKSIIKGYND/RLCTTKFYNLDEM 
D KFLVRHKLPKL I * E * IDNLNRW I TSQE 
TDW*I*QQSSSSSSSSSS/PSSSSSRPN 
GFTTESYQSFEDKLIPIICKLLKKIDKE 
\GHFPLQL*GITQIPKPDIYH/IENYRP 
ISLM 


1539 


15440 


A 


1547 


54 


419 


PATWEDHLSQGGRDCSEPRLHHCTPAWV 
RE*DPTSKKKKEK/NDQ*LLIP*S*RRL 
*RSSHQTSHLMDWNSEAGR*EATFLRSY 
SKL\KEELELLLILDNNDDHNNSD*YLL 
SIYSIPGTVLS 


1540 


15441 


A 


1548 


37 


339 


KRWKCLRA* LLMRPRHAD * LN PDGVGY S 
EL*SRHCTPAWVTEQDLVSISTNRKNER 
HTLEYSHQHYSG* P * TG/ EESEYPSAL * 
QRE 1 1 DYS FIQGMTDQL 


1541 


15442 


A 


1549 


477 


1 


PGSKDLGSYT*PQ/WSSPEVTSRDAPS 
HPSAPKFCSNPCRGRWLTSSKQPKLRLT 
SAVPGAPGARGTFHALGAGAAEAGGHSA 
SRPEAALCRPLP PLPMTLTSHPLLSGPG 
RLAWGCNGRR* IKGGG 


1542 


15443 


A 


1550 


430 


8 


CWPGSSGTPDLK* STRPGLPTCWDYRHE 
PLCPASKTFLSPQIETPYPLNNNAHSPR 
RPALVNYSLLSVSMDLPIL\AFHTNGIT 
GYVASPAPPVRWGSHSVAQAGVPWPNLG 
SL*PPPPRLK*SSRLSLLSSWDYSHMTP 
LN 


1543 


15444 


A 


1551 


2 


419 


ETSPSLQGWLGVLFPKRGAKTSRFLIIR 
PQGGS FKDGDFFNP PGEIKTPPAKKKKK 
KDSARS P PARLQA* GAGLWDARAS FRRP 
FQAPVPLSAQPRAQPLAVGTSRDGGSSG 
PTPGQESAV\PWREKHPQQPPPQPG 


1544 


15445 


A 


1552 


2 


387 


FRHVAQAGLELLG \ S SDPS VS A SQSTGI 
RGMTYRAQPGL*LFNSKNSILSGPKV/L 
QDYMW 


1545 


15446 


A 


1553 


393 


2 


KNPIFFFLKQGFPFFP*LEGRGKFLGPC 
HLCFPGSKKG\LPHPPQLIGAPGPPPSP 
GFFFLKRGFS/LFCPGGSFSLRRKGPPP 
PALPKF\GFLRVTPLAGPGNPSFFFFFF 
RDGV S LCH PGN SAVAAS RLTA 


1546 


15447 


A 


1554 


1 


427 


LLLL I LYAI VEAVT * TEGAG YP PL PGS Y 
SKRGASGHRAIFALHLTGGCCI ** GAMHC 
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nucleotide deletion, \-possible 
nucleotide insertion 














ITTMIDIEPPAITQYHTPLIiA* SGLIT* 
GL\LVLSLPVLDAGITILLTDRNLITTIi 
FNPDGGGDPILYQHLL*VFGHPEVYIRI 
LPR 


1547 


15448 


A 


1555 


3 


382 


TLYLLFAA*AGGL/ GSALSLLIQAELGQ 
PGNLLGNDHIYNVIGTAHALVI ILFIVT 
P 1 1 IGGFGN* LDTL I FAAPDMALSRLNN 
I SL * LLHTS VLLLLAS AI VEAVA* TG * T 
G YLCLAGNYSHP * AS V 


1548 


15449 


A 


1556 


380 


3 


EVSPCCPGWS*TPGFKQFACLGLPRCWS 
YSREPAPPAKACFLDI*VK*WFCK*GSP 
EVNSRPGLYLFI YLIKQKKGVK* *PARY 
LVLFFCFFVFVFVFVFF*HRI S \ SVAQG 
GVQWCDLGSLQPPP 


1549 


15450 


A 


1557 


100 


254 


IPTVPTY*TPIKSFHRARIESSGPGYSW 
PVDSAK*VPLAWSLDSR\RDSGNLVKP 
LMRVTN* MKRHLVTLTQS \CYS 


1550 


15451 


A 


1558 


2 


289 


APGVS STMEDEMGGGLEPQRRGCSKQRS 
HHC I PAWAT E * DCLKFNNNKKN YLLL I S 
IiRTQSYLYF/ C*WSY*LKLSIKIAGGT 
GEKEH* SQKRKSK 


1551 


15452 


A 


1559 


24 


354 


PLPSASPGPEGATPVPTS/ACPNKIKLY 
HLKKKKKKKKKKKKKKKKKKKKKKKKKK 
KTKKKNKKRKQQLNE * ETG 


1552 


15453 


A 


1560 


376 


2 


AARGSGVRDPLEEAVCLFSDLQLRAGRT 
TALLKALFK\RQGHLSLQRLLLSF\VCIi 
CPAPRGGAYRRRQASLSCGGLHPVRASR 
LLCLPKQAWAMAGAPPPASLPPCSWISD 
CCASNQ*DSVGVG 


1553 


15454 


A 


1561 


3 


408 


AASTVFLPFLERKGIDFGFLFFFFGEKK 
FPFLAPGGAPGGLFS FPEASS PGLNPFF 
WPNPPEK * KKGGP P PPPGFFFFF * KKRG 
FPGG\PGGAPFPDPKIGPPGPPKGGEFR 
GGPPPPGPNFFFFFLKGKGGGGPP 


1554 


15455 


A 


1562 


355 


161 


FKPGDGGCSEPRWCHCTPIWVIMRDSVS 
KPKQSKTKKRNVFT/C*D*VF*NNNINY 
INCFS*RALYILYEKQFM 


1555 


15456 


A 


1563 


410 


1 


TPPPPPKNFLGTPLFPKNKAGKGLFPPP 
LGFPKGGQWPIPPQRFPPLFSPKKRAD* 
KPKPPPALKIRGPNPGVFPKGFCGFSFF 
PPPPFPRGGGLIFFLPP/TK*SGG*KKP 
KKKKKEKCPKKQRGGIDQLTSNLGV 


1556 


15457 


A 


1564 


2 


374 


ADRNLNTTFFDPAG /GVGDP I LYQHLF * 
FFGHPEGYIXjILPGFGIISHIVTYYSGK 
KEPFG YIGMV*AMI S IGFLGFIV*AHHI 
FTVGIDVDTRAYFTSATIIIAIPTGVKV 
FS*LATLHGSNMK 


1557 


15458 


A 


1565 


396 


0 


1 1 FL I FLRHGFAWAQAGMQWCGLGSLQ 
PLFPGFRLFS/CLLSSWDYR*RQGFTMI* 
ARLVLNS*PQAl\SASPSQSPGITDVSH 
CA 


1558 


15459 


A 


1566 


186 


452 


KQKCNFKTLNTKYQPLPFFFLERNFCFC 
PPGGGEGADFTFLEPLPSGAKGFF\CLT 
LQRM/WE*RVSPPTPLNFGFLVKKGFSL 
CGSTGF 


1559 


15460 


A 


1567 


453 


37 


KTALYWKKTPSSTS IAREEKS VPGFKGQ 
AHFLFRGKHKFKLVSKLIYHFENP/R/A 
LKNYAKS I L / PYK T ^KAWMTAKLFS P \ W 
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X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














FIKYFRPTVEM* CSYNKILVLIDTGLDH 
PRPLIETYK\FINLHVCVLFNTTPIVQP 
LPR 


1560 


15461 


A 


1568 


453 


39 


TALYWKKTPSRTS I AREEKSVPGFKGQA 
HFLFRGKHKFKLVSKLIDKFENP/R/AL 
KNYAKS / TLAYKWNKAWMTAHLFS P\WF 
IKYFRPTVEM*CSYNKILVLIDTGLDHP 
RTLIETYK\FINLHVGVLFNTTPILQPM 
PRAQ 


1561 


15462 


A 


1569 


1 


352 


FGTRRERERERERERERERERERGRGGA 
GHQIPSIEGQHKIVGAIIY\CREC\GEK 
ISGSTSYIKVWDIRDSAKCIRTLTSSGQ 
G I SGDACAATFTRAITSAQGEH* INQIA 
LRPSGT 


1562 


15463 


A 


1570 


394 


3 


TIiDWPQSRGGSTGKPVYPSVCCCCCCCC 
CFNFCQEAEKNNAEGLIjHNQAGRTKDGS 
FAPSHDH*A/PRGTEV/DLLESTLQTS I 
KQVESKPR\EQARTGAGGQKEKATQNPE 
KSVLTSMYTKSQGSEGRLPGNR 


1563 


15464 


A 


1571 


399 


2 


KHQLPVFWQYNKKAWTTRTLFLDWLHCC 
FVFE VR KYLAS KGL P FKWL I LDNAPGH 
PPRTP * VQY\KGI E VI YLP PNTMSLI * F 
LDQGVTRTYR/ SHYTQYSMQRTISAMQE 
NSNKENI I KVWKDSTTDDAI VA 


1564 


15465 


A 


1572 


37 


400 


RGT I TGEAASADQE I ADKLS DA I NKI ME 
EKG\Y*LPEQVFNLDESTLFWG\KKKPQ 
RTLLSKEKKRAPGFKTGKDRLTLLFC/A 
NAVKLI IRTALTYKAENPQALKENVKHQ 
LPVFCLTTRGL 


1565 


15466 


A 


1573 


84 


485 


AGHKDSPRPHQTQEPSWLHLWDPAPGLQ 
VE LP AS P \GRALALLS PWWDGTGRPGA 
GGGTRRGGSGPTGAHGAGGRLMHGGLQV 
PSPAPREGS *GPARYQAQRMWARTAGGP 
STPSAGASRVPSPHCPGP 


1566 


15467 


A 


1574 


3 


463 


TPAQGLRDPSNMRKHAYCGCCV/ CITLC 
VGAQNKNRAVCGL YS TCPRLCVYEHEH I 
CVNE*VCEHVCERESVRVCESTH/LPLC 
A^TCGPI FGCMSEKHVFSYTPCVHRVCV 
CVCVHLGCCVC/VCVCVCVCVCVCVCV 


1567 


15468 


A 


1575 


1 


383 


FLS FGFAPQAGGQGHNHG *GP P * P / PKA 
KG I FP PHP PEKREQRVHATPPGKFLDFF 
FFWKKGGIiNIjGPKKNLGPGGKKNL,L»VS P 
PKGGGKKKETPGPGGVFFWGGIFFFFPP 
PCS PG * PLSLLKKPKGG 


1568 


15469 


A 


1576 


35 


469 


RI PRCHQPVGPLGCREGAKPQGPGPDAA 
DSHPPAS PRAH * / P * SLHRDP I PGFRGP 
CRRNAGAGPRAHTAGI CAPQPNS * SRHH 
PWAAAPAGDDS PTTS LGSHVPVGRKSSD 
GCRQRALCTPGS PAPSEAEVGGS PELRS 
LRPAWA 


1569 


15470 


A 


1577 


473 


62 


SLEINPY I SGQLI FNQHAKTNH* SMRKE 
\SFFNKWCLDN* I STGRRMKLDPYLKPH 
I KLNSK/LKDLNIRDTTMINLHDLGFGN 
GF*TMTTK*AIKEKIN/WDFIKI*NFCA 
SNDIIKKVKRPNVTAYTCNPSTLEGPR 


1570 


15471 


A 


1578 


59 


426 


LERRSFGAPLSPFFAPQFEMKKGFGSP* 
KFFFSPKALNFGGGWPFFPPPKKRFFS 
KNPQEGFIPPPLKICKKXTSQPF/YKFGP 
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T=Tbreonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
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nucleotide deletion, V=possible 
nucleotide insertion 














PKESFKRAPPFFFFKGGPFPPIjVSNQWT 
GKPRLPPPPPKETQNPSSFQIGGQKKGK 
GGPQSFSFPIKKKKKKGGHSRSRTSPRV 


1571 


15472 


A 


1579 


2 


419 


KANKTKNAY FEG INKMDKPQVI LKKKY/ 
REKT* IPSIGEEKGIMSP\NSEDTKRI I 
KECFKQTYAHRFYSLKE/MDILLESHKL 
PKLTQ/EETDSLNS PV 


1572 


15473 


A 


1580 


129 


403 


YMFFIPINCQDHPK*KKKKKKKKKKRGG 
/RPFKKTLRGPKLNRAEKNKFFF*KGSI 
KKKCLE IliKKKLFFGGEKNCKNPPKKKK 
PSREKKKF j 


1573 


15474 


A 


1581 


317 


76 


PRFFFFFPPPKKGFFSPPFFFFSPRFFP 
PPPFLKPPPRFFFFGPFKKNFF\PPPRP 
LIFFFF*APPPFFFFFFFFFFFFV 


1574 


15475 


A 


1582 


259 


377 


PREMKTYLPTKKLGYEISHYH*WWGCKM 
VQPVWKTVWQFLKGLN IKLP * D SAVPLV 
GM*PREMKTYLPTKKLGYE/MFTLSLLI 
IANK* KQPKCPRMNKW 


1575 


15476 


A 


1583 


i 


415 


PTRPITSSICLRQSYLKALIAYSSISHI 
ALGV\TAILNPTP*SFTGAGILIIA\HG 
LTCSLLFCLANSNYERTRRRI I ILSQGH 
QTLLPLIAF* *LLARliANLALPPTINLIi 
GELSELRTTFS * SN ITLLLTGLNILGT 


1576 


15477 


A 


1584 


216 


406 


LNLLPLVLGGSSCLPPFCGGN*KPPPPP 
G\LFLEKKGFSPCGPAGF*PPALRGPPP 
PPLPRGLI 


1577 


15478 


A 


1585 


383 


3 


KKFGYPFYWIG* KILK*FPG* K* SLPHR 
KSPFFFFF*GRVLLCPFGWRGTTKGHGS 
QVT\LPAAAWTFQV\K*SSPLRLPSR*G 
YRQASPWSGNFFF/ CLVESLSMLPGLIL 
NYWAIAIKPSGPPKVLG 


1578 


15479 


A 


1586 


126 


413 


NPTLKK* KMKENRM KKNEQ S LRD L * DT I 
KLTNRCILGIPKEERKKGAEGIFGEIMV 
GNSSNLIR\ENINLNIEVAQ*TLSRIN* 
KRATLRHVIMKM 


1579 


15480 


A 


1587 


242 


409 


GWMI FRFNFFFFLRGSFTLVTQAGG*GG 
DFG/ SLRP PPPGLKRFSCLTLPRS WDYR 
H 


1580 

j 


15481 


A 


1588 


2 


338 


EIEKKGKGKKRRG* RSNKKKRGGRLEGQ 
KY*SPPCPRLSFFLVDEKGPEVLQDSLG 
GWWKTP/SGCPEMTDSSQPY/YRAFYVL 
KN/QRVGFSVDVGE IEKDQDVEKNQDPS 
CPRL 


1581 


15482 


A 


1589 


360 


0 


NNFLEQLKFPPKKKITTDSYKAPRPFFF 
FPPSQKGVFPPTLFFGFPPGFPPPPFLN 
PPPGFF/CFWAPLKKFFFPYPGG*TWVS 
LKGPP/L/RFFFFFF*DGVSLCHPGWSS 
SAQS 


1582 


15483 


A 


1590 


75 


412 


VEGQHCNFCAAQETINRVKRQHTSLLET 
FANWSPDKGLIPRTYKELKHLNRKKHSY 
* KWADDLDRHFSKETYTPPKTYSCTQYC 
*P/SITIEN*KLKTSNIKTRLQGGHKRL 
ER 


1583 


15484 


A 


1591 


309 


1 


FSTQGAHMQVCYMDTFHDAE I WTS IEPV 
TQI VNMLPNR* FFNPCPHFFPPPFCSP / 
LVSFLLLLRDRVSLCCPGQSRSVGFK*S 
TCLGL PKRWD YKCE P PHV A 


1584 


15485 


A 


1592 


2 


415 


L3ISL5IF3FLP\F SFLLPSLSLSbt'LS 
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G^GIycine, H=Histidine, I=Isoleucine, 
K-Lysine, L=Leucine, M=Methionine, 
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nucleotide deletion, \=possible 
nucleotide insertion 














FFLFLSFFPPSFFFFFHRASLCHLGWSA 
MA*SWLTAVSTSQ\VKQSSHL 


1585 


15486 


A 


1593 


3 


396 


HSAFLFFL* DRVSLCHLS WKAVAQSQLT 
ATSTLL/VLKQSSHIiNLP 


1586 


15487 


A 


1594 


300 


62 


KMTGVLKSSCGKS PKQVGWLC VCVCVF/ 
S FKRQGLAS / VAQTGMQ WGDHS SLQP PA 
PGLR*SSCISLISSWDYRRVPRRAPG 


1587 


15488 


A 


1595 


1 


412 


FDRPAADQKAASALKASGVQAQMAKGTY 
HDWSLQDWKVLWMTH* VS \QQEQDPTNL 
YISNLPIjCMDELQLENMLKPFGQVISTR 
ILRDYSGTFRGDGFARMESTDKCDAVIE 
HYNVLIiI MTPPGVSAPTEPLLCKFAE 


1588 


15489 


A 


1596 


428 


2 


QSESREPENFLLPTSTSSLIiLVHCLSRT 
CWNSTLSCVPHFFHLTFNSHLLLTSQP 
RRVHFC* LSLS S 1 1 S WKLLNTLPGVP I R 
\ASEI FGLRTIRNFPFLSHSVLFPFSLS 
ESHSVTQQGVQWHELGSLQPLPHGCQWL 
SCh 


1589 


15490 


A 


1597 


2 


442 


QGS L * PHPPMLK* SLTSAF \N YRHVS PH 
LANY FLF FF * KQGIiAVLLGLALN* AQ / C 
HLPALASQSAG I TGVSHGAWLFFFFLNQ 
CYLI * FLILI F\ ERRNSPVAHVLVNGGD 
LGLLKPPPPGLMGFSCLNLLRSGKYKPP 
ATSPGYIFC 


1590 


15491 


A 


1598 


419 


121 


NLGYPRVSPPPPFLNPPPEFYFGPPKKK 
FY/PPPPPAQKIDPP*TFPPFFFFGTDG 
SHYVAQAGLELLAS SDLPASALQSTG I T 
KHE P PHLAET I FLVFL 


1591 


15492 


A 


1599 


45 


397 


DRVSLCYPGWSAW\*S*LTAASNSW\V 
K* SACFSLLS RGDYMCMTLHI ANI KKNF 
I»GRVRGSHL*SYHLtKLCLKK*RTKEGFL 
SFFWKGKRHFLGQKNILNPRLKVFFLPG 
PPKGIrGY 


1592 


15493 


A 


1600 


3 


397 


SRRPGRFSLMLTLSWHS*VCRAALAARE 
EQWSGCFKSHF*LEVNFWVSGIDREV/C 
G Y/LKTVLGEDLNDYVSTQ I + D * LMKPR 
CPEKQDESLLKEFGGGA*RLNWHRPGA 
VAHACNPNALGAKGRRIPRSG 


1593 


15494 


A 


1601 


244 


2 


KTKTSFHSLIDSCGYLIiSYSNFQI*KKM 
IFK/YLHLRAVPRHVIVRFTNLERQEKV 
LRAAREKG*VMHKGKPIRLTADLSA 


1594 


15495 


A 


1602 


435 


2 


PQEAIYTPTQHPTYGAICRIARIHGSRD 
Q/SVEMKWHPQLTITPSGP/LGKSLLPI 
P VALCFADLEVF I PKGGMLS PGVT I I S L 
NWKLRLP WVI LQ * R I TPLARVT PPDYQR 
E I RLLLNNERKKS YVWNTRDPLGHL WL 
PCPMVIK 


1595 


15496 


A 


1603 


288 


8 


EFPLIREMQIKPTMRYRLTPV*MD\R*W 
* GCGKRGTLVHCWW * CKLVQS VCERS S K 
NITIELPCDPAVLILGMHTKERKLRPCG 
WTRGVGPGYC 


1596 


15497 


A 


1604 


411 


236 


LIPSEHFFFSG*F*AFDRSSLAPTPQLR 
RHRPPTGLLPLSLLQGP\LLTPSVLLS* 
GGRFKGPNFTPPGGQGKIFFMGPPKSNP 
GPGV 


1597 


15498 


A 


1605 


2 


437 


KCLRISPCAGPRRPWCPSFEPRVCVWPL 
GVPQVG/PEGP*GEG\GFEGGDVRQLWQ 
GKICKKKKOTLGKRGPWFPKEKRGFSLTP 
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K=Lysine, L=Leucine, M=Methionine, 
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nucleotide insertion 














WFGPQGLWEPCLNPGGPLFLIGGFQKGP 
KVLIGKGKTPGPKKKNNRPRKRGSLGPP 
GKDDFPL 


1598 


15499 


A 


1606 


31 


467 


EFPGRRFRGGGCSELRLRVCTPAWTTEG 
DSISNNI *LVKEN/HTVNSWKCSDS 


1599 


15500 


A 


1607 


418 


32 


VGQVGLELLTSSDPPASAYPK*WDYKR\ 
DPRAQPNPNILKAQSNRSPPRQR* PASS 
FSNQNSCMHRCISLP*VLFLIFFKNRP* 
NNSFGYNSSGWWSGLNIIiKSC* FWFLFG 
NSDTE I R I CRS FHRGRE V 


1600 


15501 


A 


1608 


379 


3 


FPILSPSYNPSSTELP***SLQSPLFEP 
YDFFSVNMFSFLFFNLKSHTVAQAGVQW 
RNHGSLQPRSFQLK\HPPASASAGTTGM 
CHHAW/L I YLCVYLFLEMRSHCVSQD * V 
QWHNHSSLQPETPTK 


1601 


15502 


A 


1609 


456 


99 


PFPTPFPPPHSNVJGLGLGCGL/EPTMTP 
SGLGLKGSPS*SPPAHRLSGAQLCPVLR 
APVLGPSQMPPGRKKKPTARGS PWRKGV 
FVMSGQSDPLGPSSCQELGPRQSTQGTF 
PGA 


1602 


15503 


A 


1610 




420 


FRFSDGAAGQKCSS PPRPGRGRAEVLLT 
SQTGRQGRGAPHI SDNGQPGRDAP\PS * 
M*WRPGRGAP\PS*VGWRPGGDAPHF\Q 
TGQPGRGAPHI PDDGRPGRDAPHFPDGV 
GAGQRLQTRHFGRPTAAAWKVKWTSLR 


1603 


15504 


A 


1611 


426 


3 


KNPFLLEAKVSFNPKWPPALPPGEQRDS 
VSQ KKKKNLL I HKKAH S KH * F F I CRE \ C 
ESAIiLLHQNTHAGGKS YVCNK * GRGFRN 
KSHFTYQRTHSGKKAFL*KECG*DFL* K 
AILTAYQKTHSGKKSFVCKECR*DFTQK 
TK 


1604 


15505 


A 


1612 


428 


273 


HHA*LIFKIF/CVETRVSLLCPGWS*TP 
ELK*SSLLGCPKCWDYRREPPIRPTT 


1605 


15506 


A 


1613 


311 


4 


ANKFKNLNEIKFPEAHNLPKFTQEGGLN 
NPVSLY*KN*TYSLRLFP\KKKKSGPDD 
FTGEFN*TSKEEVPSLHKLLQKI*ERNT 
LPNLFHKARVTQ VSKS EM 


1606 


15507 


A 


1614 


338 


88 


PNPPPPSKGK\GFPPPTPGRKKKTRPPP 
PPGKFLGF * KKRGFPPFERGGP \NPAPG 
GPPPPNPPKGGGTKGEPPSPEGGVFFFF 


1607 


15508 


A - 


1615 


2 


162 


KHGGTCLYSLLLGRLRHESCLNLGGGGC 
RE/PE/ SHCIPAWATE*DSVSEKKKYL 


1608 


15509 


A 


1616 


3 


399 


PEVREYLTSRGLPFKVLLILDNAHGHSE 
PQRFNTEGINVFYLLPNPRSLIQTLDQG 
VTRTFKSHYTWYSTERIANAMEENPDRT 
S * KSRIMTPL/ IDAI VMTEKAMEAIMPK 
TIISCWRKIiCPDWHDFTRFT 


1609 


15510 


A 


1617 


390 


1 


KRNCFGPFLTPPPGQRLGGFKFLK\HFF 
YYRGERGGFFFSPFNKGFPPPPPFWGFF 
LGALKF*RGVPPSKPPPPGPGKKFKLKP 
FPPRKSYGGCFKCFLGGPFPFEGPPPQK 
KKKKNFTAARDLEPNAW 


1610 


15511 


A 


1618 


468 


0 


MKLVNIWLLLLVALL*GKKHLGDRLEKK 
SFEKAPCPGCSHLTLKV2FSSTVAEYEY 
I VAFNG YFTAKARNS F I S RALKS S E VHK 
WRIIPRSNPSSDYPT*WP/VALKKKKKK 
AGV/LPLiENYSI I TRV 


r i6i i 


15512 


A 


1619 


421 


2 


SSRLSLPKCWDYRREFPRPAQPRILKKI 
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G==Glycine, H=Histidine, l=Isoleucine, 
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S* * EPFFLILKCI * LRTQGLGRGHKGKL 
NSTYVLHKFMD* *LGQLMNGLNMLDESS 
FCNVLKHTKQ\WRDYYFVCLFVCFVRQS 
L/NPVSQAGVQRRDHGSLRAPPPGFAPF 
S 


1612 


15513 


A 


1620 


406 


2 


GTKKASSNGFINGPGAAKKKSQTFAFLA 
KPGAFPSILLDEKNRPQIKPPACFP/SI 
RESQTSYPVFFFWSPSCGFFFFWGKPEP 
LLLKEAGKVPLFLFLFGFL * DGVSLCQP 
GWNAVLRSQLTAESNPHASAHAS 


1613 


15514 


A 


1621 


411 


0 


PPPPKKKPSGPPPP/PSSSSSPSPPRKF 
LGGPRVFFPPPFFKPPPPKNFWGPPKKK 
KFPPPPGGKKFFF* RAPPP 


3614 


15515 


A 


1622 


2 


403 


TAR CGLNFPCS S S L PF IAS * VAGTTGTH 
HHAQLILLIFCGDELSLM\CPGWS/PNS 
SHLGLPKCWDYR 


1615 


15516 


A 


1623 


298 


411 


LIINVCWPGLVAHACNPSTLGG*GGRI / 
TMRSGVRDQP 


1616 


15517 


A 


1624 


263 


2 


DSVS KKKKNFKE * IiIPVLLKLFQNI EEE 
GILPNSI»YKAGV\LIPKPDKDT*RKEIY 
RPISLINIDAKIVSKILANKIQ*FIKKI 
TDAW 


1617 


15518 


A 


1625 


3 


281 


PFSCLSLPSCWDYRRPPPRPANFF/VYF 
YKKNTRTQCFTVKHGFTVLTRLVLIS+P 
CDPPSLASQSAGITGVSKRTQPHTVFFL 
NNPALPKLQT 


1618 


15519 


A 


1626 


300 


20 


NPGPRGFPPPGPPKRLiDFRGGAPRPGF* 
YFLKNFLGFFWFQKNFLVFFLGNKTPFP 
PQFFFFF/RDRMSLCHPGWSEVAQAWLK 
AALTSQTPAI 


1619 


15520 


A 


1627 


394 


40 


PQPAAASLFSPGLFFFFPPVFSPPPFFK 
TPPRIFFFWPP*KNFFF\PPPAFFFFFF 
LGAPPFFFFFFFFFFFFFF* DRVSLCRP 
GWS AVAQ S * LTAALNSQT 


1620 


15521 


A 


1628 


386 


3 


I FPTRCTHLHLGTNKLPTCS FFDQAKKN 
PFCS\HHSRGVGLRARLFCERLTIEGAG 
TPACPAP * FPGEPTRP/PGRVRWIiTPAI 
PALWEAETGS * YVARTGLELLVS SNP PL 
SASQSAR I TGVSHRTWP 


1621 


15522 


A 


1629 


401 


93 


ARGVLPLNPPPWGGRGGGSP*GKNSKPP 
RPRGENPPPFLKPQKLPPPGGGPPPPLF 
LGG*SKKTPPPPKGGAPINQKPPPGLPP 
PGKKGAP FPKKKKKKDKN I RTKKKARRS 
GSLLQSQHPGRPR/RGGPPLTKNPPLAS 
PPQEKRGPPFQKKKKKKTKI 


1622 


15523 


A 


1630 


417 


47 


PPPGTISSPNP/QKNLKKGPGPGGNPRN 
PPPLGGQRG/RGLWAKKSRPPGPPRGNP 
PLFKKKK INGGGG P P PWPP PRGARAGK 
SLYPGGGPPQ*PQMGPPPPPPGAKKGFF 
PKKKKKKPKKPRKT 


1623 


15524 


A 


1631 


416 


54 


E YWCKGR * I DQWNRM * FKMDQHLHSQF I 
FN*ATKAIQWGKESLFNRLCLKNWIiTIR 
DK\IYLDACLTTY\*KINSSWHSGAFL* 
SQILERLRQEASLSPGI*VQPRQHSKTP 
SLKKFFFKFFS 


1624 


15525 


A 


1632 


2 


373 


L.VFLDNMLKLLRHSALASACFPEDLAKI 
MDEGGYTKGQIFKVHGTAFCWKKMPSRT 
F WRE QS VPGFK / ATD * IjLLGAIvAAGN \ 
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KLKPMPIYHSENPRVFKNYVKPILPALY 
TWNSKA/QMAAPLF 


1625 


15526 


A 


1633 


3 


383 


NI WN* KAWMTVHL/ FIT* PTEYPKSTVE 
NC* EKKKILFKI /LLLIDNTFGHPKALM 
EIYKEINVIFMPANTISIL*PMDQGVIA 
TLNSYYLRNTFHKTIAFINYDSSDRCGQ 
SQLKNFWKGFSILDAIR 


1626 


15527 


A 


1634 


2 


182 


GACTQL I GRLRQENRLN / SGDRGCSE P I 
LHLCTPPWATE*DPVSKKKKKNPCPKLK 
KGPPP 


1627 


15528 


A 


1635 


331 


1 


LFPPPPVLKSGPGPN*N*PPLKGKSFGP 
LKKKFFFFFF\FRDWVLLYCLGWSQTSG 
LKRSSCWDYRCEPPHLTPNF/SYF/CRD 
S VS L * PRLEGSGS I 1 ALCShKLhDSS S P 
H 


1628 


15529 


A 


1636 


80 


381 


KLSKCHAHHSRGFYKYSLFQLGASQFPQ 
VLRITHPHKGSWAAPRS *G* SQC/SHFF 
SFFFETKSHFWQAGGYGRNFT*LQPPP 
PGLKRFSHLS LPS S WDYS 


1629 


15530 


A 


1637 


381 


38 


SKRQGFHHVGQAGLELLSSSDLPTMALH 
PTCPLQKCWDYRC/ DATAPNLSSTFFMQ 
EKCIiAFL\PLFSSAPHSL* *PKAVIEKD 
CPGL * I W* * VARSQRMNVFNFI K 


1630 


15531 


A 


3638 


3 


295 


PGPDGFTAEFNQTFKE*LIPILVKLF*K 
I QEVR \ F FKF S I TLTPKSNKD 


1631 


15532 


A 


1639 


48 


380 


ILGKAISFT IE * KGLKYLGI YLTKEAKG 
LHTENYKMLLKELKKDTNIWKGILCQ/W 
TRRLNMVKISV 


1632 


15533 


A 


1640 


343 


23 


SWLTAT/FCFLGSSDSPASAGVELLTSG 
DLPALAS * TAG I TGVSHRTRPALS INTS 
TLLCSSPYCPPHLQSLQGTTQPVPFLKA 
QRTGCLLQGVILTSQAENFCNKH 


1633 


15534 


A 


1641 


3 


397 


LELFSSAHCCPSLTVMQYYPP\RPTSHD 
CQREKSPHRTKKKKPLEGVFLG* KIKTN 
LENPPPLPFFGGGPPPKGGGPFKIV*GG 
WPWPWGLKIPTLPKGPTPRPPWGPLGTF 
GGEGKPijRAFPLQRFFRGPE 


1634 


15535 


A 


1642 


2 


308 


NKWR * GNW I S VKI DNFN F YLM P YTNL S * 
IRDDLNAKATTI KLVGENIGENLGIGKN 
F*ERTLKA\LRGKKMDKLDFITIGNFCF 
SKDR I KNKNKARRGGSRL 


1635 


15536 


A 


1643 


16 


386 


EKKKLSL»FTENGIPYL*NPKESAKRLLS 
LINDFSKV*GYKNN\DEKSVAFLNTNK 


1636 


15537 


A 


1644 


533 


3 


PLSLSLSSLFLSFFL*DRACFVAQAGVQ 
WLDLGSLQPPPPGIK*FS/CGGNVAVTP 
RLSPLTLPAMTEVRLPSSKIQTNKEKNT 
VMS E I YQSLM * MGWG / M* VRMCVKFKKP 
WVDN* * VSVMGCLLPSFLPSFLSFLFLS 
PS PLS FFPYFFETES CFVAEAGAQ WLDL 
G*LQPPPPAVP 


1637 


15538 


A 


1645 


341 


19 


GIGGRPP*FQLLGRLKQEK/HLERGKGF 
NEPKSRPCISAWATKGDCFQKKKKIHKT 
LQQIQPKQSWKTDISQFRTDYKTTIIKT 
VGHGIWIDIFINGIELSENLNQCS 


1638 


15539 


A 


1646 


279 


3 


TFYHNE KDNNKCW LT I \GKI + MLTGCYW 
ECKMVQLPWKTAWQFL/R/DVNI*LPYD 
LAITLLSIYTRKRKTYVYTKTCIQMFLA 
VLFTTAKRWKOF 
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1639 


15540 


A 


1647 


392 


1 


QKLE IMLREEGMLKAKTGQKLGLLCQAV 
THLVNAKEKFLKLNVLLQGTHKW * ESET 
AKSLI LFNSMKAEKGE * AAEEKFAASRG 
WFM/RFKERNCLHHLKVQGEAASAWEA 
VANY I EDLVKKIDKGGCIN 


1640 


15541 


A 


1648 


425 


148 


SQSLI*SKALTLCNSMKT/ERAEEVAGK 
KLEASNF\LKFKKRSCV\RNVNMQGKVA 
SLDGEAAASSPEDLWFIDEGGYTKQQI 
FN*DKTSFFF 


1641 


15542 


A 


1649 


271 


462 


RRQKKRHKRCILSPLLFDIVLEVPARTI 
* QEKKI KGI QIGKKEV/ KI I SLFADD W 
LYL 


1642 


15543 


A 


1650 


70 


398 


RPEASLRHMCLNAGQLLSKRAKLGALSL 
S FFFWKKSLAFAPQPGGQGGNLG* * KPP 
LPGLRGFSGLTLLRN/WE*RWPVPPPT\ 
NFGPL I KTGFPLVGQAGFDLRTLGALR 


1643 


15544 


A 


1651 


425 


3 


FEGFPKVGFPLGPRFPSRVPPFGTLPPP 
PKRGPRCFPQPGAPPPFCIWTPPGALPQG 
GVGPALPG/ ALQKFGPKNPGGFFSGPPQ 
MAP\GGFPGGP*RPPRGGAPFFFFLRQG 
FPVAQARVHLPGSSDPPISAPQVAGTTD 
VC 


1644 


15545 


A 


1652 


385 


2 


KGNNPSPPEIKFFFFFF*KGLLPLPQGG 
GQWGYFRSLQPPPSRLKLFSCPNLPSNW 
EYRGP* \RLL*LITGRGTSGSKTKVPTP 
CGPFNLNQLGLGTQARNFS/RLVFFFFE 
TESPFVAQAGIQLRDLHS 


1645 


15546 


A 


1653 


242 


3 


KNKNFGINRGFFFTFKVPGFFFFLGKVK 
LF FF FRNLKFFLKAKP P*\VFFPIG PS F 
FFFFFFLDRVIiLCCPGWNVWQSL 


1646 


15547 


A 


1654 


3 


285 


HFI I YTKDLNRRFSKEDI * I I IKHV/KK 
CS PSLAVREMQIKTTVRPGTVAGTCNPN 
TLGGQGRRIVQDQLKQQ3KTSSLQKKIL 
FRLiARHGGTCP 


1647 


15548 


A 


1655 


1 


373 


KVSLFFFFEQGLLCCPGWSAWSSLQPQ 
CPRVKQFSHVS/LPSN*EYSCTPNTFSL 
QVCVSIHKYI*YIYIYIFKFF/ CR/DRT 
LARLHRLVSNAWSQAILPPWPPKVLGLQ 


1648 


15549 


A 


1656 


189 


2 


VQPGQQERNSISKTKNKTKQKLPTKKSP 

GPD* fkeelipilhkl/ f/hkiedkgtl 

HNSFYWTI 


1649 


15550 


A 


1657 


385 ■ 


13 


ggpppwgarsppkl*npppprkkppgpp 
pppggapplggfflffpppggpppgggk 
/s fsprffffffflggggqiflfs pggg 
prgvfsppapplvpkpaffskkkkrgss 
ggekpeadgyfik 


1650 


15551 


A 


1658 ! 


352 


2 


hlsllsnwdsirapp/rpcvifknvpln 

IFFL*RGVTMIjPRLVLNDPPISASQVAR 
I IDVSHWAKLRRSV/CYVFETGSGSLSQ 
AGVQRYNHGSVQPQPSRV S * S SHHSLWK 
YRYPPPRR 


1651 


15552 


A 


1659 


265 


3 


HSGQRDEGRMRCGEWLESHGWRARSQM 
TLKTSFLFTMAKI * NQLKCLFMDE WI KK 
MWH IHTME Y Y SAI KR/DEIPS FVATWME 
LEVIM 


1652 


15553 


A 


1660 


1 


163 


NQQNENRKTIEKI IGTKT*LFKKIKXFD 
KPLARWT / RG KKN I Q I TN I RN * RRD V I 


1653 


15554 


A 


1661 


56 


320 


kffotsagestkirclf/sclflflk/q i 
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ESRSVSQDGG*SEPRSCYCTPAWATE*D 
SVSKKQNKNKPKKNLNKCPASSLLITLP 
PARHKA 


1654 


15555 


A 


1662 


3 


421 


GIITDTFPNLEKGINIQVKKV/RRPPSR 
FNPKKTTSRDLI IKLPKIKDKGS * KQKE 
KTS KS ARL PQPHGLLGLGLGKS AS SP I K 
KERKQ\Q ITYSGAP I FLVTDFSVETLQV 
RRE WHDVFKVLKEK/ DF YPR IVYLVKI S 
F 


1655 


15556 


A 


1663 


362 


2 


VIFADEAQILKKEDKLDFIKMKTSVHQK 
TLiSTEYKDNHREKIFVSQISDKELISRV 
YKGLLKLNNTDKNLILFYF*RLGHSVTQ 
GRVQWCNHSSLHPQTPGLK\NPPTSAS* 
AAGNTGVHL 


1656 


15557 


A 


1664 


79 


355 


IHLPSLIGDFNLFTGISLLICLVLFVFI 
FETGSCSVAQ/S/GVQWHNHGLLQPRPS 
GLRQSSHLSPLSSWNHRHGPG*FIYF/C 
VEIRAHYHP 


1657 


15558 


A 


1665 


47 


384 


KEKASGPLINFFFFPKLCKLAPFFFLPF 
FLWGGGGGKFSRNPQKHFPR* KRVFVNF 
FFFFFFLRHCLTVSLSHCHPGWSAVA*S 
QLTAGSNFW\VKQSSHL3/LPSSWDHRH 
APP 


1658 


15559 


A 


1666 


163 


601 


I FCKGGVLPCCPGLADLHFSTSNS I SFY 
Y S S GLLRMTNKTET P MS TIP KGVG VAWR 
FGNSECIFQELPLTLHHLLSTMLASFIH 
SHEASANALVGRSLTVGWGCRGVGVSVD 
PAAWL WRDLXGCF* DKSHS VTQSGGQWC 
NLS SLQP * APRLKRS CI LSL PRSWDHWH 
VPPFLANF* I FCKGGVLP\FAQGW 


1659 


15560 


A 


1667 


418 


3 


S VCLGLPKCWDTGRKPLCPAPS FFY/EG 
SITLIPKSEMHLPRNENYRSGFLLN/M/ 
DAKI LNR I LANCISN * I *NH * \ * KVKFT 
PGKKDWFNNRKPTDI I YPH*QNREEKSL 
VSSTDMAKVFNKIQPVLRELTIEIKGNF 
LNL 


1660 


15561 


A 


1668 


411 


1 


LRLHVGRTTT\LFKAVSQGHLSLQRFLL 
PSVEIC PAPRGG VYRGRQ AS LS CGG LHR 
VRASQLLWFLTQASAMAGPPPPVLLPPC 
SLI*DCCANNKGGFIGVGPFEPCVGYNL 
LVSHLLRPSEKPSIRVGVT*FSRC 


1661 


15562 


A 


1669 


151 


1 


PLEKEAEITGP*PHAWLIF/CFF*TESR 
Y I AQARMKWHNLGS LQPTPPGFK 


1662 


15563 


A 


1670 


1 


389 


TFF* KLKMINLSEEGMLKAKISQKLGLC 
\7VSQWNSKEKFLKEIKSATPEST*MI 
RK*NSSVANTENV* ERSRTSHNILFS * S 
SIQNKDLTFFNSLKAWRGQEVAEEKSEA 
SSGWFMRFKERSHLHTIK 


1663 


15564 


A 


1671 


1 


363 


ECTGPKIAKIILEKKNNVGGLPI/PNFK 
I * YKAPVI * FWLKVP VI PS S SAI LMKTV 
* Y\ YFKDRNQDEWYRLRVWKINSHI YGQ 
LI\FSKGTKTIQW*KSLFNK*CWNNWLF 
TCKRMKLD 


1664 


15565 


A 


1672 


203 


2 


ALNKRTDMPS* * IRRCNI I /KCLFSPKM 
N*VFNVIT/ IQCPSGHFFTETDKSILKF 
I WKSK* PRLAKRTL 


1665 


15566 


A 


1673 


15 


378 


NYHHNQNNEHMYHSLPNFFFFFFGKGAP 
PG\PKGGGRG*REP*IPGGKGNPPIi- ?P 
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KEGGPPGGTPPPGKPGKIWDFWEKGGFP 
LSPGGAGTPGPKGDGPPWPPKGGGKPGG 
TKQPGPKKFF 


1666 


15567 


A 


1674 


2 


349 


LFLTVLARLVSNS*PQ/CDPPALASQSA 
E I TGMSHRAQPRG IR * I * CGS YTTCHPL 
LLLT \ YLFI YFS TESHS VI QAGVQWHDL 
GSLQSQLRRLT/ CILTLPSSWD* PHVTL 
PG*FCIiF 


1667 


15568 


A 


1675 


348 


78 


LGPWVPPVIPAPLGG*GGRS PRPGFLTN 
PGPKGKPRVFLKIQNLPGV/RGRPPLFP 
/GSPGSGGPKSP*PLGPSFPLT*NFSPP 
FWGPPGGPK 


1668 


15569 


A 


1676 


2 


385 


TS RRD YRP \GHHNQL I F * / S FCRDGAS L 
CCPCWSQTPGIKKSSCLVFPRCWDYRCE 
PGL* I *MGKNPT\LFSNGL* CDCIPLIH 
S IAD I RKKPHS *LQGL*LCHQQNSQTES 
GSVTQAGVQ* CDLGSLQP 


1669 


15570 


A 


1677 


386 


1 


KSTRPVLYKWNNKA*MTEHLFTA* FTEY 
FKAT I ETFCSEK* I PLKI LLLI CNVPSH 
PRALMGMYKEINWSMPTDITCILQPMD 
QGVISTFKSYYLRNTFDKAIAT/DSDSS 
DGSGKNLLKTFWKGFTI 


3670 


15571 


A 


1678 


2 


193 


EGGRIFFNSFFEVILTLI/PKPKKWER 
K* SYHP I S IMNGDVKI LAQTLLNQ IQQY 
LKRIIHYDS 


1671 


15572 


A 


1679 


561 


830 


TLLLGT\NAWDFKLKPMLTY\HS \ENS 
RGL * KS WINLGLTVFYKWG TNNAWGDDR 
HHIiVYRHGFTGIF*GSQLKTYCSENIPF 
KILLFIDN 


1672 


15573 


A 


1680 


415 


2 


TSCAWLSLYPVYLYRSSSLPRFIFCHFK 
HLWYKNITKKAAEINTNFCCYINRVLLC 
HLGWS / ATWSS * LTVTSKLLGSRDPHT 
LSIjPSS*EGRCITPRLGDLLNLFSRDGG 
GIRGTSTLPRQVLNAWPQAILLVRIT 


1673 


15574 


A 


1681 


■ 


78 


RPR I RHEVGQAGLKLLTSGQTPAS VP * C 
WDYRPEPPCPALHISYK*NHAMCGLKCL 
AVSA*RHVLGFIRG\WHVECCFPFCS*A 
GLKLLTSGQTPASVP 


1674 


15575 


A 


1682 


414 


162 


GGPGGPIPGA/AGI^PPPPPLGNPPPP* 
KAKICPGGGAPPVFPGS*KGGGESP*PP 
RGRGPFIGVQPLPFGLGHKRGLFPKKKK 
KI 


1675 


15576 


A 


1683 


378 


129 


QPFGPRNFYQIFNFPG/PPV*HSPLFGP 
KFPPFSPGGGQWGPLGNPRPPGAKGGST 
LRGPRTGGSRGGPPGPGKFFFFLKQSLA 
L 


1676 


15577 


A 


1684 


3 


374 


GISVLPGIGAPGNKPELFEEVKLYNNAR 
ERE KYDNMAEL F / A WKTMQALE KAY I K 
DCV S PSEYTAACS RLLVQYKAVFRRVQG 
S E I S S I DE FCRKFRLDCPLAMER I KEDR 
PITIKDD*GNSLS 


1677 


15578 


A 


1685 


2 


373 


PFIRPET I KLLEE I PGGKLLDLGLGKDL 
LARTPNAKINTWDHI KLKS FCT I KETIN 
TMKKQPTDCEK2 FSRLISDKGLM/SQMC 
KELVQLN* KK/TDDSVKGWAEDLDRHFS 
KKDI KMANRPGKVL 


1678 


15579 


A 


1686 - 


378 


201 


HATCLANF\CSYG*DRVSPFCPGWS*TP 
ELKRS TRLGLPKCWD YRCLGR S LLFPGA 
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residue of 

peptide 
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Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E>=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=Iso!eucine, 
K=Lysine y L=Leucine, M=Methionine, 
N-Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 














PPDH 


1679 


15580 


A 


1687 


1 


404 


MCVNPGGGACSEPRSCHCTPAWVTERDS 
V* KTNKKKVKRRKKI FHANS / KLKRVGM 
AVL I SDK I Y YKLKKVRRDKE Y I I KGSTY 
QEDISITNTYTLNT*APKYMKDTLTELK 
EEIDIYAIIVGDFNTLLLI ICRGT 


1680 


15581 


A 


1688 


307 


33 


DEGSCHDAHAGLKLLDSSGLPASASQSA 
GILGVSHGARPLISSYNGTSHGGLGPAL 
VTSFNLSHLFKDLLSIiQl/HVTF*GMGL 
GFAGAKLSL 


1681 


15582 


A 


1689 


11 


394 


IFILEARTRISSRTFFKS * INIHAPYNS 
AI*LIGIFPREKK/ STCFYIYTQMFXAS 
LFVTAQTRKQP KCPS TGE \ W S KNLWN 


1682 


15583 


A 


1690 


20 


391 


SEGKGWYSCTKWQSMKLGGITFFFFFPP 
PKNPPPKKSGP/QKGPFF*GKGPPWPPK 
KRGHKN\RGFPKQRARFPKPVFFLIFGK 
RGFLLGPKGG*NPGEKRNPPPWP*KGGK 
NPGNPKGGPELTL 


1683 


15584 


A 


1691 


72 


392 


IKMIGSIiFFGFAFFFFFGKKTPFFPQPK 
R\GGKP* IT*TPPPGN*RNSGPSPPQKV 
GI KA P P PL»P K \ NF * FFGKNG VTPFP PGG 
FEPPTPKEPSPPVSPKGGKTNPAP 


1684 


15585 


A 


1692 


389 


161 


HGGACLRSQLLGRLRREDCLNQGG*GCS 
EPCTPAWVTE +D/ SSQKNQKS KI KKSGL 
DNSFSIG*GILGLSTCDRYS 


1685 


15586 


A 


1693 


286 


363 


DGISL*PRPPGLRQPSQPS\Ll,SNWGCR 
STPLCLAGFFVFVETGFLEVAQACLRQG 
FTMVAQS/ASQKIHIT*GAFETIQVLFY 
WGGVGFGQWRFFFFFETESHSVTQAGVR 
GCSLSSLQVPPPG 


1686 


15587 


A 


1694 


1 


356 


ELLEPRGRGCSERRPCHCPPVRVAEQDS 
VSKKKRERKYLYFLRFNWRSLRIFIYFC 
*/KPLQHNIQNISFTLQNSFGFFSRQYC 
PSPLEI I FLTPLTENLLGLFMKGI IQNW 
FFGVGLF 


1687 


15588 


A 


1695 


3 


298 


KYFETNENKNIQYQNL\AVKLVFRENLT 
WNACVKKEERFQVNNLALY P KN * EKSM 
LNPKGKI IKVRSEKNDIE*KNDEENQ*N 
* SWYFEKI TYWQTLD 


1688 


15589 


A 


1696 


3 


405 


RLWCGWRNRHLGS * NRVEN P E TG LHR Y A 
QLI FLTKVQKQVGEGQPFNK* CGGTWAP 
TGKT/MEQPVKASSSSSSSSSSSSSSSC 
KM*NIVFFKMGENLWDH+AKSYEVRTKA 
* TI KGKVDKLDFI KI KHFC YGKN 


1689 


15590 


A 


1697 


6 


392 


LQRTLLVGLFNAAGNLKLKPMLI CHSEN 
PRALKN YAKSTLPVFYKWI KIAWMTV* T 
VAAWLTE YFKP I FENYCS/EKKI PFKLF 
LLTNNAPGHPTGLMEMYKTTNVGSLPAN 
TTSVLQPMGQRVI STFKS 


1690 


15591 


A 


1698 


390 


3 


AI IESDFLTTSREVAKKLRVHPFMVLWV 
LEPIGKVKKVNKWGP* KLNK/NKKNHCI 
EVSSSLFLCNNEPFLDGIVTCDEKWILY 
HNW* *SAQWLNREVAPK\HFLQPNLHQK 
KVT V I WWSAAG h I Qtf EH 


1691 


15592 


A 


1699 


1 


245 


GGGGEYS KI IAI KTALKNTTYLG IYLIK 
\ DVPDL YTKNYGTMLREIK YLEK * RAR P 
CS * TERFKI VKMS I /LPNLI YRFNTI 


1 1692 


}5593 


A 


1700 


a, 


324 


GTSGTSGTSGTSQTCRISRVRSTSSWTS 
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Predict- 
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acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoteucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop cod on, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














GRHLQCHVSKAESSWFVLSASSCSTGSP 
ISLSGS/GSSLSNSSKNLTLPCLFSSLL 
S*P*ANPVDSAFKIYPGLSSVGSCL 


1693 


15594 


A 


1701 


183 


372 


PGVLLCWPGWSAVTVHQCDHSALQPRTP 
RLK* S P\ STWDYRYTPLCPA 


1694 


15595 


A 


1702 


29 


382 


GKRFPCSWSPKFPLTQGGFPPLHPGQKK 
KPGSKKKKDKSKCW*G/CKEA/GTLTKY 
RQELEMVQSFWK/TVWQVLRKLNVELPY 
DPAFLLGLHPTSTQKR\DTMFMA/AITL 
/ISERWKQPRCTS 


1695 


15596 


A 


1703 


1 


382 


KKVKIIGEAAVEFPDTIKKIIEEKEYLP 
L*VY\NADESGLFV7KKKLQRTFISKEEK 
\SMDRLTLIILCKCSWVYEQDGP*AIiKE 
KGEHQLPVF*L*NKKAWTVRTLFLD*FH 
QCFVPEVRKYVASKRL 


1696 


15597 


A 


1704 


2 


330 


KLNNLLLWNS * VNTE I KAEVS SSLLE IN 
EYEDTTYQNLWDAAKAVLKGKHVAPRHF 
LQEVKK/ RLKRFQ INNLTLYLKEL / EKE 
HINLKASGRK+MTKIGDLFGLYFVLNG 


1697 


15598 


A 


1705 


100 


342 


APKWSIVCPELVGSWSH*/PSRMKPWTL 
TRQGFTPOiARLVLNS*RRDLPALASQSP 
G I TGMSH RTQPLL I NLME I F TE I LS 


1698 


15599 


A 


1706 


600 


211 


SCSVARLEFSDVIKAHCHL/RTPGLKQS 
SHLSKLSS\WD*GRVPHDLANF* IFCRD 
R VLPRLLQAGLE L \ LAS S D P P S * AS EKC 
WNYRHEPTVPRQNLGLLKTOYGWFLKK 
YI LTVS VFSLMCLDHCLLM 


1699 


15600 


A 


1707 


409 


1 


RGPFFFSPGKAFLKLGSWSFFPIKKTRA 
PKKPIFSPVSP/LNFPPKTGFPPVFPPQ 
MGGFFF/CFPSFLFFFLPPPPPFFFSFP 
PPFFFFFFFFFFFPLFFF*DRVSLCHPG 
WSAVALSQLTAALTSMDSSNSPTCV 


1700 


15601 


A 


1708 


154 


2 


IGKPLAGLTK/RKRENT* INKIRNEKGD 
MTADNTE I QS I IRDFVS * RTAHQ 


1701 


15602 


A 


1709 


263 


37 


S AQLHPLN I QNHRQS I LLHBFFLKKQD / 
G/ WPGAGAHACNPS TLGGQGGWI TRSGD 
QDHPG*HISV 


1702 


15603 


A 


1710 


390 


42 


YAGGFRAI FFFPLPREGAKNPNFPVGPP 
PFGGPFFFFFAPSQKKEPGFFGKKGVF* 
GGAKGFPNARGPFFF/G* KKKKGPKNKT 
PGFFL/MGPPNPGGPPPRGEGGKIGAKK 
KKKSIRL 


1703 


15604 


A 


1711 


3 


167 


YTCVFVCLCLCDCMC/CVCACMYICVCV 
CTRVCM*VCVCMCVRVCVQALTVLCKSV 


1704 


15605 


A 


1712 


116 


391 


KRNFFFGPQGGGEGPKFN*RGPPPPGVK 
GI FPPSPPEG/GKKKGAPPPPGI I FWFF 
KKXGVPPCGPGGV*TPD?GGPPPPGPPK 
GGAPRQGPL 


1705 


15606 


A 


1713 


401 


47 


HHYATKP FTHAHTCTC I QDTCNTCMQHT 
QVHT/HTDTHTHTRKVSSVCVLIMAEQE 
RPCPHC* GGEAGAECGVCAWLGLSS * TW 
RWRKGPAHTVRWGQEQSAGCAHGLVFP 
RSYLWT 


1706 


15607 


A 


1714 


1 


400 


CVESCEVD I EMVS CCV/ CSG * SAVCSGT 
ASAHCSLP I PGSRDS PASACQVAGTTGM 
PH/LYPGVPLKPREGLQFTELPSGQL.EI 
QPTCEI?KR*HVPCAL*VQI J TDIRPN*RY 
QFRVAAVIJVKGTRRFTAPSKHFCS 
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beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E-Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=G)utamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, V=possibIe 
nucleotide insertion 


1707 


1560S 


A 


1715 


413 


3 


LNTFEPQRSPFIVNSSEMILLVECLFVT 
SGWIYHERFLNPGREIDWATCYSETGPC 
PVTQDGVQWSNHGSL*PQTPRLK\HPTT 
SVFPVA/RQGL/NSVAQAGVQWHDLGSL 
* CRLPSLKGSSHPRLPSSW^TYRYAPPPC 
I 


1708 


15609 


A 


1716 


421 


144 


RLECSGGITAYCSLNLPGPSSSPASAS* 
VAETTGLLHWKKKI VETGSHFI VQTAFK 
FLDSSDLPALAC/ SWDYRSELLCPACFY 
NFCLFINIPC 


1709 


15610 


A 


1717 


3 


384 


YSACVCVCVWTOVSVHRCMSISGH/ARI 
IRVGPAHKAQSSCRPVPAGCCTSAQTPP 
WCSAATADPPPR*GESLPGAYPGSHSTC 
CPGCCLDW * HSLPSTEPAACRAGAPGGG 
H*AWLGCGGRAGGR?G 


1710 


15611 


A 


1719 


3 


615 


PVGS WARS SGAGV7PPGSPQS VSDGEAGH 
S I PAPRGQCSRHRAEGRTARVCLSHCSF 
SGPRPGLVPIR*SLGRPDAVQAIVPDSQ 
EGRKTGI HAEAVMFPLGPGKGVKGCAGG 
RLCPSSPPSRCLDLGRRGMPPSSGPAGP 
RPSGVGSDLGRPGAGAATSSSSSSSSSS 
SSSSSSSSSSRDGEGP\TGSVEAPGSLL 
GPWLPSQPS 


171 i 


15612 


A 


1720 


320 


3 


GLKWLNLKDNSL YL I LAKWDDCLDEK* 
YADKMLQYTKS VWVS RSRG CQGQVGEKK 
QATEAAQE WELRK \ RL YWRKE CDALRAA 
REEQKELRDVRKAKKVYVCVRV 


1712 


15613 


A 


1721 


44 


373 


KAMGQTLWKTVWQFLTKEEIGPM*QSCP 
T/DLAIiVFLGICTIDLKAYIHTETCTQM 
I I ITLLI I AKNRKKAS CSS VGE / WNKKL 
YYIRTMESYSSLR*NELSSYKKHWGGGS 


1713 


15614 


A 


1722 


135 


396 


AQGLFCTSVKLASEQPLRILFQLDKRNK 
\ FEI YGTSG* L* S III CQNNLQSKFQMY 
HHKIMSSLGAVAHACNPSTLGGQGGQIT 
RSGD 


1714 


15615 


A 


1723 


4 


383 


LNLRAKAI KLLEENIGIHLPDLGLDDLF 
LDITPKSQATK/AKIGN*GFIKLKHFCA 
AKNI I KKMKRQ YKEWKKI FGNHVS DKKL 
VFRIYK*HLPLIIKNSSS 


1715 


15616 


A 


1724 


2 


405 


NSRTSLILNQNLE/ IKLSEKGTWKAKTG 
* KLGLLQQK\ ISKIANAKEKLLKEVKSA 
TSMNM*MMRK*NNFIPEMQKVLVW/I* 
Nl/PLCQSLIQNKALTLFNSIKAERGEE 
A\ KLEATKRWFMRFKE S CLHNVKVQDEG 


1716 


15617 


A 


1725 


90 


400 


SQLLRRLKQNSLNPRESSSKTTTRTKVS 
HWHKNRHVNQ/YNKIENSGINLHIYG*L 
TLNKGDEASQYSS /DILFNKWCWQKKKK 
\YLDPYLTPCTKISSTWTISGFL 


1717 


15618 


A 


1726 


390 


1 


TFFPKI\EKKIL*FIWGPRRPKIANVFP 
*QNKPKIEGIPLPGFKIYYRALVTKTAW 
F*HKNPPIGQRNKVENSETOFHPPSELN 
FFFF\YKGAKNIHWGEDSLFNKWCWENW 
ISI*RRMKLGPNLTPYTK 


1718 


15619 


A 


1727 


3 


365 


HASAKSNLRW 1 1 KLNLRAKT I KLLE ENM 
GENLWDLELSRGFLDRT/PKLYSIKLKL 
RK/ WNFI KI KNFCFSRDT* KS * MAGKNI 
LSGKDLYSEYLKK*CNLIIRQSKTST*M 
FIAALFI IAKH 
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nucleotide 
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correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A-Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possib!e 
nucleotide deletion, \=possible 
nucleotide insertion 


1719 


15620 


A 


1728 


368 


2 


RNCLNSEGGGCSEPRSRHCTPAWATEPD 
SVSKMK*NKKINK*IMFLY\EQQSETEY 
FKLPLTLA/SRNMKFIiGINLSKGVQDLE 
TENYKILLQEIEEDLNK*RNKTY*WIKL 
NTVKMKSILSKR 


1720 


15621 


A 


1729 


326 


30 


NPFGGPKKGGSQGREIKPPLPPMGKPLF 
F * KTKNKWVGGAPPVI PPS / SGVLSQKK 
TFTLEGGGPNKLNSPPALP/ARGPKKNF 
FQKKKKKTKQNPSQREE 


1721 


15622 


A 


1730 


1 


374 


I FNADKI A\FWKKF* KPQGTSVGREEKQ 
APGFKAGRNRLTTI /L/GASAVGFMIRA 
AL I CKAAN PQVLKGRDKHQLP VFRLLYN 
KKAWTTRTLFLDCFHQCFVPEVRKYLAS 
KGL VFKVLL I LDNGP C 


1722 


15623 


A 


1731 


389 


1 


FPPKIFFFSTLFFFFPRFFPPPPFL*P/ 
SPPIYFF/ CAPKKKNFFPPPPGKNFFFF 
KTPPPFFFFFFF*D*VCLCCPGWSAVAQ 
SWLTTTSVFRVPVI 


1723 


15624 


A 


1732 


118 


422 


D I TTHLFKWLKFKK* EKGLN I LFTKEDM 
QMEKNLKRCLT* FWKELQIKMRYHYPP 
IQMAKI * KN/STIS I AWQG YRTIGTLF/ 
HC* * EQPFW* FLSKIiNMILPYNPA\IML 
LS I YPNAIiKKHVHTKTQyi * MF I AALF 1 1 
TKNWKEPRCPSICEW 


1724 


15625 


A 


1733 


407 


1 


NIKGPFRGPLIQWGLLIWPKDSFPILGY 
PPFPSPKISFFFLARSCSGAPNHFPLPN 
QSPCFPQPSFFLGEEKKEFLPGYSLAFP 
FN/RFGGS *RVRKGNGGPI PMGESFLFF 
F*DRVLLCCPGWSAVA*SRLTATC 


1725 


15626 


A 


1734 


322 


362 


TAI Y I Y 1 1 * LFTDW I PSHPRTW I EIYKE 
INIFVPANTIPIVQHRNQGVICTFRKTI 
TVTDCDPANGSGQSKLKTWKGFTILDAV 
KN/IRDSWEKVKIG/TIN/GVRKLIPSL 
KNDFKRPKT 


1726 


15627 


A 


1735 


49 


395 


RGGPGFFFFFFFFSKKSQILPPGWKGRG 
EPRVNGTP P PRGKGNPPAQP PQEGGKTG 
PPHKPG* FLCFLKKKGVQKGG*GGPQ/A 
PGPKGAPRPGPPKGGEKREGPPGPTRPN 
LYYAH 


1727 


15628 


A 


1736 


417 


2 


FLFFFFFLFFFFSFASGPEILFTCL*HT 
H I HFLFFYSKST * * P PVFAGGMFQDPQW 
LPETKMVPNFKKKRTTLT/YIP* *KLCE 
CDLSNFFCFF*DRLLLCRPGWSAVA*SR 
LTATSTFQAQANRTRG 


1728 


15629 


A 


1737 


316 


338 


FFFLSFYFETESYLHHP+GFIVKLSKVK 
D I EN \ LKKTARGNYQVTYKGAS I RLAAD 
FSAE I SQAWREWDNMFKVLKE KTNWQPR 
IYKTLFVPHF 


1729 


15630 


A 


1738 


197 


379 


QKRAQIDKAIFI CRDIAL/P* FQ/MYYW 
ATVTKTAW * WYKNRHI EQWNRSW \ PPE I 
KSQSYSHL 


1730 


15631 


A 


1739 


4 


401 


RGYRHAPPSLANFCI FSKD/MGFTWLN 
S * PQ/ CNLPASTS * SAGITGI SHCTRPQ 
MATFLI GPHKI I P WS VLWPNL 


1731 


15632 


A 


1740 


94 


117 


KDRPMVPPVGAGEDQADEPCRGHASLWS 
QLVSAPTTP I PLPGRDVPSRATPFPAAL 
AAQQPP*ASPYPLPPGLGAGHASASPVT 
VPFSPISESTGS*ESAL/PAPRPGGSG 
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Predict- 
ed end 
nucle- 
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correspon 
ding to 
last amino 
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residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Lysine, L^Leucine, M=Methionine, 
N=Asparagine, P=Pro!ine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine ; V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *-Stop codon,/=possible 
nucleotide deletion, V=possibte 
nucleotide insertion 


1732 


15633 


A 


1741 


3 


390 


VDQFL I SHDLS KLTEDEVHNLNS STT IG 

\evefkvkkl\*kkks£gpdgitgkfy* 
tvreevtpil/ sylf/heiekeetlvns 
fyear 1 1 h/ 1 pkpdknkp * tn idaktts 
kvlpnri wqyvki i iqkn* vgfv 


1733 


15634 


A 


1742 


3 


442 


DLKSRVEPRVRPS VRKQ* VVLLKVLI CA 
SKDTLKRAKRQP I GWEKI FVNHMPDKDL 
IPEYKINMQKSGVFLFTNNSYSSTTDND 
INKWAKDLSRHFSEEDIQMANKHM\KR* 
SVSLVIREIKIKTTGR*LFTPKCWQG*G 
EIITLVH 


1734 


15635 


A 


1743 


411 


2 


LPPFKHPPPEIILGAPKKKITLPPPRPK 
KCISLKGPPFFFFCRYRVLLCCPGWFST 
PGLKQSSHLGLPKWWDYRHEPYCTQSSF 
SLSFFLKQTGR*WFDLSNFFFY/CYCFI 
YFSRDRGLTLLPELVLNSWPQAILL 


1735 


15636 


A 


1744 


1 


393 


RPGGPPKGRSREQGREGERSRRRRPRAP 
WARSHMWGARVFS VPRS FDQRPQEKCV Q 
SSYKQQLPARARDGTGNLIRGAPLFFF* 
DGVLLCRPGWSAGFKQFSRLSuPSR* DY 
RRTPPHPANF * ML»/ CLRRSLT 


1736 


15637 


A 


1745 


395 


0 


PSAPSFFSTRL*LGEPPGFPPPPFLKPP 
PRN/SIFGAPKKKFFLPPPRGKKFVSLK 
GPPLFFF* DGVLLCCPGWSAGFKQFSKL 
SLPSS*DYRRTPPHPDNF*MF/ CLRRSL 
T 


1737 


15638 


A 


1746 


397 


1 


CGNFLKREKNFEARRFLQK*AARFRNIR 
*VTPEITAFCPL*HVASFLVFIiTPNFPT 
I PQL Y CLE PLGEMGGSGS KLP P F PTS KT 
PNPLI S VNLCF P \ AI KWENFFF* DGVLL 
LH PGWSA VAQS R PTATSTS 


1738 


15639 


A 


1747 


392 


2 


FTKKGGRGGNLS P/ LPPQKI F*XKKTLK 
KPFFWQRVWFFSPLFFEQKGQGPFFFKK 
PFF*KRPPDTPQPPPPSFIFFFLLPFFR 
/HLVAQAGMQWRHLGSPQPPPPGLTQSS 
QLELPHTPPHPDNFCIFGRDR 


1739 


15640 


A 


1749 


33 


403 


IKGFKKKRGKGGAPQKKGGTGAXPPPPP 
PQGF*K*QKRKNGSPPNVFFPNPGGPPP 
PPPFWGDKRGGPPRGGGAPPPRGKRENP 
FFPTLA/ HGKKEKKK 


1740 


15641 


A 


1750 


3 


396 


KRQTTNWE KV FA* KN\ ADGL I SL I RRKC 
LKVKK/W*R/DMNTQFTDKGILMTNKHM 
KRS STSL I iMEMQHKAG VI FHPSDWQKC* 
STDNTQS * QGKEEMYTIKHS W* TFDYQQ 
PF/SESNL 


1741 


15642 


A 


1752 


30 


419 


NEG I GAGHE VS FAANKCR SMRKNVEVLA 
LNRRLDGELLSGLTSTQALPGWAYLHLL 
SHHAVRPLFLCFKRGWVLLCHPGWSAVA 
QSQ\ FELLGQV ILRPHLPS S * DYRS I P P 
CLANFKNFFRERARYACR 


1742 


15643 


A 


1753 


16 


410 


VGPKKSLQQWAAVQATLPLES YDLAHP 
1 1 LKVSLAORDAI * NLWQ I P I VAS* YI P 
LGF/YSKAMPSSVDIYSSFEKKKTHFFL 
TGGGGPPLYSNYLGRLGGANHLTPGVKN 
QPGQFGK-PPPLQKVQTLAGRG 


1743 


15644 


A 


1754 


2 


17 


NSSLIKHRRIHTGERPYQCSECGRVFNQ 
NSHLIQHQKVHTR*RMYI*SR/ CGKDFT 
OKSTLJ*H 



644 



WO 01/64835 



PCT/US01/04927 



SEQ ID 


SEQ ID 


M 


SEQ ID 


Predicted 


Predict- 


Amino acid sequence (A=Alanine 


NO: of 


NO: of 


eth 


NO: in 


beginning 


ed end 


C=Cysteine, D=Aspariic Acid, 


nucleotide 


peptide 


od 


USSN 


nucleotide 


nucle- 


E=Glutamic Acid, F=PhenyIa!anine, 


sequence 


sequence 




09/515,1 
26 


location 
correspond 
ing to first 
amino acid 
residue of 
peptide 
sequence 


otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


G=Glycine, H=Histidine, l=Iso!eucine, 
K=Lysine, LHLeucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R = Arginine, S=Serine> 
T=Threonine, V=Valrne, 
W=Tryptophan, Y=Ty rosin e, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 


1744 ! 


15645 


A 


1755 


138 


380 


XPXXXSPXPPPCPXPLSSLFLTKCSAY* 
TPAKRPPPPGPFVPPPKPCCPPPPPLRP 
WPPLPIjLTNPI PPPTLSFHPGPALS 


1745 


15646 


A 


1756 


194 


3 


WLCIPIRHTTEQQPGFPP/LIIWSQPYS 
PLFCLFVF* DTV S LCGPG WNAG VQ SGLT 
AASTSQAPSL 


1746 


15647 


A 


1757 


2 


403 


RVLFS PTLAYT YLLF YLGAT I LMGVN I W 
KQPNCPTK/ GQQKIKLQYIYRMEYYSAL 
KKK* ILLFAIR* VNLGDIMLSEV5Q\SR 
KKNIVLSHMW 


3747 


■ 15648 


A 


1758 


398 


65 


FFFFFFFFFP* TKGLGCIHRCDHGS PQP 
RIH\GSSNAPSLAS * VG3TTGAFHHARF 
TLIQSSSVHVSTHTLHPYPSSLP 


174R 




A 


i / jy 


456 


j * 


FAKR I TD KKLLS L I YLAT * KG E I TKAIi I 
KQ* \LKAQAKNSLKRHTI VLNHMKL./CL 
ILLI I KEMQIK/STLRYHFFFITLAK\ I 
*KLGNTFCWQGL*GTLIHCWWECK*HNS 
YVGGI WQ/ FSNKLYVQI * YNS AI SNLDG 
RVGRPG 


1749 


15650 


A 


1760 


3 


378 


QFQYFY YNGSVI KAVW YWSKNRK\ IDQ* 
NRTES PDKNLHKHMQL I FDKGTNTPQ WR 
KDDLFKKWCWNN*TSTC3KKKKKKGGGP 
* KEQNLTPPGWEDI I FLFGAPKNMPGAG 
VKTRWGGKNPGFPQ 


1750 


15651 


A 


1761 


69 


384 


YTSASV?GGARYTASAAGWKTLliLLF r LFI 
*DRVLLCHPRWSALTQPRIIAASTFQ\V 
KQSFWLMHIGDWDYRRCMPPCRANFFNF 
LCKKKKTLRRQEVNQTPALVRV 


1751 


15652 


A 


1762 


390 


1 


KFSTPGNKNLFFLKAPPFFFFCRGRVLL 
CCPDWYSTFGLKQSP/ CFSLPKCWDYRR 
ESPPQAFFVFFGLPSLPFSPPPAPSLSQ 
SSSFFF*MSFHSFAQAGVKWLN\liFGSL 
QPPPPGLK* FSCLSLPPTRP 


1752 


15653 


A 


1763 


2 


390 


PRVRGFFMRKFVDSYLVPYTKSNLKWTI 
DLHVRAKAINLLN*NVRIHLYDRELCNG 
FLEMI KKTKAAATTTKLDFI KI KNFCAQ 
QMSS /MKVXRQSTKWKKKCS YH I SDKGL 
VSRKYVKKAYNSS IRTQSH 


1753 


15654 


A 


1764 


334 


3 


WSKRSGPPL.SKNQTKKXNATKPQT* KNW 
INEIGPII/NTSPSKEKTGSNGFTD*FY 
* TLKEELLS I LTjKLFQKTEKSV I FP KS F 
YVKDHSSCLSGIKPKDANMEQHMQINQC 


1754 


15655 


A 


1765 


259 


1 


KSTLFLMKXYWKGSFLKKHFFLETRSLS 
D/SPG/GVQWRDHSSLQPRTPGLKQSPH 
LSLLSSWDHR*APPCPANFRFRLKTRIG 
RDV 


1755 


15656 


A 


1766 


402 


386 


FKKKSRLRKI / KVQDEAASTPDLAKTT / 
DYEGHYTKQQIFHVDERAFYWKKMPP/R 
TFIARKKKSMPGFRASKDKLTVLSGANA 
AGDLKLKPVLTYHS / ENPRALK/ H YAKS 
TLPELYKWNTKAWMKIQXFPS* FT* IFM 


1756 


15657 - ! 


A 


1767 


2 


406 


PRVRPRVRKLITLLKVSQRWSSEKKKKK 
KKTKKKKKKKKRGGGRL* KKKKKKPRRG 
RVNLFFWGPKKSTPPRVFKHRGGEK/PP 
PPPPKKPRERKPSLGVGSTWKGISPIKH 
SKKTKSTK 


1757 


15658 


A 


176S 


14 


409 


IASGFLFFIYFGVIGRPPKRGGVFFPGG 
GGAPKPPQGGC* KKKRGGGGGP P ? ?FPP 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A-Alanine 
C=Cy stein e, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIalanine, 
G=GIycine, H=Histidine, l=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N-Asparagine, PHProline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop cod on, /=possib!e 
nucleotide deletion, \=possible 
nucleotide insertion 














SLGGLGRPRPMSKIKTFPSAGGKPP/PF 
LRNQN * TGGGGPPQ * PPLFGG PGGKNFS 
PPRGGGGGGPKTPPPPPPGGE 


1758 


15659 


A 


1769 


305 


1 


TKTGPFPSNRAKKKKIVFQKKKKEKKIN 
CFVTAP FKG I KTE ATD * E KI FAKHLSVK 
G/ LVYL* YI * I S KI YKELLKLNNEKTT I 
PIKK*AKDLSRHFTRYTD 


1759 


15660 


A 


1770 


148 


402 


FS P VS LG * GRGN I YAGMS NV * EVP P E I D 
FQHEVKRALQTS FQ VKLVKI I FFKSTI * 
KSLAK/WLAWAHAClSrPSTLGG*GGWIT 
RSGVKSEPGQHGE 


1760. 


15661 


A 


1771 


250 


2 


KKKKEKKIICFVTDTFKGIKTEATD*EK 
IFAKHLSVKG/LVYL* YI * I SKI YKELL 
KLNNEKTTIPIKK*AKDHSRHFTRYTD 


1761 


15662 


A 


1772 


407 


1 


KKIRRGGGQPPLFPLIGGGREKKFFFPG 
KRGF Y * T KAG P FP S S WAKKKKFVFKKKK 
KEKKIICFVTATFKGIKTEASD*EKIFA 
KHLSVKG/LVYL* YI * ISKI YKSLLKLN 
NEKTT I P I KK* AKDLSRHFTR YTD 


1762 


15663 


A 


1773 


1 


406 


KKKKKKPTTFPGLPFFFSLSLPPRVPGA 
RGPPSSFPSKPGLGC* IKKSGVSPAPKK 
KKPRKTNPKNFPRPNPKGGGEKFFNLPT 
HPPGGTPPFF*RRRENPPGFAPQKG/EV 
FSPGGRNSRGBPRGGPKKKKKKGG 


1763 


15664 


A 


1774 


2 


378 


AAGEWLHQGSLQSLPPGLKQSAPLGVSK 
W*NPWHDPPPPAPRFVWWGGWLRWS 
FFLGAQAGIFFFF* IEMGSHYVAQAGBE 
/PPRLQGSS*LNLPSSWDYRPVP 


1764 


15665 


A 


1775 


1 


431 


QQMRDKRNLFEHNK*GIRGIYLNIIKAR 
HEKPTVDTILSGESFSSKIKTMLISPFL 
FNTVLGVIA* ARKRXDI * VGK* EVKSYM 
FTODMI \liDNPKDSTPKKKKTGYFMGGP 
GSKPPPQRGGAFLSIiTRDPLEREFPKTA 
LFTLGQKKIKGPKFFS 


1765 


15666 


A 


1776 


334 


402 


KGGGGVGG*QGPWRLAHCTDK\KEERKR 
ERERKRQRKKERKKERKKEKKE *MKNNK 
KKNDK 


1766 


15667 


A 


1777 


406 


3 


SPSSSSSSLFSPPPFFWGGPRF/SPPPP 
VFKPPPPFFFLGPPKKKFFPPPPAV*FF 
FF\LGPPPPFFFFFFFWEAGFPFFSPG* 
GP/SGPMAGFRSLPPPGNSLSKKKKSEG 
hGEGGNS VLTR VLL ISSYQI PGNPR 


1767 


15668 


A 


1778 


70 


409 


LISFLVSSLIVRLYRPLLCLFPDPSKGH 
CIPCLLPLFWILLHIFLFMHSFIYCLIN 
DRVLLCLPGWCAWRPRLTAAS ATQ\ I K 
RSGSHLSLPSSK* \ WDHRRCPGFFFFFF 
FF 


J 768 


15669 


A 


1779 


390 


31 


SHLSLPKCWDY/RL*ATTPSQKILVFTH 
G*VLSLLSLSFLIPPD*TF*KMSLMRPS 
LTQKS S 1 1 QHDLLDKVPITI FLRQSL/D. 
S / VTQARMQWHDLGSLQPQPPGLKLSSQ 
PQAlASSWDYR 


1769 


15670 


A 


1780 


357 


1 


LTLFWGAQYLPKKGKGPFFLLSLPSPSV 
WGPLPQKKKSPPLCFFYFLNRVLLCHPG 
WIAWQSWHSSAHFSL/VLTRFK*SSCL 
SLLSSVmYRCTLLHPPNFLNFWYRQGNA 
VLPKLVL 


1770 


15671 


A 


1781 


122 


254 


RKNE \ WSG AVAHACKP S TLPG P GG Q I MR 
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nucleotide 
sequence 
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SEQJD 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQJD 
NO: in 
USSN 
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26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A-Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Ptaenylalanine, 
G=Glycine, H=Histidine, l=Iso!eucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, ^Proline, 
Q=Glutamine, R^Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X^Unknown, *=Stop codon, /^possible 
nucleotide deletion, V=possible 
nucleotide insertion 














SGDQDEPG*HEPPRPTDSFFLYPSH 


1771 


15672 


A 


1782 


45 


387 


TQTPKLRRFFHFSLLKSWHYRCSPIAPQ 
HNGYFIiTNLDIATSMFFFKDRVSL\LAQ 
AGEQWCDPSSLHPQTPGLRR\PPASASQ 
EGETTGAHHHTWRNLiIYFFYYTYKF*GT 
YPMA 


1772 


15673 


A 


1783 


1 


401 


FATLARLVSNS * PE / CDLPASASQSAG I 
TVSPAHLAISPSFPAMPSSGLGLSYPAY 
HPGLGLRCHLPVLTSPWTSDTGPSSVLP 
DAGALHCPPEPQHICPL\LSGWLQTP 


1773 


15674 


A 


1784 


432 


1 


FLFLFFFIKKKTPFFGAKPKNRGF*KPP 
EFFQIF/S/ CPVFLGRFPKPQKEFFFPE 
REMGFFFLFS PQGGGPARGFPPPPLNQG 
EAPRAGHKKKGPNLGGRTFFFFFFEMES 
\FSVTQAGVQWHDLGSLQPLPPPFKRFS 
CLTHAS 


1774 


15675 


A 


1785 


15 


434 


RLSLSCCGREEHSTLPGAPWRCTEIAWA 
DSPDPAPSPPSALPSLLPFHVYRDVCPV 
LCLRGWP* *MVERGRLGISPTWLLGWPF 
PGGA/ PHIKPE* YFLFAYTI LRS VPNKL 
GGVIiALLLSILILAIIPILHISKQQSII 
F 


1775 


15676 


A 


1786 


1 


258 


CWPETPVLK* SHLDLTKR6EDRREPVWA 
ASTTI FFFETGL/NSGAQAGVRWVHLGS 
LEPLPPIPSLFMTP*GPCPGLGSSWPLR 
EF 


1776 


15677 


A 


1787 


399 


63 


SLHNQWKSTPTLKTSKKI SH I * ST WPG 
WAHACN SNTLGHHGGRTA* /RSGVQDQ 
PGHHSETSSQRLRNP Y IKRC I KYLAKS K 
CCINDSFTVSVTSRKLIGKREVSPNNIT 
FR 


1777 


15678 


A 


1788 


3 


474 


MSISPVRWNSKEAGRAANRQFPSFSPWK 
DDSRDASPPEPASPTIG\PIRRLAESSW 
TWGSPCAEHPRARAGRRKAATDCPWAAG 
SQWRGPAGQGAPRS CLFPGSRTAARAQH 
PRVAPPPPPAPLNTRASALRSQLPRNPL 
*VMTPRPPAAAPRSPVGP 


1778 


15679 


A 


1789 


66 


395 


LVQPLFDFIWHSRLSLWSGRDLLWSGKE 
TMNPCLNHSIGVLQEWQGSDVKKRRRLM 
ESLTGPAADVT R I LKSNNPAITTAECLK 
ALEQVFGSVDSSRDAQIKFLNTYQNP 


1779 


15680 


A 


1790 


413 


1 


PSPRALITDYS*EEGPRFWQV\EKKGQP 
LKPHPGLGSPKQESPRVGPPKRGGYNPF 
* KKRAKFFAPGENKGPPFWTGRDPTF*G 
KTKKKTNPPKKGGKKGPPPKPGQFFFFF 
LRRSLAWSPRLECSAAISAHCKLRL 


1780 


15681 


A 


1791 


314 


■ 


KTKP F FLKKTP PKKKNKKKGSC* DKARY 
QTRKG/IVNLGH/HPSFLFYFILFFETE 
SHSVGKAGVRWLRANSLQTPS PGFQQFS 
RHSKPSRKDYRHPPKRPQECVQ 


1781 


15682 


A 


1792 


104 


409 


EKQS FAFDWHFDL YFDNVK * KEGEES KA 
GEFNASTGWFGNFRKRL/RFKNVRVTGE 
PASVTQEAADEFPDNFKKITEKKGYLHG 
KFLMYHEAAKYLNFWCPTK 


1782 


15683 


A 


1793 


392 


2 


GERERDRL* REREREREREREGERERAR 
ERERARQGTSTVESRF/HSYCRRDQDVA 
RPKAKGE VAAGRGS PDGLQVGRGQ * PG P 
SLRPGPWREWGFATYLACGGP I PTGBVE 
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M 
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ing to first 
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peptide 
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Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=G)utamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, J=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Ty rosine, 
X=Un known, *=Stop cod on, /^possible 
nucleotide deletion, \— possible 
nucleotide insertion 














SGEEWAQQGBWGQALGPP 


1783 


15684 


A 


1794 


405 


119 


IIKMAIVPKAIYRINTISIKVPMAFFTK 
\LKTI IKFI *N*KRARIAKAILSKKNKA 
GS IPLPDFKLYYKAI VSKPTWCWYKNRP 
IVLVHSRTARKK 


1784 


15685 


A 


1795 


455 


3 


csvtqageqwcnhsslqpqppglkws/c 
psqlpk\wnyrhvpprpaltahpaltad 
f* rrkeyklmrhrgkkwhdftlrs kkmk 
amrheyhsfltypnghlylhwqffffdt 
eshsvsqagvqwhdlgslqppppgfkrf 
krvg pggns gad 


1785 


15686 


A 


1796 


2 


134 


PRLQHCTPAWATPQDSVSKRKRDMTTDP 
AI * KG * * ATTTN I Y I HKFYNLDE I DQFL 
KKHKL PQLTWYE I / DNLNS PITR\IEFV 
ILNSKKKYPGSDGFTGEF*D*FCIiETKK 
RYDYRPCNLKRI ISDYNKHLHT 


1786 


15687 


A 


1797 


1 


404 


PTRPLTGSSASGMMVEIFPKTYLISSAC 
WVKISKIDLKLFSPFFFFGFLEGGFYFC 
PPNSRGGSPGGKFGLIEPLPPGLKGIPP 
PPPKRGGEFGPPPPTPAYFFFFLWGGGV 
/PP/AVGGGGKPPI*GNPPPWPPQG 


1787 


15688 


A 


1798 


2 


383 


SG WL WACRS PDS E PLACPAG PRQ * GAYH 
PGQQLVJMVR*VSQQCWPTPPALHSIS\P 
RP * LPPRRAGLGTCS PP/VPES PLLPPW 
APAWPKPPRRALPPA/PPVPGPIDRPRA 
EECRRMVRDWQAAPPAAP 


1788 


15689 


A 


1799 


35 


410 


ATGPSLGKVCS*AFSFIj*PKLDFWHIiYF 
QASGFLL/CHNSPSQTNSCSSFGKHYGM 
VLRVNSL* PDHPWNEFYFLNFFI FYRDS 
/SLTILPRLVSNS * AQTILLPLP/PKVL 
GLE 


1789 


15690 


A 


1800 


65 


415 


KKGVLKGGPPPQPLGV*GPPPEKGGPGP 
F\GAPKEKPPLALGEPKGKPF*RGPKGF 
FFKKGLE PGAP P P PKPRGEKPP VFKGPH 
PFCKKGGGDKREKNGGL*RKKPTELGNG 
PPPPG 


1790 


15691 


A 


1801 


417 


3 


NLGPNI FPPRAPQNWGPAPPLFFFLKKN 
FTWGGGSTPLFP / LNLGGLGGPFPGLEV 
YAPP S PHG * P R F FFKNQ KL P P P VWGAL Y 
S PFFGGWEN\RKAPPGQTLFFFFFFLK* 
GQDLPMLLQLVPNYWAQVILPPWPPKV 


1791 


15692 


A 


1802 


1 


431 


QPCTPGLK* SSCLSIiPSIGDYRC/RTTV 
PS * F F FF FLEKGVG F I PRGG I KGLDNC * 
LGPHPPELK\NPPPQNSQEVGTTGPPPR 
PG*LFFFFETFFFF*KGGFFILAQP*MK 
WGALKKTPPAFSKRGGYRGGTPMEPTHC 
FLKEN 


1792 


15693 


A 


1803 


256 


399 


AIKNIHDS*EEVKJ[STLAGIWKKLITTL 
IDDFDGFKTSVEEVTVDVME 


1793 


15694 


A 


1804 


407 


2 


FEKANLFPFLFKNSGPPNVPGRGMAGVP 
KFCCPSKRKVPGPIFFLRRLFIiFYPKCR 
RPPLFWPAFGP* KFFLPQVFAPAF* KPS 
PKKKRAPVFFFFFFF/RDKVWLCHPPWS 
AMA* SQLTVTSVSWAQAILLPQR 


1794 


15695 


A 


1805 


429 


116 


LLTKKKRKKKPLACGESGLGGP*GTVTG 
V*QAEDTHVIW\VLSSAPSLSSEEMTDS 
M PGHL PS KDS RYGMEMLTDKKWTWDGG A 
WDSSPOGANGKRGAROASGFS 
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SEQ ID 
NO: of 
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sequence 


M 
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NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 
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correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A-A)anine 
C=Cystetne, D=Aspartic Acid, 
E=Glutamic Acid, F-Phenylalanine, 
G=Glycine, KNHistidine, l=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X^Un known, *=Stop codon, /^possible 
nucleotide deletion, \= possible 
nucleotide insertion 


1795 


15696 


A 


1806 


2 


408 


FVI F VFL VETG F \ S I LARALACAI SNS * 
PRDLPTliASQGAG I TGVSHRTWPRRS CV 
F * E AFLTMLMVLH I P PSLL WHSVFTHLL 
ALSS * CFLFFLRGPWAVTQAEGQGHDLG 
SLQPPLPGLKWFSCLSLPSS*NYR 


1796 


15697 


A 


1807 


1 


196 


FRLGASLDLSGGCSGVSWTPAPPLPPPP 
PPSAASGI S /SGSTS AAAGL* S CRTAFF 
SFLSS FFFFFLKKINPFPFLGG I * PFKG 
GPRLV*GNI*PPPQG\DFLGGTRGGKKP 
WGGGKFGQGGNFPLPLKKPPGPQKNPPP 
\ PP P P P PS AASG I SMAAHLQLPACDRAA 
QLS FLSSHLFFFFF 


1797 


15698 


A 


1808 


395 


3 


LGKKMKNPKFWQGCEATGTL/ M/HCWWV 
CKFVQSFWNTDS I Y* G* AAATHDSAMLL 
LGMHSMPACTFVHQKT*TKMFIAALFIIj 
PLNWKQV/ RCPS VI \DGYI PTMDQSTAM 
KMNKLHAKTWMNLRNLMLNEKPR 


1798 


15699 


A 


1809 


7 


454 


IPGSTISLQPPPPGFG*FFCLSIiLSSWD 
RQPAPPR/LANIiRR*T*LQSA*LWR\RG 
P ILDEMKS FMCNCNSLTEGVKGRVEMMS 
QNGRLLTKFCHVGQACLKTPDLK* SARL 
GLLKCWDYSCEPPCLTQMP FS FFLFF * D 
RVS/PLSPGWS 


1799 


15700 


A 


1810 


20 


355 


PQCAHGCRAVAPVCVCFCFCVCVCSHCV ' 
SMCMCGEVSAGWTLLCLCLST\ CSGAYA 
CDWGCG/ C YSACVCARVCACVCAQLLTC 
IGMWE *GQAGQGEVLLDLPLCLCSCWAP 
Q 


1800 


15701 


A 


1811 


-> 
.> 


414 


SSKNDNNSLQEFMDKIAGMKKNLGNLTE 
LNNTV*EFHRAITS INNRINHAEERISD 
LENWLSEI \NRQT* KI VTRNEQKLREW 
DYVKRLNL* I IG\ VFEREGGKAYYLQNI 
FED I VHENFPRFARDANSQIQEMQRT 


1801 


15702 


A 


1812 


1 


443 


AGKSPSPKKFKKGPGGGRWS* FLGGAGG 
GVPFRPG I QGSNYRFF/ SPPPPPLGEKK 
KPPFKKKKKTPQKLQNGNPHPYFNPRVH 
GCREQQPWEKAPDATRQPHYADKHVEAG 
EPREPPKPHSEPLFSSPRKRPLFTRSST 
/SGASPP 


1802 


15703 


A 


1813 


411 


66 


WKNNVFNKECLER*IF/IIQKKKLDP\F 
LTRYIKIKSK*IKDLNIRLEIKKTPGKE 
SVTLAKWLIRSPYLSFP*QIQSKQ*INS 
YVLIKITKEERWSTSKEYQKPWRERKAM 
YKIK 


1803 


15704 


A 


1814 


369 


20 


QEVRPSIYLSSSNRKYVKDSNARFTKEA 
IQIANIHMKNCPTSLIVGETHIKTSKGY 
HYVP IRMAKILKD \CNRROGETGTLIHF 
* WEWKMVQP FQKP F * WFP KKRKI H * QFD 
LATRS 


1804 


15705 


A 


1815 


2 


675 


GLAILGRRLRGEACTRSPFSSILIFMVS 
MGEWPGVP/ GIKGTRTGAEAVPTRRRKS 
SIWPQTGGAGESSG/PLRGLPRQGKPGS 
PRGPSGPPQNCARVJWHPQAAPLGACCFS 
GPEPSRLPPWRCGNWCLPSTPSSSA*EG 
WR/PVLQPGFL*SPSSLASICPGAERGP 
PGSSRP/ GLRGAPGG 


1805 


15706 


A 


1816 


273 


2 


MESHIMWPFTIGFFHLVQCLHGPSMLQC 
IAVLHSSYLFIYLFIY*DRVLLCHPGWS 
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AWQS * LTGASGS W \ LRRSFHLSLP * C* 
DFRHEPR 


1806 


15707 


A 


1817 


40 


404 


GLHB \ PGVQGCSEL * SHDCTPAWVTSET 
LTLKKLKI KKS KKKKRANPRGFERPTFG 
SAGPGGLLKA 1 1 SKPRVI QGTQKSGAIiL 
GFNSNGGENHPPQKQPYLGGLYAKAHSP 
RGGGPPSPCG 


1807 


15708 


A 


1818 


392 


3 


EKYNMSYDIKSTNHRKNCKLDFIKI/RN 
CCSLKDTINKMKMQASNLEKI FAIHMPD 
RGLI FKRKNSCNLVRR*QPPFFKEAKDL 
NTQKS Q * TNG /HGS KETSL 1 1 REMQI KT 
T/MNYTTSIPT*MLKIKKMNI 


1808 


15709 


A 


1819 


311 


431 


EWGRAWWLTPVTPSLWEAEAGESRDQE 
*KLCTTVEK*KTISN/HDVPIRSSWTGM 
VAHACNPFTLGGRGG* ITRSGVGDQPD* 
HGEGL 


1809 


15710 


A 


1820 


68 


410 


AKKNQGPGPMVFGFGGIKPPPKQKKVRG 
GFFAICPKEQVFFFFFFWRAWQSLNHC 
SLQPQPPGLKQFKQSSHLSLLSS*GYKH 
VLPCPANFILFSLVETGS\SIYFPGWSQ 
TP 


1810 


15711 


A 


1821 


408 


1 


T P F F FLRVLRLT P P LLGN F FG PG FPP WG 
GFS PGPLLKGPRP/ CFPI FKP IFQPGKW 
GLVFFFFPPFFFYPRGSR*NLKKIFPNF 
PFFFPVFF* ILNPPFFFFFFFFFFFL*D 
RVSLCRPGWSAVARYRLTASSTSQ 


1811 


15712 


A 


1822 


362 


76 


S E I APLH FNVGDRVS LHLETNKQSNKQT 
KKTLIFRDRVLLCRPGKNA/VEVQ* *LP 
AASN/FliRbKQSSCISLSSNWIYRHAPP 
HLATVFNFLIIFE 


1812 


15713 


A 


1823 


314 


2 


VISKPCPRELTCITYGVSLTQCSMFGRM 
KGLLLIWPPVCEVRRASGRPPLMGSEEP 
LCPAATPSGRCTQQ/LH*ERAMMTMAVL 
SNRKGGNVGKR* RNQ I VAVS 


1813 


15714 


A 


1824 


57 


389 


NLHLLQL PT YTD AD S TG PTLS GMNVKNL 
HWSYEYKYRSITGVQWLILGSLQPLPPR 
FKCS CLS FliRRWDYRCAPPRRATF\ *FL 
VETAFLERLTSCDLPTS ASQSADI TGV 


1814 


15715 


A 


1825 


410 


70 


VPIMSATQDYRHEPPRPAGRFLKKLKME 
PPHNPALLLVGI * PKNMKSLHKDVCTPM 
FSGTLFAIAKIQKKPNCPSSMDEWINCR 
N Y\MH I YDGI C YS ALKKNE I LARRSGTR 


1815 


15716 


A 


1826 


2 


411 


FLVEMGF\SMLVIAGLKLPTSGGAPASA 
SESAG I TDVSHRAW P VFFFLKRCIiVIiVG 
RS**A\WPHTNLIPPLPSGIKGDIiCPNS 
AGGWEKGAPPPSPGKF/ cef*grtgttn 
FARG / WTKTPD 


1816 


15717 


A 


1827 


276 


3 


GRPGPADFR VRPQLLQRFLFI ylfteme 
SCSVTQAGVQWCNLGSLQPLPPGLQ* FS 
CV\K* FSCLGLLSSWDYRHMPPHLDNKS 
IFSRNGVS 


1817 


15718 


A 


1828. 


1 


391 


leprrrflqcvqdcatalqpgqqs ktls 
qkkkkkkggpp/s*yqkgqgcpsgkkgr 
gvagkgafgpgfggenktppgggptgeg 
pfpqkgwgpsqgptkgpnlwgpggpkl 
ggkggppgptkggggpssf 


1818 


157)9 


A 


1829 


2 


134 


DHLS PVVHNQPGQHSEAPSI//X'INI WKL 
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AGCGGAHLSSHL/LREDHLNLEVQGCSE 
P* LHACT PAW ATE * DPVSQTKLN*N I WK 
LAGCGGAHLSSHLLGRI T 


1819 


15720 


A 


1830 


427 


107 


VQHQPGQQRESPS III IQKLARGGARCL 
*SQLPGKLRLENRFN/ SGSERSHQCTPA 
WVTDRDCLKRTG I TRAS SQRLSVG IKQS 
CLNPRTAQLQVSAQSPSTVSTNL 


1820 


15721 


A 


1831 


3 


540 


VQFPNFKIYYKATAIKTVLXQHKQR* ID 
E/MNKI * TPEINSYVYGYLNFNKDAKAI 
QWGNDS F \ FKKWC * DNW I SACKYSQTS V 
SASSASSSSS 


1821 


15722 


A 


1832 


385 


2 


AGRQSETPYKNSTIMKVSLQVCVDLSCL 
LGPGSLVQDSPNI PAQRKRFLRLGS FVJL 
PLSRFSRVGWPLPHWGQRSSGFSLPR\P 
P*SQIPAPRSPPPAGPVPARSWVCGPRP 
QTRPLPAERPSPRPRRL 


1822 


15723 


A 


1833 


7 


399 


RISRSYL3EYGGSGKEHPTLGASYARIM 
VFG/VFIIYF*RQCLA*AQWYSHSSLLP 
QTPGLKHPP\AQAS * GAGT I GTHHHT * L 
TFAF/ I FVLGCFFL*NKI SVTQAGGQGC 
NFGSLQPPPPGLKRVSCLTLPR 


1823 


15724 


A 


1834 


2 


306 


LARLVSDS * PQ/ CDPPAS ASQSAG 1 1 CV 
SHRAQPAEEILKVFGTLCLEPQRPTPDI 
FI I PYLFC\LFEMESCSVAQAGVQWRIL 
GSLQPLAPGVKRVSCLSPP 


1824 


15725 


A 


1835 


12 


400 


KKGMVPKQLKVGKKPPFSCWGPNMKKRD 
SPVFHHQDPIFPIPFFFFGNGFLFFPP/ 
LAGGQGGNLN* PNPLPWGLKEFPPPTPR 
GRGEFCGGAPPPPINFVFLKKGGFPLGGR 
GGLEPPPLGDPPPLPPKRG 


1825 


15726 


A 


1836 


220 


401 


KGS FVFI PQPEGEGP FLG * LKPRFPGLK 
QFSCLTLLRSGNYGPLPPPPVIF/ CGFIj 
R 


1826 


15727 


A 


1837 


12 


357 


GLLGGQMNGSLGTQTSYEDLMSSS\FKP 
NSPPPTPS/VRTGHLPK*PLESSNGPPP 
PQVSHSFQSQWARGHPSPPPQWNTPFSP 
PQQYTQCSKTDP*PPPSPPYLGQEGSNA 
PSLA 


1827 


15728 


A 


1838 


8 


380 


LM KTHAKL LN KRVAN * IQQYKK/ HH/NQ 
MGFILGVQIYFNF*KINLIQLINSVKKK 
KN/HSSSSSSSSSSSSSSSPPVLIKSLC 
NLGKWKRNFLCLTKGIYKNKTE\NSMKI 
ILNGEQLNAFPLRLGTK 


1828 


15729 


A 


1839 


2 


444 


VPGDAKWFSVLHLKDAFFFI PLVPESQY 
PFAFEWENPNTREK\TWAVLP * GFWDS P 
HFFAQPLERDLRGLQLEDGS I LQYVDHL 
LVYS PTQE ASDQNT I KTLHFPADRSYKV 
S KKKAQ I T LQQVHCLGYI LTPGTCK/LS 
PERVQAI 


1829 


15730 


A 


1840 


1 


642 


EIKGIQIGKEEVT/SLFADDIDYLRIDS 
TKKLLEVICELNKVG* KINM+ T* IVFLY 
I GNEHLE LE I / R / E LM PC I KT S S TMKYL 
EINLKKDV* DLYTEN YKVFPRE I KIT * A 
NQ*EILCLCSRRLNTKM/ stfpqvfcvy 
YAIPVK/I PSRNFLVLVDKLI LKFI * KY 
RGPRTAKTTLKKKKKWRLTLLI FKS YY 
KTIVITIGWYGFQDRQVD*WNRIE 


1830 


15731 


A 


1841 


3 


423 


HR I TS E* DLHLARE LI W\ SM YGS LDHKN 



651 



WO 01/64835 



PCT/US01/04927 



SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ IP 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=G!utamic Acid, F=PhenyJalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L^Leucine, M=Methionine, 
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YDAELDRMAMPCLCAIAGALPPGYVDAS 
SSSKAEKKATVDAEGNFDPRPVSTLSVI 
I PEKLDS FINKFAEYTHEKWAFVKI QNN 
WSYGEN I DEELKTR PMKR\ P Y \ KTYSEK 
D 


1831 


15732 


A 


1842 


33 


179 


YMLGGRGCSDLRSHHCTPAWVT/TA* LS 
KKQYQRQQNRLQYHLQEYERKK 


1832 


15733 


A 


1843 


349 


10 


LAGGLNSMEGER \ liR * EREE CTQQQMVH 
DKY C KDLMG FGTKPRH IT P FS S FQAVQP 
QQSNALVGLLGYSSHQGLMGFGASPSPA 
KSTLVESRCCRDLMEEKFDQRK\QWVLK 
CR 


1833 


15734 


A 


1844 


15 


856 


AAEQLSFIYKLPQNPSPSTPGSSLSGTH 
GMQTMLGSTHLPNLTDMLGPELGVQGI P 
SPGCACQRGRG\GGGRECCSPPGVSPQG 
\SAVGRGAEGPGGLTRSGSGAASALVRP 
GEKGCWCRTASGAGPQRRQRTRGPGSWG 
LS FSQTS EEKCP S PAGSAGAGP VCQRRQ 
SSFAGGCGTGAGAPGST\GDAH?AQGGS 
GGPLRSLPAVGGPRPGPSFLKTSGSGSV 
PQGVP I LWSLT* RALAAFGSQGPAGLAV 
SCTGGRGYRDPQAPGTGAG* HGNSTRLR 
GP 


1834 


15735 


A 


1845 


402 


2 


SKARRQGRPLRQG* APG/AARI PEQKRI 
GGP/EERRRPSARGPRATRVAG2GPKPK 
GQTAMAGGGHDPLPLPPARSRSQESIGA 
RSRGSGHSQEQPAPQPSGGDPSPPQERN 
LPEGTERPPKLCSTLPGQGPPPNV 


1835 


15736 


A 


1846 


446 


32 


TSRKI S * KTGKQ FLMKECS CNSHHKAAF 
TKKDVLN I \LAWKHVNTKASETFHFFQ 
SGQAKVQQGFVKEGCELIKEAJLNLFNNV 
YGAMHVKTCTCMRLLDRLQYIMGDYAEA 
LSNQQKAVLM \ TERVMGTEHPCI RPL 


1836 


15737 


A 


1847 


440 


4 


VDGRHVEVSKKGGQVNYQAG \ KTVEIWA 
DKLGCNMLGTADWECLKSTRYKELIQQ 
AITAGGAPIAFGPVIDGNVI PGDRQILM 
EQGEFLWYDIMLGVNQGEGLKFVDGIVH 
NEDGVTPND* KFS VSNFVDCMRPRRGPN 
YSRFQ 


1837 


15738 


A 


1848 


526 


0 


PRRDPPPKRQTPIPTHVSPW7EKGPWGP 
APLRPDHPSLS P CPAMG* K\ PGLPRGCP 
QTQISPLFNRSPASPLICHHHPPSEP\K 
PGPEPPPTPSSS3PSLARFTRPGESSPL 
PPPQTPSGPP 


1838 


15739 


A 


1849 


417 


31 


QATGQECGCHRGGPP PGPAGETE PQAPL 
RLPGGTGIPWGAGILCPS*LPGPGSLSP 
AAGRG / SGPSAGPGAAWFSSP* /PACPS 
S SRS AVPGGAGS FRRAGPGLFYTLPAP P 
WCGRGASNKI IQMPGLVC 


1839 


15740 


A 


1851 


3 


285 


YTVCECVCVCLCVCLPVSL\SLCLSVSV 
CFFPSLCGFVCVCPCACVSLAECALCAT 
KRFV/CMAACLW*ASFCVSTWWIRPAVN 
RFRRGGSALGA 


1840 


15741 


A 


1852 


128 


524 


KI PGLGQRSEGVGQGKDDLHVTAPVPTK 
GWGEGAASKPTVLPPPPP\PDAPTVFFF 
FFFFWEKKYFFGPPPKGAGPQIYLLGPN 
?PGFKPFFPPHPQAAGI*YIKPPPWYKC 
PLLKKRGVSTLAP\GFPKPPP 
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1841 


15742 


A 


1853 


1 


1648 


MTVPLHSGLGKGVKPERKTTVRRPFVSA 
GKKYACPFKKAETPQWRRLRMRPKAPAA 
SLLATFPISQRLIPLTPARKHCPSESET 
STWWS KAPATPRTRGAGPTS PPRPTRR 
R\*ACSLQKLFAVEEEFEDEDFLSAVED 
AENRFTGSLPVNAGRLRPVSSRPQETVQ 
AQSSR/PAAVTPHCSLRGFGPARLGP/P 
ASLPPARPVLTA/ GP5CIGAAPLRPVST 
SSSWIGNQRRVTVTEVLREPARPQSSAL 
HPLLTFESQQQQVGGFEGPEQDEFDKVX> 
ASMELEE PGMELE CGVSSEAI P I LPAQQ 
REGSVLAXKARVVDLSGSCQKGPVPAIH 
KAG I MSAQDES LDPVI QCRTP\D PP * DL 
VLWVTFLFQ/PALTVPTQQLHWEVCPQR 
S PVQALQ P LQAARGTI QSS PQNRFPCQP 
FQSPSSWLSGKAHLPRPRTPNSSCSTPS 
RTSSGLFPRIPLQPQAPVSS IGSPVGTP 
KGPQGALQTPIVTNHJjVQLVTAASRTPQ 
QPTHPSTRAKTRRFPGPAGI LPHQQSGR 
SLEDIMVSAPQTPTHGALAKFQTE 


1842 


15743 


A 


1854 


235 


223 


IHKFIHQ I WLAKI TCQRTKV* KERSVLL 
PTSFSPVPSQGHYTCKQ\LCSLASDLSQ 
PDLVYKFMNLAVLHAMWNSRKVSCYPWT 
MIYFLHANRT 


1843 


15744 


A 


1855 


373 


3 


I KDG I YRY FYKALDS FCLCCEFMNQFFS 
CMDRELSQRCFLNSAY* FPSNL* CYLCY 
LFFFF*LKKYFLTFFLFLRDRVLLCCPE 
* SAKVHS * LTVPLDFW\ VKGSSCHSLLS 
SWDYRHALPHLY 


U44 


15745 


A 


1856 


378 


1 


RQRHSPAGNTGRPQVTPOG* ISWPSITK 
DRTSMSSVTSGALGHTAASPHARLLPLA 
LPS VRTQHGSPPPGQEQPTI J CPSNLPT 
HPSLPLGMHPSVRASPPLCK/P/SPPSI 
PASVHASKHPSPPVY 


1845 


15746 


A 


1857 


3 


379 


YMVRKVI EVWFLLLLLLFFFFRGGFLGQ 
GWGPPAPGFSFGKKNPQGLGSSNLRGPW 
* TNP* PVPGGALFLVGPPTPAEFPFKKF 
SRGFLLV/ALADF*TRPRLVSPHGG/RG 
AKGTPAFLESMPPWMP 


1846 


15747 


A 


1858 


452 


1 


GTHGLLLGSGPF \RQVFKPDNFVFGQSG 
AGMNWAKGH Y I EG AKLVDSWD WQEE * 
ESCDCLQGLQLTHSMGSGMGTLFISKIR 
EECPDCIMNTLSWPPPKVSDTWEPYN 
I TLS IHQS V5NTDETYCI DNEALYDI CS 
RTLKLTTRCI 


1847 


15748 


A 


1859 


1 


385 


NTSSDYIFPFFFLFRNSIHSVTQAGGQW 
HNQGSLQPWPSRLK\C?TASASICLHMP 
PFLANFLIFFVEIGSPYVAQAGSRDPPA 
LASQ/ SAGITGMGHCTQP* VFLFFFFFF 
SFRKKCSPWPPGGGPPIF 


1848 


15749 


A 


1860 


470 


17 


I EMDSRRVPRDKLACI TXCSKH I FDAI K 
IT*NELASAD/DFFPTLIYIVLKG/NPP 
CLQYNIQYI TRFCNPSRLMTGEDG YYFT 
NLRLGTHCSWLMMTMWTCFRAFRI FSSQ 
WMSYVLRFRNGITGVSHRAHP 


1849 


15750 


A 


1861 


3 


790 


CS RPE F PGRR F VE AVRS KP YLS LP F * SR 
*SFFNVPAE+TSAKDILASSEFIKQNND 
VSS\LQKFMPQEVK*LDHIHTA*ADGSW 
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KAELERGPDGS APS PFPQKPAASRQPEL 
GEIiATFLGRVDPWYQSNVNTLCPAIHKL 
AEMPPSLDTSRTVDPFILDVITYYIRMG 
TQPIYFQIYTVKIFFSDLSQDPTEDIFL 
IELKVKIQDSKFPKDGFSPRRRGVAEGP 
GAELSLCYQKALLSHRPREVTVSLRATG 
LILKAIPASMY 


1850 


15751 


A 


1862 


192 


3 


SSGSHSVTQA\GVQWHEHSSIiLP/LLT* 
PPGRK*ASHLTLASS*DYRRAPPHPANF 
*IFCREGV 


1851 


15752 


A 


1863 


82 


370 


SLCQKRAFVGEKLVHGLLVS PSGGRVPS 
CPDPWGCRPRFHAIAVYSFLKLRWI PE 
VSILPEDLEELYDLFKVRSSGKMRGSRP 
*AGLSRG/DPACP 


1852 


15753 


A 


1864 


2 


325 


IQVYS I SHLS I YLF\ I YHLSRGSMHVSM 
SLS I * S I YP CMDV* MYLSNLCMHVY I YL 
FYGS I YRFYLS I CLSVYLS I YLS I YLS I 
YLI CHPS I FKTVIDEHAI FATWRH 


1853 


15754 


A 


1865 


3 


377 


YSPWCKLFRELCKINVFD\LDSPLLSGK 
EFNDTTHNTFDHMWRTKEHNEAGWLLIiS 
SVDKVMKENDELRDSNSWLQKQV\ * PLK 
S AKTALSGS LNS CREKAE I VE KQTQS hT 
M*VADLQRKMHVQP 


1854 


15755 


A 


1866 


10 


378 


GWKNGEFIDAL* KVYGKKAPNKS AVY KW 
IT/ * FKKGQDDIEDEDHSGRASTLRKKI 
HLVYALIDKD/*RLTAEAIANTIDISIS 
LAYRILTEKLKLSKbSTQWVPKQLCPDQ 
LQRRAELPME I LK 


1855 


15756 


A 


1867 


346 


I 


DILWKLQKPQPNGKMLKAAKERKKFAF 
KGVPVRMNADFS I AAMKA/ RRRWNS I FS 
F*KENNCHLRLLYSAKKIFP/EIKTFSD 
REFVT I RS AVKE I LKDVLWAEERLSHVK 
SRNV 


1856 


15757 


A 


1868 


1 


377 


GTFFQRTQCKGIKQ\YWGLI IKASSDP 
TCVEKEKVYIGKLNMILAQMLKQEWTEH 
WPAFISDIVGASRTSKSLCQNNMVILKL 
VSEEVFDFS SGQ I TQVKS KHVKDSMCNE 
FSQI*Q\LCQF 


1857 


15758 


A 


1869 


90 


384 


QWLLFTEYSSLYHPVLFFFFFFPGGRTG 
PNPPAGGEGNETG/ PNGPSTPGGGGNPP 
PLPPGGLGLLSMPPPPRQILLMETKKRP 
PP*TNKCCSPGYSPP 


1858 


15759 


A 


1870 


2 


578 


FWKHALLLGLDFLPGKMAPWSGQGCSL 
GHTEGGTSWDFAVGGASWRLKWCVKGD 
SHKGPATPIASCKGPLGRPCPLLAQSKA 
*GS * KRG / VAPGS P*IiALGMGGG \DRLT 
LI S QVHGNQVTQI I PFSTEGETKAQRS P 
SLPPRDLIRGRHSWNLDSTQLLGYCPLL 
PPPLHPAGPLPVPFTNGEIQKENSRE 


1859 


15760 


A 


1871 


1 


382 


SGQDAGSCLLYGAGSGAMVSGAYNPYI E 
IIEQPRQRGMRFIYKCEGRSAGSIPWEH 
STDNNRTYPSIQIMNYYGKGKV\RITLV 
TKNDPYKPYPHDLVGKDCRDGYYEA* FG 
QERRPLFFPNLG I RCA 


I860 


15761 


A 


1872 


490 


1 


ADSLSKDPGRPLHPPNIWGDRGRQ*PGE 
TPQ*HRRSCASQDPGRSQ*PGKILPSPK 
PGRPPPMTGEDAGPPK\HGNPSPNDQKK 
NPPPK+ / PQGNRIHRDPGRFPPQ * PRKK 
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SFPNNLGDSNDQRSRFPVA* KNLPQSSS 
GTRAIGRELYLPPQPGSSAFPNSTT 


1861 


15762 


A 


1873 


373 


1 


GGGVPGALFSHKKKSLFFFPPPPFFFRK 
GL*NF\KIKGCGSPFLPLLFFLEKESCF 
VP\RVECGGVILGPCKVCLPGSPPFSAS 
AS*VSGATGAC/RPRPGKFFSPFFFFYF 
LVEMRFHRVSQDV 


1862 


15763 


A 


1874 


3 


374 


YMLGKEIVSKTKIGQELGLLNQ/TSQW 
NAKEKFL*ETKSATPMNT* IKRKQNSPI 
TETEKALLVWIEDQTSHNILLTQNVI *N 
MALTLFNS I KAE RGEEDTE E KLEGS RYW 
FMRFKKKKAISIT 


1863 


15764 


A 


1875 


2 


364 


IHSGKGESLWDLGLDTBFLDLPSKA* HI 
KAKNDKLDL I KM * NFCSAK/ET* / 1 RMK 
TQAIHWERIFVKNI *NRKLVYKELLKLR 
N\KKNTIRR*AIDMDRHFTK/EKMPMTN 
*YTKGCSISLVIL 


1864 


15765 


A 


1876 


41 


461 


GFLYLCSSEIDPYPSPSKKIKSEWIRTY 
WMKLLSENATIMLKDSDLSQV\FCVNIP 
FVQAAKAKI EE *DYIKLKWFCSAKGAI * 
KAKRQPTE/W/DRIFANYPCVYGLITTT 
CMEFTQLTSLITTPTITLPWNPRQSYHH 
NPT 


1865 


15766 


A 


1877 


2 


185 


VRPTKLDPLERTQYTLPLLY KWNNKAWM 
TAHL/FTA*FTEYFKS\SVKPDE*VREI 
DYRMLYL 


1866 


15767 


A 


1878 


1 


491 


IHRLLWPPLSAARPPSRESGLRCRAPRR 
PASAAAATAAS PS \ PTAPQGPPR/RRRL 
L I Q P P L YPRG L FTPG VP PL / APGG S R E P 
S * SLT * DARTLPLPVLGPRRARLLGACP 
AVQAEEGVDPVGHLSEVLAAPCKQP * TP 
PPACHCLDGEGRPSGVQAPLHKAKLYP 


1867 


15768 


A 


1879 


24 


449 


LQPPLTWALLLQPPQPKSRAAFFFFFFF 
FGKKSQFLFGPPGGGEGEKliG * REPPPP 
GTKGMPPPPPRERGKKRGGPTGRENLGI 
*RKGGVPPGGRGGGQTPNPGGGGAK\ PP 
KGGK*GGGRLPPPQIKGPKRGRPKKRKG 
RP 


1868 


15769 


A 


1880 


190 


2 


PLYCHKVGQVGLELLTSGDPPPLSLPKC 
WDYRY\DHHAQPS F*LFLS VQ I SGIKFN 
HSWQPCI 


1869 


15770 


A 


1881 


1 


458 


FAIRAGRNLPDKEFRYLRTVIVTAAVYW 
GLNS KLRCLTS \LLTFQHRAGVSPYTSP 
FGFA*TCVFAKQLLEPILC/RPCFHRAP 
IxLPKLRGHFAE/ FP*QCFFR/ QALGFSP 
/RSTCVGLRHG 


1870 


15771 


A 


1882 


3 


392 


YMMR YH YIL I RMAKVKI \ S STNDDTQ * h 
KLIHC*QECRMVQPLWKMV*QFLIKLNI 
*LP*NLAILLWGIYLIEMSTYEFR/RKI 
CIQMFMTDLI VIAKYWTQ/ PQCPSVAGW 
IKQ/ IRSIHTVEYYSAVKRNQL 


1871 


15772 


A 


1883 


2 


473 


IQGGIAAYRVDRQERSNGRQGNRRNLSG 
TNMRRNKQKGIiRRCKPRW / RSRGMAWKV 
DKPRDAR* KSDRDRQ/RGETDG * RVSGG 
LTA* EAYRH\HRGQ\EAASQSGRSRQAG 
RQTP * QEQAGNEAE I RREEEKSRGQERE 
RE \ KDRDADRHKG 


1872 


15773 


A 


1884 


138 


444 


CYLTLI KCRI I Y INDKVI VLT I VWY * KK 
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DRHID*WNRLQSP*J^PSIWVQLILLNN 
AKNTQWVKDTLLDKWYW/ENWIYQCMRM 
PLCS YTI PMNQFKMNYGLS 


1873 


15774 


A 


1885 


27 


470 


TICCYI FLFFSFFS LFFFFFGKGVSF * S 
PGGENTG / ANLG * WNPPPPGKG I S PA* P 
PKEPGMEGAGHPPGTKS* PGKFGPGPGG 
GPS FPYRG PNKTWGKPGPKWFPGG IGYP 
ATIPQKKQPEGKRGR\PGGPMSPYNGPP 
KTPTHGKGPG 


1874 


15775 


A 


1886 


478 


1 


KNVQENSALQAAMCRKMLI VCQTQCVI I 
SGESGAGKSVAAKYIMGYISKVSGGGEK 
VQVRRGKQKDKLGRPHLHGSAAPPICTP 
ASV*LLPTPPASAP\HVKDI IIjQSNPLL 
EAFGNAKT VRNNN S S RFVS L CRPAWS S C 
SLRADLSTHTPTHTHTRV 


1875 


15776 


A 


1887 


1 


402 


HSLERPHYIG* LFKNI FSRD/ RVFAMLA 
HWSRTPGLKQSTPLSLPKCWDYRCEPQH 
PAGSFFFFFFLKNGFWGCSLGGRAGGQQ 
* LKS * WRPNPLG * GNPPC * PS KEVGTTG 
AHKKI ANRE IRTRACGGTNFS L 


1876 


15777 


A 


1888 


511 


124 


GTRRQHFAGAHVP PEGP* S \MLDPKLK\ 
DDRPARDMWIREPGLLLPRAPAQDAGKY 
YCHRGWLTMSFHLKITARPVLWHCLLRT 
GGWKVSAVTLAYL I FCLCSLVG I LHLQR 
GESCPQWVCPNPTPSSPG 


1877 


15778 


A 


1889 


667 


310 


QLKP* ATKSV* KDTAFGIDVGNDFLAMT 
PKAQAMKGKIDKV7GFI YR*S ICTAKETI 
NRVKR* PRK* EKI FAKPTWQKGQI S/RI 
HKEFQQLN/KQKSNNLIERQTKDLNRFL 
SKECSKDL 


1878 


15779 


A 


1890 


462 


3 


KWFFPLGPPFLPPP/PPPI*NPSFQKNK 
KI/TRGGCARYFPPLKSPRPRIPFPFFEK 
GEGSPNSKHSPAPPFWGPKETFFFRRTP 
PPPPPPPLCSFLKNNLFSYI PKGFLGGK 
GQNFHS FLPS F FFS I KKKLLGIjGVHVRF 
YCR VNS CTGFFVQM Y 


1879 


15780 


A 


1891 


1 


455 


NTCLGFGNGFLDATPKA*SMKKI inkld 
FNETENFCSVKDTVKGMKRQATHWEK/V 
RKTHILYKDLILKIYNQLLKHHNKKTST 
I KQ*AKDLNREDIQMTNKHMKRCSGWGR 
WFTPI IRALWACKVEGSLEIiESSLGNIV 
RPHRKREREIHCY 


1880 


15781 


A 


1892 


1 


537 


RGGIQAPKEVSPEGRQEPARKSLI *TA* 
ETPP*SQ* /PIPEEP/TGVFMKKPVSVS 
LETGKHAVWPKVNGKELPDNP7IKWFK 
GKWLELGS KSGARFSFKESHNSASNVYP 
VELHIGKWLGDRGYYRLEVKAKDTCDS 
CGFNIDVEAPRQDAYGQSLESFQRTSKR 
SLYALALEDPGM 


3881 


15782 


A 


J 893 


2 


514 


VRCQRRCHE*RACGSSLVNAKKLYEDAL 
MARKVKQSLFSLDVETDEDKFQMMSLQ\ 
CSLAYGTLTKILSEKRSAKSYGMSSVRM 
RSAGQTSKAHLHQPRRVSQVLQVPAVNL 
LPFRKKGQTKDPALNTSLPQKVLGTTEE 
ISGKKHTEDTISVASSLHYSPPASPQGS 


1882 


15783 


A 


1894 


473 


2 


VMGESRGFSPPPTFTGGNFVFFWKQGVS 
LFCPEGFKTLCSSSPPPPPPQKAGVLGG 
SFHARPPPFFFFQI *IPFFFGIKKP\LC 
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SPPP*PL* KKRG I FP VPKGI FCEMESCS 
VAQAGVQ* YDHGSLQPWPPGF* * SSYLN 
LV* LS S * D YKCVP PHPMY 


1883 


15784 


A 


1895 


613 


11 


FRPRSPACGLHAVSSPKLPGAQAIiSSPG 
LNLVFTAGS WDAG / LLRLS PRANNPRVA 
LPRVHTGPSST/DLSPSCPLGLACGFGT 
SQPACL*SPFLLLAPARPSGCAWPGLPP 
VCSLHLRGGWGGADPTGCLDAWAMALAS 
LRPCLCPALPLSREPPSPLLNDLVLPRM 
PRRAPVLPLGRTPHSLLLCHVQS PSEQL 
PSKGPERL 


1884 


15785 


A 


1896 


20 


449 


KFGYSSAAARRQQLGWEAWL*YSFPLQL 
EPSAQTWGPGTLRLPNRALLVOTKFEGS 

evspcvacgiqaalsmgstssvkiilskp 
qaplpqwhqmvfarclcmcgaqunvppX 
es ftfqvs tkdvplalmacalrk/ katv 

FRQPL 


1885 


15786 


A 


1897 


393 


3 


RPTAQSKGNI * VRVAS \EALS PKLLDFL 
PGKVLNGEKVDVRPATRQNLSQFEAQAR 
KRECVRVPRGGIPPRAHSRDSSDSADGR 
ATPSENLVPSSARVDKPPSVLPYFNRPP 
SALPVMGLPPPPIPPPPCI 


1886 


15787 


A 


1898 


395 


217 


RER/ CKSFR/PPA/HLQAKIKGAQ*QVN 
QAAAAQAAAPAAAMVSRDISSLLVSSQK 
SKVSNYM 


1887 


15788 


A 


1899 


1 


375 


NTVLVQ*NNKAWMTVHPF/TAWFSEYFK 
ATVE IYCS/EKIPFKI LLVFARVH SHPR 
TLIEI*KEIYAVFIPANTPSILQPMDHG 
r ILSSKPYYLRKASRAQRLTPVI PALWE 
AEAAGSPEVGSSGLA 


1888 


15789 


A 


1900 


47 


326 


VKSI IQFNSKNEPGKHDKTPSPEKTQIV 
WWLGSELRSPSYSGAAGGAEMGRSLESW 
RSRPQRAETAPLESSPGSGSEMLSL*RQ 
HLTPTAWAGVQWRGLCSLRPRPPGFK* / 
FCPSQHHPQPPSSWDYATRCQATKQFVF 
F1>EMGF\VMFARL ILTVELNN 


1889 


15790 


A 


1901 


181 


837 


AGRVDRREPGMGTG I CKE LETGSRETRS 
ASRWGRGRWRLGQACRVPQGLPLSTFHL 
GAQAKARGGTPLACS SHLPNSHVGSLKA 
QRDEAMVQSGLAPAVS STCTRWT * GSEW 
* *GLHIVAARRQQGRjEEEPPRTTAAPET 
LCFQQTSWASSCSLEHSAQPSEVQVRAL 
S VPSHS PMW\ PQLSLPRDHRKPPGE 


1890 


15791 


A 


1902 


1 


385 


YTWGFRGKKPLIHCL*EYKLVQPLWRAA 
WRFSK* LRVEL* FNAAIS PLGVYPEENK 
LFYQNSTCTCWFITALFITGKT/WNQP 


1891 


15792 


A 


1903 


207 


3 


FREMEFLHLGQAGLELSTSGDPPTLASQ 
NVG/HYRREPLrRPANTLQS* PLGLKQPS 
CLSLPSSWEYRHMY 


1892 


15793 


A 


1904 


2 


391 


IQPLISQRKYKTLGQNSIiS/CCAHP/PP 
HFFFVLDS YFHSLLSQLKNAPFT* G* CP 
YWYSKSYSLIHSWGSSSILPCPLTSSGF 
PSLPPSYQPLPCPSLFLL*NSLSTLCLL 
FFSLLSQPSFSNRWPSQVYL 


1893 


15794 


A 


1905 


3 


424 


YRAGCLQSLLPPPLLLFIjLPCDVPFPPS 
S\LP*VKASCGLIRSQQOTGTIPCLQNR 
KSNKPLHKLPSLRHSLRAMQNRILPILR . 
IGKFF / X FFS * DG VSLCHPGWS AGVQhQ 
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FTALHLL/VLKRFSCLSL 


1894 


15795 


A 


1906 


2 


389 


I QRGLDKSCLI TDN I PSSQSLIQKKGVN 
PSFKSMKADRGKEAAEEKSEASRSWFMR 
EKERSHLHN/ 1 K* VQGKAAC YPEDLAS I 
IDEGGYTKQ* I FNEDY/MWKKM* FRSFL 
TREKLTPGFKASKDRLTPLV 


1895 


15796 


A 


1907 


458 


13 


AKEETQSVWDFPM / VPQGVYVNFFVSR 
NANLSTI KQLLWHRAQYEPLFHMLSGPE 
AYVFTCINQTAEQQELEDEQRRLCDVQP 
FLPWRMVDCEGARVNKLLSSQISLFIG 
KGVRELDSLSDPEVSDFSTKMCQFCEKS 
AAL*DQATS 


1896 


15797 


A 


1908 


409 


3 


EKTDGLYRASQRGKDLRRLCAQS FHAS W 
KDGMALCAL I HRRRQ\DL IG YAKLRKDD 
P IGNLNTAFEVAEKYLDI PKMLDAED I V 
TTPKRDEKAIMTYVSWL* I VIAGAEQAE 
TAASRI CKVLAVNREKKKLMEER V 


1897 


15798 


A 


1909 


116 


379 


HSGPRREGALLLPKCLPHAKRCLLLFKM 
CSDGATALCCPGWS SAAFS * LTQSP/AS 
TSQAK* PSHLGLPSCWDYRCI PPHFANC 
LDYYY 


1898 


15799 


A 


1910 


418 


3 


QD * YATANRWF I CMLSQAC F L P S L* PAH 
LL * L * QLMLFS FPGTPVFS YGDE I GLDA 
SALPGQHMEAPVMLWDESSFPDIPGAVS 
ALMIVKGQSEAPGSliLSLFRR\LSVQRS 
KERS LLHGDFLAFS AGPKLFS Y I RPMY 


1899 


15800 


A 


1911 


394 


3 


ILEAYPEVKDPAVKGASSKKEMYGH* /A 
AEQALPVASEQEQQRHERSEKKQPQVKE 
GNNTNKSEKIQLSENICDSTSSAAAGRL 
TCX3RKIGKTYPQQFPKKLKEEHDRCTLK 
QENEEKTNVNIMSKKNREDV 


1900 


15801 


A 


1912 


499 


141 


PGLGERDWTSKYGQGAGEGSTREWASRC 
G/IRPGGDAGQQQPRPE*SVCPRGAHSP 
GPGSV7KAS PAWHSAEPGGRCSGLGVQKE 
GFPGHLLQPGCRTPGPGIRKERFSGYLQ 


1901 


15802 


A 


1913 


127 


387 


ISFVFPPTLPKMPQLKPETISMTGLNIiF 
QHLCNLARLATSAYDGCSNSEV/CDLDL 
LY*AALFLKLDYQIRFPNYFSTKYRIYY 
LCLY 


1902 


15803 


A 


1914 


504 


0 


PGPGQRKHSTAPMCLLNI YAQI LRFVLA 
NQIYKCIKRIIHHDQVGFIPIMQSWFNI 
QNQ * \ INLIHEI WRLN* KNHM IIS FDKT 
HHLFI IKTFIKLGIEGNLLNLI KNIF/ S 
KNPAANIILNSEKV 


1903 


15804 


A 


1915 


46 


415 


YTSNKQLQIQILKITYNSTKKYKILINVJ 
I KD VKDQ YTENQKI LLRE I KDLNX/ YRD 
TSCSW/NIVQMSMFS/KLIYRFSEPPNK 
NLSTL* ILTTSF*FAWKYTGTGIYKVTM 
KNRVGRLS LPNFQN Y 


1904 


15805 


A 


1916 


420 


1 


ENADCVERARKS PDS IP* \RGGQ I SVTM 
VSPNEQEKAGQLAIGVRAVRYNGV\LLA 
KMWRKKLHLTLSANLEKIIAIGLFFSNF 
ERKPPENTFLKLTAMATHSESNLSCFAQ 
EDIAICRPHPAIKMPEKAEQYKPLTASV 


1905 


15806 


-A 


1917 


384 


3 


TRT I TSGQY STHV I RASRVPTR S * VP VF 
RSCTSNRRFSQAIEPRVMH*KVHIRAST 
VRYDSGGHVAVYPANDSALVYQLGKILG 
ANLYVVMSLNNLDEESNKKHPFPCHTS / 
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RRTALT*YLDITNPPMY 


1906 


15807 


A 


1918 


405 


105 


KAEAQRRRE I CSG PHSQADS AWHP LVT W 
WH*PPPVARSSQEPGVHSPVP/HVCRKI 
LLIRPKMAIANEG3SIYRELRWFTPWSRSR 
* VGCLTTPGMCVKHLRCV 


1907 


15808 


A 


1919 


409 


3 


GGQEEGVFRVLNSKEFETERGKP* IPPV 
KDQEI IH\PTKFNHVAHMGPGDGKQVLM 
DLPLSAVPPSQEESPGPAPTNLARQPPC 
RNKPYISWPSSGGSEPSVTVPLRSMSDP 
AQDFDKEPDSDSTKHSTPSNSSNP 


1908 


15809 


A 


1920 


9 


470 


APARNPLPRPCTWPTGP*CLRPARPPVA 
SACLCGGTWNLALWLCS PGT/PVPFLTP 
PCS/SCEVQQPASHSVASNQSKEPAKSA 
AVAHECPPGGTGSADPGWPPGATCPESP 
GPATPHTLGWEPGKSSPPTMEEEPWAP 
QGSPCWTVRQRMTMMM 


1909 


15810 


A 


1921 


556 


1 


IQLWAALGGILRRVGRDPPSHRIGKEPS 
AMAGQAGGNGDGEEGS S \GGLAAVPHLP 
*SQHAAHPLLIGPPGQQNLiGDSKV*GFP 
SPRLEENTLENGGWGSKQLHGSPGSQHA 
GGSWKNGETSLKG/PH*ADGAGRHTMPQ 
SPPSPPFKPHSV*HNPPAS/PPPKGSPA 
PSGTSPLPMSAVSLLPPGSL 


1910 


15811 


A 


1922 


567 


41 


GGWGETFSRLGNDLQAKR* SRFNAQAQE 
ETSR\VLAVSLINEALDKGSLEKTLSAL 
LLPAAGLDDVS LP VAPR YHLLL VAAKRQ 
KAQVTGDPGAV1.WLEE IRQG WRATQDA 
NTAQRMALGVAAINQAI KEGKAAQTERV 
LRNAAVALRGWPDCAKGYQRDLESAMA 
KIQRPAAV 


1911 


15832 


A 


1923 


2 


405 


I QGCG ITSSS VLHGMVFKKE T5GD / VTS 
VKDAKI AEYSCP FDGMITETKGTVLI KT 
DEELMNLSKGEENLMDA*VKAIADTGAN 
VWTGGKVADMALHYAKKYNI4MLVKLNS 
QWDVRRLCKTVGATALPKLTPPCL 


1912 


15813 


A 


1924 


510 


37 


LLGHAFHVQSSSGRPQLAEASGHSKLKK 
GECVQQRTGNVGLSPNTARWGTPLGPS I 
SSSAPPWSFSAFPGPPGSGK*AAKDGPG 
CSLKSGLKR\RSQGGLR * TRGSGPS PMP 
PSPSPS/ERPPPGDEGLLPCTPRGGLPG " 
PKINTACVCAADI S PGLEPV 


1913 


15814 


A 


1925 


74 


429 


ATI PGHELLLLFFFFFFFFFLGKGAWPL 
G*GKG\GGPIRG*GNRAPRG*REFPPPT 
PGKRGNTGGGQPGQPI FGFLKKKGAPPG 
GPGVPKTRGQIEPPPWPSKRAGVTGWTL 
GPGPKV 


1914 


15815 


A 


1926 


515 


304 


AIAASIiALALNGVFTNTIKN* IVGRPRP 
DFFYRCFP/GMG* PHSDLMCTGDKD\ W 
NEGPKETSPSGHSSPV 


1915 


I58I6 


A 


1927 


I 


433 


NTVGSNKKSKKL YFAE IEKS YLNF I WNL 
KGPQ*VKIILKKKSSWKCTLIHSKA*Y 
KV/LKI VWYWQRDTHM/DHVJNS I KQKMV 
KQFFDGDTKTVQWAKDCLYNNt«JCWEK\VJ 
ISTSNRMKFNIYPTP*ANINCHQDPNSK 
ELYRYKN 


1916 


15817 


A 


1928 


316 


356 


GGT*PPQAATPIS*LYLPITSSL/TLPP 
APSLPLQIAPIPSDSPSPRYS/TPTPPI 
SLGLAPPTTLLIPVPSLPVSPRPLNSTA 
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nucleotide deletion, \-possible 
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P P/ TFLTGSGAAVT WSLAVS LS PAPAS 
AMDKAQNM 


1917 


15818 


A 


1929 


490 


11 


PTRLTCPGSHVPGPVTFPP YETHSMP \ P 
PCHVHPLVQPPGSSHVPGPVTSSPSPPS 
ATHIHPLVQPPGSSQLPNTWVHPSQRMR 
TWLQQHPPLPAPPQP / PSPGS I CSSMTH 
LTEYLASFPAPQ/PREQAPQGGVPTSLP 
QW*APQPTAPCRATQSQQLLCG 


1918 


15819 


A 


1930 


1 


552 


RNPRKRAFPSPPIPAPPS*NQENERSRH 
PQSLIAPFVKSRKRAFPSP/LSPCSSFV 
KSRKRAFPSPPVPAPPS*NQGNERSRHP 
QSLLLLREIKEMSVPVTPSPCSSFVKSR 
K*AFPSPPVPVLLRDHTFLPLPEPRQPS 
TVP VGCFGFSRI PRRWNHTGCAFLLI CH 
SARLLCDPRVSSCWRHS 


1919 


15820 


A 


1931 


414 


3 


RVPGESRK* ERVLDSRHKD * EGRRRGN I 
\MGKRVDYTT*TVITDVTNLS IDQVGVH 
RSIAANMTFAEIVTPFNIDRLQELVRRG 
NSQYPGAKY I IRDNGDRIDLRFHPKHSD 
LHLQTGYKVKRKMCDGDIVI FNRQCI 


1920 


15821 


A 


1932 


521 


103 


ATEPAGVRLKEGGNI TESFVTVGNVISA 
LADFSQDA/ SRYS* KKKQVLVPYRDSVM 
TWLLKDSLGGNSKAI MMATI S PADVN YG 
ETLSTLRYANRAKNI INKPTINEDANVK 
LIRELRAE I ARLKTLLAQGNQ I ALLDS P 


1921 


15822 


A 


1933 


1 


490 


NTGWWRKGEGNGTLMVRRLPCVPLCSS 
AGGLTEEDEGELCAAGFPLLAEDFGQAL 
QQLQTAHSQAVGAP/KGGDQVLGGGTGA 
PPTAYPWHALSI S * I PSVS WHDEGALPA 
VKIEILGI IHLLPERHALLSLVQARSGL 
LLHG P PATG KI LLNKAGTTE CCLT 


1922 


15823 


A 


1934 


383 


151 


EVAFLENLiI KDD I ERGRLP LLLVANAGT 
AAVGHS\S*LGRMKELCVQCVNRPKMEG 
YMHVSQHPVPESHKMRAKAI F 


1923 


15824 


A 


1935 


379 


1 


Y VP WS KD KEY F FN S Q * GLTTHQ I LP Y I 
DGFRHVQKISAEADVKLNLVRIAIQILL 
*VGLQSYLGQGHQPGKS/CRGPGCEGWE 
GMVLRS*AQLSLSGTTAL*HWCPSSRKL 
IQFGI IKNLIRRLCI 


1924 


15825 


A 


1936 


376 


2 


GCLFIYMKPTAMSSSQVARSGEVSPFTA 
VPA* S * K/QGHGAVLGCHITSEACFQAC 
FPFLRPGRSTCFA*SGC*DVERSSSHSH 
GTAHSP \HGTAHTPMEQHTHSHEQHTHS 
HEQHTHSHGTAHMY 


1925 


15826 


A 


1937 


426 


1 


KHGE I IDDLLKVYRDNA3KKSAI / YSKW 
ITHWKKRRDDVGEEVHSSRPATSVCEET 
IHLVCALI *ED*LIAAETIANAVGI PRC 
SAYTILT/E/KLKLSKLSTRWVPK/P/L 
LPAQLQIREKRSMAILNKWNQDHEAFLH 
IIIAGLY 


1926 


15827 


A 


1938 


2 


469 


KRRKLVSSIAAAIHPALSS*TRQRFLLL 
LFLFHIVREDLVQLRIKKKHTIQIGNVE 
VKLL* PTDNIILCGKLYRFHRSTQKVLQ 
LI/NFSIVAAYKINMHISLVFLYSNDE* 
LENKTKQTSSFTIALKRI KYVGSNKKSK 
KFYFAEIEKSYLDFIWN 


1927 


15828 


A 


1939 


468 


920 


IPLSTHSPGMGTSTHLSTRHHLSFWTYH 
ERLLNPGRETDWA/ IMLFRWGLLMLPRL 
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X=Un known, *=Stop codon, /=possible 
nucleotide deletion, ^possible 
nucleotide insertion 














VSNS *AQVI PPTSASQSAGI TGMS \HAS 
GHLI FNY\ CSM* WLHLRNSQ* KRCTGQE 
TGPCPVTQDGVQWSNYGSL* PQTPRLKQ 
SYHVSLPSSWDYS 


1928 


15829 


A 


1940 


448 


474 


GIPGG*GSTTRNALYHVMNGEDWILTT 
CKHGKDWQKHKDSRCDRDNTEYEK\YDF 
GEMLHNATFCLVHRGRRHGS FRFLEAMQ 
AACVPVMLSNGWELPFSEVINWSQAAVI 
GDERLLLQI PSTIRS IHRDKI IALRQRS 
QFLWESCIATAIEDPNLIGWR 


1929 


15830 


A 


1941 


1 


423 


NTTLIFAGGM*NVCPGPLCARLLHRSLT 
HATPDPPLLTLP* IPTPTPSSRCAPKAP 
PHLICPTSCPLCSKPHPWPP/CSSCQ 


1930 


15831 


A 


1942 


48 


417 


RLTM YQ VL YRLKCTALQPLP PRVK* FSC 
LSLPSTWNYMHMPQCPTNF\ CI FTRDE V 
PPMLP\SWWRTP 


1931 


15832 


A 


1943 


450 


2 


VHRRSNRQNMDTSR * RGAMQS VES V\GV 
PY\EQ*TIVDGI\NSGVWEGIAYA*IEE 
RYPEEFALRDQEKYLYRYPGGE/AYKVE 
TI KLNVEAVNTRRDKPTNI FAKNQAPVR 
MRRNSFTPIiSSSNTIRRPRJSTYSVGSRPI 
KPLSPLRAQECI 


1932 


15833 


A 


1944 


451 


3 


GRITRHLPRRAEDDREREREPSPLPSRH 
PMFPPSVTPKASSDWP/PASSIPCQACH 
G/PPPVSLPRKPAHRSC\PVFPP\ASGG 
DTS IHSGKTVYVKRKSQPAWPVLPPGGL 
*APSHGAPSPSPPDQHRHCPEI\R*DLL 
PAPAPSPFSIPPLY 


1933 


15834 


A 


1945 


402 


3 


VRLLVSWKVMYP*SRSKASL/HPQITAS 
LTGSCVNCI VILI LNFFYEKIS AW IAKM 
E I PRTYQE YESSLTLKMFLFQFVNLYS S 
WFYVAFFKGKFVGYPGKYTYLFNEWRSE 
ECDPGGCLIELTNQMTIIMAGDV 


1934 


15835 


A 


1947 


1 


405 


NTGWRVFAILCSLKGRPRGIERV/GGKK 
KKKKKPGGPLGPAQGKTPKTQKGGGAQG 
NRGKPPLFSGGNFGNGRIFGNPLPAPGP 
GGGGGPR * KTR\KRKNGNWDLLKGGGKL 
VFCPMVG KKLSG VPGV* QKNFKGG W 


1935 


15836 


A 


1948 


443 


I 


LTPGAANAS LLG \ CCMEDLS VKG * RQGL 
WEALLTHNMVAGCRLEEVDNAYGHYEAF 
STIiAPKAWLSVELAEPCVPEPGLPPVFA 
N F I QLLS A/ PVVVTEGGTAWLE WWHVQP 
MLALMEAELRKSQVLNRVT*GAHYSDCI 
AAALRIKIT 


1936 


15837 


A 


1949 


396 


2 


GNGRGGSVPPNSLNEDGISCAI * /RHIN 
WL»NG*TPTI YCL* ETHLICK/DH/HRLR 
VKGKKKILHPNGNQKPAGVALLT * GQTD 
I K* KAI KSNKEGHYE I KGS VKQENI TVG 
NIYAPSTRAPRYKKQLLDLKGVN 


1937 


15838 


A 


1950 


419 




RWQ/PSARPPTPSGK*GASLPARPSSGT 
*GALLPGCPWKVRSAS3WPPSRLGSEE 
PLCPA\PSHLGSEER\PSRPPSHIGSKE 
R \ PARPP IA * D VGS ASAPP/LPSGM * ER 
PAAPPSGR*GASLPGRPSPEMWGVPLPR 
RPVWDVRAPSRAPVWEVRSVSAWPPHLR 
REKTLRLATAPV 


1938 


15839 


A 


1951 


422 


3 


QNHVIMSE/DAKIAFGKIQYPFKMESLN 
TL*MKVNFL>njIKITQKNPIANTMFTGE 
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E=Glutamic Acid, F=Fhenylaianine, 
G=Glycine, ENHistidine, l=Isoleucine, 
K=Lysine, LHLeucine, M=Methionine, 
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T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=XJnknown, *=Stop codon,/=possible 
nucleotide deletion, \~possible 
nucleotide insertion 














MLE/AFLLRQGKRSC/CSLSPILFIILL 
EVLP S * VWQEK* / 1 QG * EREE * NCDFVD 
DM I VYVTSN * KKPLYNLL * 1 1 S * VARFKV 
ILDTCI 


1939 


15840 


A 


1952 


2 


409 


CAHYSKNPPQRPLRTDARLPAV/DFWSL 
G AI L FMVGCGQP P FQEANDS ETLTM I MD 
CKYTEPSHVSKEGKDLITRTLQIDPRRR 
ASLQ* IENHPWLKGVDFSPATKNNIPLV 
*YK\NLSEEEHNSIIHRMALGDITDR 


1940 


15841 


A 


1953 


'2 


367 


IQCVRL*VHVCLCASSCVCICLYVHATL 
CVSTCL* CVAVCM/CLLCACATVSACLC 
/V/CGCVSTCVCVCPPVCTCDCVCMSAC 
VCT/ CLCVCVTVSTCVCERLCVPTCVCR 
CPCICSSLNGNEWMG 


1941 


15842 


A 


1954 


374 


2 


E AP WLLRAGRGAL P P CW S KT P PS P LL F \ 
PPLLPGTLVYQPWVPLPPWNRLACAVTL 
ST*ARAGTSNPSWHLPPVSTAPQHPSTW 
QSPGAGTMGDPSLSPWTSHLLPGCCHNP 
CHC P F PHHKPTRV 


1942 


15843 


A 


1955 


1 


411 


OTPS PELHPC * PGL \ PPLPSPPQQPPTW 
APPRTSTQQKLPILCLLKPSAHTDAPCT 
QPGSTLPLHTPHTQQAQGTAYQIHTT*A 
APPPGAKPG*RCPPPPPSRQPQRMQTPA 
PGNPQPCPRLTTLTRVLVPAVPPLPI 


1943 


15844 


A 


1956 


33 


451 


RGRNTFGPLQS PPPRFK * FS CLSLLRS W 
EE \ RD YRCM P VHPANTMLAS L VLNS \CD 
LPAIiAS QSPGI TG VSHPTRPHLS * FLMP 
LE*GHHHLLLVLLSQPPNMPP*FQRFST 
GQLECSSFRTNLIVPSPSLNFHSDFPLT 
S 


1944 


15845 


A 


1957 


3 


399 


YMQVRTTMSDHSSHSISKINTDNTKSCK 
G*GSTESLFHDRWEYKFVQLLWETVWHY 
VR * TFT I L YNP E 1 1 LKR / 1 FRHT YKNVC 


1945 


15846 


A 


1958 


47 


399 


AANPTLPAVFFFFFLETEPPFGPPGGRA 
GPQSRLTEPPPSGVKPIPR/ PPPPGEPE 
KWPD\QGGGGGGDPKPPGGPPP*ATGQN 
SAGKKKKKKI PRPGAHGADPLLPGKAGG 
EEWLDPA 


1946 


15847 


A 


1959 


407 


238 


TQAFALI /KDGGVIGGICFRMFRTQGIR 
EIVF*AVPSNEQVKVSGSPRRQPQCTAH 
S 


1947 


15848 


A 


1960 


106 


1854 


NEAVKSKPNQTRNHKEKNNYVHLPDAGP 
SQPASAGGSSSACRRSTKATLHKRWVSS 
PAGPGVQPLSWQHPPMMA*GLQGNPSPQ 
AAAPPARPGLVSGNCLT*EMAQAGA\GT 
GGSLS AS LGKRCPH I PVP CE\ PVL *GLC 
VPGRSSLGELGGNPVTVQSFPGPGAESD 
P W * GDAE CCSLLQAS FGDRAGW SIR/ RG 
SVGRPAGVPRGKGRKPTLSGA/ SGPGS V 
LGGFCCPEPLSREAESGMVQE* *GRFWT 
GQERTPTGR+GCKVAGYLHSSATVGHSG 
AGAGGS PGKTSATLDVGQGLGGT/ PSGP 
AWDRTNGYMEKAALPLCNGKVTGNTQCG 
AQP PASG P S PWWJQPLLLRQRLRQSTGL 
* LLFPGVGGGLQPGEG\GQPFPLSPWGS 
l/TGRKNSNKARPGGNEGTEG * GARNGAS 
LTWWTVPNGGYCPQAGRDFWLVDSRFK 
PSLGLRAMV*NSRKAPLSFEDGRMGTV* 
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K=Lysine, l^Leucine,M=Methionine, 
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Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W^Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 














PQGQEGRASHRKGDSGLSPHSGKG*DP* 
G/PGGKERSISPPLPRGL*GP/VGERGR 
NGDWPW 


1948 


15849 


A 


1961 


76 


427 


QRQEPTPGHTTSGTRHPPGPGGRGPIQT 
HSHSHLFPWLETSAGSSPPNFLCSLPPY 
TPLLRLLPSPFLCPSQTTQRMPI PAPHP 
S PAS PCKVP*DARGAPAGGAPSTL/RGH 
HPPHP 


1949 


15850 


A 


1962 


375 


131 


FLVGRIRLLYCLRELFVYLLREQKAKRN 
SQWVHTLPISSRHVDAVPCSPS INRNRM 
GRDK KR P F PL \ CVDA* LHS P SWI QP 


1950 


15851 


A 


1963 


89 


538 


GLILLQKWHPGAVSSMVERG*LLHSGLF 
FSFPDKVSSWQAGVQWHDLSSLQAPAS 
GSKRFSSLGDKRHAPPHLAHFCI F/M*T 
QDFAMLARLGTSFIYLFGCCC\FETGSH 
SVAQAEVQWHNHS SLQPHS PRLRRSFHL 
SIPGSWDHRNAP 


1951 


15852 


A 


1964 


402 


42 


CCHYPGTPGLQGRLSASHQAS / SGPQLP 
APLPLQVLPFLALGIGVDDVFLLAHAFT . 
EALPGTPLQVGPC P \QGSS EAAQLTG * E 
PLGS S DLGLLMNLGASCPHL * TGE 1 1 VL 
CPKGYCLDQ 


1952 


15853 


A 


1965 


8 


14 


SSRAAACPPRPALSLPRPPVFTRRMGPQ 
GSGLPKLPVSAPSSRMGSFPAPSPLLLA 
SGDRHLCPDAAALPLLAIETGCDS\PPG 
H*PPPSPGGLHPCSPPS*PLIL 


1953 


}5854 


A 


1966 


521 


2 


YELYGIIKRHRREVSNNQQVRYKKHSIP 
PDYVSSVPTDPTWGPERREEESSGHFMV 
DHTGTAAGGGGGM I LASPKLGATPLP PE 
*APA/PP?PPPPPPPPGVGSGHLNIPLI 
LE ELRVLQQRQ I HQMQMTEQ I CRQVLLL 
GSLGQTVGAPAS P SE IHGTGTAS STKPL 
LPLTV 


1954 


15855 


A 


1968 


111 


395 


YLMRGFLLHHNMVKGKEDERKTKGARL I 
LFFFFFFKKESPSPPRVEGRDP\NLGT* 
NPLPPKVKKLSRPTPPEKWEPGAVPHCP 
NNFLFFRKNGGS 


1955 


15856 


A 


1969 


413 


101 


D VNRHFS KKNKHCQ * LVI KEMQ I KTKRR 
YHFS PTRMGKIKN\ KKENKFWQGYGETG 
I FTHCWWECKMVQP I KKTVWQFFLKVK* 
LLVFGPACKELGSYYSLVTKS 


1956 


15857 


A 


1970 


411 


1 


N LTPALCT KVH F KVJVMD I KMKGKT I RLL 
ENNTRKYLHNI K I W KGLNRTE \KP * TI K 
KKTDKFDYDTIKNLSSSKNKTRQTIDWE 
NIL\YLQYITDK* I ISGIHKAFLKV/YK 
/ KKTNNPVGKWARDLNGHLI KNDIHMAC 
I 


1957 


15858 


A 


1971 


3 


828 


GQACHFI FRSAQAGGSRLRIWRAKQLGL 
WRVCALI M PALEHS FPTRLLQGNS VPGP 
S I SLDTS CL/ CNRCVWEGGRG * TGPGLP 
SLGKVLVEGIPSESPGPTASHPCCSPRP 
D?DQ/LSCISAPSATPTVH*SHLTLPMG 
WPGPQ*GLHLSDQPGPRKP*PGCSGLGG 
GDAAPRGMEKP\PPPQLP 


1958 


15859 


A 


1972 


398 


3 


GPGCYFSVRLQCP/RKIPAWKRAVCSAT 
LI\SLQGPSLSAPHVLGLAALAVHLGES 
R S ALP E VDVG P PAP GAG LPVP ALFD S LL 
TCRTRDSLFFCLK*ALLPOSRJ,LSSLSK 
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Q=GIutamine, JR=Arginine, S=Serine, 
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X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














SDTCS FD K I RRDRHAD I P PLY 


1959 


15860 


A 


1973 


2 


433 


QDPTKMTKEELNALKSTGDGTLG/RASE 
VEVKNEIVANVGKREIIiHKTEKEQHTED 
TVKDCVDIEVFTAGENTEDQKSSEDTAP 
FLGTIiAGATYEEQVQSQILESASLPENT 
AQVESNEVMGAPDDRTRTPLEPSNCWSD 
LNGGS 


1960 


15861 


A 


1974 


373 


580 


TIFSRQVLRIQNALSDKPNVSTVYSNNG 
SELHGTSEASV/YHWKILISTEYHKTL 


1961 


15862 


A 


1975 


47 


413 


KWKHLRGSSHWRPQENQVHQRIAELRKA 
GLWSQRR/PAEAAGGPRPKSHWDYLLEE 
MQWMATDFAQERWKVASVKKMVRAVARQ 
LQDRTRREAGARREEPSRLRQTSPVLPE 
KSSVPGLVLRR 


1962 


15863 


A 


1976 


444 


3 


GYERSRGTTSGTHSSRDYDGIFGQATVK 
WAMLDQFRMLSPCSKEVMHQPFYLKRVE 
IMAQCEEWIADIQQYSSDKRVGRIMSRH 
SAAI KRRTGQLREELLKLPCPEGLDPDS 
GDAPEVCRAATGAEETIMHDQV\QPSSS 
KVLPSDFQ 


1963 


15864 


A 


1977 


281 


567 


PRSSLQGQELSRPWGRKGNFPGPPVPPP 
QRMFYDSELFSGISDPSCCF/RSAPTAD 
QVYGDQDMHEWRKHCMDYLVRKWRPLG 
KGQGGKI S S PHR P 


1964 


15865 


A 


1978 


357 


2 


KQILGPPHPQAQPGRWPPPHGPKDAPL 
WSSRAAPPGRGLGRAGPAAGVEAGATLR 
DSSPSTWTREGLHVQAQRKRPSHVHKG/ 
SGPG CLE DGE F P TS LR LQAQLAE I GRGN 
GLSVRRQ 


1965 


15866 


A 


1979 


29 


434 


VQAEAEGLPGDTEHPQPQLMSRSLEGQS 
DVT I KHVACGDF FTACLTDRG I I MTFGS 
GSNGCLGHGSLTDISQPTIVEALLGYEK 
AQVACGASNVLALATERELFALGRGDSG 
RTGARTKESH/YLPQQVPMP 


1966 


15867 


A 


1980 


3 


2082 


SSEGYLRGNMSENEESEISQQEGSGDYE 
VEEIPFGLEPQSPGFSPQSPEFEPQSPR 
FEPESPGFESRSPGLVPPSPEFAPRSPE 
SDSQSPEFESQSPRYEPQSPGYEPRSPG 
YEPRSPG YESES SRYESQNTELKTQSPE 
FEAQSSKFQEGAEMLIiNPEEKSPLNISV 
GVHPLDSFTQGFGEQPTGDLPIGPPFEM 
PTGALLSTPQFEMLQNPLGLTGALRGPG 
RRGGRARGGQGPRPNI CGICGKSFGRGS 
TLIQHQRIHTGEKPYKCEVCSKAFSQSS 
DLIKKQRTHTGERPYKCPRCGKAFADSS 
YLLRHQRTHSGQKPYKCPHCGKAFGDSS 
YLLRHQRTHSHERPYSCTECGKCYSQNS 
SLRSHQRVHTGQRPFSCGICGKSFSQRS 
ALIPKARSHAREKPFKCPECGKRFGQSS 
VLAIHARTHLPGRTYSCPDCGKTFNRSS 
TLIQHQRSHTGERPYRCAVCGKGFCRSS 
TLLQKHRVH\SGERPYKCDDCGKAFS\R 
ASDLIRHQRTH 


1967 


15868 


A 


1981 


2 


188 


LPETNFAELFLP Y I SQHNLIRKYKK/ WP 
GAVAYACNPSTLGGQGGWITRSGDQDHP 
GLHEEWP 


1968 


15869 


A 


1982 


3 


424 


EGQAIVERMNLCLKQQLQKQKGENRYYR 
TPHKQLN\ ALLTLNFLSLPKGR I J..SAAE 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Pbenylalanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, ^possible 
nucleotide insertion 














QHLQKPAAKTE AEQLVWWRDL I TES WE I 
GKI I TWGRGYAYVS PGL/NPSRHLKPYH 
ERMLRKRFREDPEDP PSCSHVKTDAEED 
PN 


1969 


15870 


A 


1983 


3 


399 


YSKLSFKGTLTKFRRIYSSSFYKEFQGC 
ISDLCHP/R/TLTNCLRGELLKENLVWG 
AFGCHPHFPCYYKKSQERNUjQALRHPT 
TVAFGETGFNYSYKCTMPVPEHHKVFKR 
QLELAASLEK/PLVI LCQQADE 


1970 


15871 


A 


1984 


1 


405 


RRHIGGGVRLYYIGGEVFAKSLSDSAIF 
AQTPNCNQRYGWHPGTVCKI ppgcnlki 
FNNQEYADLLDQSVNQGLEADY/QLTRM 
CTI LMSLLKGWGAE YRRQTATRTPCW I E 
LHLNGPLQRVDKVLTQMGYPS I LM 


1971 


15872 


A 


1985 


27 


452 


QGREHAQGGQSPGAGHLGPTPEPQPEPQ 
PRPS S QAVPAGRWE PAQE / PTRHPHPRL 
SPASRPLGSPPAPLVRSSPGRCRLHEHT 
IWSSTVGTSVEPAPSLGRPQAPLEPGTV 
TSS VRLQQ PHMHTPGKIMFDPSKRNGKF 
TFT 


1972 


15873 


A 


1986 


414 


220 


GAEQEELLSP/GSGGCSELRSCHCTPAW 
ATRAKLRLTKQNKTKRESYRQGNQNWGR 
ILGELLGRL 


1973 


15874 . 


A 


1987 


52 


412 


TRERKLFFACDNVW KHLKR YLRKNS FGE 
NLWSS RNI KKKKKKKKKKKKKKTXNFS T 
P KNF F FFKACQCMGKKNTKHD FHEQNF I 
EHALLQQLNINFSTNYIEGRVF\HPGAP 
IESLLLMCH 


1974 


15875 


A 


1988 


2 ' 


143 


EKRRGLD KRTP /AQAAFEKMQEKRQME R 
ILKKASKI PPPFVCMWSVDS 


1975 


15876 


A 


1989 


3 


163 


TEFQPSEK/WGEDLGDNTWEYIFAIDLL 
CCHQKWI CHPLFLVG WRAGAEVSGVF 


1976 


15877 


A 


1990 


1 


439 


DKTAAEDAIRNLHHYKLHGVNINEKAST 
NKSKTSTKIiHVGNISPTCTNKELRAKFE 
E YGPG\ I ECDILKDYAFVHMERAEDAEE 
AIKGLDNTEFQGKRMHEQLSTSRLRTAP 
GMGDQNGCYTGGKDGHWSKECS IDRSGR 
VADLTEQ 


1977 


15878 


A 


1991 


1 


145 


WAASKAMK/ MGDWKTCHSFI INEKMNG 
KVWDLFPEADPVLLKRLRESR 


1978 


15879 


A 


1992 


2 


425 


N I STLKKTLESDCT \ KLFSQG I GGEQAQ 
AKVDRCLSDLAAETNKFRDLLQEGLTEL 
NSTAIKPQVQPWINSFFSVSHNIVEEEF 
NDYEANDPWVQQLILNLEQQMAEFKASL 
SPVIYDSLTGLMTSLDAVELEKWLKST 
FN 


1979 


15880 


A 


1993 


3 


449 


VAGPAPGAGARFGLDLQ FLQRFLQ I LKV 
LFPSWS S QNALMFLTLLCLTLL \ LKS FD 
QFTCNLLYVSWRKDLTSHLHRLYFRGRA 
YYTLNVLRDDIDNPDQRI SQDLERFCRQ 
LSSMAS KLI I SPFTLVYYTYQCFLSTGL 
LGPVSIFGY 


1980 


15SSI 


A 


1995 


1 


410 


SSRRPFIALKTKSMRBLNPEDIDQLITI 
SGMVIRTYQLIPEMQEAFFQCQVCAHTT 
RVEMDRGR I AEP R VCGRCHTTH S MVL IH 
NRYLFSDKQMIQLPESPEDMPAGQTPHT 
VI LVAHNDLDDR/ VQPVDRVNDSGFF 


1981 


15882 


A 


1996 


1 


154 


LFFFRLLTO YTKKVAQ VS T PTLG KVS 



665 



WO 01/64835 



PCT/US01/04927 



SEQ ID 


SEQ ID 


M 


SEQ ID 


Predicted 


Predict- 


Amino acid sequence (A-Alnnine 


NO: of 


NO: of 


eth 


NO: in 


beginning 


ed end 


C=Cysteine, D=Aspartic Acid, 


nucleotide 


peptide 


od 


USSN 


nucleotide 


nucle- 


E=Glutamic Acid, F=Pheny!alnnine, 


sequence 


sequence 




09/515,1 
26 


location 
correspond 
ing to first 
amino acid 
residue of 
peptide 
sequence 


otide 
locution 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


G=Glycine, H=Histidine, l=Iso leucine, 
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IiGKVGKKSWKHPEAKXKRPCGKN. 


1982 


15883 


A 


1997 


2 


475 


KKLRRQTRREAQKE \LPDKV \R WGLVTP 
SETPVGISNLSPFLGPRCL/QNPAELKP 
PWRFPWAQRPKAH\ERANAGPXTPAEQR 
KVKKIKKLKEDISQGVHISVYRVRNLSN 
PAKKFKI EANAGQL YLTG WVLHKDVNV 
VWEGGPNAPKTITRLMPLRI 


1983 


15884 


A 


1998 


3 


431 


QLRTRDRGWPSRRPEREKRTSQSARRPT 
CTES R WKS EEE VES DDE YLAL PARL TQ V 
S SLVSYLGS ISTLVTLPTGD IKGQ/SPL 
EVSDSDGPASFPSSSSQSQLPPGAALQG 
SGDPEGQNPCFLRSFVRAHDSAGESSLG 
SSQA 


1984 


15885 


A 


1999 


1 


400 


ALDLRGLQILVGFPKRRVTTCSYPTALQ 
S P I E YQRKERSTAVMRTE PDS AYQAS PR 
PYSAGPADSKKPTKGYCYNPTLP\RLEI 
MTLEGTTG 


1985 


15886 


A 


2000 


1 


372 


QN I DLVI S FFS SRLLQAGAELS VERVLE 
IIKQGWALPKDRL/RGSCAPTLSAAGR 
SSGGQS PCMPGLCVCSFWVLTVS WLVCQ 
KFPELKFKYVEEEQPEEFF I PYVWSLVY 
NSAVGLY WN PQDI 


1986 


15887 


A 


2001 


393 


1 


GGIGRGGGAGGGVGAAGSASGGVGRRGA 
GGVI ADSGAPGGGVEGGVGASGGWRE /G 
RGTSGGVGGSGGACGSV/GGSGGAGGGV 
GACGSTSDGVGRSRGTIGGLGGSGSAGG 
GVGACGGASGYVGIRGAGGG 


1987 


15888 


A 


2002 


2 


362 


WVTFISLIiFLFSSAYSPGVFRRDAHNSE 
VAHRLKDLGEENDKALLLIAFAQYLQQC 
PFEDHVKLIiNEVTEFAKTCVADSSAEDC 
DK\SL\HTLFGDKLCTVATLRETYGEMA 
DCCAQQEPE 


1988 


15889 


A 


2003 


2 


358 


EANRGWFIRLKEGSQLYNIKVQGEAASA 
DVEAAASYPEDLAKITDEGGCAKQQIFN 
VDK \ QTAFS W MKR P CRTL I AREE KS V PG 
FKL/SKNRLALLLGANAAGGFKLKSVLI 
CHSENSRTF 


1989 


15890 


A 


2004 


190 


1 


DQTCFLSFTVKAVTFNGWVWWLTPVI PA 
LWDYRH\RPANFFVFLVETGFHHVAQAG 
LKLLGS 


1990 


15891 


A 


2005 


1 


132 


GMCHHAQLIFVF/CSRDRVLHGCSQTPC 
LKQFSCLGLPKCWDYR 


1991 


15892 


A 


2006 


2 


134 


PMTFFTELEKTTLKFIWNQRRIQIAKAI 
LTNQK\NKARGITNIC 


1992 


15893 


A 


2007 


315 


127 


SEIAFFFCSFLKIILDTR/FSFFARAGIi 
KLLASNDLPSSSSQGAGITGVSYGTQPV 
CFEYNVG 


1993 


15894 


A 


2008 


3 


325 


RCSMLAVREM/QKATKRCHFLPTRIALI 
K/ND/GSNKCWK\HCWWECKMVQLLWKI 
VWQFLKKLNIELPFDPEIPLRDIYPKQL 
KTYVHTKTCX3QMF I AVLF I LNVPHLMNV 


1994 


15895 


A 


2009 


369 


3 


VGQAGLEFLTSGDPPASASQSAVI TGMS 
HHTQPIFCIFGFA\GCPDWSQTPELKQS 
AHLSLPSSWDYRCMPPHLANFYFCRH/R 
VCCPGWSQTPGLK\YPPALVSQSVATTG 
TSPRAWQDTILPV 


1995 


15896 


A 


2010 


114 


287 


APLCLCLRHLRLLI KRLTVLGTEAHTL \ 
NPITTRGRGGQITWGREFETSIjANMVKP 
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E*=Glutamic Acid, F=Phenyla lan ine, 
G=Glycine, H=Histidine, I=lso leucine, 
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nucleotide deletion, \=possible 
nucleotide insertion 














CF 


1996 


15897 


A 


2011 


218 


346 


IMKFNLLKFYFIYLFSRQSL/SSVAQAG 
VHWRNLGSLQPLPPKF 


1997 


15898 


A 


2012 


310 


1 


SSSSSNTHFGIPKYLINPDTCFLA/KVN 
NSSLTGLEYTETLKPGIK 


1998 


15899 


A 


2013 


3 


301 


SDCCASNQRDSGGVGPSEPARKHTLCV/ 
CNSLDLIE " 


1999 


15900 


A 


2014 


332 


3 


LRDPLEEAVCPFSDLQLHAGRTTALFKA 
VRQGHLS LQRLLLS F\VCLCPAPRGGAY 
RGRQAG\SLSCGGLHPVRASRLLCLPKQ 
AWAMAGAPPPAWPRPCSLISDCCASNQR 


2000 


15901 


A 


2015 


2 


325 


LLLHHAPPVNLFLRDRVSLCGPLCGPGC 
SQTPGLKQSSCLSLPKC/WDYR\RATAP 
GL 


2001 


15902 


A 


2016 


373 


3 


WAHGLQPGKPKVDSLSKKRKKKKGKKLN 
TFPLR SRTRQACLLS PLLSNI VLEVLPN 
EIRQ/QKKEIKGTHTGREELTLSLFTDN 
MI I YVDI PKQSTKKNQGSYS VARPGAVA 
HACNPSTLGGLGG 


2002 


15903 


A 


2017 


343 


1 


EFPFVSGSRATGKSSDIRATKYIWRVLE 
YLRAW PRGQRRLKS S \ HTS LLGS YHPGA 
FRGDKWSCCHQKDETGGRGBDEVLLCCP 
GVLGCSALCRSGVRTKFGIRGRPWKERE 
R 


2003 


15904 


A 


2018 


139 


2 


NS I S TKNTKNWSGMVAHAFNPS TLRGRG 
GQI \RGQDFKTSLANMVKP 


2004 


15905 


A 


2019 


3 


322 


ARELVFFFGAYRKGFFHRDMKPKNLLCM 
GPKLGKIADFGLARELRSHPPYTDYGST 
KWR/YRGNPALLRPTPMAKFPYEGWLKI 
SGSSIWNIKTSDSNFIFFKESKG 


2005 


15906 


A 


2020 


205 


377 


NIVENIVFCWPGVCFLQTCTVCINPETS 
DE/WPGAVAHACNPSTLGGQDGQITRSG 
DRE 


2006 


15907 


A 


2021 


3 


324 


KKWGKR \LNRAPNGRR YPETRWALEEDQ 
CHICKELHIKTVRFHCTPIRMAKIHTTD 
NPQSWPRCGTGTLIHCRRGCKTV/R/PL 
WKTVRQ/FL/RKLNIPLPCDPAVLSLCI 
Y 


2007 


15908 


A 


2022 


2 


382 


RVSQDGLNLLTSRSTRLGLPK/ CWDYRC 
EPPSP 


2008 


15909 


A 


2023 


1 


421 


RWN PGGRG CS ELRLH Y C TP TWVTE RDS I 
SKIAKNK/NNKRPRNNCR 


2009 


15910 


A 


2024 


339 


3 


S WDHRRVL I FVFLVQTG FCHVGHAGLEL 
LTSGSQSAGIAGVSHRA\GQKHQFRPEH 
RFLKFGFVFRDRVSLCCPGWPQTPRLKQ 
SSRLSLPKSWGPATALDPDSSYLEEMVL 
S 


2010 


15911 


A 


2025 


2 


146 


NTFGRSRQEDHLGPGGQACSELRSHHCT 
PAW\VIEQDPVSKKKKPPKP 


2011 


15912 


A 


2026 


3 


380 


RLECSGGISIHCNIiR/LPGFKRFSCIiSL 
PSRWDYTRLP/PFVFLVETGFHHLGQAG 
LELLTSGDPKCWDYGC\DHCTWP 


2012 


15913 


A 


2027 


32 


296 


DYMNSLMYFHSLVLASTDEGFLPKTVST 
QSAGITGISHCARPWIPF/CFFFFKNRK 
TRFVAQAEGQGGNFGSLXPLPPGFRGFP 
CLSLT 


2"013 


15914 


A 


2028 


2 


187 


FTLLPRLECSG\MILVHCSLNLPGLRWS 
SCUSLLSSWInYTCVAPFSIFYYFIFLTW 
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T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop cocJon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














GGTLNC 


2014 


15915 


A 


2029 


197 


3 


AAGLSQAGMQWGDFGSREHLPSRFQRFF 
CLKVPNNWDYRHGPPRPVGI FFFSF\LV 
ETGFHHPGQ 


2015 


15916 


A 


2030 


88 


362 


KFGINLKKKEKKGPQKNQNFFPTFLFFF 
FFETESHS I TQAGVQWRDLS PLQPLPPG 
FKSLPSSWDYRCLPPCPANFCIF\IPDS 
VSLCWQGW 


2016 


15917 


A 


2031 


7 


131 


GTIMAWAPGEGGCSEPRSCHCTPAWVT\ 
SKTPSQKKKKGVLL 


2017 


15918 


A 


2032 


1 


343 


LECSEPRLCHCTPAWV/ TGDSVSKKKKK 
KKKK 


2018 


15919 


A 


2033 


243 


14 


QKLAPWPPQSAGNNRRVPPHPASMAFLT 
KIEKTALKFIQNHKRPQIAK\TILSKNR 
VGS I TLFNS KIHYMTTVT KT I 


2019 


15920 


A 


2034 


238 


3 


FLAL P KVLG 1 1 G CV P PHPAS MAFLTTI E 
DTALKFIQNHKRPQIA\KPILSKNRAES 
ITLFNSKIHYMPTVIKPLRVTST 


2020 


15921 


A 


2035 


345 


2 


LVKI QKEMNVFMTANP I S I PQFMDQGVI 
SSCKS YYFRNKFLYV\ IAAMDTD/SSNG 
SWQSEWKTFWKGFI IliDGIKNTDDSWEE 
VKISTLTGIKKKLIPTSTDDFKEFKTLV 
EDVT 


2021 


15922 


A 


2036 


1 


140 


GRCC \HELRSRHCTPAWATRAKLKKKKK 
KKREKKTQ KG KNGLGFWAF 


2022 


15923 


A 


2037 


199 


348 


RSSNEGGRDCVFCFGDRVSLCLPGWSAV 
AQLWLTATSTSQ\VKSLSHLGL 


2023 


15924 


A 


2038 


3 


193 


NGLNAPI KRLRLAWW I KSQDPS VCCIQE 
AHLTCRDTH/RC YLKG/WYKAF 


2024 


15925 


A 


2039 


116 


337 


SKLLKILPRLCWGWWQAPVIPATQEVEA 
EDHLNPG /RSRPAWATQRTPVS I KKKI I 
LRPANGK/ CHGPSW 


2025 


15926 


A 


2040 


368 


252 


VFFFFALFYLFFFFFFFFFFFFFFFFFF 
QFLGFYFRFKF 


2026 


15927 


A 


2041 


204 


399 


VSSHKINGLTVCSTSPFFLSLLPPSEES 
ACFPFAFCHDCKFPEAS/SVMLPVKPVE 
L 


2027 


15928 


A 


2042 


103 


356 


WHFSPPQPLPPP/PLPNPPPPPTPPPP\ 
PPPPPPPPPPSPPPPPPPPSPPPPPPPL 
PPPPPPLSPPPPPSTPP/PPPPPPPPPP 
PLPP 


2028 


15929 


A 


2043 


3 


344 


LYKWNNKASLAAHLFAAWFTVYFKPTVE 
TYC/SGKKKIPFKILLLIDNTPCYPRAL 
LEMCEEIS I VFAPATTTSS / LKPMDQGV 
IVTFKSNYLRNTFQAGGGEKKEKHERKK 
NIIS 


2029 


15930 


A 


2044 


2 | 


349 


PRVRKS PGPNGFTANF YQTFKEL I S I LL 
KLFQKKKKIKKGENPPNSFYGAI IPRIP 
NPNMDLSKK/ETYGPVSGRNMBAKIFTK 
FLAGHFKQS FGRE I HHDQREFI PG I QGG 
FNIGN 


2030 


15931 


A 


2045 


280 


462 


CXFFLWVLVWCWLLFXVWLWLCFFG 
FWCCWFCXCGVWFFVFFWCVLCGVL 
LGCWC 


2031 


15932 


A 


2046 


3 


284 


PSPSFSLLLPPSFSLLFPPSF/SPPPPS 
FSLLLPPSASLLLPPCTSLLIHPPTSLQ 
LPPLPSFYLLLPTSISHHLPTHNLPTTS 
IQDPSTPCSIK 
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nucleotide deletions-possible 
nucleotide insertion 


2032 


15933 


A 


2047 


95 


1 


GRVDAIIAHCKLR/LPGFTLFSCLSLPS 
TWDC 


2033 


15934 


A 


2048 


211 


1 


EPTTMTGAKKGSSQTTTOYHMLQSLRIN 
VRVDFFFFFCTDR/ SLSMLPRLVLNSWI 
PAIILPHPFQVLGLQ 


2034 


15935 


A 


2049 


35 


266 


EVRRPS TWQPPRLRSEEPLRL \QPPRLG 
NIGFCHVGHAGLKLLTSSDIjPTLASQSA 
GIT\GHSYRRASEEDKKESSMSS 


2035 


15936 


A 


2050 


281 


3 


QSNFKRALLSILLKAIYRSNI IPIK/VP 
MTFFTE I/EKVI PKFIRNLKRLRIAKVT 
LSKKNKTRRITLSDFKLYYY\ IAVFVAA 
WYWKKNRLSDQWN 


2036 


15937 


A 


2051 


251 


12 


ILRFFLPKFLGFQKPPSLEKTHFFLPAF 
GNSLFWPPEKLGQKKVFFFFFFFFLRQG 
L/NSVSQAGVQWHNLGLLQSPLPRLR 


2037 


15938 


A 


2052 


2 


325 


ADHLRPGVQDQPQPGQNGKTPSPLKIEK 
\IAGCGGGHPRLREENCLNPGGRGCSEP 
RSRRCT PAW / VNDS KTLARKKKKKKGVE 
KNECGRKVMRV/LQGPKAKVKPWGENL 


2038 


15939 


A 


2053 


3 


166 


SLLLPRVECNGAI SAHCNLHL \HNLHLP 
GSSDSPASASQVAEVRGSLEPRSSSLA 


2039 


15940 


A 


2054 


227 


3 


LNENIGRITGMSHHARLILLFCE/YRVS 
ITQAGV/QHDPGSLQPLPRVFKQFSHFS 
LQS S WDHRCAP I RIiAI FWVFC 


2040 


15941 


A 


2055 


301 


0 


QRKSHMFLTINQKLEMIKLSEEGMSRVE 
TGQKLDLMC/QVSQAVNAKEKFLEKIKG 
DTSVHTQMIRKQSSINVDMEKVGIVWIE 
DQAP /HNHI PLSHLLMRAR 


2041 


15942 


A 


2056 


1 


109 


RP LRRLRQENRLNRGSRGY SEP KLC / HL 
CTPAWAT 


2042 


15943 


A 


2057 


313 


250 


PQPPPPSPPPQPSPPPPSPPPPSPSSP/ 
PPP 


2043 


15944 


A 


2058 


134 


2 


EKESRSVA\RLKGSGAISA\HCNL\CLP 
GSSNSPPSASRVAGSSGA 


2044 


15945 


A 


2059 


120 


326 


NSLVADME/KVLWWIEDQTSHNIPLSQ 
S/LIQSKALTLFNSIKAKRSEEAAEKIF 
EASRDWFMRFKIKQK 


2045 


15946 


A 


2060 


102 


413 


ERTGFRHVGKSGLEFMTSGDPPTLASQS 
VG2 T/VHEPRTRPG 


2046 


15947 


A 


2061 


311 


1 


FFKKQRFCKSGCKTPGKPRGPKKLGANQ 
FSGPTPLKNGVGFSPGPKGGFFPPPPGG 
FPAGR PKWLTRI LGKGGS LLRG / RYPKK 
GFLKP I LGDNS PQRAPKRG P 


2047 • 


15948 | 


A 


2062 


229 


380 


WYDLGSLQPMPLRFKQASRVNLPRSWDY 
RHPPLSRLN\IVFIVDTGILHVG 


2048 


15949 


A 


2063 


142 


383 


PQSCFSTHWQLLQKQEETAGAVSVCVCT 
S \VCVCVCVCVCVCAGAMCVCAGA/CFC 
VCVCAGA/ CLCVCVGA/CLCVC 


2049 


15950 


A 


2064 


408 


200 


NLIQIKALTHFSSIKAERGDKSTE/EKF 
EGSRGWFVRFKERGHLCNIKVKHEAANA 
YAEAACSQLSRRSS 


2050 


15951 


A 


2065 


348 


3 


WVSPYSPCVCVCVCVCVCVCVCRDWVSP 
CCPGVCMCVCVCVCRDW/CFTM/CAQVC 
VCVCRDWVSPCCPGVCVCKDWVSPCCTS 
WSILKLLSSGNPPTTVSQSAWITGRSHS 
AW PARA 


2051 


15952 | 


A 


2066 


73 


286 


NLIRGLLESHI LI SMRYGYCKS YTLSMR 
DIPEALNKWKS / I PMFCSWS RRL I VSMA 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

ocl 


SEQ ID 
NO: in 
USSN 
097515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
n ucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=lsoleucine, 
K=Lysine, LHLeucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unkno\vn, *=Stop codon,/=possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 














VLSKLINRFNAKDVQH 


2052 


15953 


A 


2067 


329 


0 


PSSTFLLPFPRPPPSPPPPPPPPPPSPP 
/PSSSSPAPSSPHP 


2053 


15954 


A 


2068 


221 


3 


ELKVILHCLRDFSLQSSIMKVLILKILI 
LSVCCVC/VCVCVCAVCSCVC/VCSCVC 
/VCS CVCLCVLC VCVCVLS C 


2054 


15955 


A 


2069 


I 


167 


GTRENPLNPGGGGCSEPR \ SCHCTPAWA 
TKSETLSQKKKKKKKISNYKTPFKSYRI 


2055 


15956 


A 


2070 


2 


354 


ARACLGLPSSWDF/ SVENRFHRVGQAGL 
ELATSGDPPTSASPECWDCRH\DHHTWP 
LL 


2056 


15957 


A 


2071 


20 


341 


CIVTVNTRGENINICWSPDGQTIAVGNK 
DDWTFIDAKTHRF/LKQNSSSS/SEVN 
EISWNNDNNMLYPDTGNGCINILSYPIT 
ESRAI YQRP I FHVHVNQDL PHGDVLS 


2057 


15958 


A 


2072 


353 


58 


LQLLTTSDP PAS ASRGAG I ADGVWFTQ / 
SLNGAQAGVQWRDLGSLQPHPPSRL/LIi 
ASQSAEIAASARPPPRLGSEERLCLAAH 
RLGCEEPLCLAAQSGK 


2058 


15959 


A 


2073 


1 


338 


GSRLQRVCINYRLSFFPFLSQGWINFTW 
LFCLCVCFLRDRVSLCCPGWPSTSGFKR 
SSCPSLLSRWDYRHMPQHLASHTLFKKL 
/TILPR 


2059 


15960 


A 


2074 


1 


322 


GGGREAGEARGGGEGGQGSGRRRRGRGG 
PRTGGAEGGRGAGETPGGGAR PEREQGR 
GRHS ERQG PT / RQTKR P KT KTKQLS QN K 
NT 


2060 


15961 


A 


2075 


450 


225 


T P VGRGCSELRS CNCT PAW VTD \ E TLSQ 
KE KRGGVK I GWKKRRTR I I SLHLPGSHE 
KFNNLETVKNCVNHFCRLNT 


2061 


15962 


A 


2076 


2 


470 


TPQNKPHPTTKNTQPQPTQKK/ PTQTQP 
/TTPKPKTKKWTTPNPAPPNTQKNKTTK 
PTTQNPHKTKQKNKTKPNTTQQQNKPTN 
QNPKTQTQQKTTTQTKPPKKKKKKTADT 
TS PNPI STKKI KKIAEP 


2062 


15963 


A 


2077 


304 


1 


NSATPPCSPTA/KPHPTPPTPPPTSFIP 
TCQHSPPTKICPQARPRTPPYAHPPRCP 
RKI I PKGRRHPitAPPQAARRDLNYYHPI 
IWRGRVCVCVCVCVCARA 


2063 


15964 


A 


2078 


167 


1 


TI LQTNSTWSNVLLWQGAVAQAC \NSST 
LGGQGGRITRSGDRDHPGQHGETLSRA 


2064 


15965 


A 


2079 


3 


364 


HETRSRHSCAWCSAALWRAAVASRCPSS 
/IPVTPPQCLYWP/WKVPLQCPPDL 


2065 


15966 


A 


2080 


303 


2 


S KRGR PSGHECP FLGTS S S CRH VAS CT I • 
RTPRRLCKSQLDRCSPLKERHKFGLSRS 
VLSAMTQSG I Y/ WQPPPPEFKRCFS CLN 
LLSSWDYRHAPPRRARA 


2066 


15967 


A 


2081 


4 


326 


ag i tgmshrawlflyflnkfaft \ yglv 
lnfflhkiqepslg\sgsgp/lscns 


2067 


15968 


A 


2082 


1 


343 


PPPPPPPPPPPPPPPPPPPPPPPPPPPP 
PPPPPPPPPPPPPPPPPPPPPPPPPPPP 
PPPPPPPPPPPPPSPPPPPPPPPPP/PP 
PSSPSSPPPPPHTPHP 


2068 


15969 


A 


2083 


330 


88 


ACDRSIISRIHKEL\EQIiCKQK\SNNPI 
KKWPKDMI RYFSKQDVQTVKTMNKCSTS 
LI I IEMQI RSTMRYYLTTVRTPHPS 


2069 


15970 


A 


2084 


1 


340 


RVRSHGTTHLAQLIFVLLVQTGFHHDGQ 
DGPDLL/NLVIRPPQPPKvL 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=A)anine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Pheny)aJanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X—Unknown, *=Stop codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 


2070 


15971 


A 


2085 


2 


348 


ARALSLGANAAGNFKLRPLLRYHSQNPR 
ALKNYAKYTXPVLHRWHSESLETAFTEC 
FTEYFKPTVETYCSEKD/l/SYKALLLI 
EKALGHPRALMEMYKDIS IVFMPVNTTS 
ILQPMD 


2071 


15972 


A 


2086 


369 


2 


KGLP PS PANLFFLS VY/MG FPMFTRL I L 
ISCPCYPPPSASPSAGIKGLNPPGWPPF 
S FFYQKFVRFFVLARNRTFGFVDYFS IL 
YLIYGCSNLYRFLLSALYFFVSVFVFVIj 
TQG I EAMVPRA 


2072 


15973 


A 


2087 


314 


3 


LLESVGPTRNSRPFKGLIGRILLDPEFY 
PSLVSDDLPASAS PNAGR I LPFFFFETG 
SHS \ TQAGVQWRHHGSLQ\ LKRSS YLSL 
TSSWVYRHTPPHPANICIFSRA 


2073 


15974 


A 


2088 


330 


16 


CPCFFLSALiSVLVGWCFAFWCGCV/WW 
CVCFVWFVCVCFLGCWLCLCFV/VCL 
VGVCCWFFGGLCCVCC 


2074 


15975 


A 


2089 


1 


337 


GTRTFLPPSYKDPCEY/IWAHPDNPGSS 
SNCNMLNFSTSD/PPVHSSGNWKVLSSP 
NRPYYYSYTATPHTDPTPKLPSPNPSSP 
SPSYPLLSDSTICQTTPTTTPITSSHTL 
LTS 


2075 


15976 


A 


2090 


350 


84 


QRKENKRTRKRATERRDESREEKAGRKE 
GENQEKRNKEQQGRQRRSRDRT/EEEKE 
EAKRREHKNPKKKKTKPPPQKKKKTEKK 
KTDNLSN 


2076 


15977 


A 


2091 


1 


355 


SDP PTSASQSAG I T YVSHRT WPLLEFSG 
TS I RIAG KP AGVL VE VTGK/ S VCGGGVT 
KTH WNECHTG YP KCCWSSQAGES SLQP P 
PPGFKRFSCLSLPSSWDYRLLPP\QNFC 
IFSRDGL 


2077 


15978 


A 


2092 


27 


345 


AS I PCLKKRKKKKKXTGGKP PP \GGPKP 
KRGGGGGP PPTKRTLFGPPQGNQTPGGV 
S/GPPKGDPPFSPIPPCQRKSPPFEKGG 
KKGPPPWRGVKREKGHFLKNFQQK 


2078 


15979 


A 


2093 


3 


301 


HEHVAQAGLKLLGSSDLPTLAS PKCWDC 
KR\DYCAWPHIFPISGPLYVFFPLPRMS 
PHTHTHKYTRTFFHYAHKHCACTLNLLL 
RAQFQCHFFKGWPDP 


2079 


15980 


A 


2094 


76 


342 


WFXXFFFLLFVFFVFWFVCFCVFCGFFF 
FCFFCCVW F VFG FCLFLCFLFW FC FFVF 
FGFFGFWFFVFFLVCFLVFVCFFGCFFF 
LFCFF 


2080 


15981 


A 


2095 


115 


325 


MDERKKIRGGGRQGKECKIHCKKKLSPG 
IRSYPVEN/F/VDTMYDYLQPAYYKLND 
LTNADPCAVRYLLFDQN j 


2081 


15982 


A 


2096 


80 


227 


SCLGN/ CIHLYSHSPTLSFTHTHTHTHT 
HRERERERERERE I CI CMS VYA 


2082 


15983 


A 


2097 


308 


1 


NSTVTMENSVNIHYRTRVFTEAQFTIAK 
SWNQPKCPSILEWIKKL\WIYIYVCVYI 
CVCVCI CVOTCVCVCVCVCI YI YMMEYY 
SAI KRNELAAFAVTWTRA 


2083 


15984 


A 


2098 


2 


361 


ARACLGLPSSWDF/ S VETRFHRVGQAGL 
ELATSRDPPTSASPECRDCRH\DHHTWP 
LL 


2084 


15985 


A 


2099 


1 


221 


LLWRLRHENHIiNLGGRGCSE PRLHHCTP 
SWMTR/GKTPSQKKKTKQPMEWQNLCLK 
KVFGKTGVRSI FKG I KLS 
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SEQ JD 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, | 
G=Glycine, H=Histidine, I=lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S^Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possibie 
nucleotide deletion, \=possible 
nucleotide insertion 


2085 


15986 


A 


2100 


186 


2 


ERVYVCVCVCVCVCVYQCYLPQI RSb/ V 
ASSLKANARVCVCVCVCVCVCVCVCVCV 
SLWHPLTA 


2086 


15987 


A 


2101 


305 


2 


HNILLSQSLIQSKALTLFNSMKA/E/RG 
EEAAEEKLEASRGWLMRFKE\RSHLCN\ 
TKVQGEVASADTEAAPSYTEDHSKITDE 
GGYTKQQ I FNVDKKAS YWNMS 


2087 


15988 


A 


2102 


6 


166 


EQTALEILARAIRQEKEIKGIQIRKEEV 
KLCFFVND\MVLYLENSKDFLKVDAA 


2088 


15989 


A 


2103 


53 


176 


E I KNNRPGMVAHAY/ NPSTLGGRGRQ I S 
WGQEFETSLVNMVK 


2089 


15990 


A 


2104 


3 


267 


FRHVGQGGLKVLTSGDPPALVSQSAGIT 
GVSHCAQPIVGDFNTPLSIL/D/RSTRQ 
KINKD I QDLNSALDEADLLD I YRTLHPK 
/STEYTFF 


2090 


15991 


A 


2105 


268 


3 


PPKEHGSSPATEQSWMENDFDELREEGF 
RRSNYSE/LREDIQTKGKEVENFEKNLE 
ECITRITNTEKCLKELMELKI KARELRP 
ECRSLR 


2091 


15992 


A 


2106 


3 


170 


GFHHVDQAGLELLTPQVIHP \LGLPKCW 
DYRREPPCIiASPHFHQIAISQKRHREAK 


2092 


15993 


A 


2107 


1 


398 


SARGPDGFTAEFYQTFKEELVQILLKQF 
QRIKGEILL/KNHYVKPSITLIPK\PG\ 
RDITKKLLTRSFCVSLCPVLSPLQSLQS 
RPSSLSMISLHSVCFSVLSASAHPHVHL 
CPTCPTLVLSGSHCVCCVSLFFF 


2093 


15994 


A 


2108 


3 


370 


HENWNNKGWGDSTIYSMKYLYFKPMLRP 
Y/ C/SQKKI PFKI LLLFDNAPGH PRVLM 
E I MYKDMEVF\MPVNTTF/ ILQPMDQRV 
ILTFKSYYLRNTFHKTIAAINSDYSDGS 
GQSQLKTFWKGFIVL 


2094 


15995 


A 


2109 


■ 


213 


HFPVENESAPG/ FKAAGDLLTLLLGGNA 
AGDFKLKALLVYPSENPCFLKGSFKPNL 
PLVWCSHKKAWVQLG 


2095 


15996 


A 


2110 


391 


3 


KKKKKNPHRKKIKKRWKKIFNAERNKKR 
AGVIIHILDRLDFKQQTIRRDKR\GYYT 
MINGTIQQEVITILNIHAANTKAVRYIK 
QVLLKLKTELGPNTI I TGDTNTLLLSTL 
NRSSRQKNQQTLDLICNI 


2096 


15997 


A 


2111 


188 


365 


FQNTIHI CVCVCVCVCVCVCL/ CVCFCV 
FFFCLLCGGFMCGCWCDFCILFCFYGVG 
FFFL 


2097 


15998 


A 


2112 


3 


340 


RMESALDRLKQFTTWGDTGDFHAVDEY 
KPHDATTNPSLI LAVAQMPA YQELEEEA 
I AYGRKLGGSQEDQI INAIDKLSVLLGA 
EILKMITGRVSTEVDARLSFD/SDAMVA 
TA 


2098 


15999 


A 


2113 


11 


305 


FLFTDFCLFMTHILGHKINYITN/ CKRN 
VIIT/SYFSPHNRIKLKISIRKISRKSS 
NTWKLNNRLLHYPQ I KDEVSRE I RKYLE 
LNINENTNF/QNLWDIHK 


2099 


16000 


A 


2114 


3 


387 


QTNH/NIPLSQNLIQSKAITA/NCMKAE 
RSEEAAGKFEASRGWFMRFKKSSY\IKV 
QGEAASAGVEAVAIYPDLAELIDE/GCY 
TT/QIFSVFQTAFFWKKKPSRTFM/REE 
KliI PGLKASKDSSSLLLRVHAAGD 


2100 


16001 


A i 


2115 


152 


393 


VYCPICWLVFFFCFCSVLILFVMFV/CL 
CFS FFCFLGF VWFSS FYLFICVFFFI V 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
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26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=GIycine, H=Histidine, I=Isoleucine, 
K=Lysine, U=Leucine, M=Methionine, 
N=Asparagine, Ps=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














LGCLFLCWFDLFLFVLFVLCLFFVS 


2101 


16002 


A 


2116 


2 


373 


ARACLGLPSSWDF/ SVENRFHRVGQAGG 
ELATSGDPPTSASPECWDCRH\DHHTWP 
LL 


2102 


16003 


A 


2117 


3 


449 


HEFDHAMLQAHRAHHLAIDAYHEFEETY 
I PKDQKYS FLHDSQTSFCFSDS I PTPFN 
MEETQQKSNLELLR I SLLL I ES WLEPVR 
VLRSMFANNLVYDTSDSDDYHFLKDLEE 
G I QTLMGRAEKRHCRTVQNLKQT YRR/ F 
DTNS/HNHDALL 


2103 


16004 


A 


2118 


146 


15 


FFFFYFFFLFFFFFI PFLFFFFFLFFFF 
FFYFIFILFIYSLLVF 


2104 


16005 


A 


2119 


406 


3 


LFSVNETGFYSKMLSRTFTATEETSIPG 
FKVSKDKLTL\LGANAAGDFKLKPMLIY 
HS /ENPRALKNYAKFTPPVLYKWKNKAIi 
MTTHLFTAR YTE YFKPTVETY / I LLLI D 
NAPTHQRALMEM YKE INVFMPANTTS I 


2105 


16006 


A 


2120 


109 


396 


YYFYFYLVLFYFFIFLFFYFFFFFILLL 
FYFFFSFIFFFFLFVYFIFLCYFFLFFF 
FFFFYLFFYSFFFFFIFLFFFFFFFFFF 
SYYLIFFLSYII 


2106 


16007 


A 


2121 


15 


413 


IVLARNTNFWLSFLFPVALGILIVLKGV 
KYI FW PLE YCQRLKMFVSYS FH Y\ FFLG 
SLLFLKYGFHMYLILLL/CIFIIIMCFF 
IECYSFFFCCLYHFFFSFYLFFLYFLIF\ 
CYLVILFFSFLLFLFLSSYCFFF 


2107 


16003 


A 


2122 


1306 


429 


SSSSSSSSHVLRI IKDEDFKILEQRQW 
LSEKEAQALCKEYENEDYFNKLIENMTS 
GPSLALVLLRDNGLQYWKQLLGPRTVEE 
AIEYFPESLCAQFAMDSLPVNQLYGSDS 
LETAEREIQHFFPLQSTLGLIKPHATSE 
/HKRGPSMVMILTKIVNAVAEWRRLMGPT 
DPEEAKLLSPDSIRAQFGISKLKNIVHG 
ASNAYEAKEWNRLFEDPEEN 


2108 


16009 


A 


2123 


3 


206 


LRRLRQENHLNWRGGGCSEPRSQHCAAA 
W / VSNSETPQKKKKKKEKKKKKKNLPNS 
ALKKTYSQRGKLF 


2109 


16010 


A 


2124 


23 


401 


IASGRPFFFFFFFFFFWPPPPGGYPHFS 
FLKKKKKRGGGGGKSLLPPGKGPNPPK\ 
WGFPLFPPPLFPQKTPPPPFFFYKTPPI 
PPQGPRGGPLKFTPPGGGGGNPPTILLD 
KKRGVLGPPPFFWTN 


2110 


16011 


A 


2125 . 


3 


439 


MFDVSLLTFTHSLFHFSPQFHRKCELST 
LCDGGELRDHI LLPTS I CP I TR/ DKCS C 
PGEGC 


2111 


16012 


A 


2126 


1 


213 


HFPVENESAPG/ FKAAGDLLTLLLGGNA 
AGDFKLKALLVYPSENPCFLKGSFKPNL 
PLVWCSHKKAWVQLG 


2112 


16013 


A 


2127 


104 


419 


NSFFFEEELYNPFGGLGKKTFFGGGEEF 
GHTPPENEALGGKNKFFTGEGGQTFSNN 
GEEKSVSFWISIEKILHRALL/AHALCK 
NCWELNFGQKEEPFFPPPEEF 


2113 


16014 


A 


2128 


399 


140 


PPPPKNFFFPPKGKFFG/RG/VGPKFPP 
PKKRVFSQKPPRGFFYPPIoKKKKNPFPP 
PGNPGPPRGFFKRPPPFFFFFFFFFFFF 
FFFFF 


2114 


16015 


A 


2129 


10 


457 


KTS WTWCPVP WPATQKAE/AGGS PEPG 
RSRLL 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A^AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline T 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V= Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possib!e 
nucleotide deletion, V=possible 
nucleotide insertion 


2115 


16016 


A 


2130 


409 


0 


PPPPSPPSPPPPPPS/PSPSPPPPPPSP 
PSSPPPSPPPPSPPPP/PPPPPPSPPPS 
PPPSPSPPPPPSSPPPSPPPSPPPSSSS 
SSPPPL 


2116 


16017 


A 


2131 


317 


177 


FFFFFFFFFFFFFFFFFFFFLRLQI FFF 
FLP I LIFLFLFFLFFFLFF 


2117 


16018 


A 


2132 


3 


542 


EPWSVAQAGVQWRDLSPLQPIjPPRFERF 
SRLSLPSSWDYRRLPAKPANF\QFIiVET 
VFRHVGQAGLELLTSGHLPALTSQS AG I 
TGMSRTS\RPGFLFKV 


2118 


16019 


A 


2133 


146 


439 • 


LKSVTS IAKTW I QPK/ CPTNSEVDKCPS 
TVKWI KKMWY I / YFTME YYAAI K 


2119 


16020 


A 


2134 


95 


1 


GRVDAILAKCKbR/LPGFTLFSCLSLPS 
SWDC 


2120 


16021 


A 


2135 


2 


380 


TKTHHTH/ THTHTHTHTHTSLS 


2121 


16022 


A 


2136 


4 


337 


KRNNKAWMTVHLFTAWFPEYFRPTVETY 
CSEKKIPFKIL/LLVDNAPGQPRVLVEM 
HKEMNWFRPANTASILQPMNQGGISTF 
NSYYLRNTFHKAIVAIDSNSSDGFGQNK 


2122 


16023 


A 


2137 


2 


356 


PVS S S QVRAS VYLKKKKKKG PPPEGITN 
TAGEKPPQSFRGKG\PPFPLISPKKEPV 
I \SFLKNFGPCTIGKKKPPHPPAKNGGP 
LQ/RPPTQGGGRGKKKGKNNQRPLSGLG 
GNRSPKPPF 


2123 


16024 


A 


2138 


2 


357 


FLGS SDPPPSAS P VGRATG / RVFFFFFL 
VEWSHYVAQNGLELLDTSNPPAVASQS 
VRITCVSHRTWLLSPLYKIIQVCVSK/S 
PELEQSEDKSLKK 


2124 


16025 


A 


2139 


379 


16 


LLQVRCFVSTVNRGSSCQKT I QVY YVQE 
AIPPSFLLSPFLT^I/PYTKINSRWIKDSN 
VKPKTI K\TLEENLGMPLNI FFTYQLLW 
LYLHPESQLE I CNS FRALQEGNLI IFFI ' 
GRVGRPGTTGL 


2125 


16026 


A 


2140 


73 


411 


NY LLNNLFF F F FLE RGLT FA PRAGGWGG 
NLTSWNLGPPGPNKPPPFPPKRP\GTPK 
PTLKEGLFGFFKTTGFPPGAQKGPELPG 
LRGPPGLAPPRGGNKGGNPPPGPLKTFN 
G 


2126 


16027 


A 


2141 


46 


421 


AGVS WRDHS S LQ PCLTSRARAV\ SHLS F . 
LSSRDYGPMPPPPKKKKKKKKKKKKKKK 
KKKKKKKKKKKKKKKKKKQ 


2127 


16028 


A 


2142 


26 


479 


LAYLLPSPVMYKPQNWRTILQSISNKGL 
I CR I YKETLQLNHKKTNNP I FFKWVKKS 
KHFTRGAIQVANK\HIKMKLNI I SVYGN 
AQKNTMRYHHTP I KMKKMGCALWLTPVM 
CQEAGY ILKMYPGWTPYPGRWGRPTAP 
ALPPRGTFPRD 


2128 


16029 


A 


2143 


9 


166 


QNRLLARLTKKKRGKNQAHP/ IKNDKGD 
ITADPTEIQASIREYHKH , 


2129 


16030 


A 


2144 


1 


328 


LEKESRPPPPPPPPPPPPPSPP/PHPGS 
LPKPLGLQPLPRPEVSFLTEYLPKLKAC 
EGGGVEIAAASFPRIYLMGMCSRQDRIQ 
KDIDWIQKSRAEDCLFAGVKAIXKA 


2130 


16031 


A 


2145 


1 


440 


KTFLRSLWQLVE/ CYISSGLIDDHRRPM 
ALTPRHPRSG ITAPRPRPQPPGRVGI PE 
PTALSPSPGPPPPPCSTPGRCQVPSLER 
RRKEGRE PPS VGRGCGGHGI S PSSDI FF 
HLNF CLAHPPDLFVFVCS I SNQLYI FKE 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenyla)anine, 
G=Glycine, H=Hi$tidine, I=Isoteucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P-Proline, 
Q^Glutamine, R-Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
XMUnknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














KKKKKGGR 


2131 


16032 

i 


A 


2146 


23 


422 


IASGRPFFFFWGAPGGEEGKKNFFWGPR 
GGKKKIRGAPTPRGXGNPPPKPWGKKGG 
APPKNLGLFGKKGAQRGPPGWFKTPGKK 
KSPPPGPXKGKKNRGRP\GPPPQIFFKP 
FFSTGEKNPIX3KKTPNWRLLPW 


2132 


16033 


A 


2147 


3 


372 


KKNTPFFLTPRGPPPPPRGPLSPPPKRA 
TPPPPPFKKKKPGFPPKKKNFFPPPGGG 
PPPP/LPGEKKGGYF/WW 


2133 


16034 


A 


2148 


363 


1 


FNCRWFETKSHSVPQAGEQWHSFGPLQP 
LPPGFK\EFSCLSL 


2134 


16035 


A 


2149 


2 


361 


FFFLVETGFHQVGQGGIiEHLTSDDPPTF 
GLP\GGWDYR 


2135 


16036 


A 


2150 


387 


2 


KKSFFLVSPARVQWGDPFNPPIPGSNNF 
P FST P P KTGV I RGP P PARK I FVFFI KTG 
\FPQLGRGVLKSLPQ/ CDSPPPAPPKGG 
VSGGNPPAPPFFFFFFEMESHSVSQVGV 
QWRNLGSLQPPS PGFKQF 


2136 


16037 


A 


2151 


23 


421 


IASGRPFFFFFFFFFFFFLGGRGVFFYP 
PGGGGGPNFFKKTPFPPGKKKFFPPPSP 
KGGFLTPPPPPPFFFFFLKKKGVINGGG 
GGVKISPPGGTPPFLPQKGG/NKKGGPP 
PPRKKKIFLFFFPGAIKQRPPP 


2137 


16038 


A 


2152 


3 


366 


VIWAHCIFHLLGYTDRPCLKKKKKGFPG 
GEKMGLKNFFPPLGKKPPPPQI PNFWSV 
IEKPPRGFYLGGAAPPKKFFLLKP\GPP 
FFKEP PPPKKKGGAS PPLKKKFFKGKKS 
PPFFFSDPTS 


2138 


16039 


A 


2153 


423 


253 


FFWGGGAPIFPPPKKGFFPKNPPGVFFS 
PPKKKKIFFFP/HPVNFGPPKDFFKRPP 
P 


2139 


16040 


A 


2154 


287 


1 


KEMNKKKNPKKKKVSKKISTAFYNKTLN 
NINFF\FAFFFFFFFFFFIFFFFFFFFF 
FFFFFFFFFFFFFFFFRFLFLFNYKQTA 
QLMQHSAPTRP 


2140 


16041 


A 


2155 


433 


2 


RRVLFIFPPL/HPFSPPPLPFSPFPPPP 
PMKFFSPPPTFFYFYKSPPHPP/HPPQV 
VFS PPTPPMPFI TPPPPPHTS PS FPS FF 
FFFPFFFFFFFPSFFFFFFFFFFFFFFF 
FFFFFFFFFFFFHI ISIKGEIGRSSWR 
VMSRTRG 


2141 


16042 


A 


2156 


69 


606 


LWWPPLSRHAAYRQWPGPTAPRGLGHKV 
XGPGASPA\PCGAAAGSTAQG\GGGAAC 
LPGPAAGAVTWPAGPGGGRATGPVLQR 
PAGAGQPTQQGQHDHAGRVIiWQHG IGRP 
GAHRPGPCAP/DSAPRAPGGRCG 


2142 


16043 


A 


2157 


449 


0 


SSPGSSSSCSPPPPGGGVGFFL/YKPPQ 
KKPP P PP VGGPG FFPQKKLKS FFPLPP P 
FFLGGGGPPPPPPPKIKSFYPPPKKVSF 
PPPPKKAFLPPPPPPLPPPPPSPPQPPP 
LF 


2143 


16044 


A 


2158 


241 


5 


KKFSFFPPGGSQGGFFGSLQNPPPGFTP 
FFCLNLGKKWGQRGPPPRPNNF/SFFFF 
FFFFLVETG FHHI SQDGLDLLTS 


2144 


16045 


A 


2159 


434 


2 


LGSASQLGCSGVRDPLEEAVCPFSDLQL 
CARRTTALFKAVRQGHLSLQRLLLSF\V 
CLCPVPRSGAYRGRQASLSCGGLHPVQA 
SRLLCLPKQAWAMAGAP/PPASLQFCSXi 
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nucle- 
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sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H-Histidine, I=lso!eucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S^Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














ISDCCASNQRDSVGVGPSEPRAGYNIiLV 
RCFLSP 


2145 


16046 


A 


2160 


448 


84 


FLGGFFFFFFWFPPPS PPQKEAL\QKKI 
FFVFFSPESNQKNFFFSFSERGGPPNFL 
FKGGGR\FSPWFLFFFFFFWFFFLWFF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFLGKCPLKYT 


2146 


16047 


A 


2161 


3 


156 


RGCSEPRLRHCTPAWVT\KETVSQKKKK 
KFLTLGGKTFKNFFFFNRAPGTFP 


2147 


16048 


A 


2162 


412 


37 


FFHSPPPPPPPAGGAVFPPKKKKKSPPP 
PTPLLL/RGGGGPPPPPPPKRGPPPQKP 
KRGFFSPPKKKKKFFPGPGGPPGPPQKT 
PPPPPPPFFFFFFFFFFFFFFFFSLFFF 
FLRTLWR KGRGRG I 


2148 


16049 


A 


2163 


5 


370 


QSSCVQWLVPWPTIQEAEAGGFLEPRS 
ERLQ/CSHCTPAWA 


2149 


16050 


A 


2164 


437 


36 


FFFFFSEAESCSVAQAGVQWHARSLPQP 
LPPG\SSDSCLSLLSSYMCLSPRTRGS 


2150 


16051 


A 


2165 


3 


623 


RQGFTLVAWAGVQWYDLGSLQP P PTGFK 
RFSCLSLPSSWNYRHAPPCPANFVFLEE 
TG FLKVGQAGLELPTSGDLPAS ASQSAG 
ITGMSHCA/RPKVCSYHLFF 


2151 


16052 


A 


2166 


337 


0 


PPPPPPSPPPPS/PSPPPPPPSPPS\PS 
PPP 


2152 


16053 


A 


2167 


350 


156 


AEATS KI RCQKC Y I M I AGHSGTRLQFQL 
LQRLRQENHLNPRA/ RGCNE PRSHHRTW 
PTWYISKSFLA 


2153 


16054 


A 


2168 


1 


337 


DSRSLSQRSRSWSYWGYYSDLSTARHSG 
HHKKRTKKKK/ 1 KKKKKKKKRGP FKKKG 
PLKTRKKPRGGLLKAHPFWGGPPPGFFL 
TGEGAPP PVFFKKKKKKPPLGGKGGFLW 
G 


2154 


16055 


A 


2169 


2 


606 


RVLRAVAAHEEPDKEGKEKPHAGVSPRG 
VKRQRRSSSGGSQEKRGRPSQEPPLAPP 
HRRRSQPP\HPGPLPPTNAAPTVPGPVE 
PLLLPPPPPPSLAPAGPAVAAPLPAPST 
S ALFTFS PLT VSAAGPKHKGHKERHKHH 
HH\ PAPMV I PAPAEP I 


2155 


16056 


A 


2170 


311 


2 


GLQPLGLGSVKQCMDLACVPETVCVCVC 
VCVCARAHTPVCTQGCVPESTQCVCMHV 
C/VCVWACTCVPVCMHTCVGVGASVCMQ 
RNELGRWAWENDAIRQQRC 


2156 


16057 


A 


2171 


78 


337 


NTPNSIYKAMSLKGPITGTFLPNYPGHK 
VCVCDTYLCVYQHTHTHTHTKTHTQAFP 
HI/HIYTHT 


2157 


16058 


A 


2172 


2 


278 


KNRLNPGGGGCSKLR SCHCTLAVIAT\SE 
TPSQKKKEKRKRKGRKRKRKKKIVYTKI 
EKGNAKMTGESFIHFCKKSSSVPQVALS 
AEYRCSST 


2158 


16059 


A 


2173 


2 


365 


FYHVGQAGLKLLTS SD? PALGLPKCWDY 
RRE\DCAQPNVKS 


2159 


16060 


A 


2174 


195 


389 


FRVFSKLKYYYDFFRGRISLSCPGWSTT 
PGFKRLSCLSLPSSWDY2RPPPCPAN\F 
CIFSRDGVSP 


2160 


16061 


A 


2175 


23 


374 


IASGRPLI FFFFFFFFr FLFCRGGGPPF 
NPRGKERGGFLIKGPPPPGGKKNFWAPP 
PGGGEIKRTPPPPGVFFFFLKKKGFSFG 
GGGGKK/PPPPGEPPPPPPQKGGKICICKK 
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ing to ftrst 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Pbeny)aIanine, 
OGIycine, H=Histidine, I=IsoIeucine, 
K=Lysine, k=Leucine, M=Methionine, 
N=As pa ragine, P~Proline } 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V^Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possibie 
nucleotide insertion 














TRGPLF 


2161 


16062 


A 


2176 


2 


158 


FFFLKQSLSVTQAGVQW\PVDSLQPLPP 
RFKRFSCLSLPSSWDYRCVPQCPAN | 


2162 


16063 


A 


2177 


340 


120 


PRFHFLASPSEMSQMTVKAKTTVPASEC 
AYPKIEPFFPF\DPRGSES\FDKLGVHH 
HPLFPLSGEPLMILHKKTH 


2163 


16064 


A 


2178 


3 


256 


YLKKDLNVNQKTLTLLE / EKQGK/HLHD 
MRLCKEFLNKTSKAQYILKKISQY/LIK 
VQKFNAVKDPVKRMKRQASD/WENI FT 


2164 


16065 


A 


2179 


95 


1 


GRVDAILAHCKLR/LPGFTLFSCLSLPS 
SWDC 


2165 


16066 


A 


2180 


1 


143 


ARGERERERERERERERERERERERERE 
RER VF V/ RERAAL S RERGD KCERSGL F 


2166 


16067 


A 


2181 


1 


304 


ARGQGHLSLQRLLLSF \ VCLCPGPTGGA 
YRGRQAS LS CGGLHP VRAS WLLCLPNQA 
WAIAGAPPAALLPPCSLISDCCASNQRD 
SVGVGPSKPCVGYNLLVW 


2167 


16068 


A 


2182 


3 


163 


FQRRSTSSCGWDKDARSQSRSPPRQQAH 
GHHS / HTHTHTHAHTHTHAHTHAWTRP 


2168 


16069 


A 


2183 


3 


205 


NFKLFCKGFFFLRQSL/DSVSQAGVQRH 
NLGSliQPLPAGFQLFLQPPPPGSKDTHP 
GVQRHDHNSLQP 


2169 


16070 


A 


2184 


228 


88 


WFFRIYDSFFLVFLFVFFFFFFFFFFFF 
FFCVFTPFPVYF I QYWY I 


2170 


16071 


A 


2185 


287 


3 


SLFLAI PQWEFQRKI YLRKQKI VLLACQ 
PSKGLS I VQWRGQLNSPVLKSQDPTP \ 
PRFKRFLCINFLSRWSYRHVPPRPDSFV 
FLVETGFLHLV 


2171 


16072 


A 


2186 


252 


105 


VSISCHPQ\PCLIYTCY/KIDGYSNIHL 
CTHTHTHTHTHTH TRVELQG PA 


2172 


16073 


A 


2187 


267 


459 


KHVPGSSYTWL/IFFFRGNFLKKGANFV 
PQGVLKGGNLS S L Y P PPPRLKQYS CLTL 
LRIGNYRHA 


2173 


16074 


A 


2188 


170 


15 


RS VMKDLNSHFS KENIQMANKRTKECSA 
LWFRE I K/TTMRCHLTP I RMATI KK 


2174 


16075 


A 


2189 


2 


504 


DVT I STCHASAKVGTRLVFDHYGKI IQK 
TPYPHPRGMTVSVKQLFSTLPVRHKEFQ 
RNIKKGLG\RRSCFCFDF 


2175 


16076 


A 


2190 


1 


565 


FFETESLSPRLECSGAISAHCNLHLPGS 
SDSPASASRVAGTTGACHKTWV/ IFFVF 
PVETRFHHVSQDGLDFL/NLVIRPPRPP 
KVLG 


2176 


16077 


A 


2191 


142 


382 


NTP PLL F F FV I RDRVS LC C PRW S G VAQ F 
WLS AT YAS R V \ KR FS CLS LP / SN WD YR C 
VP 


2177 


16078 


A 


2192 


138 


365 


KHQYHHCC/ LLKKKKKKKKKKKKKKKKK 
KKKKIKNKKHGGPFKKNFW/EGQPRNWA 
GGV 


2178 


16079 


A 


2193 


385 


21 


RGEI FFFKTRRKKFS PQGGRGG VFPPSP 
PKNFFFPQGGKFFGG/EGGPKVPPPKKG 
GFPKKPQGGFKVPPKKKKKI ISPPGVIG 
GPPGNFLKGAPPFFFFFFFFFFFFFFFF 
FFFLVRAVKLS 


2179 


16080 


A 


2194 


415 


56 


PPPPTTAPVFSPPPPPRPFFSPPPPVFF 
FRSFPPAPPPPPLFFPPPSPP\PSFPPP 
PPPTRPCPPPPPTIPPPPPPFSPPPPPF 
FFFFFFFFFSLLPFFFFFFFFFFFFSFL 
FLFFPSRLV? 
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C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIalanine, 
G=Glycine, H=Histidine, I=Iso)eucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
WKTryptophan, Y=Tyrosine, 
X^Unknown, *=Stop cod on,/= possible 
nucleotide deletion, \=possible 
nucleotide insertion 


2180 


16081 


A 


2195 


35 


369 


MKRPSPPPPTPPPPPPPPHPLLPPFPEK 
KTGPP/RGFFKGGERGPPPPKKKKDPPP 
QKQKKKKKWGGGGGGGGQRKKKHPQKNP 
PPPPGRGGGKNFLWGRPRGPPPQGGGGK 
K 


2181 


16082 


A 


2196 


2 


266 


SKPRSCHCTPAWATQ\ETPSYKKKKKKK 
KGLPLGGPGEKTIFFSLKPGQKAQPKNK 
IPFKKKI PFPSTPKGKKKKNFQGGPKKK 
KKEI 


2182 


16083 


A 


2197 


1 


277 


ARGERERERERERERERERERERERERE 
RERERRHP/RPRERE 


2183 


16084 


A 


2198 


1 


285 


ARGERERERERERERERERERERERERE 
RERERERERBRERBRERERERERGGAPP 
PRLi YRERVLKKQRGRFLP PVRERKK / RS 
PVQTLPGY 


2184 


16085 


A 


2199 


346 


1 


VPRHGLPCFCFPHQLQPPKRKKFNNRKR 
ALTSHTQFFVFETESCSVA/RECSGAIS 
AHCHL\CIMRSSNSPASGGAGSSELRFR 
HCTPVGQQTE I WS PKNKPI KKEEGCP YC 
QFLSC 


2185 


16086 


A 


2200 


3 


389 


HEGMILAHCSLNLPGSGD/ASHVARTTC 
VHHHVQIilLFIFVETK/FSHYAVQAG/li 
ELLSSSDP/PTSQSSRITGMSHH 


2186 


16087 


A 


2201 


3 


204 


HECHCTPAWAT\SETLSQKKKKXKKKKK 
I FPGE KTWGGWNKKKFL FLNRKF FLG KG 
LFKLKRGPLKLF 


2187 


16088 


A 


2202 


69 


379 


KKKRVFFWGPGGSKPPT/SGNPPPWPPK 
GGGLRGIjPPPPGPRGVIFKNFGXRGPPP 
PPGLKSWGPRDFFGLALQRGGI SGLNNG 
AP P FFFGVLKKLP PLFFVS KG 


2188 


16089 


A 


2203 


81 


379 


KKKKKPPPPKKKNPGPKKQEGGPLRGPP 
TFSGGGGGKKTCPQKLKAAGG/SKKAPG 
PPPGAEXKNPALGGEKKTLGPNGRGPKR 
LRGLAKKGNPPSSFGGK 


2189 


16090 


A | 


2204 


1 


341 


GQSLPVSPRLQLSNGNRTLTLLSVTRND 
VGPYECE IQNPASANFSDPVTLNVLCEY 
LLF \ PVAQATS PNPRSQR PGLS VPLS S K 
NADSPPTPRNPARHDFLPQANMGRPSLD 
QE 


2190 


16091 


A 


2205 


2 


370 


DAP PRPANFVFLVKTGFP P VGQAGFKLP 
PPGDPPPLASQITGKG/HCAQPPF 


2191 


16092 


A 


2206 


362 


24 


PRGSSSASSSSSSSSSPPSSSSSSSSSS 
SSSPF\PPPPVLKPPPPPPKKSPPPKKK 
NPPPPKKKTFFFFFFFFFLRRGLCPFGR 
GKRAKPPFQKKKKKRKRESSFYQQVIIH 
M 


2192 


16093 


A 


2207 


357 


3 


AGQALWliARV I PVLWEARS YYVRI SWYR 
NIVAMDSDSLDGSGHSKLKSFRKGFPVL 
YAIKKTHDSWEGVS\MSALIVWKSLIPS 
CVDAFEELTSSAEE VAAQWG I ATDLEL 
LVGCARA 


2193 


16094 


A 


2208 


280 


411 


GNDVYFLVFLF/ CLFEKES CSVAQAGVQ 
WHDLSSLQALPPRRMA 


2194 


16095 


A 


2209 


3 


392 


PIIYEKYTQQINEMPRKLQHQQLALVSR 
NGP I LLHDNAQLH I TQC/LQKLWELGYR 
VLPHLPYSLDLSPTDYYFFKHLDNF\LQ 
GKHFHNQEDAENVSQKFVKSQSMDFYAT 
/ GNKL I SHWPNCVD Of 
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Predict- 
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nucle- 
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ding to 
last amino 
add 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=G)utamic Acid, F=Phenyla)anine, 
G=Glycine, H=Histtdine, l=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Va!ine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


2195 


16096 


A 


2210 


248 


417 


GILANCASIKDLISRIYKKLKQIN/KQK 
TNDPLKKWAKDMNRYFSQEE I QGVNKHM 
KK 


2196 


16097 


A 


2211 


213 


493 


GKRFYFACPGKIGALQGFFKGGGPFFFF 
FFFLTPPPNTNYLTNKLKIKSPLCTFYF 
SNILGFYPSITHRTIPDAWVTA/SQSGP 
VLMRSIS 


2197 


16098 


A 


2212 


159 


413 


LAPRVI FGPPKKPPERPPLFFFFFETES 
SSVTQAGVQWHNLS S\ PPRFKRFY/ CLS 
LPSNRDYKRPPPHPANFCTFSRDGVS 


2198 


16099 


A 


2213 


455 


60 


NPRREVGPICPPPKIRVPPQNPQVGFYS 
PPLKEKTFTSPAPVNLGPPRDPFKRPPP 
FFFFFFIiDFFFFYFTIYKTSAACPRSPP 
AQPRVARRPLVPSS \PPLPSLCIAPAPR 
GPGSLCPRGSLEGDNGSSPG 


2199 


16100 


A 


2214 


2 


243 


LTLSPRLKCNGMVSAHCNFCLLSSSDS/ 
RLR/ QENCLNLGGGGCSELRLRHCTPAW 
TTERDSVSKTNKKNSLKIFTCFFVDAA 


2200 


16101 


A 


2215 


1 


286 


FSQLLRLFIC/SQGGRLLCCESCPASFH 
PECLS IEMPEGCWNCNDCKAGKKLHYKQ 
I VW VKLGNYRQVFPRTKRKYS I IVQTS F 
ILWIQSDLEIDR 


2201 


16102 


A 


2216 


173 


2 


ITI FFFVRQGLTQAGVQWR\DLSGETLV 
ILPPLEAPCSLQSSWDYRRVLPHLPNFC 
IV 


2202 


16103 


A 


2217 


216 


4 


VYPPSFMVFSQVFLSSTHISLSFSFFWN 
YLFIYLFIYLFIYLRWSL/DSVTQAGVK 
WHNLGSLQPLLPGFK 


2203 


16104 


A 


2218 


14 


228 


KRSSHLFTDDI ILYMENTKHSTK/NLLE 
LIKE I SKVTG/ YQKSVAFLYVNNKQAI K 
KTIPLTIASKRIKNSGQA 


2204 


16105 


A 


2219 


244 


3 


EVLNQNSGLPRWPNILRAKASLRVPRQC 
SRGWFSANGAGTTRYLYAKEWGGLGGG 
GCSKLRS CHCTP VWAT\SBTLSQKK 


2205 


16106 


A 


2220 


162 


2 


INMVGNLFFFGDRVSLCHAGWSAVAPSW 
PTVASTSL\VKQSSFLSLPSSWNHRH 


2206 


16107 


A 


2221 


146 


3 


GRVDGVPWRNPGSLQPPSP\GSSDPPTS 
ASQESGTTGAHHHTRLIFVF 


2207 


16108 


A 


2222 


239 


2 


SYISKPDKFPHDSSPEIKPVTVNWRNSF 
SFIFPFFFLFPEKESHSVTQAGVQWRNL 
GVS S YW P \ AGLKLLTSGDPPALAS 


2208 


16109 


A 


2223 


2 


159 


LNRDLGGGGCSELRSCHCTPAWAT\SES 
PSQKKKRKKKKKKKKGGRGRNSKI 


2209 


16110 


A 


2224 


3 


345 


RFKLFSCLSLPSSWDYRRVPPRPA/NFF 
VFLIET\GFAILTSSKTERQSRLECIFG 
FYGLP CRE KRAS ERRSVEG/HERKI LFS 
FDFFFLGGTES CFVTQAGV QGCYLGS PQ 
PPPPG 


2210 


16111 


A 


2225 


103 


319 


FSEE YRNVTNFLMLTMSCS CLTLVE / C / 
WSEGYMATPCTILLLLFFQRHCLTLSPG 
GVQWCSHSSLQPQTPGIK 


2211 


16112 


A 


2226 


2 


110 


FHHVGQAGLELLTQWSIHLSLPKCWDF \ 
RHEPPHPA 


2212 


16113 ; 


A 


2227 


2 


178 


IFLIFTFLEMSSHYVAQAGLEFPG/FKL 
TSRLSLIiSSWDYRRPPPRLAKFFAFLAK 
GDAA 


2213 


16114 


A 


2228 


173 . 


3 


FIFIFISLFFI FFLRQS I VLS PSAVQSR 
LQPPPPRFKQFS \CLSLLSS WD x KRVPP 
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C 


2214 


16115 


A 


2229 


333 


1 


HTPPPPPTFHSLCTEGAPPPPQNT/PNP 
PRNPFI I PLSRSHTRANEPSLSTPRTHP 
HSPRPPLLTHPHPNPRASAPPGSRYPPR 
ARHRERPRBR PTQRERERER ES VCV CV 


2215 


16116 


A 


2230 


350 


3 


KKKDI PPVNI YAPNTGAHKY I KQILLDL 
NRDGP\HTITAGDFTTPLSVLNSSAESR 
CSRPFIISPSLWGLRE 


2216 


16117 


A 


2231 


277 


1 


QMHPARGHLPQAL I PVQKPAL I SQGMS A 
SGSTQVSPFLSPCFWVEVNCSNTKVLT\ 
PFCGAGTGSHS FAQAGAHWCNRGSLQPH 
PPGFKRL 


2217 


16118 


A 


2232 


3 


335 


ETRFHHVGQAGLEPIjTSDDP PASASQS V 
/GITGVSLRARPGES/GSWKTVCCNNMS 
EPT 


2218 


16119 


A 


2233 


29 


448 


CPSLRQAWHEAAIDEVRTGTYRQLFHPE 
QLI TGKEDAANNYARGHYT I GKE I IDLV 
LDR I RKLADQCTGIjQ\GFLVFHS FGGGT 
GSGFTSLLMERLSVDYGKKSKLEFSIYP 
APQVFTAWEPYNFILTTHTTLEHSDCA 


2219 


16120 


A 


2234 


1 


365 


GARLILVFLEETGFHYVGQAGLELLTSS 
DPPASA/FPKCW 


2220 


16121 


A 


2235 


28 


460 


DRL I DH I S KLGTRGLQGFLVFHS CGNLG 
\ TG FG FTS LLMERL S LDYGKKAKLE FS I 
YPAPQVSTAGVEPYNS ILTTHTTLEHSD 
CAFMVDNEAI YDI CRKNLDI ERPTYTNL 
NRV I S Q I VS S I TAS LRFDGALNVDLTE F 
QTNLV 


2221 


16122 


A 


2236 


614 


84 


LAASLAACAQLSALAASHRMWALQRLRK 
LLTTEFGQSININRLLGENDGETRALSF 
TGSALAALVKGLPEALQRQFEYEDP\ I V 
RGGKQLLH/SPHFFKVLVASRLVTLEAG 
HFCPCCAETHKW \AWFRR YCMASRVAVA 
LDKRTPLPRVFLDEVAAVRVCGHILQLG 
DTELQQHICHL 


2222 


16123 


A 


2237 


1 


393 


GPMLAQLSVFRCGSTSAPNDLWYHFIEL 
PYHGES I TMLIALPTESSTPLSAI IPHM 
S TKTIDRWMS IMVPKKVQVI LPKFTAVA 
QTDLKEPLKDLGITDIVDSSQGHFCQIT 
KAENLLV/ SHILQKQK 


2223 


16124 


A 


2238 


3 


402 


HVGQAGVQ I GKACWELYWLEHG I QPEGQ 
MPSDKTIGGGNDSFNTFFSETGAGKHVP 
RAVFVDLEPTVIDEGRTGTYRQLFHPEQ 
LI TGKEDAANNYARGHYT IGKE 1 1 DLVL 
\DRIRKLA\DQCTGSQGFLGFP 


2224 


16125 


A 


2239 


2 


478 


GRGGLHRI PWTPLTPSFARGLVPSLAR 
GVEARNGAGPIKS YPRPGSRLKMQNGS K 
GSGLQNKTFHWEICDAHVNSKIQLKQ\H 
I S SRRHKDRVAG KPLKPKYRP YNKLQRS 
PSILAAKLAFQKDMMNPliAPAFbSSPIA 
AAEAVSSALTLPPRPFCFV 


2225 


16126 


A 


2240 


255 


2 


FLFVKPHQ I SCPTKKG I KSFLVLCPLN F 
FLFWRQDLAHPGWSAVTQSWLTAASN\Y 
GLKQSSYFSLLSSWDYRCIPPHLGGKRP 
L 


2226 


16127 


A 


2241 


365 


11 


EPPPPGGEKKKRGKPPPKKTPKKDGPQK 
KSAAFFGG\GKIKKKKGAFRKKKKKGGG 
P P P KKKP PPRKKKKKKNFS PLP P P KHT P 
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SLFQKPPTKI FLFSPPLLFLNILFFFPL 
SPFIF 


2227 


16123 


A 


2242 


3 


143 


PNFLSPRLDCRGTIRGHCGLYLPGSCDP 
S / CLS PPSS WNYRSTPP 


2228 


16129 


A 


2243 


132 


2 


ms ah cnlhlpg s s d s pas vs q vag i mga 
chif\vflvqtkphhi 


2229 


16130 


A 


2244 


334 


43 


TSKHMKRCSISLATREMQNKTTIGGHFV 
PTRLAN I QKPENAKYRQGCLMC / WCWVC 
NLASPLWKTIWHYLVKCLPYSSAIFTLG 
\IYPEEVLASSVPGT 


2230 


16131 


A 


2245 


312 


2 


PPSKLPEKNFFFFFFFLRQSL/ ssvahs 

wqwrdlgplkppppeskqfscptllsi 
wetqtalsqdcatalhpggqretlshki 
cvcvcvcvcvcvcavciyin 


2231 


16132 


A 


2246 


3 


230 


RAQAMVETSRERCLLRPPQI ETR \HVAQ 
AGLKLLAS SD PPTS ASQS AG I TGVSHHT 
WPQPLTFCPHAKSLPFINQI 


2232 


16133 


A 


2247 


2 


214 


GRVDLVIQAGVQWHDLGSLHPPLP\GSS 
DLP/ASDSQVAGTAGRFKYAWLIFF\VF 
FVETGS /HTQVSNS YDVL 


2233 


16134 


A 


2248 


3 


139 


EGVQGCHHNSLQPPTPG\SSDPPTSAS I 
E AGTAG S H YHVWL I FLLF 


2234 


16135 


A 


2249 


3 


160 


EGVQGCHHSSLQPPTPG\SSDPPTSAS I 
VAGTAGSHYHVWLI FLLFCFEGDAA 


2235 


I6I36 


A 


2250 


319 


3 


GQKRYKIFLCFFLFGS^LVFLRWSLTLSP 
RLERESVS KKPKSQKPPI KKPKFTPRE F 
KEVLAKYG TKFFFVFFGLFVWFFLRWS L 
/US VAQAGVQWRDPGSLQAS PRP 


2236 


16137 


A 


2251 


15 


394 


FVSFSFFPSFPSHLFFSSSSFPPFLPSF 
HFSFLPSDRPS WPS /FLPSFLP 


2237 


16138 


A 


2252 


60 


306 


GRERILEEIMAEDFIiNLMKDLWISIQVA 
QQIPSKMNSKRPHRYQHFRSQSQRILKA 
TREKQLATYKGSSVT/VSPSPGPQTVNS 


2238 


16139 


A 


2253 


3 


351 


GFHHVDQANLKLLTSSDLPASTSQSTGI 
TGI\DHCTQPNFPI 


2239 


16140 


A 


2254 


150 


2 


RPRRPDHSRLGAGDQPGQHSKTPS\HQK 
KTKTSQAWRHAPAIAGTRQAEA 


2240 


16141 


A 


2255 


357 


1 


LNLNLSLTLYAKINLKWITGTNVKHKTT 
KFLGGKNGANLMDTRLDNAFLDLTSKAQ 
LTKEKIDKLNFI KIQTF/CS I KDLLGNL 
KRQAPEEKKILRNHISNKELVSRIHKEV 
PKLNNKK 


2241 


16142 


A 


2256 


397 


I 


FSLFPPVGGQGGFFSSCKSPPPRFRAFF 
CPNPFRKGGNRGPPPHPGKGTLGFFFFF 
FRQSFALSPRVEYNLCLPGSSDPPASAS 
QVGGAPGLPPPAWVNFGIFFF/CFFLRQ 
SRSVAQARMQWRHLGS LQAMP 


2242 


16143 


A 


2257 


2 


132 


TLTTjTPRLECSGT I S / AAHCKLHLPGS R 
HSPASAPRVAGCGGGHL 


2243 


16144 


A 


2258 


214 


347 


KISLILGVHKICCEF/CFFEMKSRSVTQ 
AGVQGHDLSSLQPPPPGL 


2244 


16145 


A 


2259 


347 


2 


FFSFFFFFSEAESRSVA\RLECSDTVSA 
HCTLHLPGS 


2245 


16146 


A 


2260 


333 


1 


SDQRWTENAFVELRDEGFACPSFSEL/R 
STPSTSGEEVENFEKKLDECITRITNTE 
KCLKELMELKAKAREPREECRSLRSRHN 
QLEERVSVMEDQMNEMKREGKFREKRT 


2246 


16347 


a 


2261 


2 


357 


S PRS CS VTGI AILLFL YFLYKliAFALL Y 
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E=GJutamic Acid, F=Phenylalanine, 
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nucleotide deletion, V=possibIe 
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GLALNSFLHEIQEP/SLASGSGPLSRNS 


2247 


16148 


A 


2262 


355 


0 


PPLPSPLSPPLSLSFFSLPPFPFFFPFF 
PLF/CPSFSFFFPSPFFP 


2248 


16149 


A 


2263 


348 


35 


YFLKKGFSFFPRGEGRGKDSRSLKQLTF 
GFKKP/SCPSFLRKW\ETRLCPPAQKIF 
FFFFFLLSMGFCYIAQGDLKGSSQSSGI 
TGVSYHIWPTFIGHLTMCLAKC 


2249 


16150 


A 


2264 


270 


92 


DRPARRKMFFTYSSRLFFFFFFFFFFFF 
FFFFR\FFFFFFFFFKFFFFLFSRWSLM 
IFIS 


2250 


16151 


A 


2265 


1 


417 


FR P PAGVQWRNL I SLQSLP PGFKRFLYL 
SLPSS/WDYRCMPPQIA/NFFVFWFDV 
WWVWLVEMRLRHVLARLDI WLI CI 
SLMANGMSYLFLCSFAIHLPSLLKCRFT 
YFAHCLFLYYFLETGSHSVTETGWQWCI 
I 


2251 


16152 


A 


2266 


344 


427 


LIGLCEDTNLGAIHAKRVTLIAQDLQLA 


2252 


16153 


A 


2267 


310 


417 


RVPYGTLGSGPLATMAGFDDKLKPHMEG 
AAAPILVR 


2253 


16154 


A 


2268 


186 


464 


NSCLSHNQRQLLFRLEKGKGP I SAQEKL 
VLHTLSGFSGLV\ VGWLVFEMES CSWQ 
AGVQWRVHPPPRFKQFSWLSFPSSWDYG 
HVPPCPANF 


2254 


16155 


A 


2269 


1 


427 


PEPPSbPPDGAKKQKTQKSKKWRICFLR 
KKKKKKKRKKQKKKKKKRGGGSQKKSRR 
GPKLTTETNIIILIKGGLKKMNYREIEK 
KLLFGGGGVIGPTPTQDIKGREEINYLE 
AVGREKQRFISLVK/TNVA/HEATRDTI 
FRGYL 


2255 


16156 


A 


2270 


399 


44 


I SFQLLLP ITVLPGHVRVLMEMYKEVHV 
VFVP\STTTFMLQSMGQGVTLTFKSY\Y 
LRNTFNKAVAALTSDSSEESGQSQLKPV 
WAAFSSLNASKNI / RDS WEAVK I PAL KG 
VWKKWSLKM 


2256 


16157 


A 


2271 


19 


420 


AAGIRHEERERERERSRK/QRERERERE 
RERERERARRKIYHTYRPRPPRVFFFFF 
FFFFKKKRRGFLVPTPPGGGGAQKKKTS 
LEGGKGVFFKGGGKKTPLKNPGGWGEPP 
QKKNGGGGPPPRDPPPPPRPLF 


2257 


16158 


A 


2272 


463 


20 


SYNIPLSQSLIQ/ SRALTLFDSTKAERN 
RRGK\RSGKLQWEGSRGWLMRFKERSHL 
HNIKVQDEAVSYPEDLDKMDALNTKQQI 
FSVHKIALYSKKMPSRIFIAVERESMPG 
FQASKDRL/LLLG/ ATAAGDLKLKPMLI 
YNSKNPRVPRAEF 


2258 


16159 


A 


2273 


474 


82 


VGVWADFLKNTSQAQATKAKMDKWDPI K 
LKSFCTAKETISKAKRQPTEGEKIFANY 
TSDKGLI /SRIDKELKQPYRKNPNNPVL 
KWAKGWAQWLTPVILALWEAKAGRTQGQ 
EIEATLALFSGLFCQVFLC 


2259 


16160 


A 


2274 


152 


3 


AEGRNGDGIIQKSS I \ RTLLSNDKNPQN 
IHRRPTRFLTMLYQQNLCHLGL 


2260 


16161 


A 


2275 


32 


361 


LGASARYEKPTVNLILNGERLNIFPIR/ 
SKTRLGYLLSLLLFNIGLAILASAINQK 
KBIQI IQI VKKKI KTQRKKKKNKTKKKK 
ALFKFKGGPEKEKGPPKNPFKTPPWF 


2261 


16162 


A 


2276 


329 


487 


EFVNITIKIATSLHYKAIVIKA/AMVIW 
YWHKNFJf IDQWNR I ESLEINPH I CNK 
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nucleotide 
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residue of 
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Predict- 
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nucle- 
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residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenylaIanine, 
G=Glycine, H^Histidine, I=IsoJeucine, 
K=Lysine, L=Leucine, M=IVIethionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 


2262 


16163 


A 


2277 


2 


466 


AHHSCSHLLTTAAHYSCSPQLLTTTAAH 
TYSPPQLLTPAHHHSCSHLLTTTAAHHS 
CSPFQVLTAAHH\GAHTCSPQLLTPAHH 
S S PQLLTP AHHS CPHLLTTTAAHHS CSP 
PQLANSYRG\SPYCSSWSQTPGFRKSSH 
VGLPEPW I FHGLQW 


2263 


16164 


A 


2278 


4 


467 


KNVTQQGKIHIRDKLDEMWGNTSVFCTN 
HMKHQTNFNAKKCNVFKECGK\TACNFQ 
LTQYQISHANQKPYECQICGKPFRKRAH 
LTQHNRIHTGGKPYECKECGKVFICCST 
LIQHKRTHTSEKPYECLECRKTFRRSAK 
LIRHQRIHTGEKPYK 


2264 


16165 


A 


2279 


383 


3 


FLCVCACVCVYVCMCMC^CACXCMCMCG 
CVCVCVCVCVGLRGLGWGAWCRSWGPP 
LCFLLLG I L P LKS RLLWL PRRT I S I CTL 
PSAQGPLPAPGFGKYASNTTGVKGSSSV 
FSLLSRITALSHLHW 


2265 


16166 


A 


2280 


47 


219 


VCSELKS CHCTPAWAT \3VTLSQKQNKT 
KQRRTLGSI FFQHTFMHLKKEKSLILQK 
W 


2266 


16167 


A 


2281 


294 


160 


NKTFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFLFLF 


2267 


16168 


A 


2282 


68 


490 


RTFAYPKPPVFGSHSGKTTIPKNARGSF 
PS/PPLPFPCPSPSPGPSPVSCLPPRLP 
SPSPSPSLSPISRLPSP/SRSPSPSLS 


2268 


16169 


A 


2283 


3 


454 


CQSAPLGGASQLGYSGV3DTLEEAVCPF 
SDIWLCAGRTTTLFKAVRQGHLSLQRFR 
LPFVWLCPAPRGGVYRGRQASLSCGGLH 
QVRASRPLCLPTQASAMAGAPPPASLPP 
CSLISDCCASNERGSTGMGP/SEPGTG 


2269 


16170 


A 


2285 


1 


452 


LKDSGRDYVSQFEGCALGKQLNLKLLDN 
WDSETSTFSKLREQLGPVTQEFWDNLEK 
DTEGLRQEMSKDLEEEKAKVQPYLDDFQ 
KKWQEEMELYRQKEEPLRAELQEGARQK 
LHEL/QEELS P/LGQEMLDRARAHVDAL 
RTHMAPYSDSLRQ 


2270 


16171 


A 


2286 


3 


266 


NS S PPS SGHS TPRLAP PS PAREGTDKAV 
SALKS PQPNRGMGRGQR/ PGLPS / DTAT 
I TPHTSGFPKQPQLSLKVQTQRAKGRLS 
HWDLEP 


2271 


16172 


A 


2287 


274 


453 


IYTFVKSSSKTLRPRHIDQWNRIENPEI 
K? / EYSQL I FDKANKNI KWEKDTLLNKW 
CWDN 


2272 


16173 


A 


2288 


81 


487 


TVYFKPTVETYNCWDKKKI PFKGLLRRD 
NTPCY PKSLLEMCEKIN I VFAPATTTS S 
/LKPMDQGVIVTFKSNYLKNTF\RLGEG 
RKKKKKKGRKKSFKKALAAINSDSSDGA 
GQSKQKT / FWKGFS I PNAIKNI QDPWE 


2273 


16174 


A 


2289 


75 


469 


SRGVAGAPPKPSPSTPSPGPLDVTPGPH 
SSHHAASPGP\PPPPEPTASSMASAP\P 
PAPQPTPL\PPATLGPPSAGPE/PSPGS 
CTSTGWGYSFCCPRCRRWMRWQPPQQGP 
AWHWWPREFP/PPPRVSGS 


2274 


16175 


A 


2290 


256 


55 


PTQPLRVELTTVTLRCDINKWDYIKLKS 
FCTA\KKRQPTEREKIFPNHVSNKRLIS 
KIYKELIHRI 


2275 


16176 


A 


2291 


497 


29 


SLTHRVAGGAAVTPLAHAG ARQ I FFLGD 
PHPTSSLLGWGPAWDPCAFQVSDHPASS 
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residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Giutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S^Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














RVGKTLLSCPGGGRLPWVTCPGGGQLSC 
LiTQMHLTPAGMCS PCVCVCVC VCVCVPN 
GMCPRCMCVCVCPCRDVY/ I CVCVCVCV 
CVCMCLCPCWDVFTEL 


2276 


16177 


A 


2292 


141 


392 


SCSTEVKQ PKIGVREVDFVAAPGIAPSS 
PQKKQTTSTSSLKSLLRLSKVRFLFNIV 
LDVLARE I RQEKE I KGIQ / LGKEE 


2277 


16178 


A 


2293 


14 


291 


YVGTEG I S F I S FMRVTNYMTRHLATLRE 
S\CYSR\VYPRFIEFLRFDIQSTGQ/RI 
TSRQHPPR/DLRDALLYLNRR I TLVRTR 
CKSVAKRPPGSP 


2278 


16179 


A 


2294 


no 


293 


LLSNRSLAIASLCGGCNELRSCHWTPA\ 
WRQRETVKKKKKKKKKPGWGFLNPPPQR 
GKLCF 


2279 


16180 


A 


2295 


96 


313 


WNGCYLLSNRSLAIASLCGGCNELRSCH 
WTPA\WRQRETVKKKKKKRKKPGWGFLN 
PTPQREKLGFLKRGPGF 


2280 


16181 


A 


2296 


234 


2 


CCLETTRSLFDKGTKENTQWGKDSPSNK 
RCWKNW I STCKRMKLDP YL I P / YTNINV 
KY/I KDLNLRP E I TKLLEENLRE K 


2281 


16182 


A 


2297 


334 


6 


KLiFSPPGGGGGPPPPPPKKGWVPPETPX 
KRGGGAPNPPP PKGGGAPKPPPQKKNS P 
PKKKKKI FCPPPKKKGPKGVFLRGPRGP 
YRVFLKGPPLFFFFLKKSWRPLAMYA 


2282 


16183 


A 


2298 


467 


8 


L PGF KAS TDRDN \ VAGDFKL KQMLj I YH S 
ESSSALKNDTKSTLPVXiYRNKEAW/VTA 
HLL IPWCTEYFKL IAETCCSERKISFK I 
IiLLI \DNAPSYPRALMKMFKINVFMSDN 
TTS I V Y S TDQGV I LTCNS Y Y LRNT FYKA 
ITAIDSDSCRMPQEGN 


2283 


16184 


A 


2299 


1 


449 


SIYLSIP/FNLSINLSIYLSIPIYLSIY 
/HISVYIYL/SIYLSSIHLSAIICLSIF 
QSISLSLSINLSTYLSTIHLSLSSIIYS 
S I I YHLS I H/LS YYLS 


2284 


16185 


A 


2300 


1 


445 


QAGLQLLTSGDPRTSGLPQCWDYRC\DH 
RSWQT 


2285 


16186 


A 


2301 


241 


22 


KWVLGTCACVHVF I CVYMCLYVCAWV/ C 
VSMCLYVCACVCMNQC/VTVCMCVCVCV 
CRFVCVGIPPHSKWISIG 


2286 


16187 


A 


2302 


3 


478 


GGQTETLLTSQRKGGW PEALLTS QMGRP 
GRGAPHIPDDEQPGRDAPHLPDGAAGQ\ 
SAPHLPDGE \ PGRGAPHI PDGAAGQRRS 
PLPRWGG / ELGRGAPH I PDGAAGQRRS S 
HPR/PGRPGRGAPHFPDGATG/Q/DGAP 
HFPGIPPDGT 


2287 


16188 


A 


2303 


440 


41 


KSHHLSFLSFLLFFFP/TKSHSVAQ/CW 
SAI S AH CNLCL PG S S HS S ALAS RVAVTT 
GVYRI 


2288 


16189 


A 


2304 


2 


395 


FFLVKTRFLHVGQAGLKLPTSGGPPALA 
SQ/SL/RITGMSHRTQPE 


2289 


16190 


A 


2305 


184 


2 


SIKKPKPLVNLRNH/NT/WQGAVTPACN 
SSTLGGRGGRITRSGVQDQSGQRRESEF 
LRLGYGLD 


2290 


16191 


A 


2306 


86 


472 


I IKLCKWQNK/RFEAGSLVPEMGFYYVT 
QAGLELLFSRDPHTS ASQSAG I TGAAFK 
QRWSVGTVLLQVDRGTPPVGDCGSRTPQ 
WPGQAFLRTALKSEAHPPHSSTDVTPVL 
WSEGSPCLLSPPSLSFTG 



684 



WO 01/64835 



PCT/US01/04927 



SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=A la nine 
OCysteine, D=Aspartic Acid, 
E-Glutamic Acid, F=Pheny)alanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P^ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Va!ine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /-possible 
nucleotide deletion, V=possible 
nucleotide insertion 


2291 


16192 


A 


2307 


133 


2 


RVIHVWRSHPAI P\TTREAEAGESPEP 
GGGGCNEPRSCHCTPAW 


2292 


16193 


A 


2308 


3 


369 


LTTAF PDFPGSTHPLTSASQVAWATGAH 
HKSWLIFVFFVETGFHL/ SELLSSSSPL 
ASASQSAGITG/REPPCLTSPFFF 


2293 


16194 


A 


2309 


253 


365 


TDYFYLFI YLRQSL/ SS VAQAGVQWQDL 
GSLQPPPSGFK 


2294 


16195 


A 


2310 


362 


0 


TKAQQCVHENHFKLKDANTLWIKVWRWI 
CHSSPNQKKYGLAILNLDKSGFRSRKDT 
GDEE/HFIKIKKSVIQEAI IIINIYA 


2295 


16196 


A 


2311 


167 


2 


LCYCVI FVFFI FWRQSLTLSQGGMQWCD 
HGSLQPLPP\GLKRSSHLSLPNSWDYRR 


2296 


16197 


A 


2312 


67 


482 


DHLIPGGGGCSELQSHPCTPAWVTA/ET 
LSQKKKKKREK 


2297 


16198 


A 


2313 


188 


437 


AGFFP PENQLTNMKVRKANESDPWGVKP 
ESDESETHGSLSLSLSLFFLFIFLRQSL 
/ NS VI QAGVQWRNLSS LQR ISP 


2298 


16199 


A 


2314 


41 


325 


TTTLFXRVRQGYSSLLMFLLPFVCLCPA 
GRGGVYRGRQSSLSCSRLHPVRASRPLC 
LPTQASAIAGAPPPALLPPCSLISVCCA 
SNEQGSVG*DP 


2299 


16200 


A 


2315 


417 


0 


SPPRNWDYRCVPPRPANFVFLVEMGF/Y 
HVARPEFGLELPTSGEPPTLA/FSKCWD 
YR 


2300 . 


16201 


A 


2316 


98 


288 


LMAWPATWEAEAQESLEPGGWGGGEEG 
CSELRSCHCTPAWT\SETLSHTHTKKK 
KKNGAAL 


2301 


16202 


A 


2317 


1 


410 


LNHIPNLSLTKRKPSPHSLNLKKKKKKK 
KKKKKKKKKKKKRGGGVKKKPRGGQKKK 
GGEKKNFPSKKGGKKKKRGEFGKKNFFG 
GGKKREKTPQKKKSPKGKKKNLR/EERG 
EKNPKRGGEKKRSSSPRNNNLRGEKK 


2302 


16203 


A 


2318 


249 


3 


PLKASS PPKAFNFCREVGP I CPPPKKKV 
P\PKIPXLVFIPPPIRKKLLPCPPPLTL 
APPRVPLKRPP 


2303 


16204 


A 


2319 


2 


393 


AHLGLPKCWDYRHEPFRPAPLFLLNSYF 
GLDLLT/S/GDPPALASQSAEITGVSHC 
AQ/PEYVY 


2304 


16205 


A 


2320 


389 


2 


RGNSNIGGPGPLRGKKFSPPPPLKNWGT 
KLGPPPPPPFFFFFF\GKPGFSIFFKPI 
LKFKGPKKPFPRAPKNGGSPGPGWFFFF 
FDSNFTQLPGIKLSKGGKKNFASPPPPF 
FFFVHYKTGYPSATPDAW 


2305 


16206 


A 


2321 


389 


206 


FQWRWGF/ NHVGQANLELLTSNDPPASA 
SRSAGTTGMSHHAQLKNYFLMVRMWRNW 
IAVGM 


2306 


16207 


A 


2322 


371 


42 


SFPPQSGFFSPPPPHEFFFPPPPSFFSW 
VGVRQI PPPPKI FSSPFPPGGFFSPPPE 
R/VDFFFSPPPFFFPPPSFFLSPPPPFF 
FFFFFFFFFFFFFFFFFFFSFFGVKKRK 


2307 


16208 . 


A 


2323 


82 


386 


PFLTQKYFFTPPEEGFLKKPNRREGPPS 
PITDPTLWPNMMKGIVPKAPPI IFMGGG 
INMTFSGFVTTKAPFPRPLRFNPMLRQG 
FDLLTLKAS \ WGS S AS WY 


2308 


16209 


A 


2324 


1 


413 


RGSGDNRHGP PCRVNFVFLVETGFLKVG 
RSGLELPTSPALASQSVGITGVSC/RPP 
PQASY 


2309 


26220 


A 


2325 


58 


400 


SETLVS KKKKKKKKTPPPPKNPKKKKNP 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
Vocation 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K^Lysine, LF=Leudne y M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W-Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \~possibIe 
nucleotide insertion 














PPPPPGAKKGPGGKF/PPPPIFQKKRGG 
FKK/SPPKKPPKKKKTPPPLGKKGPPPP 
RGGPLGGFPPPPPQKKNGPPPPKKKPPF 
FFPPK 


2310 


16211 


A 


2326 


462 


306 


AHHNLRLPGSSDSPASASGVAGTT/GMC 
HHARPILYLSGDASALLKCFSSAQLF 


2311 


16212 


A 


2327 


1 


393 


SRPPSRTEKIRNFFFVETGSCYVAQAAL 
KLLDSSDPPTSASQSAG/ ITGMSHRAQP 
TS 


2312 


16213 


A 


2328 


391 


2 


AWFKETKAGWI IPRSWDRGSWVSQPYSA 
LTSSPESGFHSVTNGDSMTCLEDGKHSL 
VAPHGI PQRGSSLQDGVSFCPAPTMKPY 
LASTKM\RLHEARELGFFSFLSLRQGLA 
LS PQLECSGAI LAHCSLNL 


2333 


36234 


A 


2329 


2 


115 


GCS ELRSRFCTPAWAT \SKTASQKKKRG 
KKKKGGVFYF 


2314 

- 


16215 


A 


2330 


2 


406 


AAAPSALALRDGWAVRPELDLLPPCGEE 
VAPGAHCLGCGPSPCLFLSPSHTRQSPP 
APTSSPGLSTSPPLVPTHVSAPHSSKGP 
PSIPGAQALRGCGLGGWDR\PSSPSLP/ 
PDVSPKPLNFAP 


2315 


16216 


A 


2332 


226 


377 


KRKSKHITFLFKTLSWPDTVAHACNPST 
LGGQGGRI /TPRSGVRDQPDQHG 


2316 


16217 


A 


2333 


3 


191 


CLS PGGGGCGGLRLC YCMPAWVT\NETV 
SQERKKERRKEREKEIKKERKKRKKERK 
KEKKKKG 


2317 


16218 


A 


2334 


295 


81 


FFKPFFPPKKTCPEFS\FFWEKRGFPPP 
PLRSFFQNPPPKGGPPILLKGPPPPPSL 
GWPPPPPLFFFFFFLR 


2318 


16219 


A 


2335 


375 


8 


TQIVPLPSNLGNKTRLRLKKKKRNEQGN 
IPTDTIDNRIKQI IQ/TYYEQLCANK\N 
LDKMDKALESHNFPKLKQRESLNI/HSA 
KEIHFI/ILNISTKKTPDPTGFTGKFLQ 
IFKEKKMAGHSGSHL 


2319 


16220 


A 


2336 


399 


97 


FFFFFFFKQNLA/SVTQVGVQGQYFRSL 
QPLPPRVKPFSFPNPLSNRGYRGPPLGR 
VRQENCPKLKS KRFQLNKI PRLGKKKKL 
RFPKKKKKEKEKKIVKT 


2320 


16221 


A 


2337 


411 


57 


KKPRSFSSCSSSPPFFFFFSPPPKKKIF 
PPPQIFWGPPFFPPPPFFKPPPPPFFFS 
PQKKKKI \SPPPPKKKFFFKTPPPPFFF 
FFFFFFFFFFFFFFFFFFFFFFFFFF 


2321 


16222 


A 


2338 


97 


354 


AKAPSLSLVLFSFSTFFLGIQGLALLPM 
LECRGAITAYCSLNLPDSSGPPTSAPSP 
TPY/RIAGATGTHHNALLLFKFFSRDGL 
PL 


2322 


16223 


A 


2339 


391 


45 


LMFFHLSHKHRSGEAPSIHWS IYLSIHP 
SIHPSLYHSSIHLSIHSTIYPSTCLSIH 
/CISIHPSIYPSICPSVHLLAHSFIHST 
I CPSMHLP IHVS I QHFLSAQ I LPVSVFG 
EVSDI 


2323 


16224 


A 


2340 


506 


0 


RDHEQLGIVRADKKKKKKKKKKIKKKKK 
KKKSSSSSSSPGTFFRG/VPLKDPVG 


2324 ; 


16225 


A 


2341 


203 


1 


VELRVRATEPGFNFLKGIYCTSMVDWIK 
KMWYIYTMEYHAAIKG/DRIMSFVATWM 
ELEAIFLSKLMQ 


2325 


16226 


A 


2342 


402 


40 


PYPPKKKASPTDA\FSSSSSSPPPFFFF 
SPPPKKGFFSKPFFFFSPRFFSFPPFLK 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=GJycine, H=Histidine, I=Isoleucine, 
K=Lysine, LHLeucine, M=Methionine, 
N-Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Vaiine, 
W=Tryptopban, Y=Tyrosine, 
X=Un known, *=Stop cod on 7 /= possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 














PPPQFFFFGPLKKNFFFPPPALKFFFFK 
SPPPLFFFFFFFFFVEAGVLLCYSG 


2326 


16227 


A 


2343 


181 


1 


QLKDQASDLLGKNGDEVKETI PS FLPSS 
LPPF\LPLFLKQFLPSFLPSFFPPSLLP 
FLS 


2327 


16228 


A 


2344 


2 


407 


FVASQLGCSGV\RVRDPLEEAVCPFSDL 
QLRAGRTTALFKAVRQGHLSLQRLLLS F 
\VFLCPAPRGGAYRGRQAFLSCGGLHPV 
GASRLLCLPKQAWAMAGAPPPASLPPCS 
L I SDCCASNQRDSVGARPSEPGAGH 


2328 


16229 


A 


2345 


405 


178 


IIKFIYEKATA/ LLNG/EKLKAFPLKLG 
TRQGCLHSLLVCNTVLEVLTGTLRQEKE 
I KG 1 1 IEKKEVELS LFADR IL 


2329 


16230 


A 


2346 


2 


412 


FKASKASLSPLLGANTAGDFKLKPGLIY 
HSLK\LKNYADSILIjVIjCQWNN"KAWMIA 
HLFTAWFTE YFS PPLRPAQKKI S FKMLL 
FIDSAPSQPGVLMEMYKEINWFMPANT 
TSILQPMDQGVILTLKSYWLRDTFH 


2330 


16231 


A 


2347 


2 


397 


ESLEPGRRRLQGARIMPLHSSLDNRVRL 
CIiKKERKXEIGVLIRCWQECKIVQPLWE 
IVWYFLKKLN\ESPYEPAVPLLNIYPRE 
MKIHVYTNTCTQ I F I VALFT I AKS GKWW 
GTVACACKPSSSE/WLRWVDYL 


2331 


16232 


A 


2348 


*> 


423 


EGCSELGSHHCTPAWAT/AESVSQKKKK 
PKKK 


2332 


16233 


A 


2349 


49 


262 


QMCKGSNRRRGKRVGS RQ I S KKKTNAP I 
KKWAKDMNRE F / D I QMANKHNE KCSTS L 
1 1 REMQ I KSTMRYHL Y 


2333 


16234 


A 


2350 


356 


2 


F VTAPLHS S LGNRARS YCKE KKKVQVAA 
KAVLRGKFI IAYTVFKKRKISN INLSIS 
LKTLEKEEHTETKADGAQYVTKI \RAKI 
NKIETANETKSRSLEKTSKTVEGKCLRD 
ILLSAQI 


2334 


16235 


A 


2351 


360 


19 


LDAQFLEVGLGELLFRSTVPTLQPPGCG 
ASFPVLNTLPF\SLSPSQSSSSPASLVR 
PWVAPPFLCPHGEPDGGPDSTTSLPFPG 
PAATGPAARLIQHPASRQPRPASHTHCG 

V 1 


2335 


16236 


A 


2352 


360 


0 


NTFIiAAFRLVFCQMTSYSLALLSHKLAL 
MPLNLSDUjTRWTHCMGELFFLDILAIQ 
NPFHTVFFLGHPEWGMESRFWQAWQW 
PDLRQLQPS\PPGSMRFSCLSLPNSWNS 
PSYGRI* 


2336 


16237 


A 


2353 


34 

- 


474 


EGWRPCKELAARQVGCPHSCFSPHWQLL 
QKQEKTAGAVSVCVCTSTLCVCVCVCVC 
VCAQAMCVCAGA/CFCVCVCAGA/CLiCV 
CVGA/CLCVCAEAVSVCVQ/VAVSMCVC 
RS/VSLCVCMQGQSLCVCAGA/CLCVCV 
CGIPPPVLCLN 


2337 


16238 


A 


2354 


297 


16 


KF FFL KS F F FFS F F FLTPPR F F FF FP KK 
KKIFFFPPRKIFFFFLIPPPPXFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFFXRHGG 
WFEEITILTV 


2338 


.16239 


A 


2355 


3 


315 


PVTPATRETEAGETL \HDLGEPGGRGCG 
ELRSCHCTPAWVTEQDSVSKKKKKKRGA 
RFKESNFTTPGLQRNI FFLGALKLISGA 
GVLKRRDGKTLGFPQFNRPWG 


2339 


16240 


A 


2356 


399 


154 


PGQRG E IPSLPKIQ/E IiAGCG / GHLN PG 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to ftr$t 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P^Proline, 
Q=GIutamine, R-Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /-possible 
nucleotide deletion, \=possib)e 
nucleotide insertion 














GRI FGEPRSRHCTPAWATBQDS I CKSHS 
RSGLWCYKSSSVYLSTRGVWVRGIASV 


2340 


16241 


A 


2357 


416 


2 


FFFFFSETESRSVAQAGLRT\QWRNLGS 
LQAPPPGFTPF 


2341 


16242 


A 


2358 


209 


2 


KKNLVP W PRGGY FKSLQPPPPGV/SCPN 
PPKKLEYRVLFPQPSNPFFFFCI FSRDG 
VSPCCPGWSRTPD 


2342 


16243 


A 


2359 


279 


380 


RGYNP / WPGAVAHTCNPSTLGGRGRQIT 
RSGDQDH 


2343 


16244 


A 


2360 


415 


224 


FFFVFFFFLLITFILMLNLLPCHKFLFL 
QFLLGYLFLLYIA CAFFLLVALNI FITS 
FQQLDYYTF 


2344 


16245 


A 


2361 


278 


2 


ISDPFRFWNIYRIHTGLISLIQISENCV 
SCQKFQILEHFRFQIRDAQSVLRK\RKA 
WTGAVAHACNPS PLGGRGGW I SRSGDLV 
HPGQHSETP 


2345 


16246 


A 


2362 


139 


282 


KKKKGGRGGGGGRGGGGXXWGGTKKKKK 
GGEKKNLWGGGKGGGKGGGS 


2346 


16247 


A 


2363 


19 


409 


PKPPSVLGGGGPARYPSPLGGPNRPVPL 
GPGVGAPPGPPGKTPFFLKIKKIYPARG 
GPPVIPASPGGEGKKSPLPPRPRVPLTQ 
IFPP/PPPPGGPNQG 


2347 


16248 


A 


2364 


159 


383 


HSHFKNLS S 1 1 KKLHRNNTFTEHFSLS S 
SLNQCFLNLTVFYSHLGNFKNSN/SWPG 
AVAKTCNPNTLGGRGGQITR 


2348 


16249 


A 


2365 


400 


221 


GRLRQENCLNSGGRECSEPRS/HLHCAP 
AWAT \EQDS VS KNKKQNKQKQTH I YTVL 
LCARH 


2349 


16250 


A 


2366 


383 


14 


GGRGCNAPRSCKCTPAWVTERAS PQKTK 
KQKNTHTKKRISSCCYKMEDPLRQA\TF 
LQCPRAEGPSQKAARMELMEKQEKNQGP 
ARHRRQEQPLTSRPCPDHLCWLSQVSS 
TPAQGLSIiLICK 


2350 


16251 


A 


2367 


274 


1 


PRKILKARGKEKLASRGTMIRMTSDFYL 
QTMQARREWSKIL/NVLEEKIHQHRIL/ 
PVKSSFKS EEE I KTFSDKQKLRGLVTSR 
SDLGKDVK j 


2351 


16252 


A 


2368 


161 


2 


PFFFFSETESCC\VTQARVQWRHLGSLP 
GS SDS PASASQVAW I TGTRHYAWL I F 


2352 


16253 


A 


2369 


361 


198 


NGRLIFVFLVEMGF\TMLARLASCDPPA 
SASQSTGIRGMSHNSQLKCFTEFDSFC 


2353 


16254 


A 


2370 


116 


300 


HLNGDAVEERDFMKCTVSGI IVAHCNLE 
FIiG/SSDPSASAPRVAGTTGMCHHIWLI 
FVILVEM 


2354 


16255 


A 


2371 


2 


192 


MKLDPHLS PYTKVNSRWI KDLNLRPKTI 
KILEPNIR/ITLLGIGLGKDSMTRNPKA 
IAIKTKLAR 


2355 


16256 


A 


2372 


1 


133 


AGELLEPG \GRGCS KPRSCHCTTAWATE 
QDSSPEKKKKKKKGGF 


2356 


16257 


A. 


2373 


43 


403 


LHDSPAIiASQGAGTTGVSHHARPAAGIN 
SRIGQAEDRISELEDWLSEIR\RQSGRN 
MDKRMKMNKQNLQEIRDYVETKSM/NTR 
LIGVPERVGENGSNLENIFQDLIHENYK 
GKPIRLMVDL 


2357 


16258 


A 


2374 


404 | 


215 


GQGGRITRAQKFETSLGNWRPGSEAQE 
LLE \PDDRG CSEPRSCHS TPAWTTEQDS 
VSKKRRKC 


2358 


16259 


A 


2375 


3 


397 


SKQLEFTQLYTKLI, r QLNKTKISDLKKEK 
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C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
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X=Unknown, *=Stop codon, /-possible 
nucleotide deletion, \=possib!e 
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YPIRLFSKENTQMASRYRTKYSTWLI IR 
EMQI KTTFRHKLAALASSRCLLGLG\AT 
SAHCCTVGAPLW 


2359 


16260 


A 


2376 


413 


1 


PKKGLFPKPI IWVTPGFFPPPRFKKPPP 
KKI FGAPKKKKKS PPPPAKKFFFFKGAP 
PPFFFFFFFFFFFFFFSGDSQERVREAM 
P VAGG P \ PR PHS LS APHAPGGTAWTPMH 
PVQTHKAQSPKLPASECPPPTTPLS 


2360 


16261 


A 


2377 


398 


247 


RRFHHAGQAGLELLTSSDLPALASQ \SA 
GITGMSHHAQPSATHFQKHLVS 


2361 


16262 


A 


2378 


2 


143 


QENRLNPGGGGCSELRS CHCTS \AWVTT 
AKLCLKKKKKKKFFGKGGG 


2362 


16263 


A 


2379 


417 


90 


FFFFFFFPGGGRLQVCP PPG I LLFCFLY 
KGGSPPGGSTIFPPPPPGEVGPPGPPPP 
GGFFFFFLE/QGGVSPDGPGFFVLPTPK 
KPPPPAPQKGGEPKFKPQVWGPPWPTF 


2363 


16264 


A 


2380 


36 


427 


VHPLNHHDQKGQASSTQKKKKKKKKKKK 
KKKKKKKKG / WKGGGAF 


2364 


16265 


A 


2381 


402 


2 


NFFLKGRGWGPLPPFLPRGPPPRGGPQK 
/RGGLGGPPPFFFGSKTPPAGGSRTPMG 
QKKKGPPLPEGPLFGGAGQAPFLPPPVP 
RGGVPSPKKKKG/APPPFFPPPFFPPPP 
PPFFFFFFLRYNLALLPRLGCSGT 


2365 


16266 


A 


2382 


166 


5 


THGHVI YVDQEMQMMIENM/ WPGTVAHA 
CNRSTLGSRGGWITRSGDLMVKPRLY 


2366 


16267 


A 


2383 


2 


457 


TSQPSLLSSWDYRSTSPRLANF\ILFYF 
IYFFFAFSVETGFHRVSQDGLNLLTS/S 
/IPSIPRIPKHWDYRHDP\RTWP 


2367 


16268 


A 


2384 


2 


417 


GRVGFSQSNGNPSSLSFTLLKVDFEVTI 
PGEGKDRIFKVS I KWLAI VSWRMLHEAL 
VSGQIPVPLESV/ QALDVAMRHLASMRY 


2368 


16269 


A 


2385 


318 


32 


TME IMLDKKQ I QVT F/ FEFKMGREAVET 
THNINYTSGPETVQWWFKKCCKGDESLE 
DEECSGRPEVGNDQLRAI IAHASADAWV 
DRDSGRCCCACP 


2369 


16270 


A 


2386 


409 


3 


ISQAPSTPPRKRGFR/PPQKTSFLGPPF 
YAAFYQEKKFLFFSSDPPRETGDKGKQK 
GFPPPKVAPKKKGFFKKGPPPGKKKDPP 
SFFKHKSQPTRPPRAPALEGPRSR/SAA 
LQPGDRRR/PPSQKKPTRPPTRPPTRP 


2370 


16271 


A 


2387 


415 


10 


KKGFFPFGPFIFLFPPPGFLPPPPPPIF 
WFSGFCPPPPPLFYFFFRGGPK/HILVF 
PLFFLP PPFFFFFP I FAKES P PKKGPGP 
LNFFFFGPKKN/ SPPPFFFFFFCEMEFH 
SCRPGWSAKWHDLGSLQPPPPVFK 


2371 


16272 


A 


2388 


410 


95 


KATMDKSDHI KLKSFYFAKETPTKVKTQ 
PPEWEKVFAN YPSDKGLI P 1 1 YKELKQP 
YG\ KKSNNS I KKWAKDLNRYFSKDIQMA 
NRCMKRCSRPGAVAHTYNPST 


2372 


16273 . 


A 


2389 


362 


92 


RFLFFFSPPPKKGFFPKPFFFFSPRVFP 
PPFFLNPPPKLIF\GPPKKKIFFPAPGG 
KKI FFLKGPPPFFFFFFFFFFFFFFFFF 
P VENT FY 


2373 


16274 


A 


2390 


131 


487 


ATEHEKTEKS S LS FFS IS KRKKKMEKLH 
DIG FSSNFLG \ RP KAQATKGKTDWTSAK 
LKI CSSRDTI SRMXRQPKEWAKTFANKS 
CNNKKPEKIDKNKKKKKKKFLGGALLKK 
TNLKPRG 
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ding to 
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peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=G!utamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W-Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, V=possible 
nucleotide insertion 


2374 


16275 


A 


2391 


408 


145 


RWSLALSPRLECSGVSSAPCKVPPPGVT 
PFSCLSLPSNWEDRCESPPPAQ/IVFIG 
EGFY I LHGFF/ RRGPKI RCFI SGCPPPV 
LSFPT 


2375 


16276 


A 


2392 


1 


203 


L FF FAFS VETG FLHVGQAG LE P PTS GD P 
PVSA/ FPMCWDYRHD 


2376 


16277 


A 


2393 


386 


16 


TPSAGGRHIELSLSTCPSCAQHQGKEHL 
EGGEGGGAQS LTTAPSSATS S QDP I S AH 
AVEDKLS IRLETDI LKTKS / WPGAVGHT 
CNPSTLGRRRQANHLRPGVRDQPGLVIC 
RPRPPKVMGLQA 


2377 


16278 


A 


2394 


3 


396 


QLLERLKQEDRLSTGSQGCSELRSHHCT 
PAWATEP\DSVSK 


2378 


16279 


A 


2395 


273 


416 


FFYS FLI KIRWKKQPGKVAQACNPSTLG 
GQGGRI /TLRTGVRDQPGQHG 


2379 


16280 


A 


2396 


329 


76 


FIPIESACSQECLKPNLRQEW/YIFGTL 
KLI FFETESKP VTQ/DWSAGELL\DPRG 
RGCSELRSCHCTPAWATRAKLCLKHTHK 
IKK 


2380 


16281 


A 


2397 


210 


7 


GGKKMYCRKPGGGGFFAAWJSPKGLFS/ 
R I W KEAP I LS PQKKKKKTNN P I KKW AND 
ANRHFS KEGTQQ 


2381 


16282 


A 


2398 


48 


393 


S ILKTRKPPLKKGRGRRKKEKESVRTHV 
FFSYQSNAFPSKPLNNSMTLGRLLSFSF 
FLVHFFFYTDGILLCCPGWSRTPGLKGS 
SHLSFPKCWDYIHEPPHPAYHS/LFCR 


2382 


16283 


A 


2399 


128 


383 


EEAPKH FPKPNLH / QKKVLVTAW W AAAG 
LI HCS FLNPWET I TPEKYAQQTNEMKQK 
LQCLQLAL WRKGP I LLKHNVRLHITTH 
AS 


2383 


16284 


A 


2400 


54 


384 


LFTFI LNS VFHT YMCLYFWTLFFS VNPF 
VSMPIPQCLDDSSFIISLDSEINPNIYS 
QLI FDKKTQRGKNSLPDRWCWENCIFTY 
KRMKWDPYL/SPYTKIISNVJMKDLNIK 


2384 


16285 


A 


2401 


393 


1 


HRGENTHQQGGGLSRWRRHSRQRGTSRW 
IRHTRQWGPSRWITPSRQRGSSRWRKRS 
RAGDIEVEKTQQSEGALEVRIRSRQRGT 
SRWRS FSRQRVS FRWRI CSRQRCSS / KV 
RRRSRQRGTSRWRRHSRQR 


2385 


16286 


A 


2402 


1 


330 


RPPPPPHCWDYR/HEPPR\QPTLWVIFK . 
LS VETRLCYVAQ IGLELLGS S KESSRLD 
LPKCWDYRHELLCWMVIFQEKLV\SGFL 
FKI PRFFKAGME I FKQIEGFWS SHPLAT 


2386 


16287 


A 


2403 


3 


407 


ADAWGLRGTHGPWEQAGISGI S ?SNS FL 
FVCFRQSVALVSQAGEQWRRLGSLQPPP 
PGCGRESC\PSFQGGITPSCFFVFLEKM 
GLHRVGQAGLQLLMSGEPPALCSQRRGI 
TGVSHHARPPS KGF I WHTGAPAM 


2387 


16288 


A 


2404 


239 


573 


VCFGVFSLSHFFESE FSS I TQAGVQWLN 
LGSLQPPSRGFKQFLCLSLPSS\YWFTG 
TPPLVGGGSAGLRGG 


2388 


16289 


A 


2405 


1 


122 


PTRPGRGCGELRS CHS TLAW/VN SETTS 
QKKQRKEKKLPLL 


2389 


16290 


A 


2406 


417 


130 


QALRVKHVQLVPSSDFLAKTRAQSVNNL 
LRHSSSSSSSSSSS/SSSSSSSSSSSSS 

sssssssssssssssss\ssssssssss 

SSSSSSSSSSSSSSSSSSSFLFLFFFFG 
LRGAFLGRFCP I RR 
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peptide 
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Amino acid sequence (A^AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Iso1eucine, 
K=Lysine, LHLeucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R^Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 


2390 


16291 


A 


2407 


327 


3 


RQSQS LRYMAHQP VCCFTCS I L I RAELN 
E I /ETKNKKIEKINET/RKSW FFEKINK 
MGAI TPDATERQNI IQGYYEQLYMQNLE 
NLEYMDKFLGRFKPPSLNQEELDTLNR 


2391 


16292 


A 


2408 


178 


404 


AAYRINSLQEVQHPTKRHSQPRKQSKLR 
EGNAQRNl^IRRGPGEWAHACNPSTLGGQ 
GGQV/TLRSGVRDQPGQHG 


2392 


16293 


A 


2409 


94 


386 


TSFALVALGGRSCSEPRSCHCFPAWVS \ 
SETLSQWKRKTPTLNNAKYWGGRFLRFF 
FLTRGGGGRFFFFGGTKTKSLGAGFKKG 
GGGKPGGPPNKGLG 


2393 


16294 


A 


2410 


1 


384 


PTRQVRI KLFPRI YVHTRKRLKNI YGNT 
ETF I TQPFKNHS VWQL I RNNRNAKKQEN 
VAHNEEKNKKQS IETN\QKYTHNKNLDT 
TKKKKKKKKKKGGGPLKKTI FKARGGEN 
NFFFLGPQKLNSGAGF 


2394 


16295 


A 


2411 


423 


2 


FFFFPPPQKKSLFPPFFFCGPKIFSSPP 
VFLTPPQKIFFCPPKKKKYFPPPRGKIF 
FFLK\PPPPFFFFFFFFFFFFFFFFFFF 
FIFFFFFFLGFLLSSLLIGVPFLLLNAN 
IMILLQKKILKDEVQIISSFPSTKNRNV 
L 


2395 


16296 


A 


2412 


218 


112 


LKKENNTKCYYGWGTTGTLIHC\WW\KL 
IQVLWKTD 


2396 


16297 


A 


2413 


64 


403 


LKNFFFFFFFKKKGPPGAPGGSKPRGLG 
EPPPPPPQRGGNTGGGPGARPKKNGGGF 
F FFWRGEP P / P P P PAKGGGQDLGPGPFG 
REETNNFPAPAPPGPKNKNLGPRAQKIL 
CF 


2397 


16298 


A 


2414 , 


217 


409 


VKYLSYSVLLTI ICTVQACYQELRPGAM 
AHTCNPSTLGGHGGRI /TLRSGVRDQPD 
QHG 


2398 


16299 


A 


2415 


416 


0 


ERPPPSSFFPPLAPGQVGQFFYGKGPWP 
EKSKKKNRFWGTKPFPPEGGGPPPPPNS 
PPGGGEKS PLFFFGGKGNPTWS PGPKFL 
KKRV I KGPLHPGGKG YKKTRE FGLKPG P 
PPPKFFLGGG/PGFQNWEKPPQP 


2399 


16300 


A 


2416 


277 


0 


PFVLLSSSSSSSSSSSS5SSSSSSSSCS 
LLVRYLCKVKHKSSLCAHPGTVPFLELI 
HDTDS FAH / VFPADLL 


2400 


16301 


A 


2417 


205 


486 


RHGYGFGFWKNTCMQHRHSYRRVYIHRH 
I HLTFIiA YVHRES PEAMS ALLTQ I LLFF 
KLLSFLRRCLA/SVTQAWQWCSIiGSLR 
PRPPGFKRFWC 


2401 


16302 


A 


2418 


364 


1 


KESLGDPVKDTVILFATRNPMQMGSNYQ 
FFI YIjCLTHLLSRS YCWFGCKMLQ/ PLW 
KTVWQFLKKLNAEFPYDPAIPL\DICIP 
VFTASLFI I AQRWEQPKCASTDEWINQM 
WHMHT I EH YPR 


2402 


16303 


A 


2419 


281 


492 


LFPPSLPASHPKLSTQQPERSHSVTHCG 
/VDRRGSDLGLLQPRPPRLKPSSRIjSLW 
SSWDYRPAPPRPANSC 


2403 


16304 


A 


2420 


63 


465 


RLQR FLLS F \ VCLC PAP I GGAYRGRQAS 
LSCGGLRPVRASLLLCLPKQAWAMAGAP 
PPASLPPCSLISDCCASNQRDSVGVGPS 
NPGAG YYLVARRFLS PLEKRS I QVG VTL 
FSRCRLSPLSLTRKGNSLTPCAS 


2404 


16305 


A 


2421 


61 


482 


QRAR I TGVSHHAQ/ LR FCLFDMR S J IS VA 
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OCysteine, D=Aspartic Acid, 
E=GIuta mic Acid, F=PhenyJalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R-Arginine, S=Serine, 
T=Threonine, V=Va!ine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














QAGVQWCDNSSLQPPPSGLKMb/STSAF 
QSL/WDYRCEPQHPIHY 


2405 


16306 


A 


2422 


90 


444 


YCFSSECSEKCRCPGHDLQLSPGLCHHG 
VSGLAKPLLCFWPWIWKPQPLCSPTTSL 
PFLPLPPFSQCPQGPAWLELEGRPSLWK 
QGGLQSLAIKR/REGSRAQWLTPVIPAL 
WEAEAGG 


2406 


16307 


A 


2423 


1 


444 


PGPGPGCGPVATLGSPSRPARTDSPSLP 
PHSQLREAEARNRD I EAHVRQLQERMEL 
LQAEGATGESLMC PLPRRT / WEEVGRIJj 


2407 


36308 


A 


2424 


189 


418 


SRRAEPGSFRGCLRVGVPSCTCV\SLWV 
CVCVCWG/VCVCVCVCVCARA/CLCLCA 
NFSL/CSHVSLCLSLSLSL 


2408 


16309 


A 


2425 


30 


895 


IjDEQCTSEIHRRGEATARPRAPEHPAPP 
ATAVRGRDAASQNLKRRPGSGTDGLRLQ 
GAEPSRLLRTYAGGAVIPRGTPERAQPP 
PPQDPLGRRRWLSRNTWGPWPGTTQPPS 
PQLIiRNDWGSCGFMVPEAARGKVFQDSQ 
. EGAH I RRETVS KS VCAE P WRHQRARD PA 
PTNFPLKCQKQRGASTSSGQHGDRVNLV 
FFIDDDYSPPSKR/PKTNEPPQPPVPEP 
ANAGERKMREFNSGPHNPVEETKIjICLC 
PSGHASCQVHLWTGAMLLGFQSWRKLPG 
SGLKARILQ 


2409 


16310 


A 


2426 


494 


154 


SSRVRCQCA\LLGGASQLGCSGVRJDPLE 
EAVCPFSDLQPRAGRTTALFKAQMEMQK 
S PVFC VAHAGS CRL ELFL FGHLG S SLUG 
LWN 


2410 


16311 


A 


2427 


416 


1 


PQRGPLLALEPGRQGAAPVEDLQPQGPD 
KPPPPPLPQPFRARTVVTAAVPRHPPPV 
ACHPPQPLAASKPWP/SVAGGDLLPLPG 
PERPVHAFFIGFI FVHLGLGGVSGRGAV 
APARSGPVPPRPPSSSTSRFSLFFLKE 


2411 


16312 


A 


2428 


84 


409 


DYKHAPTMPRIQNFVYHSPQQPRCENRI 
NFTPKKKKKKKKKKNKKKKKKKKK\KI F 
WGG 


2412 


16313 


A 


2429 


1 


389 


LRDLSSDRSNPGRFLSTSNSSLY/EKDK 
RNKAYFTK/RPSPVNDIIST 


2413 


16314 


A 


2430 


456 


0 


PGWPGRGAP \PSQTGWPGRGAPH I PDDG 
QPGRGAP \ PSQTGW PGRGAP \ P S QTGWP 
GRGAPHLPDNGWPGRGAPYI PDDGQPGR 
GTP\ PSQTGRPGRGAPHIPDDGRPGRD / 
GSSLPRRGGSRAEALLTSQTGWPGRGAP 
P 


2414 


16315 


A 


2431 


3 


344 


CRERRSCHCTPAWAT\SETPSQKKKKKI 
FFGNGPPGGPQAGLKLRAWGFFQXRGTM 
GPGTKNHPGQRGEPPLLQKKQKITRPGG 
GGPGAPPPQGGGAGKSFNPGGGMFQGGE 
IP 


2415 


16316 


A 


2432 


1 


109 


RPLRRLRQENRLNRGSRGCSEPKLC/HL 
CTPAWAT 


2416 


16317 


A 


2433 


1 


239 


QSFAVLPRLVSNS WAKVI CL/PSVSQNA 
EIT 


2417 


16318 


A 


2434 


3 


464 


DWLQLEMQGEIVALVHSHHGGLPWLSES 
ARRLQVQSDLPWWLVCRGT I HKFRCVPH 
IiTGRRFEHGVTDC YTL FRDAYHLAG I E M 
PDFHRENDWWRKGQNLYLDILQAPGLYP 
VPLSAAQPGDVIMCCFGSSVPNHAAI / Y 
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X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
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CGDGELLPHIPK 


2418 


16319 


A 


2435 


3 


427 


EGARTSSERHPCNKYLLLHSXiAHLFINA 
LNLVLKGLSPSPFPALPISFPIFFFSPH 
FLGTPTLEGGRADLPFLQPPGAPG/QPA 
PLNYGPGPYRNPCP/RLPQLKPAGPGHG 
LLKSPPPNPGRNWPLLGSLFDFFKKRTS 
IPLP 


2419 


16320 


A 


2436 


238 


400 


QFRWKRGKAIFFFFFLRQSPSVTHAGMQ 
WSNLSSVQPPPPGFKQFLC\PSVPSS 


2420 


16321 


A 


2437 


U 


442 


LGTRPRATDWGVRLELSRACPVLGHPAK 
HPRPQRWCKTOFSYSPEQADELKLQAGE 
IVEMIKEA/CGNPDMPSVSPGPQRPPKT 
TEDKGWWEGECQGRRGVFPDNFVLPPPP 
IKDAQPLLLFGIDQEAGPTESGISGIRF 
RRLSC 


2421 


16322 


A 


2438 


411 


80 


PQAEEGAPTPGSENFNPPPPRGGGAPPP 
PQKNFFPPRGVNPGGGGGQKRPPPKKGG 
/SPKKNPGGDKNPPPKKKKNIGEGGGFI 
GAPRGTPKKTPPPRDGYFQVFFFIVSLK 


2422 


16323 


A 


2439 


290 


1 


QLNKI KKTPLLFLPWAKEKVPKI PPNYP 
PGPKKKGYPPSI FFFFFLRQGL/NSVTR 
AGVQWRDLSSLQPPPPGFKRFSCQKPNS 
AFPHASADAWVDP 


2423 


16324 


A 


2440 


440 


213 


PFSRPLFFFSPPPKKRASPPPFFFCFPR 
VFFPPPFF/SKPPPKFFFFPPPKKKKIS 
PPPPKKIFFFFSPPP 


2424 


16325 


A 


2441 


131 


408 


GCVPPEPAFLCFVLEIVSSVAQAGVQWR 
NLSSLQPSPPGLKRFYRPRLPSS/QDYR 
RAPPLA 


2425 


16326 


A 


2442 


102 


351 


QASSSVLKLCVCVRARLCV/ CACVCACV 
CVC/VCVCECVCVC 


2426 


16327 


A 


2443 


190 


3 


PQGAREKSHRPGPIGRRLKL/DPPFLSP 
HVKINPRWIKDLNVKPTTIKTLEGNLGN 
TLLDTCPG 


2427 


16328 


A 


2444 


410 


30 


VCVAPPLCVCIGTLCVRTHTQFCVCVHT 
HSLCVCR /CQFLC VCVQ/ VAVS VCVCRA 
SLCVCR\ALCVCVCR\TVCVCVHGTVCV 
CVCRDS FCMCVCAGAVCVCVCVCVCV\ C 
LCV CVAS AQLLGRGFCSS VNI KGAGP 


2428 


16329 


A 


2445 


243 


1 


KVMVQNKGPFSKLFFFGPTINFFTPQFK 
QGGGQNPNPFIjFFFFFMRQGL/DS VTQA 
GMQWCHAGSLQHLPPGLKQFSCLLP 


2429 


16330 


A 


2446 


276 


408 


MLKNCAFWPGTVAHACNPSTLGGQGGRI 
/TLRSGVRDQPDQHG 


2430 


16331 


A 


2447 


24 


405 


LGDVCAFFFFFYLKNQLNPGGKLILPVG 
PAGENQRWEQFDKLQKGGIKMNPLRGGI 
SVPLTKKKNQ/WARGE 


2431 


16332 


A 


2448 


239 


3 


SPLCGKNVYKPSTVEKTNQVEKMPPSKQ 
IWELCVELLWQ/SNRGIAGSGAHACNPG 
TLGGQGWRIMRSGDRDHPGQHGETP 


2432 


16333 


A 


2449 


406 


144 


GCSE PRSRPCTPAWVTSETLS / RKKRKR 
RKKKSCLLRAILSTS PELTHFTLTTPLF 
SRYNDYPHFRDEKTEARRVYATCSGSHS 
WKRLG 


2433 


16334 


A 


2450 


349 


1 


GGAI FLTPRPKVPLTPITPLAFNPGHQE 
EIPLPKKKKKIVRAIYDKPTANIILHGQ 
KVEVFPLKTSTRQGCPLSSLLYQHKTRM 

PSL\ PLL FN TVIiD YLAEAI KQEKE I KG I 
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C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I-Isoleucine 7 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
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X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possib)e 
nucleotide insertion 














QIR 


2434 


16335 


A 


2451 


382 


19 


DLGRMTAGSGDQRCAVGVKLLS FSLGAA 
GKGPKHSEAEASLPRNPGSYWRQAQWPQ 
SSLHLKGTIPDTTSLiSrrPWKMLiSSLKVP 
SWQGAVAHDCNHSTLGG \RGG WITRSRV 
RDQPGQHG 


2435 


16336 


A 


2452 


3 


396 


FKJjKLMXjIYHSENPRALKNYTKSTLSML 
YKWNNKAWMMAHVFTTWFTEYFKPIAET 
YFWGKKKITLKILPLVBNAPGHPRALME 
MYKEINV/VQICILQPMDQGVISTLRSY 
R\KNTFCKDIAAIDGDSSDRT 


2436 


16337 


A 


2453 


315 


411 


KGGNFPPPGGGGEKKI W/ PNGGPPPRGK 
RKP P P P? PGGGGKGGHHPP PGP I FFFEK 
KKKGIjFGG/ PGGAPNSHPKRNPPPWFPE 
GGELTNPPTFFFEGGAF 


2437 


16338 


A 


2454 


3 


114 


HHV/GQAGLELLTSDDPPASVFQSAGIT 
GVSHRIRSVS 


2438 


1.6339 


A 


2455 


397 


1 


WSWQK\NRRIDQRSNIESPEMNPRMYGQ 
VI FDKI AKNTEWEKDSLFNKWCWKNW I / 
LKRMKLDHPHLTP\KQKSKWMKDLIRPE 
TEKLPEETGGKPHDIGLSNDF\LDLTPK 
AQGAKVNTDKWDNIKLKNFPTRP 


2439 


36340 


A 


2456 


424 


3 


PKKKKIFFPPPGFKIFFFLRAPPFFFFL 
CLSHFLLNRSRSLSCTTSCCVSTIPTSL 
CNKSSGV\CGLHCSLLAI /CSLIHLTLC 
PFCVIiLVCMCDTVCVCVCVCVCDTVCVC 
PCPYGTLDIAFKHFFSRWSLTIjVAQAGV 
QR 


2440 


16341 


A 


2457 


266 


379 


HWPGAMAHTCNPSTLRGQSGRI /TLRSG 
VRDQPDQHG 


2441 


16342 


A 


2458 


10 


409 


SRTGPNPRAQTDRPWCVAFACFELPLW 
RSVDSATREAEAGGLL\DPGGRGCSELQ 
LCHCTPAWV/ TSETL 


2442 


16343 


A 


2459 


184 


387 


IVHFQMHKMINVAYI IPQFYSFIIiEIQS 
HSVTQAAVQWHSHSSLbPLPPGIiKQSSH 
/LSLPSSWDY 


2443 


16344 


A 


2460 


110 


1 


KNRVSFFF/ CSFETESRSVTQAGVQWCN 
LGSLQPPTP 


2444 


16345 


A 


2461 


380 


2 


F FF F FS E TE SR S VAQ AGLRT \ Q WCNLS S 
LQAPLPG 


2445 


16346 


A 


2462 


382 


29 


NGPGHGGPPVI PGTWGGQGGGFPRF\GS 
KPGFTWGNPPFLKNHKNYPGGGPPPVIP 
NFLGGKPGNFFYPGGGGFQ/ SGPGAVFP 
PPPGQRRSIFLPKKKKERKLILFFSLBQ 
RLQNCDA 


2446 


16347 


A 


2463 


228 


1 


KKGTLFKPPPPGGRFFFFFFFETQSRSA 
VTQATVQ/WMPSRLTANPTFRTQGILLP 
PGLKGSSCLSLPSSWDYRHV 


2447 


16348 


A 


2464 


395 


1 


RLRRENHLNPGGGGCSEPRS/HPAWQQS 
ETTSQKTKTKTKQKALASDTVLSPRQSS 
EERFHLSLFHSSFVTPFGI FS FLTTSPH 
? I LP VWLAPQLPAI Y C YGDSV I KSNFQL 
WLSKELQESLSRLLWTQAFS 


2448 


16349 


A 


2465 


29 


299 


ETPNEAS PKTS WD YRHVLPCLANFFI FF 
VKTGh /HRAGLELLTSSDLLCFPKCWDY 
RH\DRSTWPLSS VFGS I FLVYYWPLYL I 
ETNMLTTL 


^2449 


16350 


A 


2466 


1266 


1473 


YFVPQSTQNKAAVFRVGSLLQEGCGKIS 
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C=Cysteine, D^Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=GIycine, H=Histidine, I=Iso!eucine, 
K=Lysine, L=Leucine, MNVIeth ion ine, 
N=Asparagine 7 P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possib)e 
nucleotide deletion, \-possible 
nucleotide insertion 














KLYGDLKHLKT \ FDRGMVWNTDLVETLE 
LQNLMLCALQTVNG 


2450 


16351 


A 


2467 


49 


356 


VQVILLPQLLiRRLRQENCLNPGGGGCSE 
PRWSHCCP/PAWVTEQDSISKTKK 


2451 


16352 


A 


2468 


115 


3 


LSHTKWSAWPGAVA/ STLGGRGRQITRS 
GVQDQPDQHGE 


2452 


16353 


A 


2469 


3 


404 


FVAPGGGQGTFPGSLQPWPPRGPGASGS 
PSRAPLI FGFFGGKGVSPVGPGGFLLPG 
SRD/SGPPGPPKGWGSG/GLGPAPGP 


2453 


16354 


A 


2470 


404 


218 


FAQVVIKWGALSSPKPPFPGFKLFSRPS 
PPSYWDYR/RRPRPPRLFFFFFLRQGLA 
LLPRLE 


2454 


16355 


A 


2471 


386 


258 


AGGLLSPGVQGYNKLLSCPCKPAWTT\S 
ETLSQKKKKKKLPFY 


2455 


16356 


A 


2472 


2 


364 


HHTQI I F V FLVE TG FHHVGQ AGV ELLTS 
GDPPASWDYGR\GHRTWPYSHI FNNL 


2456 


16357 


A 


2473 


1 


292 


DRGCSEPTSCHCTPVWAT \ SETVSQKKK 
KKKKKTPLLRGPEKKNGNPPFRGFFRKN 
PGFKRKKGGAPPPGLFKIGRKKSPVGGH 
RHTFILGGPLFRF 


2457 


1635S 


A 


2474 


1 


307 


SLSCSS I VRRACFPFTFYKDCKFPEA/S 
PVMLPVKPVEL 


2458 


16359 


A 


2475 


2 


596 


MK*l^DILTMEVLKSTMKQELEAA\QKK 
HSLWELLRIPNICKRICFLSFVSSSSSS 
SSSSSSSSSSSSSSSSSSSTKITAW/LP 
PLEASFYRSTCLMPARALLFASTIPFWG 
LTLHLQHLGNNVFLLQTLFGAVTLLANC 
VAPWALNHMSRRLSQMLLMFLLATCLLA 
I IFVPQGEKSSQVEERKCLSLFSQGLPW 
SHLS 


2459 


16360 


A 


2476 


367 


44 


YQEDITIMN/TYALNIGVPTYLANI/DL 
NREIESNI I IVEYFNTSLSKMDRYRSSR 
QNIDKETVDLKYSI /HINQMDLTDRYRT 
FHPTATERDSISKKRIKIKIKINKSLT 


2460 


16361 


A 


2477 


12 


362 


HHEFG \GGGCSELRLCHCTPAWVTELDS 
VSKKKKKKKSQKKK 


2461 


16362 


A 


2478 


401 


1 


FFFFFSETRSHPVA\RLECSGAISAHCN 
QCI 


2462 


16363 


A 


2479 


3 


353 


YMCVCVCACVC I CVLP WMCLCVC/ CYCV 
CVCWYLCVCLCVSLCGHQHLAVSGKRSQ 
PPSHESFKTSLLLVJVFKNLPPLLCAWVP 
QVSQNMPLDGTRTLNNVSLPDEKVHDLL 
LPLTAA 


2463 


16364 


A 


2480 


57 


361 


MNESLLSCKRWSLLAFSHI/CVFCLICS 
TDISALCVTVCACMHVCGCRCVCMCLCV 
CLCLCVCRCMYGRVC^PMCLCVCVHCVL 
ARMCACGCGCVYKAECPI 


2464 


16365 


A 


2481 


3 


381 


YMCACG Y I CMHA/ CVPVCMCVCTCLCAH 
VCL/CCMCVCTFVPVCMCVCICVCLCVC 
TP 1 1 TLP FI»SQER I S FCTDLRS FRATAK 
RSHEEVKNCVYLQT I WNQYL I 1 F YLQNP 
NFGWAQWLMLVI PAVWE 


2465 


16366 


A 


2482 


2 


380 


IHVLGNCLiYFSEPQFPHWQQRITRSFLE 
G YGENLMRLiHMS Y I HMCLH I CAY I CI / C 
/LCVYICVLLCAYICVCTYLCVCICMCI 
FVYLCTHI CVFVCVLVCI YLCV/C/VCV 
CICVCI 


2466 


16367 


A 


2483 


296 


1 


NGTT IRRML P I FS RYR PGMPGLVQCTIQ 
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C=Cysteine, D=Aspartic Acid, 
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G=Glycine, H=Histidine, l=Iso)eucine, 
K=Lysine, Lr=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
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nucleotide deletion, \=possible 
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SCTANSVPYSHGFIYFFIYLLRDRVWLC 
YSGWSAVAPSWLTITSN\YRLKPSSCLD 
LLGSWNYRQMPPCI 


2467 


16368 


A 


2484 - 


1 


319 


NTVGLCVCV/ CLFVCVLSLCV/ CLSLCV 
/CLCVCWSLCVC/VSLCM/CVSHCVCL 
CVYLWLCVSVCVSLCVSLCVCVYECVI S 
STKKSLGAPSRAEAAKLPRNWAPAKQSH 


2468 


16369 


A 


2485 


3 


334 


NHF I I S I DI / DKHFRKIHQPFVPKAI TK 
LGIKGNFLNLLRGTSVKSTGNINM\NGE 
KPNNLPLRLVI QGDVLFKLLLE VLASAV 
KKKKKKKKKKKKKKKKKKKKKKKKKKKK 
KK 


2469 


16370 


A 


2486 


17 


409 


CLDNKKPRKEYCQKTHLIKKNPLLSRIC 
KELLKLYDKRMINPIKKWAEALYRCLSK 
GDV/HEMASKHMKRCSIS/L/SIRKMQM 
KMTL 


2470 


16371 


A 


2487 


410 


165 


LECNVTIMAHCNLRNLGSSDPPTSAS \Q 
TAG I HHS LQHF FFLiFFVETGFCYTQAGL 
ELGSSDLPTLASQSVGVTGTGHGTWP 


2471 


16372 


A 


2488 


142 


417 


VEQLLEGYRTKSLYLRS FFLF VCMFLRW 
SL/DSVSPGLTNCHHLGSLYPPPPGFKQ 
FSCIS\LWAWRRTP 


2472 


16373 


A 


2489 


1 


404 


KAGAQMGIKHPQHSSRLRLRRPSPRLPS 
SQKPLNTH YFPS FS FCAHRHTHTHTHTH 
THTHTHTGV / HLH PHT 


2473 


16374 


A 


2490 


373 


146 


YGLERKISFKIL/LFVGKSPSYPRALME 
M CK I NWFMKANTT PTLQ P I DQE V I LG P 
GTVAHACNPGNLRGQSGRNA 


2474 


16375 


A 


2491 


427 


85 


KTPPPP\RGPFFFYFFFFFKKAKNFFPP 
PPTPPLGKKKNP\PPPEKKISPPPGFPP 
PPFKKGPPKTLFKTPLKKKKGPGAPQKK 
PFFFKTPPPFFFFFF 


2475 


16376 


A 


2492 


443 


240 


FLFFFFFXXFXXFFFFFXFFXFFFFXFF 
FFFFXXXXXXXXXXXXXXXXFFFFFFFF 
FFXGCLFIYLFI 


2476 


16377 


A 


2493 


3 


135 


I SAH CNR \ LLIj PGS SNSTAS A S P VAE I T 
GACNNNRQDFS FIILQI 


2477 


16378 


A 


2494 


37 


420 


AHVCVCTCVCVCICLSRRVWCAHSHVC 
AGI FLHLINQTFTTHL/ cvckvmalrvt 
PSPCLQGA\QTQLAGQCMCAKVCMNFMY 
IFIEGHICPQTCSCLGEPIRGGLSPLVC 
IC/MCAGFFVSFC 


2478 


16379 


A 


2495 


402 


178 


RQNKTPFGVKKQ I TGGGGRGR VS YFLRG 
VSQENPLNPGGGGCSEP\RPPGWGTKNP 
LFQKKKKSSKQQEGRGED 


2479 


16380 


A 


2496 


138 


439 


RTFILGDHHHKPSAVHFHLPRRKLCTKE 
TGAAPPPPALGATVPPSVSVCLTGGGPQ 
I TP PPPALGARDIiP S VSVRLTMGGPQ I T 
PPHPDQQPL/SPPSVSVC 


2480 


16381 


A 


2497 


118 


382 


VKSFCASMRACVCECVYMCVSMCVCVCV 
CVSACL I C VCLCQ/ CDLINKCV 


2481 


16382 


A 


2498 


164 


439 


VKSFCASMRACVCECVYMCVSMCVCVCV 
CVSACLICVCLCQ/CDLINKCV 


2482 


16383 


A 


2499 


1 


411 


I STSFYTDTINRTALHFAVGRNHLS AVD 
FLLKHKARVDVADKDGMNALHFATQSNH 
VRIVEYLIQDLHLKDLNQPDEPKESPLH 
LWINNHITWNSLLSAQHDIDILMRS 
S fCP PLHVAADRGNVKL.VE/LLLKAGCD 
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C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I^Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
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X=Un known, *=Stop cod on, /^possible 
nucleotide deletion, ^possible 
nucleotide insertion 


2483 


16384 


A 


2500 


80 


404 


RTAAAVSTVSFPQDFEGQSPKCTQGVRE 
ALRQIK/RLVPTGSLRH/WPAGSLA/LC 
QPLSDEKDLTQLFMFARNAFTALAMMDY 
PYPTDFLGPLPANPVKGRRRLPRRRERP 


2484 


16385 


A 


2501 


215 


1 


AAIQQGSLACSHSVPPATTPRAYTPVPP 
QLLVRN?\YPKTLELRSQLRCARRFPRE 
TGADCRHAGAGRQTX 


2485 


16386 


A 


2502 


423 


214 


WQAQVGGLLEP\GGRGCCELRSCHCTPA 
WVT\GEILSQNNPKRQQNKTKQXMGPGV 
PGSWGS PRAGGL7V 


2486 


16387 


A 


2503 


294 


3 


LHWNQRQRCPPPPPPPLPPPPPLPPPPL 
PLR\HR I AGAGQTGAKIRWGRWRVGA 
GLGVAAA/ DSRES CPDGAAAGGGEGGGD 
SAATAGPGGGGGGGTK 


2487 


16388 


A 


2504 


331 


411 


PFLKTGNKGAPPPrKIFFLFFKKKGG/W 
PGAVAHACNPSTLGGRGGRITRSGDRER 


2488 


16389 


A 


2505 


407 


195 


GGRGCSELRSCPCTPAWVT\GEVLSQKK 
RKSSNKKSRCLEDQAEVGGERQLVQVSV 
PSTCQRRVGAWFIYL 


2489 


16390 


A 


2506 


1 


468 


RQGS MNKLETERQ I KKAP ARN PERERER 
ERESKKGEDRHTDI CRP/RERERQNYRD 
RQ 


2490 


16391 


A 


2507 


27 


380 


FVCLCPAPRGGAYRGRQASLS CGGLHPV 
RASWLLCLPNQAWAMAGAPP PAS LLP CS 
L I S DCCASNQRDS VGVGPS E PGVG YNLM 
VRRFLSRSEKRNIRVGVTRFSRCVCHPF 
L*LGKG 


2491 


16392 


A 


2508 


396 


234 


RQENCLNPGGGAS S E PRSRLCTPAWATE 
RDSISN/QNKQTNKSHILRAAFSRPSCY 


2492 


16393 


A 


2509 


1 


421 


AKKIKPPFPLKKKKKPGRGGPPPPPPP/ 
AKFGRE TPLNPGGKGS I KPKLGP P P P P P 
GGPPNLPPKKKKKKAIPK/PLVPSHVKS 
TGFPPSCS/RSALLRAFTQ 


2493 


16394 


A 


2510 


197 


3 


TGPPDFNFFFFLEMKSHSVAQAGVQWRD 
LGSLQPPSPQFKQFYYWGLQGA/LPPCL 
NKICIFSRGGV 


2494 


16395 


A 


2511 


114 


454 


QGPLEKKGAPFKNKFFKTGVLFFGSQNI 
PPWWFKKGFSRTPFE I KKKNPPG I LGGS 
PKKFFFFFFFEMVLLHHPGWSALAQSLL 
TTASDF/LRLKQSSHLSLSS 


2495 


16396 


A 


2512 


3 


495 


FFLSRGLFIHLESAPAIFQCLLFWFLV 
FVATGSHYVPQAGLKLLASNEPPASNSP 
KCWVYRHEYPALKCLCFGLCLF/AFLP 


2496 


16397 


A 


2513 


25 


437 


PLLFSPPAGHAMEEDILPPAPSFLFYFF 
F YWLVWVKI I YLYVLAQAATTNYHRLSG 
LNNKHLFLTILEPRMP/SIKVPA 


2497 


16398 


A 


2514 


2 


256 


QAPSAEGEMTSYVLLAYLTAQPAPTSED 
LTSATN I AKW I TKHS \NAQGGFY YNQDN 
LPYSLCSGRNESTAFGTNGQDI IHPVHS 
S 


2498 


16399 


A 


2515 


2 


284 


KCQCDELCSYYQSCCTDYTAECKPQVTR 
GDVFTMPEDEYTDYDDGEENNNAIRHEQ 
A\GFTSVISDLQAQI iwiseqkhamsfv 
EEMTVLEVIL 


2499 


16400 


A 


2516 


2 


432 


RDCERESSRAAE YTASLKAS C WMGDLAE 
SPSLDPQVAMGTPPAGPCGWSVGTRSQP 
FPRGPSRGPCCVSPPSWTPEPGTGQ/CG 
EGGGRWSPQVPP 
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nucleotide 
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peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A-Alanine 
OCysteine, D=Aspartic Acid, 
E=Ghitamic Acid, F=Phenylalanine, 
OGIycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=G!utamine, R=Arginine, S=Serine, 
T=Threonihe, V=VaIine, 
W=Tryptophan, Y-Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 


2500 


16401 


A 


2517 


401 


2 


GGEAAIRGQTKPQLEGKAPECSECEKKI 
VRS \ PRL I RRQRT PTGEKP YE CEECGKS 
FSRRYRliAQHQR\TRAGEKTYECNECGR 
GFSERSDLINHYRVHTGERPYKCDECGK 
NFSQNSDLVRHHRAHTGEKPC 


2501 


16402 


A 


2518 


398 


1 


ETFGK\SGGRSIVPGQFLAVGPKGRAVM 
ISAIEKQKLVYILNRDAAARLTISSPVE 
AHKASALVYHWGVDVGFENPMFACLEM 
DYEEADNGS TGEAAGNTQQTLTLYELDL 
GLNHWRKYKEPLEEHGNCI 


2502 


16403 


A 


2519 


384 


3 


GGGID / SDASLVIAGVRLEDEGRYRCEL 
INGIEDESVALTLSLEGEALPLPPHSCV 
AAGPPRLGLPGLLPS ISSAPLGTPAPSP 
RPRRPSSPSAPIRWPSPGSPPPPRCGVS 
VPTQPGPVPVQLLRGVY 


2503 


16404 


A 


2520 


1 


426 


GDRQMITALLRKLKQQSRESVEENRPRL 
LKALKELGDFYLELHWDFQS WVPLLSR I 
LPSDACKIYKQGINIRLDTTLIDFTDMK 
CQRGDLSFI FNGDAAPSES WVLDNEPK 
VYQRLRHEES /QEKNRTQVDCLTP 


2504 


16405 


A 


2521 


2 


425 


ALPIGRMPIMVRSSR\CVLTGKTPAEFA 
KLNECPLDPGGYFIVKGVEKVILIQEQL 
SKNRI I VEADRKGAVGA3VTSSTHEKKS 
RTNMAVKQGR FYLRHNTLS ED I P I VI IS 
KPMGVESAQEKIQTVSPQEPEWR 


2505 


16406 


A 


2522 


2 


376 


IQSFS WYGYHLCSQG I LSERVSASKFP 
LLFFFFFKKGGPPFMP/QGGKTKGPLKP 
GDPQS PG I KGFPCLAP PNKGKKGGP PPP 
RGNFLGFKKKKRFPQVGGGGSKPPREGK 
PPPLAPQGGGNRGG 


2506 


16407 


A 


2523 


403 


3 


GAPRSLSEKERQLMGMINQLSSFREQLL 
HAHYEQKKLAASQIEKQRQHMKLGKQ\Q 
QEQIARQQQRLIQQQHKINLLQQQIQVQ 
GHLPPLMI PVFPRDQRQLDALAQQGFLL 
PPGFSYKAGCSDPYPVQVIPLY 


2507 


16408 


A 


2524 


2 


369 


N AR CL I LRAAE YMAACV YVC I S VFACMC 
FCVRVCVPVCVSGCLCVCVSMNVHLSLC 
/VEQCACLWMCVSLHVCIiYICACLCVSV 
CEAVCLHLESHCRGGSRPFPWGPAFSL 
SSCLILAPPSVT 


2508 


16409 


A 


2525 


59 


380 


RNRNFNKNSSFFFFFKKGPWGGGPGGTK 
GPKQRQGEP\NHWGQKNFPGQTRKPRKP 
TPGPPPPKKIFFFKPLEKKGPGQGPQGG 
QKPGGQKKPPGGAPQKRGKTGGET 


2509 


16410 


A 


2526 


1 


371 


VMPLSRKHPKGFFSNHTLVLHVARSEMD 
KERVFQATRKSSP/ CFVPLLPWPT 


25J0 


16411 


A 


2527 


445 


3 


RWLGIiALIALKRELKKGDL\PEMRWWDS 
SI IPNG\FDLTEETPKREDYFGIANLVE 
HPAQLNPPVDNDTPVTLGVYLTKKEQKK 
LRRQTRREAQKELQEKVRLGLMPPPEPK 
VRISNLMRVLGTEAVQDPTKVEAHVRAQ 
MAKRQKAHV 


2511 


16412 


A 


2528 

.... 


1 


478 


RPTRPKRKLKSHRLQSRQESKKVRVL/T 
NAEDNEMEEETDDGPLLVPRVKVAEDGS 
IILDEESLTVEVLRTKGPCWEENDPIF 
ERGSrTTYSSFRKNYYSKPWSNKETDMF 
FLAISMVGTDFSMIGQLFPHRARIEIKN 
KFKREVYAS AI EDOATS VHT 
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correspon 
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residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Iso!eucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Th reonine, V=Va)ine, 
W=Tryptophan, Y^Tyrosine, 
X-Unknown, *=Stop codon, /^possible 
nucleotide deletion, \-possible 
nucleotide insertion 


2512 


16413 


A 


2529 


385 


1 


QKKCEDLK\DQDNPIVRPPPTPGSCGHG 
GWNIS KP YWQLNWRGFYYL YGAWGRD 
YSPQHPNKGLYWVAPLKTDGRLLEYYRL 
CNTLDDFLLYINAREIRITYVQGSGIAV 
NYNNKYVSMYNTVW 


2513 


16414 


A 


2530 


1 


1228 


FRATLRPETMFGQTNCWVRPDMKYIGFE 
TVNGDIFICTQKAARNMSYQGPTKDNGV 
VPWKELMGEE I LGASLSAPLTS YKVI Y 
VLPMLTIKEDKGTGWTSVPSDSPDDIA 
ALRDLKKKQ ALRAKYG I RDDMVLP FEPV 
PVIEI PGFGNLS AVTICDELKIQSQNDR 
EKLAEAKEKIYLKGFYEGIMLVDGFKGQ 
KVQDVKKTIQKKM I DAGDAL I YM\E PEK 
QVMSRSSDECWALCDQWYLDYGEENWK 
KQTSQCLKNLETFCEETRRXJFEATLGWL 
QEHACSRTYGLGTHLPWDEQWLIESLSD 
STIYMAFYTVAHLLQGGNLHGQAESPIiG 
IRPQQMTKEVWG\YVFFKEAPFPKTQIA 
KEKLDQLKQE FE FW YPVDLRVSGKDLVP 
NHLSYYLYNHVAMWPEQR 


2514 


16415 


A 


2531 


335 


2 


KKKALFFIHPPPFVNGRPHKNPKIKSLG 
S PTP FS LI FLKKKFCFVNQDGVRWL YFG 
SLQSLPSRFPPFFCLNLLSSWEYRGLPP 
RPGK YFFPFI FF YFLVEM\GFTVLARMY 


2515 


16416 


A 


2532 


3 


380 


AINSYIRGDDPSSYPEWQSASRSSK\W 
SPLPRALHLTDAK 


2516 


16417 


A 


2533 


3 


2083 


S S EG YLRGNMS ENE EEE I S QQEGSGD YE 
VEEIPFGLEPQSPGFEEQSPEFEPQSPR 
FEPESPGFESRSPGLVPPSPEFAPRSPE 
SDSQS PEFESQS PR YEPQS PG YEPRSPG 
YEPRSPG YESES SRYESQNTELKTQSPE 
FEAQSSKFQEGAEMLLNPEEKSPLNISV 
GVHPLDSFTQGFGEQPTGDLPIGPPFEM 
PTGALLSTPQFEMLQNPLGLTGALRGPG 
RRGGRARGGQGPRPNICGI CGKSFGRGS 
TLIQHQRIHTGEKPYKCEVCSKAFSQSS 
DLI KHQRTHTGERPYKCPRCGKAFADSS 
YLLRHQRTHSGQKPYKCPHCGICAFGDSS 
YLLRHQRTHSHERPYSCTE CGKCY SQNS 
SLRSHQRVHTGQRPFSCGICGKSFSQRS 
ALIPHARSHAREKPFKCPECGKRFGQSS 
VLAIHARTHLFGRTYSCPDCGKTFNRSS 
TLIQHQRSHTGERP YRCAVCGKGFCRS S 
TLLQHHRVH\SGERP YKCDDCGKAFS \R 
ASDLIRHQRTH 


2517 


16418 


A 


2534 


434 


3 


APLHSGKRSPTKCN\ECG\GAWNRSSLL 
DRHKI IHSEENPNKCEECGKAFKQASRL 
TIHKI IHAGEKPYKYEECGKVFSQSSHL 
TTQKILHSGENLYKCKECGKACNLFSNL 
TNHKRXHAGEKPYKCKECGRAFNISSNL 
NKQECI 


2538 


16419 


A 


2535 


46 


454 


PSTFSSKVMDKQTLCSSQATSNTSRYAA 
ALYRQGSIYPKEMKTCVHTKPYTWHSS 
I FMIAKKWKQPRCPSA\DSWRNRMWSIH 
AMGCYLTMEKNBAVMLPRR / WSI/ENI VL 
SERSH 


2519 


16420 


A 


2536 


13 


476 


ALKTYKYSHKKAFNPKKVYGKCSQKDPF 
PINHCLPTEKLHLCDKIGEGVFGKVFQT 
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Predict- 
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nucle- 
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location 
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last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F~Pbenylalanine, 
G=Glycine, HNHistidine, I=lsoIeucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W^Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, V=possible 
nucleotide insertion 














IADHTPVAIKI IAIEGPDLVNGSHQKTF 
EEILPEIIISKELSLLSGEVCNRTEGFI 
GLNSVHCVQGSYPPLLLKAWDHYNSTKG 
SANDRP/DFFKDDQLY 


2520 


16421 


A 


2537 


162 


296 


YVCQRYKL\NPFLPSYTKINCKWVTDIN 
VKPTSVKXiLQEKRRKSL 


2521 


16422 


A 


2538 


495 


348 


FGW\HAFIVKEPRVEKLCKASARAKPQP 
PAI IAKTFKAGGITGQYAQAAL 


2522 


16423 


A 


2539 


480 


63 


ARSEAWI S IADAYMASFGDRIAQLLLLP 
YIKVGNSEIKRTGGFGSTDPTGKAAYWA 
SRVSENRPACKAI I QGKQFEGL VDTGAD 
VSI I ALNQW/PKNWPKLKAVTGLVSVGT 
ASEVYQSTVILHCLGPDNRDSTSHYSKT 
R 


2523 


16424 


A 


2540 


2 


509 


NVDADDVRLAIQCRADQSFTSPPPRDFL 
LDIARQKNQTPLPLIKPYAGPRLPPDRY 
CLTAPNYRLKS L I KKGPNQGRL VPRLS V 
GAVSSKPTTPTIATPQTVSVPNKVATPM 
SVTSQRFTVQI PPSQSTPVKPVPATTAV 
QNVLIKPSMIGHKNILI/TTNMVSSQNT 
A 


2524 


16425 


A 


2541 


501 


1 


QWQWEDPGPGPRPREEASSYKCELREQ 
\LPQYVRDFFRKKAE\SGMDSSRNLEKL 
AERFLAKTCSTKDQQFKKDQWVLiS PVNC 
CHLLLTQVKRESRAHTTLSDI YLNN IIP 
RFVQVSEDSGSLFKKVQRYFFTEVRGV7S 
NDTIFKItiLDIMLITWVTQLSVHQTPV 


2525 


16426 


A 


2542 


473 


287 


EKDFNLPPKDLRL/ KTSDV/ TSTKENEF 
EDYCLKRELLMGI CEMGWEKPS PIQVC 


2526 


16427 


A 


2543 


268 


482 


KKKKAWGLQRGGKNFPAKAELPTHGIQI 
NSCS VRL/ VDIKKEKPFS ILKVEGQAQA 
RTHLNRAFDI I VLTRGG 


2527 


16428 


A 


2544 


407 


1 


CKKI CYL I HYWWE YKMIQPLWKTVWQFL 
\KKLNILYDPAVILYDIYPEELY 


2528 


16429 


A 


2545 


28 


399 


FRHSSFQRSGRGSQLMVHFLSL/SVMPK 
IGSVAGINYGLVAPPATTGETLDVQM/K 
GEADTENH 


2529 


16430 


A 


2546 


2 


365 


FWNVDEVGGEALGRLLWYPWTQRFYE 
SFGDLSTPDAVMGNHKVKAHGKKVLGAF 
SDGLALLDNLKGS FATLSELHCDKLHVD 
PENFRLLGNVLVCVLAHH/FDKEFAPP 


2530 


J 6431 


A 


2547 


375 


1 


GFACRRMQKRVREVSHAEAESAASKKVR 
ANGSGKQNEGMNVTWTATLALSQAVISS 
ATYTQMQPHSLIQQQQQIHLQQKQWIQ 
QQIAIHHQQQFQHRQSQLLHTATHLQGA 
\QKQKQQQHEWR 


2531 


16432 


A 


2548 


3 


376 


EIiGSDVA\GAEALVDRRQERKGE IDAHE 
DSFKSADESGQALLAAGQYASDEVREKL 
AVLSEERAALLELWELRRQQYEQCMDLQ 
LFFRDTEQVDNWMSKQEAFLLNEDLGDA 
VDS/IKEILKKHE 


2532 


16433 


A 


2549 


360 


1 


RGEMLWTV\NNRFLKNFVPGKIEPFKSH 
SLYPPCYVHDVSFWIDQKKGFDELEFHT 
VGRAVSQDTI IS IQFLSRFQHPKTQQVS 
LCYRLTCQTCDKALTQQQVASMRSQIRK 
EIQQHQY 


2533 


16434 


A 


2550 


2 . 


403 


WAEEDTELRDLLVQTLENSGVLNRIKA 
ELRAGVFLALEEQEKVENKTPLVNESLK 
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Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenyla!anine } 
G=Glycine, H~Histidine, I-Isoleucine, 
K=Lysine, L= Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptopban, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














KFLNTKD\LCGLEGRENLARDLGIIEAE 
GT VGGPLLLEVI RRCQQKEKGPTTGEGA 
LiDLSDVHSPPKSPEGKTSAQTT 


2534 


16435 


A 


2551 




409 


VPRNPTPLGGPGGPILRSREGGPPGLPR 
EP PS VLKKRE P PTRGGGRP PGFPLPRRV 
RPEKWVNPGGQTFPEPKLRPRG/HHPGG 
QTKNPFPKKK 


2535 


16436 


A 


2552 


256 


1 


HEMDGTIfLGPFPPPGSSKGPSLTGPPIL 
FQSPGP/APHTPSSSPANLKT/CTPVCP 
SHLPW/CCPLCLPMRLPWSPVPPVSKSS 
PPALY 


2536 


16437 


A 


2553 


398 


3 


ARQQQQLLQQQHKI / NWLQQQ I QVQGQh 
PPLMI PVFPPDQRPIAAAAQQGFLLPPG 
FSYKAGCSDPYPVQMIPTTMAGAAAATP 
GLGPLQIiQQL YAAQIiAAMQ VS PGGKLPG 
I PQGNLGAAVS PTS IHTDKRCI 


2537 


16438 


A 


2554 


20 


517 


DRPPSTKRRDTPQLRGAPDLSPRGPAPV 
PECPEH/SPRKKTSACRPPLPLRPSHSS 
PLP/SPQPSHSTPQASCPLPEALSPPAP 
FRSPQSYLRAPSWPWPPEEHSSFAPDS 
SAFPPATPRTEFPEAWGTDTPPATHRSS 
WPMPRPSPD 


2538 


16439 


A 


2555 


361 


3 


KSSQEALEEALRQRLEELKKLCLREGEL 
PGKLPVE YRLDPGEDPPIVRRRIGP \ AF 
KLDEQKILPKGEEA2LERLEREFAIQSQ 
ITEADRRLASDPNVSKKLKKQRKTKYIN 
AVKKLQVY 


2539 


16440 


A 


2556 


470 


2 


LKAAVTAGLEVPSDVSDRAFE \WIjSAFP 
L\DS P YS I HHPRR I QVSSE KE AAPDAGA 
ERITADSDIAYSSKVLLLSSPGLEELYR 
CCMLFVDDTAEPRETPEHPVKQIKFLLG 
RKEEEPVLVGGEWSPSLDGLGPQADPQV 
KVSNAI RCAQAQTGTV 


2540 


16441 


A 


2557 


2 


315 


WAEEDTELRDLL VQTLENSG VIiNR I KA 
ELRAAVFLALEEQEKVENKTPLVNESLK 
KFLNTKDGKMFSLFLFIYL\EFLiNLDTI 
WEKSG ISGLFFVFNC YRSS F 


2541 


16442 


A 


2558 


372 


3 


CNISSHFANKKCQDVIVAARNVMTSQIH 
HAV\KIIPGFNINVPGLPPPDEDTELEV 
QKVSNPQYHEVMNLELENTLDQHSYSLP 
TCRISEYVKKLMELAYHSLLEAASSSDQ 
CADQLFYSVRCI 


2542 


16443 


A 


2559 


64 


435 


WGDAS CTGRAQLG IAHKSVL PTLTDKFN 
TI PAKTPMSFFKE PDKLTP KFT / WKNRT 
PRTVK 


2543 


16444 


A 


2560 


113 


375 


VPGPARDSTQRRAVKNDKKELMS I PGI P 
GPVQVPGGGLST\ GMRGFP GSS LG FLTI 
PGGALIPFSPAFFSRVGGDLSPRNTGPG 
QKPG 


2544 


16445 


A 


2561 


431 


51 


RK I YRVYEREN FRVE I MFRFSHTS KKVC 
KGNV/DVAKFI KLHRDGWHVLNVQCACH 
QKGGIYWFRSIHVELIGYPPPRSSSHIK 
I GDKVR V KAS VTT P K Y KWGS VTHQ S VG V 
VTCGNI IWVTKFLMLTFK 


2545 


16446 


A 


2562 


1 


455 


PEGIALEWTESTGKEREHTFQPGDNVE 
VCEGELINLQGKILSVD3NKITIMHKHE 
DLKDMLEFPAQELRKYFKMGDHVKVIAG 
R FEGDTGL I VRV EENFVT LPS GLAMHGL 
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C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=G Jycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /-possible 
nucleotide deletion, \=possible 
nucleotide insertion 














KVLPPPLQLCSETAS/SIN/VGGQHEW 


2546 


16447 


A 


2563 


407 


1 


KWDHPLSKCEVPCGGSIISSNGPVSYPG 
FPSPY\TSSQDCVWMITVPSGHGVRLNL 
SLLQTEPSGDFIAIWDGPQQTAPRLGVF 
LRSMVKKS VQSS S I QVLLKFHRDAATGG 
IFAIALSAYPNTKCPPPTIHPCI 


2547 


16448 


A 


2564 


382 


1 


TQEVEVAVSLDRAPAFQSGCQSETLSQN 
SNSSNKTWILDHFGLSDFLDQRFSACTV 
FTPREQVS SHTRMF I AALFT I AKTWSQP 
/KYPPVIGWIKK\MWHIYTMEYYAAIKK 
DEFMS FAGTWMKLEAM Y 


2548 


16449 


A 


2565 


429 


0 


PLQSSCQTSCHQNCSRKTSLHFPGDVLA 
TPEQVGGS PAQVP I PYLDDD I PLLEVEQ 
EPVSLELGDVSLVSVSREGLQPASITGS 
RGHLIVQLQELLHHWLWSAVKSRWVIV 
GLFVS ILI LSLVFAIRLR\SASRAPVLL 
RP 


2549 


16450 


A 


2566 


464 




VIHGVI \NPFVHGDQYKKKFPLK\FYQE 
I YES PFVTETGE YYKQEASNLLQESNCS 
QYMEKVLGRLKDEEIRCQKYLHPSSYTK 
VIHECHQRMVADHLQFLHAECHNI IRQE 
KKNDMANMPVLLRAVSKGLLHMIQEHVL 
RALGRI PTS YVRMQS 


2550 


16451 


A 


2567 


3 


178 


YNPGGGVCSDLISSHCTPVWVT\SETLS 
QKKKKKKXXKKNFLPETRERALPGKKKG 
GGG 


2551 


16452 


A 


2568 


215 


411 


IHLIFIHLVFIDYLSVRHSSKILGYISE 
QIRKRNS/ WLYWAHTCNPSTLGGGGGQ 
I TRSG VQDQPG 


2552 


16453 


A 


2569 


170 


430 


TSSQLAPQCLAV/ VSG PLAACPELTSAT 
SPWLQVRTNAMASPLLKFSAEDLLFKAA . 
LSQFCVIMLNAKLSVQKYEK\LISAFSD 
SR 


2553 


16454 


A 


2570 


3 


463 


CRFFGYSTAAAPMTPSSGGSTLSGITAP 
A VPNI PS P IGVNGFTGLPPQANGQPAAE 
AVFANGMHPYPAQS PTVADPLQQAYAGV 
QQYAG PAAYPAAYGQ I SQAFPQPPAMIP 
QQQREWPEGCNLS I YHLPQE FGDS \ ELM 
QMVLPFGNVISSKV 


2554 


16455 


A 


2571 


3 


424 


LKTMKGGTGNGLE IMLDIQQDEYLtPVWG 
ETGTSPTSGAPLHGSRPQP/ PARGFLGF 
TVRPG 


2555 


16456 


A 


2572 


1667 


2046 


YI FFFTAFLWAALTFQVTTTLAPLALLV 
RSAKMMRASHDKPTANITLN\GKTGRAS 
KQRREERNRQEVKLSLFTDDMFLYLESP 
IVLAPKLLLLINFSKVLAYKINVQKLLA 
VLYIKSSRESNEEHN 


2556 


16457 


A 


2573 


1 


399 


FAI SQDHPAIiPSR PPSLHHPKPGTLTFH 
PDLPHQATCS RP I RHQRTWPEDAPLAKA 
DTVS PAEHP PAAATKAP /TR / PAPD KPG 
GTSDPQTGPAP\PASPPCSGP\APQPVP 
RKPSRAAPSKVSVTVPRRVPRTFPP 


2557 


16458 


A 


2574 


2 


452 


AKVNEMKS PMRKGHTLLKNKEEKLNQLE 
SSLWEEASDEGSliGGSPTKKAVTFDLSD 
MDSLSSESSESFSPPHLDSTPSLTSRKI 
HGLSHSLRQISSQMSSVL.SILDSLNPQS 
PTPLLASMPAQLF P \RD?KSTPTP / S YC 
GRCRGFSLTS 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine 7 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Ty rosin e, 
X-Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 


2558 


16459 


A 


2575 


2 


369 


TLVYPAITFILLSICICYWIVTAVFLTT 
SGVPVYKVIAPGGHCIHENQTCDPEI FN 
TPE I AKACPGALCNFAF YGEKSLYHQY I 
PTFHVTNLFVFLWL INFVI ALGQCALA\ 
GAFATYYWA 


2559 


16460 


A 


2576 


406 


1 


RQEGTGPS YLLLVG I TPRS FWGFSPPLG 
PSGKNP I XTFGGPLQGFFFRVQMGVLLP 
LPGPSGN/CSIKVSALNSSFSPSGVKPG 
EASLPWF / CFFEMESRSVAQAGVQWQPP 
PPGFKRFSCLSLLGSWGYRRPPLV 


2560 


16461 


A 


2577 


3 


410 


YISPFYITHMRAHTNLPGPF/KLNQRAD 
ALVS AAFADAQTFHS LTHLNAAGLRKRY 
GLSWKQVKE I VKHFS AGEVLHLPHQC-AG 
VNPRGLSPNS I WQMDITHI PTFGECLS FV 
HVSVD I YSHFI WAT YQTGEATAHVK 


2561 


16462 


A 


2578 


386 


1 


TERIRQRQYQRETEKDKRYTERQRKTER 
IRESDRDRERQNQIERDRRAT\RERDRE 
KQRESDRETYRERENQIETERDRKRQSD 
RDRETQRETGRIRYRERERQHQTESBRD 
RERESDRETQRHRECI 


2562 


16463 


A 


2579 


2 


432 


LLYFAKRYGAAFGECCQAADKAACLLP K 
LDELRDEGKASSAKQRLKCASLQKFGER 
AFKAWAAARLSQRFPKA* FAEVSKLVTD 
LT* VHTECCHGDLLECADDRSDIAMY I C 
ENQDSNFSKLNECCEKPLLEKFHCIAEV 
END 


2563 


16464 


A 


2580 


3 


426 


NLLNDALA I RE KTLG KDH P AG AATLNNL 
AVLYGKRGKYKEAKPMSKRALEIRKKVIj 
GKDHPDVDKQhmhAhLCQmGKTEQGE 
YYYPRALH I YHTKLGPDDFNAANTKNDL 
VACYLKHGT VKQA* SLNKENLTSAHE WD 
V 


2564 


16465 


A 


2581 


10 


389 


KLAG * GLW GHTL I TNPLTE PL T Y PFLG L 
YL+SI I ITSSICLLQTDIi*ALIAYSSI I 
HIALVI TAIIrlQTP * S FTGAVILI IAHG 
LTSSLLFCLTNSNYERTHSRI I ILFQGL 
QTLRPLIAL* *LLA 


2565 


16466 


A 


2582 


1 


192 


LIPTLAI ITR*GGQPERLNAGTYFLFYT 
LVGSLPLL I AL I YTHNTLGS LN I LLLTL 
TAQELSNS*ANNLI*LAYTIAFIVKIPL 
YGLHL* LPKAHVEAP I AGS I VLAAVLLK 
LGGYGI IRLTLILNPLTKHIAYPFLGLS 
L*GGQPERLNAGTYFLFYTLVGSLPLLI 
ALIYTHNTLGSLNILLLTLTAQELSNS 


2566 


16467 


A 


2583 


2 


438 


QAHGPLAGCRLRPRSPSPVLWRRRRRWW 
RQRRKWKTKTATAAAGMYASWTKACRGL 
EELINLTRLNVSYNHIDDLSGLI PLHG I 
KHKLR YI DLHSNR I DS IHHLL * CMVGLH 
FLTNLI LEKDGDDNP VCRLPGYRAS I FQ 
TFAQL 


2567 


16468 


A 


2584 


3 


415 


GRAILLLEEAIQYLSGIEIDLIDTDRGE 
WDSLTPQARR*KEAGLQMFVQLARFHNI 
VCKEAFGTLAFLTSE I KSLFGHPFLAER 
IISMLNYFLQHLIGPKMGALEVKDFSEF 
DF* PQQHVSDSGAI YLDLGDEENFC 


2568 


16469 


A 


2585 


2 


453 


DAKMYLS YKYATVI RNLREGTCPEAPSD 
ECKPEKRCALSHHQRLKCDEWSDISVGK 
I KCVSAETTE YCIA* IMNG*ADAMSLDG 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ 10 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ 10 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A- Ala nine 
C=Cy stein e, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, ]=Iso/eucine, 
K=Lysine, LF=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q^GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Va!ine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














GFVYISGKCGLMPVLAENYNKSDNCEDT 
PEAGYFAVAVVKKSASDLTWDNLKGKKS 
CHTAVGRNAC 


2569 


16470 


A 


2586 


3 


413 


MAVESRVTQEEIGKEP * KP IDR* KTCPL 
RLRVFTTNNGRHHRMDE YSR *NVPSSEL 
QIYTWMDATLKELTSLVREVYP*ARKKG 
THFNFAIDFTDVKRPGYRVKEIGSTMSG 
RKGTDDSMTLQSQKFQ IGD YLDIAI 


2570 


16471 


A 


2587 


1 


798 


LE VMLML VKAG ADQRAKNQDGMNALH FA 
TQSNHWIVEYLIQDLKLKDLNQPDEKG 
RKP FLLAAERGHVEM I EKLTFLNLHTSE 
KDKGGNTALKLiAAKHGES PAVQVLLAQW 
QDINEMNELNISSLQIATRNGHASLVNF 
LLS ENVDLHQKAEPKES PLHLVDINNHI 
TWNSLLSAQHDI DILNQKQQTPLHVAA 
DRGNVELVETLLKAGCDLKAVDKQG * TA 
LAVASRSNHSL WDMLI KAERY YAWREE 
HHES IRDPSTGFTL 


2571 


16472 


A 


2588 


2 


285 


AWS IAPSHPKSKVPPGPRRGKAEGRPGA 
AAQAAEQAEVHPPSSGPSPLPARQPPVW 
QIPPTPSLKTTTRRGAQPQHSRKRLA*S 
RSVSVLFRKM 


2572 


16473 


A 


2589 


11 


438 


AYYGLNWHLGATLSQKKKKKKKKKKNFP 
RGGGPPRNPKFWGIGGRPGTGPPRGEKN 
GLKNQKKEKFLAPR*KKSTGXPLKPPQG 
KG WGGG P KF P KKKR PP RAGGGS LKPKG K 
NREPFLALKSLPGVFXGGFGGNPGMGLK 
NQ 


2573 


16474 


A 


2590 


313 


391 


VH LVRVKIiGL * S R I YKE LLQLNNTT * DN 
PI * KWATDLDRH FS TKYTQM VDRLME KC 
STS 


2574 


16475 


A 


2591 


62 


616 


EVKQGTEVRDSEVRRRPQARGPLMPAER 
AGRQRWLVPALQPRRGGLRR* RGAVRQH 
GAHPHGLLLQDQKI PALPGRKQAGSLHA 
PGTEGEPDHGGDPVLDAGIQHHRQQRHP 
TADHLNPGEHRRGEAHVRAAV* PAAGAE 
GAAKERRAHQANTALQVHRR * LGS FAEL 
RLLRKPGRTSVWPSPM 


2575 


16476 


A 


2592 


345 


438 


HKRWLPVPILYQHLF*VFGHPEVYILIL 
PGF 


2576 


16477 


A 


2593 


479 


313 


QDGLDLLTS * S TR ? S L P KC W D YRRE PPL 
KLINLQSSGVGLHVQLFPPS FCFDQLL 


2577 


16478 


A 


2594 


3 


419 


LTLHSNTLPPPEMSGLT* PPATNAYWTF 
LPSAIRLFPEIFFIAVLLSVSLFDETET 
LSDAHS WRLTFKYERDANYHLLMSAQER 
LERLFGLHGGTI P I VPTADFHDS ISGAS 
DTDI AHSGLAYTMERS ARQI MRTAMKY 


2578 


16479 


A 


2595 


1 


419 


HKCEGRYRGKTYGGYWSLCATVNKALDE 
RI P I TSAS YYATVTLDHVRN I LGSDTDV 
SMPLL*ERHRILNETGKFLVDKFGGSFI 
NC VR * RDNI AEKLMHLWE S F P S YRDVT 
LFEGN * VAFYTRAQ I LVTDTGTVLEGK 


2579 


16480 


A ! 


2596 


199 


397 


SPTALNTDDWPFMGSSSRGHHRQFQTG 
I VSDHPAEAGP IS* DEQE LP YAVLHFHK 
VQPLEPKVTD 


2580 


16481 


A 


2597 


154 


3 


MLS LRG F FCLFVCFETGS * YAAQAGLKL 
LGSGSPPISSFQSARITGVSHL 


2581 


16482 


A 


2598 


425 


223 


OITGHGGKCT.>*SQI.X.GKFGQEMFLMLGS 
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SEQ ID 


SEQ IJ> 


M 


SEQ IP 


Predicted 


Predict- 


Amino acid sequence (A=A)anine 


NO: of 


NO: of 


eth 


NO: in 


beginning 


ed end 


C=Cysteine, D=Aspartic Acid, 


nucleotide 


peptide 


od 


USSN 


nucleotide 


nucle- 


E-Glutamic Acid, F=Phenylalanine, 


sequence 


sequence 




09/515,1 
26 


location 
correspond 
ing to first 
amino acid 
residue of 
peptide 
sequence 


otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


G=GJycine, H=Histidine, I=IsoIeucine, 
K-Lysine, L=Leucine, M=Metbionine y 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T^Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














RSFN* PKLLPGTSGWVTMLNSLGKKKKV 














TAYLNKWMPTP 


2582 


16483 


A 


2599 


3 


441 


QGPSRDLVSELYQINAFDTPESLLMIGK 
DHSDPIHHTFDHMWRTKEHNEAGWLLLR 
SVDKVMKENDELGDSISQLQKQILSLKY 
AKIALTERIiISCPV*TEIVLHYT*SLIM 
LLTOLL* KVHAQSHHAHSGQASNCNVMT 
LIGLQH 


2583 


16484 


A 


2600 


131 


423 


GPAPMVILFHLTGFLLAFLPLSHLLTSY 
LVPWILSGTDGHTFRSACLPRWLEAEWI 
FGGVKYQYGGNQEGK* CFFTG * S YVYNG 
SSGKVPWETFSRT 


2584 


16485 


A 


2601 


222 


447 


SG I P S FGL* WEQNRPGRLNAQ I LKDLG 
VSPGSDYGKLKNGI SSGLDNGVTTSDHD 
VLKMAIVGRKIWILGDWSG 


2585 


16486 


A 


2602 


2 


457 


FICDPLIKAIGTEGDTDVLSEIMNSFAK 
S IEVMGDGCLNDEHLEELGG I LKAKLEG 
HFKNQELRQVKRQEDNYDQQVEMSLQDE 
DECDVYILTKVSDILHSLFSTYKEQILP 
WFEQLLPLIVNLICSSRPWPY*QWGLCI 
FDDIVKHCSPSL 


2586 


16487 


A 


2603 


1 


297 


DHRQKLYAHTECGKALLWKSVICVHQKI 
LEEEKPCEGTKYDNI FSNRGCPPVPRMV 
HAVEIPCK*TECEKATGVHGPRGASEFL 
PERPTGMNMAKCESR 


2587 


16488 


A 


2604 


1 


405 


RFDVSDGLELRPKYNGIAHRLTTIWYLD 
GLRGLYKGVTP I I WS AGLSWGLYFVF YN 
AIKSYITEGSSERLKASEYLASTAEPGP 
MTLC I TNPLWVTKTGLMLQ YDAWNS PH 
*QYKGMSDTLVKIYKHGGVQGLY 


2588 


16489 


A 


2605 


165 


3 


PEELLLVFFPVLKNTGIRPGAVAHTCNP 
STLGGRITRTGD*DHPGQHGETLSL 


2589 


16490 


A 


2606 


3 


455 


KRYGCFSKRMNKRSATNVFFCARKGEVL 
GLLGHNGAGKS SSI KVTTGDT * PTAGQV 
LLKGSGGWDALEFLGYCPHENALWPNLT 
VRQHL* VYAAVKGLRKGDAE VAI TRLVD 
ALKLHDQLNSPVKTLSQGLKI KLGFCLS 
ILGKPSCGILA 


2590 


16491 


A 


2607 


1 


429 


VDYTVRKFCIQQEGDMTNRKPQRLITQF 
HFTS WPDLGVP I TP I GMLKI LQKEKACN 
PQYAGAIVDHCKA*VGRTGTFGDIEAML 
, DMMHT * RKVDVNGFESRNRAQS CQKVQT 
DMQYVFIYQALLEHYLYGDTELEVTCLE 
TPW 


2591 


16492 


A 


2608 


166 


435 


KFLSNK YVHFKQNFKKVLKFI KHLVLNY 
FKN I VLGQV* RATPGI PALWEAETGES L 
EPRSSRPTWAMWRNSISTKNAQIKNIVL 
RILDPG 


2592 


16493 


A 


2609 


228 


431 


IFSKICIFNGFNYFPIVGHIGRIYLFIE 
TGSHCVTQAGVQWCDYGSLQP * TPGLK* 
SSHLRLPSSWDH 


2593 


16494 


A 


2610 


439 


175 


RNGGLHLWSQLLRRLRWDYHFSPGSQGC 
SKS * S YHCTPAWVTQGDSSLKKQKRK I S 
TYTSILSVLNETVYI IVYYTKKLDTSFK 
RKKL 


2594 


16495 


A 


2611 


1 


439 


LLGSILSSMQKPRGLVDQETLRKAR*QA 
ARLNKLQEHEKQQKVEFRIRWEKSVSHV 
V * DSGHITT* VQSMNS I ERS I LHEWEV 



705 



WO 01/64835 



PCT/US01/04927 



SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ JD 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
OGIycine, H=Histidine, l=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
^Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /-possible 
nucleotide deletion, \-possible 
nucleotide insertion 














DGLTS FY FGEDDDCR YGMI FTKDAAP SD 
SELDSYRRAEEWDPHMADDKRKLTELAQ 
RPDDDA 


2595 


16496 


A 


2612 


1 


439 


VIRKVTGTEGSSSTVLDYTI PSSTGGMP 
VRKSEDQTDTKRTVI KTMEDYNNDNTAP 
AEDVI IMIQVPQSIWDQDDFESEDQDDK 
STQPISSVGKPATVIKNVRTKPSAIVKY 
PEKESEPFAKILKFTMDVSHEVI PHEVK 
SS*YSA 


2596 


16497 


A 


2613 


2 


453 


EKPEKEECNLWTEMWQENVPGSFGGIRL 
YLQELMTITQKALHSQPWKMKAQGAIAM 
ASMALQTNSLVPPYLGMILTALVEGLAG 
RTWALKEELLKAIACVETARSAEL* KSV 
PNQPSTHEI IQADLKECSKENGPYKI* P 
VICAADILHA 


2597 


16498 


A 


2614 


2 


441 


LEPALPGRWGGRS AES PPSGS VRKTRQN 
KQKT PGNGDGGSTS * APQPPRKKRARAD 
PTVESEEAFKNIMEVKVKIPEELKPWLD 
EDWDLVTRQKQLFQLPAKKNVDAI L* EY 
ANCKKSHGNVDNTE YAVNEDVAGI KEYF 
NEMLGT 


2598 


16499 


A 


2615 


1 


430 


RGDRDLH CTESQS E AS TEEGHD S L»S VG I 
FEEDSQLEFILDPPKSKPPAWLNGIMTC 
*DFELLNPRRVRFLLEVKDLALTRRQIL 
LHKGLSDYEKSTTLQELVLKKSSRSGPP 
LSIEDLGLNFQL+PSSRVYGFTAEELKP 
SGE 


2599 


16500 


A 


2616 


373 


552 


I CKI KYKLFNSALLFFRQGliALLPRMEC 
SGAIKVHCSLDPLGSSNPL*RKIKESTL 
NLEKSLCTRGIFLCKYEEVPKQFLKICF 
Q I FLETGS RS VS QAE VQWHDQNS LQPRT 
PGLKSFSCLNLPKCWDYRCEPPRRALCS 


2600 


16501 


A 


2617 


498 


157 


QLIGSSTHQAALRSRSLPPPAGPGTFHF 
HYQGKAWWKGGTS PDSLPSLLGRGVSV 
QLHPRGKEQRGASDT*HKCPVKLWTIGG 
KYRVSETSRIFSLPPTTLQRAGLDSGSG 
SL 


2601 


16502 


A 


2618 


255 


389 


LSEFYTYEGPSIRPPIG3*GTNLPLPLS 
Y I PRS P S AVDDENLLDE 


2602 


16503 


A 


2619 


3 


207 


QHSSLIIIRAIAIKI J GIAPFHF*VP*VA 
QGTPLTSGIiLLLT*HKLAPISIIYQISP 
SLNVSLLLSGT 


2603 


16504 


A 


2620 


3 


207 


QYSSLII IMGVGIKLGIAPFHF* VPEVA 
QGTPLTSGLLLYT*QTLTPISMIYQISA 
SLNVSLLIiSGT 


2604 


16505 


A 


2621 


45 


447 


WRRIDCRLNLCVTTASLKHFFS IAHKFH 
LHNGSHLQSQHSCEAQIiRRTARFI ICCL 
PYGESGQSWPTLTLERANLNWLTVYFNN 
WKDWEYDSDQYMSNFDRSSEMMNI I VW* 
RSIDNLPHI I CSTLINTKNSFL 


2605 


16506 


A 


2622 


3 


142 


GNQATPKTAPATMSTPT I LVATAVHAYR 
* * VAEKEHPLKFGGRACS 


2606 


16507 


A 


2623 


2 


453 


HGPGGLLD Y I DKER I RDFLNGE CMCE VP 
DGGLVPKSLYRTA*DLENEDLKLWTDTI 
YQSASVFKGAPHEILIQIVDASTVITWD 
YHVCKGDIVFNIYHSKRSPLPPNKDYLI 
AHSITFPDGNNVHLNNIC/WMLGPDYIMW 
ITTXtXTNNBN 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQJD 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
VSSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenyla)anine, 
G=Glycine, H=Histidine, I-Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T^Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 


2607 


16508 


A 


2624 


1 


315 


EEKQAPGFTTGRGKKTPLFCAHAVRFI I 
S TS L I Y KG AY P * ALKGKEKY * LAVF WVY 
NKKGLNNNNLFLDGVHLCFVPEVRKYLA 
GKGLP FS I GLYHL I LLLFFFF 


2608 


16509 


A 


2625 


232 


478 


LDCGNYFTV*MYIKTSHCIP*IYIILNF 
HLYNFRYNLGDYWGEIVSEVMAQRQPMK 
PrYAIPI IPITNSSQFKHQEAMDVKE 


2609 


16510 


A 


2626 


3 


430 


TSTPNVHMGSTSLPGDSTTIEDAIQSHS 
ESASPSALSSHPNWLSPTGWSQPKTPVP 
AQRERAPVSGTPDRDKLRPCGQRD*GYY 
WEIEASEVMLATRIVSGSFGTVYQGKWH 
GDVALAILQWDSTPEQFQGFRNDVAGL 
RI 


2610 


16511 


A 


2627 


2 


341 


ALQKHEDTDCPCWVSCPKKCSVQTLLR 
SE * S AHLSERVIAPSTCSFKRYGCVFKG 
TYQQI KAHEAS F WQHVNLLKEWSNSLK 
XKGSSFDKECVDKNKSIHCWHIQICSSV 
I 


2611 


16512 


A 


2628 


2 


179 


RHTGLWVTSLPAVFPGQVRRTLFITGLP 
RDARKETVESHFR*AHWLVGDFSPCCIS 
RTGEADPVHHRTPQRCQEGDCGEPLPVS 
RW VGS FLIHY IQEHCL 


2612 


16513 


A 


2629 


1 


418 


GFSFCHPVPVKWRHRDSPQP*TPGLK*S 
CLGLPKCWDYRHEPLRPANISY 


2613 


16514 


A 


2630 


3 


442 


FTCGTI 1 I AIPNGVKEFT* LATLHGSNM 
K*SAAVL*ALGYIFLFTVGGLTGIVLSN 
SSLDI VLHDTYYMVAHFHYALS IRAVFA 
I IEGFIH* FPLFSGYTLHQAYA* IHFTI 
I FIG VNLTLFPQHFVGLFGMPRQYSD YP 
DAYTTR 


2614 


16515 


A 


2631 


2 


454 


AAPMELI CWSGGWGLPSVDLDSLiAVLTY 
ARFTGAPLKVHKI INPW* SPSGTLPALR 
TSHGEVISVPHKI I THLLKE KYNAD YDI» 
SARQGADTLAFMSLLEEKLLPGLVHTFW 
I DTKNYLE VTRKWYAEAMP FS LNFFLPG 
RMQRQYM*RLF 


2615 


16516 


A 


2632 


131 


410 


WMWSSKAPHCFRLPSIGDADTVHQCAMS 
FQKGHSAL3GVLHLVFKPDLVYQTLLQM 
PPRKCCLWPGAVTHACNSMTLGGRSR* F 
TRSGVQDQP 


2616 


16517 


A 


2633 


2 


230 


FFSETSSLLEIQGIARHGGTWRLRQENN 
LNPGGGGCSELRSCSCAPAWVTVRLDLR 
KKRNRNP * KILKNYLKI FVI 


2617 


16518 


A 


2634 


2 


370 


GTS S S DPAQPGDDKEF I DASRLVYDG I R 
DIRKAVLMIRTPEELDDSDFETEDFDVR 
SRTSVQTEDDQLIAGQSARAIMAQLPQE 
QKAKIAEQVASFQEE * S KTJDAE VS KWDD 
SGNDIIVLAKQ 


2618 


16519 


A 


2635 


2 


376 


MTDTDALY * RELFDPADKDKMDHSRRGI 
ALVFNHERFFWHLSLPERRCTCSDRDNL 
TRRFSDLGFEGKCFNDLKAEELLLKIHE 
VSTVIHSDADCFVCVFLSHGERNHIYAY 
DAKIEIQTLTALF 


2619 


16520 


A 


2636 


1 


461 


DMAFLILTERKILGYGQGR IGPNVACPY 
GLLQPFGDAI+LFTKEPLKPVTSTITLY 
ITAPTLALTIALLL*TPLPIPNPLGNLN 
LGLLF I LATSSLAVYS I L* SR * ASNSNY 
AIiIG ATjRAVAQT J S YEWLTI ILLSTLL 
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SEQ ID 


SEQ 10 


M 


SEQ1P 


Predicted 


Predict- 


Amino acid sequence (A=Alanine 


NO: of 


NO: of 


eth 


NO: in 


beginning 


ed end 


C=Cysteine, D=Aspartic Acid, 


nucleotide 


peptide 


od 


USSN 


nucleotide 


nucle- 


^Glutamic Acid, F=Phenylalanine, 


sequence 


sequence 




09/515,1 
26 


location 
correspond 
ing to first 
amino acid 
residue of 
peptide 
sequence 


otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


G=Glycine, H=Histidine, I=IsoIeucine, 
K=JLysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
VV=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, V=possible 
nucleotide insertion 














IKGSFNLRSLITT 


2620 


16521 


A 


2637 


2 


384 


ENFKALAMIAFGQYRQKRPFEDHVKIAN 
D*LNFAKTCVADESAENCDKSLHTLFGD 
KLFPIATIiRETYLELADCCAKQ*PDIHE 
CFLQLKYDNPYLPRLVKPNVDVMCTAFH 
DTEETFLK* YLYE I A 


2621 


16522 


A 


2633 


1 


373 


TF I YLLFLFS SAYS RG VFGRDAHKS EVA 
HRFKDLGDENFKALVL I AFAQ YLQQCPF 
EDHVKLANEVTE FAKTC VADE S PEN * DK 
SLYTLFGDKLCTVATLRETYGEMADCCA 
KQE PDRNECFLQ 


2622 


16523 


A 


2639 


2 


375 


KGPCYRLVSTGTQRRHPGAVYLNKHLC* 
CNVGKA*GPHCEKCTLFCTFNEEPLKAL 
TFFREHGP*VSDPEVATAPTEKEIPSLD 
QETTKLEPGQPQLSPGISTIHLHPQFPV 
VIEKTSPPVFVEI 


2623 


16524 


A 


2640 


215 


478 


KYFLASHTSLFI I CYTAHLTCTIAEPKQ 
IESHFGKRLDADLVF*KSDDSTWDVIP 
SKPV*TSGLFSGKCL*HIVEGIIRAVDP 
RKLY 


2624 


16525 


A 


2641 


208 


376 


I LRNL I KNHF WPG WAKACD PNTLE AM * 
GQ I TRSRD WDHPDRHGEAPS VLRIHRLA 


2625 


16526 


A 


2642 


424 


2 


KGEPLPPLGGWKT*GPRAFPQAQIPHAG 
FKTRGCPFPLPQGRNKAQVPNPFCPRER 
FLPGKGSGRVAPLKNGAQLGGPPFYPHP 
FGG PS PRVPLG FGVQTHLGNKPKPPFFP 
KKKKGGTGCSVGNWGPKSSQTLPSPQAS 


2626 


16527 


A 


2643 


190 


3 


ELSTRGFQGPKRRIAGLKQCLVMFPELE 
RSGGLSEH*HLHLPGSSDSRASAS* I AG 
ITSVCH 


2627 


16528 


A 


2644 


2 


339 


CCEKPLLEKSHCIAEVEKDEMPADLPSL 
AADFVESKDVRRNYAEAKDVFLGMFLYE 
YARRHPD YS WLLLRLAKT Y * TTLEKCC 
AAADPHE CYAKE FHEFKPL VEE PQNL I K 


2628 


16529 


A 


2645 


81 


369 


VEVTGQPQNASFVKRNRWWLLPLIAALA 
SGSFWVFRTPDG * TSWR* IHNHMTQLI 
NNHLRA* YS VYRDI YFLWIAI * * SS * PA 
SVLLFTADYCPE 


2629 


16530 


A 


2646 


1 


348 


DMDMNPLRPQNYRFG*ELKAD I DYHFKE 
DNDEDDHQLSLRTVS LWAGAKDD * HI VE 
S EAMN YEGCP I KVTLATLKMS VQ ATVTL 
GGFEITPPGDLRMKCGSGPVHIMGLHLV 
AGEE 


2630 


16531 


A 


2647 


1 


178 


GYTDTILDVRSQRVRSLLGLSNSEPNGS 
VETKNL *QI LNGGES PKQKGQE I KRYDL 
YLT 


2631 


16532 


A 


2648 


181 


244 


TIKRYKNVMIFYYFNFF*R*GLNSLAQA 
AAQWCNHGSLQNQPPGLKGSSCLSLPCS 
* GYTI FYLSIHQLI D IWIVSTFSKLFLH 
VSAYSSIKMS 


2632 


16533 


A 


2649 


2 


369 


KWITFIStiLFLFRSVYGKFEFLLDAPKN 
DVAHRFIDFGEQHFKAIiAE I AFDHYFLH 
CPFEDLVTLVNE*TEFA*TCLADESA*N 
CDKSLHTL FGDKXCTVATLR * TYGEMAD 
CCAKLEPERI 


2633 


16534 


A 


2650 


1 


349 


VTFISLLFLFSSGYSRGVFRLDAHKSEV 
AHRFKDLGEENFKALVLIAFAHYLQQCP 
FEDHVKL VNE VTE FA * TCVADESAEMCD 
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SEQ 10 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H-Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W^Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














RS LH TLFGDTIjCT VATLRE T YGETAD C C 
AKQD 


2634 
i 


16535 


A 


2651 


21 


403 


C VC C VC VRE SRTE S LPGDNED FNV KN AS 
VKDVRCVHFDCDSQNDP PMEATGFTAQV 
TIYSLINLLHRLKYFETLDLH*IAIFFS 
KVSILNHPGQI I ADYAPALDCHTCK I AC 
KYAELKEMIDRRSVK 


2635 


16536 


A 


2652 


1 


355 


ARMS I TDTYGQHLIAGGLMTQEDVS E I K 
SSYYAKSNDKIJWMAHYRPTALNLQAffW 
HGLAQAS AQ I T TWS SGVP FDLLLMVGMM 
TVQVP * ELQMHSHLLKTHDQSRMENMMY 
GIKLDW 


2636 


16537 


A 


2653 


2 


360 


LFEHLG* YKFQNALLLRYTMKVPQVLTP 
TLVKVSRNL* KVGI KCCKHPEAKRMPCA 
QDYLSWLNQLCVLHEKTPVSDRVAKCC 
TESLVNRRPCFSALEVDETYVPKEFWAE 
TFTFHAD 


2637 


16538 


A 


2654 


360 


314 


YTCAIVPCKLFWWCCRDRVSLCCLYWSQ 
TPELK*SSCLSFPKC*KYGHEHTWPARI 
IFLNLYNKSG*TATRKCPPTKVTPSSHQ 
Q*KCTWPYFYFK 


2638 


16539 


A 


2655 


1 


353 


NWTL* TLKKI FSLKDAI KRI KIQSVRWK 
KI FAKHTSDKG VESKYI * RTLINQ* ENN 
SIKNWSK3STLNRRFTKDIQKANKHMLSAT 
SLVH * ES KLK* Q * YHYTS IRVAKMKKTD 
YVTSW 


2639 


16540 


A 


2656 


340 


10 


GREDKS E KC YLKPGRSQ PDRG APKSS PG 
VP* PPAPPALPGPGRSSPFPQGSLAGKD 
LRPRQPSQPGGPGELIFPVKTKKKKIKE 
RKTLLAWTRSNERPEIAQREGGCLRQ 


2640 


16541 


A 


2657 


509 


261 


IPYFKIHCLHSALGVAETEKETAEHLDL 
AGASSRPKDSQRNSPFQIPPPSPDSKKK 
SRGIMKLFGK*VKQ*TSGMGPVLHVA 


2641 


16542 


A 


2658 


345 


3 


S AHLSHPKCWD YRRE PLHLAR I S FLFKA 
SS PLYGRATFYLSTPPLMDGRWVAS I FV) 
LL* I VLP * AWVYKYLSETPLS I F*KTGS 
LSVAQAAVQWHNHGSLQPQTPGLKRSSH 
L 


2642 


16543 


A 


2659 


271 


56 


VILYIVMPIPRPCPHPHPYIHTHTHTHT 
HTHTHTRI»GKREF*tiYSW* LPGYSTNGN 
KIFLEIYLLPILLKM 


2643 


16544 


A 


2660 


366 


68 


PGQQS KI MPQTERKKGRKEEREEREEKE 
EREGRKEGRKNEGTETVIEGGSSKTQEQ 
TR I KAS * LNSLYTTVKPQRRLKGGKKHP 
KDSNFKD*RNISPHR 


2644 


16545 


A 


2661 


351 


199 


LIjRRLRQENLLNPEGRGFSE PRSHY YS P 
DWATE * DFWSNKI K*QKVQYSKY 


2645 


16546 


A 


2662 


1 


160 


ERAWLHLSPGDGVCSEPRSHHCTPTWVT 
E * DS VANQPG WDR VRLCLKPT WVTE 


2646 


16547 


A 


2663 


233 


3 


GRJDNWLKTVAKPESLSGKRGPSLLRKVF 
NPFPVYILFLPQAPKGKWLSPFPPFFFF 
FFFLRGSLAM* PGCSAVAQS 


2647 


16548 


A 


2664 


96 


349 


QMPSSVLTRVANAYYSGFFFWFLKQKFC 
FVTQAEVQGRS IS* PGPLVPGFKRFLGL 
TLLSTWNCRRAPPGLVTACGFKMSLLTL 


2648 


16549 


A 


2665 


322 


74 


DTYTLPRLNQVESLNRPITGSEIEAIIN 
RPPTKKSPGPDRFKAKFYQRYKEELVSF 
LLKLFQ* IEKGGIGWARWLMPVIPVL 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A-Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIalanine, 
G=GIycine, H=Histidine, I=Iso!eucine, 
KHLysine, L=Leucine, M=Methionine, 
N-Asparagine, P^Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 


2649 


16550 


A 


2666 


2 


228 


NLALSPRLECKSQGFTMLIRLVSVS* PC 
DLPASASQSAGI MGVSHHAQPFLHSYLR 
PLILIKGDQFKYAFNLFSC 


2650 


16551 


A 


2667 


129 


301 


VRPLAGLGSPLIFFFFFEKSLAFVAQAG 
GQRHNLG*LLPLPPGLKGFSCLNLPSTW 
D 


2651 


16552 


A 


2668 


318 


21 


FSLLVKDFQNDLAVWSSSIFPSTFLFLV 
RS*LDFVNLNFFNFYLFRDEFSLCCPGW 
TRIPRLKQSSRLRLLSSWGYKCVPLHPA 
INRFL*CDFSISGY 


2652 


16553 


A 


2669 


2 


322 


QDGLDLLTS * STRLGL PKC WD YRRBP PR 
PAPINKCLLSIYHGPGPSLGTEGEKGQD 
YIPAIEQIiTDQWRRLTWTGCYCAAVAED 
SMGWWPLLPGYEAEKEAGRFH 


2653 


16554 


A 


2670 


29 


261 


EFNTLSKCVWIY*LFSLSLSLSLCVCVC 
VCVCVCV CLCVSVCLFYGS P I CL.HFMHP 
DTLSEESFHFLQI ISKELLKC 


2654 


16555 


A 


2671 


5 


224 


GSAWWLTP VI PALWECKAGRSPEVGSLR 
PA* PRDPPALASESAGITGVKKNLIAEL 
WEVKSCILLIFISLDKV 


2655 


16556 


A 


2672 


126 


321 


WGRRGARARRDSHRALRVSPELALLELR 
PSTMAHI CNPSALGGRGGW IT* DQEFET 
NLANLEKPC 


2656 


16557 


A 


2673 


360 


74 


S RLLRKLRQEVGLN LGGRG CSG PRLHH C 
TLAWATERDPDS KNKTNQKNKQRR * *TL 
PETNSLLAWEPDCVCKTNKLATRLEIMA 
QKSCSQRPQYH 


2657 


16558 


A 


2674 


192 


357 


RIRKWLYYYYYYLLRWSLALS PRLECRG 
T I LAQCNFHL* S S SRS PASAS PVAGI T 


2658 


16559 


A 


2675 


236 


375 


EEVLYSRKDRHVASYSRK*W*RSGAEAH 
ACNP S TLGGRGGW I TR SG 


2659 


16560 


A 


2676 


236 


2 


ATKVS INS IGTLGVKVGGLLEGRRLRPG 
PKQSSCLGLQSSWDHR*VQPHPGNNFTL 
FVETRSMLARLVSNS WPQAI FQ 


2660 


16561 


A 


2677 


164 


389 


LTVQHGLRGLRKLTVMVESEGQASHLLH 
KAAGRRS ECQQGNCQMLI KS S YQP CAEA 
HACNPSTVGG*GERITRSG 


2661 


16562 


A 


2678 


208 


1 


CSVYEGSCFCILRVRRVCVCVCVCVCVC 
VCARSCIYVRSENH*KQLPSSSILT*RF 
* IQSTFRSNKRLR 


2662 


16563 


A 


2679 


159 


382 


RWLIKNHPTQARIQVRGLLDRDCQTQTW 
LWESGVQPLATTPRRHSEMGGWPGAVAH 
ACNPS TLGGRGG * ITPPRPESKLEDCLT 
GTARHRLG CGSQGCS LWLPPPGG IQICWG 
VGRAQWLT P VI PALWVAEAGRS LEVR 


2663 


16564 


A 


2680 


29 


367 


DCQSEQLRKl J RWEHHW r SQSV*GCSQP*S 
HHWTPAWVTEQDPVQNKQTKNYNTHQRA 
GEVWRB I NCLSTEQLTEL YSE I TQ I LAIi 
SVRN* KLL * YVKS S VI SAETQI SGGKI F 


2664 


16565 


A 


2681 


256 


1 


ENIVPVRFLCIHRLFAILIMVQ*LERKR 
DHIVIHLNLTLETVYLKXWQTRPNAVAH 
ACNPSTLGGQGRRI TRSRARDYPGQHGE 
T 


2665 


16566 


A 


2682 


1 


395 


LLIEHILIAMALLILTERKILGCIQLRK 
GPNWGP YGLLRPFTDAI KLFARE PLQP 
VTSTITLYMTAPTLAVTMAVLL * TPLRI 
PNPLVNHDLGLLCILATSSLADY* IL*S 
R * ASHSNYAL IGALRACPK 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 
location . 
correspond 
ing to first 
amino acid 
residue of 
peptide 
sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence <A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
OGIycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=XJnknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


2666 


16567 


A 


2683 


3 


349 


VRIDNAAVCLLY* AWAEFIWALANLI IR 
PDLDHPGNLLGNDHI YNVI VTAHAFVI I 
FFIGIPI I YGGFGN*LVPLI IGAPDMAF 
ARITNIRF*LLPPSLLLLLASAIVEAGA 
AGT 


2667 


16568 


A 


2684 


99 


2 


LTVAHACNPSTLGGQSG* ITRSGVREQP 
GQHG 


2668 


16569 


A 


2685 


325 


4 


SSFPMSF I INS Fill FLYTT I FLI FYDV 
VYSPEFKP*VCFCLTECWDYRREFQCSF 
LPCKRV*VLAPFIKNFFFFFFFLRWNFI 
FVAQAGVQWRHLGSLQPPPPNFK 


2669 


16570 


A 


2686 


382 


269 


MGFHHVAQADLELLGTNDPPALASQSAG 
ITGVSHHAQPKLTF* TFS Y* ACVGGS S P 
QFILWQHGLAGNSFCCF 


2670 


16571 


A 


2687 


3 


381 


GHL YALTFFFNSQ I CLFPRI YKTCKKSF 
WLPFLNLVSLCAIEGYKTKKVPSNV* KG 
IFIHHQTGFILRMQGWLNI*TSINGIYH 
ISRIEDKNHKTMFIDAEKWFHNI*HPFI 
IENIRPGWAKICN 


2671 


16572 


A 


2688 


289 


91 


IKYKELGVACSKTSRNVSYYCYHHHHHH 
RCHHYHHHS *RLAALCEESGWRKALGST 
SMGRVAFHRH ; 


2672 


36573 


A 


2689 


3 


211 


LHHVGQDGLDLLTL*SAPLGLPKCWDYR 
TEPPFPAPNEFLLLHSLLSKHYALGRPH 
TPRERKKASGILR 


2673 


16574 


A 


2690 


2 


197 


DLLTS*SAHFGLPECWDYRREPPRPASY 
AF I K YFC PALS C F WCI ETGP YL I FAYLR 
EESFVRSRT 


2674 


16575 


A 


2691 


253 


356 


AVPVKMAIVKKTRNNRCW*GCGEIGTFL 
HCWWEC 


2675 


16576 


A 


2692 


211 


339 


PGHPLS 1 1 LQW I WGEDTRRG PMHEALH * 
PCCSELRSCHCTPAWPTE*NS I F*KKKK 
KWLGELKEKQKNMACANDP* PGHPLS 1 1 
LQWIWGEDTRRGPMHEALHGKSGDGVYA 
VRLLTAY 


2676 


16577 


A 


2693 


1 


175 


RHEGLNLGGRGCGEPSSCHCTPAWVTE* 
DSVSKKKRKGKKKSALILIFSNGVLFCH 
LG 


2677 


16578 


A 


2694 


234 


13 


KTSREPWSEFLS IKGEQQCSFGLLYVFC 
FQMESHYAAQAGV* WCNLG * LQP PPTGF 
KQFSCLSPLSPSSPASAS 


2678 


16579 


A 


2695 


149 


2 


SQGEDFTKTRALQWRSRRSWAHACNPS 
TLGGRGWWI T*GQE FKTSLAN 


2679 


16580 


A 


2696 


2 


194 


CIGLGWAGACS SRLRQENDMNLGGGAC 
S ETR S HHCTP AW VT E * DS I S KKKRKRNR 
WGGTRFSN 


2680 


16581 


A 


2697 


338 


357 


YLIRRKKISNSKS+ FSWPGWAYACNPS 
TLGGRGRRTTRSAVRNQPGQHGDPKFFL 
VSS 


2681 


16582 


A 


2698 


53 


293 


TISIKSKKWKNKQGEYNKWCMEWWLAIC 
RRLKLCPYVLVHS C I AI KN YLRLGNL * R 
KEV*LAHSSTCCTGNMAEKPHGRR 


2682 


16583 


A 


2699 


2 


335 


LYAAKVPLQKACIRTFSYISFFIFFFFF 
FFGKQSSFGPPGLKARGGTPLIGTPPLG 
VKRNFLPQFSGKPGITGCPPLPQKFWFF 
KKKRGLIF*PGGVPNSDPKGIGPPNPP 


2683 


16584 


A 


2700 


166 


3 


DYRRVPPRLVKR * GFTW*AGV* WRNHG 
SL.QTQTI GLKQ3 SHLS LATCWDYRYE 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=G!utamic Acid, F=PhenyIalanine, 
G=Glycine, H=Histidine, I=Iso leucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Th reo n in e, V=V aline, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, V=possible 
nucleotide insertion 


2684 


16585 


A 


2701 


209 


340 


YRMTVFPARHGG5HTCNPSTLGGRGGWM 
T* CHE I ETTLGNMVKP 


2685 


16586 


A 


2702 


78 


341 


EVACNCLLPAIGCFASVTYV* SSVLL* F 
EL*VLNENSSF*NFIH*FF*NRVLICCP 
DWSVMA*S* FTEALTSQTIRSSHVSLPS 
SWDY 


2686 


16587 


A 


2703 


258 


185 


TMIAPVHSSLGDRARPYLFKQKR*GLAL 
SPRLEWTGAI IVHRNLEFLCSSNPPTSA 
SGVTGITEMEVERHGWIQETFWK 


2687 


16588 


A 


2704 


361 


41 


NFGPVYKTNPGPSRPGWGPLGKPPGPQK 
FFPFGVPPSPPFPKKGSVFSPPGGQKGP 
FKGVFWGFP * PRGKKTPTQQPGPSPKVP 
NWDFPKGGFPKKKKKKRKILRN 


2688 


16589 


A 


2705 


12 


329 


SCTLQLTTPRLK*SSRPSLPNN*DYRHV 
PPHPASFP*FSEMF*DSTQIFLNSIjKFS 
S * YYLVIFFFKMEPCSITQDGVQWCDLS 
SLQPLPPRFKRFSCLSLPSTRP 


2689 


16590 


A 


2706 


232 


1 


KRKRKS FKTY YKKFFLNSSFKNQCPYAI 
GHI FRS I IFFFFLRQSFGLVA* AGVQWQ 
DLGSPQPPPPGFKPFSHLRLR 


2690 


16591 


A 


2707 


135 


1 


CFFLIFFFFSSDTQAGV*WPQPPGIKRS 
SQLSSPRSWDYRRISPR 


2691 


16592 


A 


2708 


1 


357 


LDLLTS *SARLHHPKCWDYRREPPRPAD 
YKYFLKEVASLIVKLYLFCKLNFE 


2692 


16593 


A 


2709 


88 


344 


MQEHYYFIKRGNKMI IRKYVQLHANKLD 
SLDKIDKFQVKNHQN*L* KK* KSWPGAV 
AHACN PS TLGGRGGW I T KS * DRDHPWLT 
C 


2693 


16594 


A 


2710 


1 


383 


LHKVTTVLRFLS KFHILVLYFYYTCVHV 
SVSDMWCWFLY* I FTDMYNAVFQ* LDFF 
KLS I KFLRCI C* CVY* I FI PVFI YCFFL 
YLICLFLSYFLFNIYVMLLFYILCIIYF 
YVIFSSSSIF 


2694 


16595 


A 


2711 


227 


1 


IFFFSNSFLKISLGFFLNFAFF*KPKFF 
PFLSFKINQNFFFGFPFNFLKI FFKFFP 
NKFFKF* IYLTFFKKTKPK 


2695 


16596 


A 


2712 


1 


136 


KNTKISQGWWPTPVIPATQEPEAGGSLE 
RGRQRLQ*RNLGSLQPLPPTFKRSSCLR 
LLSSWDYRRRPPPLANFCIF 


2696 


16597 


A 


2713 


3 


349 


LI PRGQGSTWLPYNPATS I FGNDLNE I 
KMYGHAKTCIWMFMTSSFI IDRTRKRQ* 
CSSVREWINKQ* CIQTMELVFGRNFFFF 
FFFFFFFFFFFFFFGGPPPPKIQIFKEI 
NPQ 


2697 


16598 


A 


2714 


181 


342 


TSQLLNLPLP I I KA* AIRQEKE IKGIQL 
GKEEVKLFLFAHDTIVYP 


2698 


16599 


A 


2715 


3 


205 


FRHVAQAGLELLSASSLPTLASRSAG IT 
GVSHCVQP* TVS SLRLGFYDTS FKI AD I 
QSCFSWVLYYCLQSAEL 


2699 


16600 


A 


2716 


388 


23 


ASNQSKNSFEKGEKNQQSVMVKTSQQAL 
NKRAFFDMIRSVYQKTYK*RNA* *ERQS 
FSLKVENKKEYPSSLLLFNIVLKVLVNT 
GHKEK* KAHRFDRNI IQFANDMI VYVEN 
PKDSTKRLS 


2700 


16601 


A 


2717 


4 


194 


FSCLSLLSSWDYRSMPPHPANFLYF* *R 
RGFTMLARLVLNSWPHSAG ITGVSHHAQ 
PFNYFFI 


2701 


16602 


A 


2718 


i 


212 


LCCPGV2 WETARLKRSS RLS LPKCWDYRK 
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nucleotide 
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ing to first 
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Predict- 
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nucle- 
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sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V^Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possibte 
nucleotide insertion 














ESLHLAFGFFLL*YFGPYLMVYVFSFNL 
YDFSLVYSL.YGTSI 


2702 


16603 


A 


2719 


53 


299 


KAVHVNIFFITKVHLTMETQIVSLSLFP 
FPSFLHPSLSLSSFL*RDRVLLCYPG*S 
AWQS*FTWPQIPGLR*ASCISLVSS 


2703 


16604 


A 


2720 


139 


320 


QPTMRHSFSF*KQEISIFLKVSSSLHHT 
LRFLFFFFLLDRVSLCHPGWSTVALS*L 
TVAW 


2704 


16605 


A 


2721 


1 


99 


IILAYSSITHIG*IIAVLPYNPNITILN 
LTIYM 


2705 


16606 


A 


2722 


215 


361 


LTFFFFFFERESNFVTQAEMQGGNLV*L 
KPRPPGLKHFYYLSLQSTGNY 


2706 


16607 


A 


2723 


335 


24 


AIPLRPEF*THPGQNGKTRFPLKPQNLT 
GVGGNARYSQFPKRLRLKIPLNPEVGPS 
INPNFHPGIPFWAQKENFFPKKKKKKEI 
IMLNFSLICYNLFFLYICY 


2707 


16608 


A 


2724 


27 


314 


WKQLKYPPSDEWINKMWCIHAMEYLAIK 
RNKVQLCVT*RWRWLMDTKKK 


2708 


16609 


A 


2726 


1 


112 


GFHHVSQDGLNLLTL* STHLSLPKCWDY 
RHEPLCQAR 


2709 


16610 


A 


2727 


271 


3 


RS PMLKNQAI KSVI MVYIGTI WRSAGPF 
AIFFIAEYTNIIIIITLTTTIFLGTTYD 
ALSHELYTTYFVTKAVLLTSLFL* IRTA 
YPRFR 


2710 


16611 


A 


2728 


121 


1 


ENKYR PGAVAQ AY I PNTLGG * GG W I T * G 
REFETSLTNVEK 


2711 


16612 


A 


2729 


2 


290 


NRHFTKENIQMPTRHKKRDPQSSLVIRE 
MQTKTKVR YY F I LTR MAXVKK I DNTTC * 
* GYEKLNS PKLQTTKCPI FEWVWKLWHS 
QIVEYFSGIQHA 


2712 


16613 


A 


2730 


98 


264 


LVSSE * * * I YQILFFFPEMECHS VSQW 
VRGSNLC* liHPLPPDFKRFSCFSYWEG 


2713 


16614 


A 


2731 


2 


286 


PFYKATVIRTVWYW*KN*QIDQWNRKES 
PEIDLHK * SIiL I FAKGAKAIQWRKDSLF 
NKWC * NRW I STCKNESMLG WAHTCNPS 
TSGGQGGRIA 


2714 


16615 


A 


2732 


323 


181 


RDFVLLVE I TMLARLVS NS * P CDLPASA 
AQSAG I TGVSHRARP FLDL 


2715 


16616 


A 


2733 


272 


131 


GRVDRLNPGGGGCS * LGSCQCTPAW VKE 
TPSQKKKNKI ICFYAFLFI 


2716 


16617 


A 


2734 


227 


87 


AASTLTLTSLI P P I LTTLVNPNKKNS YP 
HYVKS IVASTFI ISLFPTTIFMCLDQEV 
IISN*H*ATMDFT 


2717 


16618 


A 


2736 


25 


415 


FSKYLLDHQLSAGLLVEQHRGSLCPLSL 
LHSDSSLSFAVSGKVSLAASIRNKLELP 
ET*RALMM*NHGHPLEFGGM**PWT*EP 
GS*RC * ELGQPGQARRLACNPSTLGGRG 
GWIMRSGVRDQPGQDVET 


2718 


16619 


A 


2737 


390 


1 


KGFFLETKRKFFFTNGFFPLLGGKGVPF 
PGKRLGPPILGFYPPQRVFPFPKPPFFP 
PGPFKRGQKPPGGGKNF*RGPPGPPKKG 
GAPTGKKKKKEEERKEKEKKTPRNISKP 
LSHPFPNFPFCATVRGF3 


2719 


16620 


A 


2738 


96 


331 


QFTYTKGLFPFNFNRLCTFFFS * KLVGG 
V*LCCPGWL*TPGLKQSSCLSLLSTWDY 
KGVPPCLAEHFFFFEKKSPLSL 


2720 


16621 


A 


2739 


379 


14 


LREVFPFFSPPKIKCFSKNSPQKYFPPP 
\nn?KTPPTPPFFFPPLFGKNFFFPSLFN 
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Predict- 
ed end 
nucle- 
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correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AJanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, KNHistidine, l=Isoleucine, 
K^Lysine, l^Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X-Unknown, *=Stop codon,/=possib!e 
nucleotide deletion, \=possib)e 
nucleotide insertion 














FCPPRGFF*RPPLFFFFFFFFFFFFFFF 
FFFFFFFKVFYSLLLLYFLFILK*MLTF 
KTS I STQCY 


2721 


16622 


A 


2740 


198 


38 


ANPLFFFFFFLRDGVLVYRPGWSAWQS 
* LTADS TP WAQV I LQPQRNP I EHDF 


2722 


16623 


A 


2741 


290 


39 


MSKVGPLLGVLGTSFRVLLKGPEIKGDP 
LLANPLFFLFFFLRDGVLVYRPGWSAW 
QS*LTADSTPWAQV I LQPQRNP I EHDF 


2723 


16624 


A 


2742 


1 


399 


ILIRKHSDIGTLYLLFDA*TGTIGTDLS 
VLNRTELEPDQATLLGNDPVHEVIDTAH 
ALGI I FFIVIPII I GGFGN*LVPLI IGA 
PDMAFARINNI SF* LLPTSLLLLLASAI 
EEAVPGTG* TVYPSLA*NYCR 


2724 


16625 


A 


2743 


398 


2 


SPPPLIFFLWGFSYIFPPPKKFFFLINP 
PPKFSPPPFFFKP?PPFFFFFPP*EEKK 
NFCSPPPFSPPPPFFLLPPPLFFFFFFS 
LKKKKKKKKKKKKKKKKKKKKKKKKKKK 
KKKKKKKRARSRSRTSPTRP 


2725 


16626 


A 


2744 


295 


185 


FFFFFFFFFFFFFFFFFFFFFFFFFFLF 
FFLFXFFXF 


2726 


16627 


A 


2745 


374 


4 


LKKHKSTRVP*NVKSGK*NFSPPFKIRP 
WAQKRAKKKGAREKKAD*EKGEFGKLKK 
KKS FRPGKKV* G * PKGFQRNFEKKPVKS 
QGPVNMGVIPAFEKKPRIPPVAKIPI*G 
AQKKKGRPLAI 


2727 


16628 


A 


2746 


2 


382 


QDATAPIIDELISFHDHALIIICLICFL 
ILDALFLTLTAKLTNTNISDAQEIETV* 
TILPAI I LDLI ALPSLRI LY I TDEVNDA 
SLTI KSIGHQW Y * TYEYTDYGGLVFNS Y 
ILPPLFLEPRDLGLL 


2728 


16629 


A 


2747 


2 


374 


DWPPTVKRKTNPRAQSTAADRFILLFTV 
RGLTG I GLANS S LD I VLHDT Y YWAKFH 
YVLSLGAVFAI IGGFIH*FPLFSGYTLD 
QTYAKIHFTIIFIGVHLTFFPQHFLGLS 
GMPRRYSDYPDA 


2729 


16630 


A 


2748 


55 


230 


QI PAK* INKYLQN* KKKKKKKKKKKKKK 
KKKGACL KKPHGGAKEKS WGRQ KLFS FK 
GG 


2730 


16631 


A 


2749 


161 


3 


NGNYTL VKKCFNTKEKIKWVKN * I FFFF 
MRWSFTLVAQAGVQWCKLGSLQPL 


2731 


16632 


A 


2750 


362 


1 


PEKKSFAI KGVKKAGKQCKFCDAPKKGN 
PPKKKNPTGTPKRVLKK* PPREEKKNS P 
QRKKKNTAKNKEKEAPQEE I * KGPPRYI 
FFFISFFFFFFFYFFFFFFSIVINLFGT 
FIATTLEA 


2732 


16633 


A 


2751 


311 


110 


ITLPRHGGPHVQSQLFKRLRWKDHMSLG 
DRG CSELSSCHCTP VWRQS KT F S KI KYG 
RNGTDKAATCNPGRFRSWGRKVA*GQES 
ETSLSNSRTRL* KNKI WSKRDRQGGYL 


2733 


16634 


A 


2752 


223 


16 


KKTPQKKKNTEGKGPVKKSPPQKFLKST 
PP* IFFYVFCFFFFFLYFI YLLFRIFSF 
FFILFVYYLFILP 


2734 


16635 


A 


2753 


367 


10 


PAPRGGVYRGRQASLSCSGLKPLRASWP 
LCSPTQA*AMAGAPPPASLPPCGLISDC 
CASNERGSVGVALS * SGAGDNLLVCRLL 
SGKCRNHPSSASLTLAAVDWSCSYSAIL 
APPLKNS 


\2135 


16636 


A 


2754 


14 


341 


PG PN F * FGGON SPG KNONLS LPT* KGNG 
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ed end 
nucle- 
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correspon 
ding to 
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acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
&=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Iso!eucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, ^possible 
nucleotide insertion 














PGNPKNGGPTVPGTPPETRGSLLRTLQG 
PLRPSGTPPTRRAPGTPVGNPGF* PRDF 
FAQKGPNVREGLKQGPNPKGLNPPP 


2736 


16637 


A 


2755 


15 


172 


HSWWECKLVQPL*KTVRRFLKKLKIDPP 
YYPAI PLLG IY SEERKSVKQSRRG 


2737 


16638 


A 


2756 


263 


2 


QKPLPPRNDPVFPKGPERTPRPTWGFIP 
GGVPLGKFFTLLVSPFFFFFSPSPNLFF 
F*NRVSLCRPGWSAW*SWLTAASTSQA 
QAI 


2738 


16639 


A 


2757 


311 


106 


FVCFLRQRLALL PLE CSCV I LAHCNLRL 
LCSSDSPG*QSETVSKKKKKKRKKEKET 
GNL * RS LDMKGR 


2739 


16640 


A 


2758 


23 


328 


NITYYT* LFVLY FYVPGHTWFFFXFXF 
FFFFFFFFFFFFFFFFFXFFFFFFXFFF 
FFFXFFFFFFFFFFFFFFXXFFFFFFFF 
FFFFXFFFFFFFFFFFFF 


2740 


16641 


A 


2759 


1 


338 


DR * L FS TNHG YI G TL YLL FG A * AGALG T 
ALSIjLMRAELGQPGNLIjGLDHTYNGIDT 
AHAFAI I LFMS LA 1 1 1 RS FGN * LVTL 1 1 
GAPVLAAPRITDISW*L*PHSVLLLLA 


2741 


16642 


A 


2760 


120 


2 


I N KYAWLGTVAHACNPSTLGGRWTA* GQ 
EFKTSLANMVK 


2742 


16643 


A 


2761 


251 


3 


CRGQIGTPTLKYSSSSSSKWIKDLNVRV 
KTIKLSENTGVNLRELELGDCFLDDTKS 
KNS TR KI R K * S L KLE T FF LGGTR 


2743 


16644 


A 


2762 


n 


303 


ALVLKR S S CLDLLKCWD YRHE PLHLALS 
EILMYRFCL* I FDI FFYWYSREIDLI FV 
Y*FFFFFLETGFWFLLPRGDLG*WEPPP 
PGYKGSSCLGLPR 


2744 


16645 


A 


2763 


236 


356 


DWLI FVFSVERGFCHVAQVG1»*LLGSGD 
PPAS AS QGAG I T 


2745 


16646 


A 


2764 


2 


332 


LTQTPGLK* SACLCLPECWNYRCEPPHP 
AGFLF * RH FDKGVS * LLI Y PGTGWS 


2746 


16647 


A 


2765 


297 


19 


KKKSWFFFKKKKCWGGATKFKN*RGFFL 
KFVFYFFFIG*GVFFFFFFLI S I IIFFF 
FFFFFFFFFFFVFFYFFFFYFFFFFFLF 
FAMKINVFY 


2747 


16648 


A 


2766 


1 


147 


Q I LRRLRW ENC VNLGGGGC S E PRLHH CT ! 
PAWVTV*NSVSKKKKNPLIiFF 


2748 


16649 


A 


2767 


357 


183 


NWDYRPLS PRPGKFLDF* VNRGFPLLTK 
LVSNF * PCNLPPSTS PKAG I SGFNPRAQ 
p 


2749 


16650 


A 


2768 


2 


193 


RWES CLN PGGGG Y S E PRS CHCTP S WVTE 
*DSVSTTTTKEKERKQNRTWSESNSLQK 
YKDTNHF 


2750 


16651 


A 


2769 


333 


173 


VS QDGLDLLTS * STHLGLP KCWN YRRE P 
PRPAQKLLMVIWIiGFMSSPKSPLEL 


2751 


16652 


A 


2770 


3 


331 


CMENCMV I PPKKVKHKNYHM I QKF YF WK 
AVQQLSTELNIL* PYDPAITLLGI YPKE 
IiKTSFRTKTCT*MLTASLFVIAKTWKQP 
RCPSVGE * LSNL * YVQTMECYS VLK 


2752 


16653 


A 


2771 


209 


55 


RPGRPQVLRRVRPQNRLNPGGGGFSEPK 
*GPCPPAWGAQRDFISKKKKTIT 


2753 


16654 


A 


2772 


213 


57 


RPGRPQLLRRLRLQNRL^PGGGGCSEPR 
*GPCPPAIWTQRDFISKKKKTIT 


2754 


16655 


A 


2773 


148 


1 


KATGRKTtWKSRCWGTVAHTCNPSTLG 
GRGR* TARAQEFKNSLRTLAK 


2755 


16656 


A 


2774 


2 


364 


W 3 AVRRDVJ I TALQ PERQRET P SQKKKKK 
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beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
^Glutamic Acid, F=Pbenylalanine, 
G=Glycine, H=Histidine, ]=lso!eucine, 
K=Lysine, L=Leucine, M^Methionine, 
N=Asparagine, P=Profine, 
Q=Glutamine, R=Arginine, S=Serine, 
T^Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














KKKKKKKKKKKGGVRHGVKSRKP*TSGR 
PGRKNHWRQ I QGQP KTPRERAEALKNRV 
GLKKLFKTPGHVNGAGNPKFRNGKSGKP 
PEVHLNGAW 


2756 


16657 


A 


2775 


121 


2 


HLRSGVSDQPGRHGKAPSLLKIEKLAGS 
GGTCL*SQLLG 


2757 


16658 


A 


2776 


195 


3 


GRVDINTLLALLLI IITF*LPQLIGYIE 
KSTPYE CGFDP I S PARVPFS I KFFLAAI 
PFLLFDL 


2758 


16659 


A 


2777 


3 


285 


FLNIRNKVSLCCPS *A*TPGLKQSCCLG 
LPKODYR 


2759 


16660 


A 


2778 


135 


379 


HWSATVKLYAI LLGLLKWESGTLLCSKD 
VLFFt,*G*SSAYSI3SVIMQIKAT*VKG 
KARVQLGAKKLKAYWQRKSPGI PAG 


2760 


16661 


A 


2779 


364 


1 


GSFTGAVILI IAHGLTSSLLFCIANSNY 
ERTHSRI I ILSQGLPTLLPLIAF* *LLA 
SLANLPLP PT INLLG ELS VL VTTFS * SN 
ITLLLTGLNILVTALYSLYI FTTTQWGS 
LTHHINNIK 


2761 


16662 


A 


2780 


356 


1 


GCLRAHIWPQKGNHEGQVHLFIDKVCRQ 
PMTEDCINEITTQVAQIFLVHFLLRQSL 
TLS E KRKKRKKKRKKRNR I E S PE I NTH I 
YCQLIFNKGAKKIK*G*NSLFNKTYLDP 
WISTCK 


2762 


16663 


A 


2781 


1 


204 


AQVGLQDASSPRIEELITFHDHALII I F 
L I C F LVL YALF LTLTT * LTNTN I S DAQ D 
SETDCYSQSYIL 


2763 


16664 


A 


2782 


1 


522 


YKCNECGKVFNQQAHLAQHQRVHTGEKP 
YKCNECGKTFSQMSNLVYHHRLHSGEKP 

* KCNECGKTFSQMSKLVYHHRLHSGEKP 

* KCNECGNTFKHNSTLVSHKAIHTGEKL 
YKCNECGKVFNQKTTLARHHRIHTAEKL 
YK* EECDKVFGCKSNLETHKKMQFSKTD 
SAFSLQ 


2764 


16665 


A 


2783 


1 


327 


ENRLNSGGGGCSEPRSHHCTPAWATERD 
SVSEKEKEQKQNFHLNAQSNCQKLKSP* 
KYLKHPEKTDILSKASQYNNQLTADQSI 
LKFKTKTFDQKKGRP KGHGMT YSNS 


2765 


16666 


A 


2784 


335 


128 


FSLIFCRAGILPCCPGWS*TPVLKQSSC 
SSLPKCSDYKQEPPYLACATLKCYQI PN 
FYCWPHI FKRMFY 


2766 


16667 


A 


2785 


208 


3 


RPICPLSLWGVSFLFFFFFLRDRVLICH 
TGYSAVAQYCNTAHCYSPRLK* SSCLSL 
PSSWNHNLIPPR 


2767 


16668 


A 


2786 


49 


332 


VEMGNSPINRKYVYPKSYNRCLKCNTEE 
GVUVDIG IAEFNTCSKSUjLLI FFKGR I 
LLCCPGWIEWQSQLTASSTLGLK*TFH 
LSLFGSWDHR 


2768 


16669 


A 


2787 


331 


187 


GCSEPRSPPCNAAWATK3DSVSKEKKKN 
KKREEKKH*HNTSGEKINLI 


2769 


16670 


A 


2788 


325 


34 


RSQLRGLPSMS I S * I * NSRLR * LR PRRL 
FIFCRDRVSLCCSGWSQTPGFKQSTHRS 
LPKCWDYRREPSPLAQIRYLGLFFSQGL 
SSAFCYVAMVSGF 


2770 


16671 


A 


2789 


3 


150 


AASTRQLIFHFTSKHHFGFEAAA* YWHF 
TOW* LFLYVS I YWLG 


2771 


16672 


A 


2790 


86 


311 


NI HPLNDI FTRLKNGFKKKE I SLVKNEP 
NKGT*SLISIKVLFKT*K*FIRPGVAAK 
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Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=G!utamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, l=lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P^ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
AY=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop cod on, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














T*NPSTLGGQGGWIT*SQE 


2772 


16673 


A 


2791 


2 


123 


GGGACSEPRSHLCTPAWETE*DSISEKK 
EEKK*LQIYTNI 


2773 


36674 


A 


2792 


302 


162 


PLLRRLRPENCFNLLGGGFN* PRSRNCP 
PAWATKRDSLLKKNLKNF 


2774 


16675 


A 


2793 


1 


258 


GGCS GLRLCHCTPAWETD * DP VSKKKKK 
KNFTRAKIaLIFGTLVLGKVKRGGPFKPR 
ILKLPWETWQNLVFTKKKKI FFLKNRGG 
GE 


2775 


16676 


A 


2794 


199 


3 


VGP I FRHDKPPHSKQLCTY S PAFTQLFS 
SGKILKNIHLWPG?VAHACNPSTLGG*A 
RWITRSGDR 


2776 


16677 


A 


2795 


1 


302 


GGCSEPRSCHCIPAWATEPDSI*KINK* 
INK* K* *KINRNKKGKNHFLSSST* * PQ 
VASS FLTGQHRYTKL* SSQKVLLDSATL 
QSKSNAEVKRVNRTTD 


2777 


16678 


A 


2796 


228 


326 


NDNGQSGWAHACNPS ILGGQGGWI I * G 
REFKT 


2778 


16679 


A 


2797 


270 


1 


KFGISAPFAPSPKV*KRGFF*NFLGNGP 
PVKIPPF*TPL*NFSKPGKNFPPFF*KT 
RFSKIFPNGFFFFFFFFEGVLLCCPGWS 
ALTRSW 


2779 


16680 


A 


2798 


307 


36 


FKNFCCG*EVSLCCPGWSQAPGVK*PSC 
LGFPKCWDYRCELPCLASCSL*CDRNQT 
RISAWLWI EFHLSP I LPVLPLFLKKPQ 
AGLLCF 


2780 


166SI 


A 


2799 


132 


5 


I FEVTV* CTK*HKRNMQPGWGHACNPS 
TLGGQGGWIMRSGV 


2781 


16682 


A 


2800 


32 


314 


KQHPGNGPNPSGKGPGRASRFLKKKNFF 
F * KT P P KKPHT PCKP S PKGKG P E RL PWH 
FFNQKFGPLPMGPNPDQRAGFCLRDP*G 
GGKNRSPPTL 


2782 


16683 


A 


2801 


139 


3 


AASTFY I FFETTL I PTIAI I TR * GNQPE 
RLNAGTYFLFYTLVGEG 


2783 


16684 


A 


2802 


238 


3 


AASTSHVISSMYNIYIIIQFKTFPVFFF 
RDKVSLCHTGWSAVAQS *LTAAS AALTS 
CVQGILPTQPPEYHTRPLRLFL 


2784 


16685 


A 


2803 


110 


291 


KKLGVFGFGFGFKTKSNFIMQAKGQLPN 
LG*LKPLPRGFPQFSGLSLSGTWDYKHT 
PATR 


2785 


16686 


A 


2804 


97 


2 


S YNNQDS WLEKE KTNRS VKQN * AP I PE 
KKKKKNKVGGLTLPNCKTYYKATIIKTV 
WYWRKKRQIGQ*NRIESPEIDPHKY 


2786 


16687 


A 


2805 


268 


1 


ATFLLFYYVFRPQIPYYL*IFILSLF*Y 
RHSGPPYVGPPQQYPVQPPGPGPFYPGP 
GPGDFPCRLPIRKWWLVWLMPQLMPTVW 
PTIIT 


2787 


166S8 


A 


2806 


153 


3 


I PLPKGLLVPLFGVFLKVFFFFFFFF*D 
RVSPCRPDWSAVAQSRLTASS 


2788 


16689 


A 


2807 


3 


216 


NAARDHI VKPSP* PLTGALSALVMTSGL 
AM* CHSHS I SLLILS LLTSTLT I YQ * RR 
NVTR * TS YQGHHT PTDQKGLRYG 1 1 LF I 
TSEAFFLS * FF*AFYHSSLSPTPQLRGH 
WPPTGLSPLNPLEVPLLNTS ELLAS *VM 
PLSLHIASNTKPTNQHTNHIPMKAQCNT 
INLIPRPPHTN 


2789 


16690 


A 


2808 


168 


354 


FLERESGFVAQAEGQGQNLS*WKPFFPI 
LXKFSCLSLLRSWir/GPTSFVPAKVGGF 
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OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
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N=Asparagine, P=Proline, 
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T^Threonine, V-Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /-possible 
nucleotide deletion, \=possible 
nucleotide insertion 














WRNKIY 


2790 


16691 


A 


2809 


61 


360 


YVSNSKCSNHRK*SLFFFFFFERESSFV 
PQVELHGRDLG * LQLWLPGFGKFPGLTP 
LRNGDDGPRPQPPANLGLLVKTGFSPVA 
HLGVNLGTLGDCPALP 


2791 


16692 


A 


2810 


236 


3 


KMFFQMRYTMLKK*QATFGFILALKNVL 
KTFLAP I FFFLRRNSALVAQAGVQWRDL 
GPLPPLPSGLKRFSCLSLPSC 


2792 


16693 


A 


2811 


140 


210 


NAKITKC*KGYGKRGTLMHCCWQFKMGQ 
LLWKTGYQFLKKIiKLELLSNVPI P YL* I 
W KKGNP Y AL L LA I 


2793 


16694 


A 


2812 


230 


1 


VKLCPVSLKTGVRPLLTSSEVIFILNCK 
FNIGV*LLPGSLLLLLASAIVEAGAGSG 
* TVYPPLAGNYSHAEPYALV 


2794 


16695 


A 


2813 


3 


345 


HEVRIDVHTRTCFTFGTI I IAI PTGVKV 
FS*LATLHGSNMK* S AAVL * ALRF I FLF 
TVGGLTG I VLSNS S LDIVLHDTYYWAH 
FHYVLSIGTVFAI IGGFIH* FPLFSGYT 
LD 


2795 


16696 


A 


2814 


2 


184 


AR VG FHH I DQAGL KLLTL * SAYFGFPKC 
WDYRHEPPRLALKFNNIKKKKKKKKKTP 
GEKI 


2796 


16697 


A 


2815 


276 


3 


ARVHR IDHGLM * HQ PLGLK* SLCLSLPS 
SWD* * SMQPCPDDYGTRSDQKKPTFKSP 
YVSQTGLEHLGSSDLPASASQRAKITGM 
SHHALV 


2797 


16698 


A 


2S16 


3 


94 


HENQYSSLIIIMAIAIKLGIAPFHF*VP 

EV 


2798 


16699 


A 


2817 


93 


1 


GGGGCSEPRLHHCTPVWVTE *NPVSKKK 
TRA 


2799 


16700 


A 


2818 


1 


197 


GTRAQSLLLGRLRQKNQLNLGGRGCSEP 
KS GKLTPAWAT * PDL VSTKS CTL I YL VT 
QALHITLLP 


2800 


16701 


A 


2819 


109 


2 


GVF * G V FIiAQGLTLVAQAE VQ WHD LGS L 
QPCPPRLKRLV 


2801 


16702 


A 


2820 


2 


347 


ARAPGFAENEVFVFS S CFVWMVPGGKGE 
NAELMQPSSY*ESQHFFPAPPRCSSCV* 
LCSLGPSLLGTLI FCQFTLSELPRFRKG 
* FSSSLKKVFCFGDRVLLCHPGWSAWQ 
TQL 


2802 


16703 


A 


2821 


1 


353 


CTRRDVTRESTYGGHHRPPVQKGLPYGI 
ILFITSQVFFFT* FF*TFYHSTLTPTPQ 
L*GHWPPTGITPLNPLKVPLLNTSVLLP 
SRVS IT*AHHILI *NNRNQI ISSLLITI 
LLGLY 


2803 


16704 


A 


2822 


261 


400 


VEHSNSNKSNFLGQGTGCHACNLNTLGG 
RGGRITWRSGV*DQLDQH 


2804 


16705 


A 


2823 


273 


97 


LNTPRMQRLTQLATVILRFHKDQGFAML 
PRQVSNS * AQAI CHLGLPKC* DYRREP P 
GQK 


2805 


16706 


A 


2824 


159 


355 


KKFFSIFFFFFFFF*DRISLCHLGWSAV 
VQSWLTAAST 


2806 


16707 


A 


2825 


67 


338 


LSPEL*LY*IF*KPFS*QLSLTSHSPSL 
FSALLLLS ISPPTWLHSQSDKNLHLFSF 
LFRDS VLLCCLGYSAVT* SWFTVASNSW 
AQWLS 


2807 


16708 


A 


2826 


3 


359 


HEQKYS FLHDSQTLFCF * DS I PTPSNMD 
ETQQKSRLELVRISLLLIEFWLEPERLL 
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RSMVANNLVYDTSDSDDYHLLKDLQEGI 
QTLMGRLEYGSRRTGQMLKQTYSKFDTN 
SHNHDAL 


2808 


16709 


A 


2827 


3 


338 


LERNLDTTFLDPAGGGDPILYQHLF* FF 
GH P EAY I L I LPR FGI VS H I VS Y YFGRLE 
PSGYLRMG*PIISVGLLGFIV+SHHIFS 
VGVDI HARAYS YFGSLVMAI PCTLBVFT 


2809 


16710 


A 


2828 


1 


406 


RHEGEKLRRPTFGPRHRGAGTAKMSASL 
VRATVRAVSKRKLQPTRAALTLTPSAVN 
KIKQLLKDKPEHVGVKVGVRTRGCNGLS 
YTLEYTKTKGDSDEEVIQDGVRVFIEKK 
AQLTLLGTEMD YVED * LS S E FVF 


2810 


16711 


A 


2829 


332 


3 


GIIVMQATIATALIGYVLPLGQISL*GA 
TVIAYILAAIPFIGTDLVQ*I*VGCSVD 
SPTVTGFIiAFPFVLAFI IAAVAALLLRF 
LQGTGSPNPLGGASQSDKIAFQFSC 


2811 


16712 


A 


2830 


186 


349 


YSEREVAFFFFGNGFCF*TQAEWNGGNL 
G* LNLLPPR* KE FSCKI FPRTWNYS P 


2812 


16713 


A 


2831 


303 


590 


N I LTTLLNKQS KSNQLQNKI I HTVY I KI 
KIFFFFLRQSLSVTQADVQWHCLCSLQP 
PP PGFK* FS CLCLLSS WDYKHAPLHPAN 
FCIFSRDGISPS 


2813 


16714 


A 


2832 


1 


370 


EE L I T FRDHAV 1 1 S FL I CFLVLYAL FLT 
LTTKLTNTNI LDAQDIQTV* TILPAI IL 
VLIGLPFLRILYITDEVDDPSLTIK*NG 
HQ WD * TY * YTDYGGLI FNS Y I LRPLLLE 
PGELRLRDVDN 


2814 


16715 


A 


2833 


207 


I 


QF F I FLRHS FTLFAQAGMQWRDLGS LQP 
SPPGFK* FS YLSTLRGLARKITLAQEFQ 
TSLGNMGGPHPRA 


2815 


16716 


A 


2834 


223 


I 


DN LAH KG KTR VYLKS RNKLG KGGGAGNL 
VSLDS IGSRRDHRRAPPCPANFVFLV*M 
GFPHVGQDGPELLTSCSC 


2816 


16717 


A 


2835 

j 


1 


321 


GTR KPS P * P VTGALS ALLMTS GLTM * LH 
FHS ITLLILGLLTNTLTI YQ* WRDVARE 
STYQGHHTPPVQKGLRYGI ILFITSEGG 
LFAGFF*AFYHSSLDRTPQLGGQ 


2817 


16718 


A 


2836 


57 


370 


I WMGRVLWKDRLYGVFCRAFNRAVTSRV 
WAEDPWRVPKTLSVDPRKLPPFS* ISVR 
GQIYFT*FFFFFETESHTVAQAGVQWWG 
DLG* LQPSPPGFKRFCLSLP 


2818 


16719 


A 


2837 


2 


345 


ARAHRQLDEP*L*RRPGERHPSW*SEET 
VERQRTKT*SESSQTGTSITSSRNARRR 
ESEKSLHLETLNKEEDCHSPTFKPSTPD 
HPLKVMPAPSPKENAWVKRSSNPPARSQ 
SS 


2819 


16720 


A 


2838 


15 


376 


AKIEPLYSSLGNKSETPSQKGKKKKKKK 
KGKGGGEKKKKKKTEGGTRGLTQETQFF 
GNPKG P KHLGARN* KHRAKQAKRGNQPR 
EYRFTEEKKGEKLFNPGGGRSQKPKTGQ 
WNSAGGKK 


2820 


16721 


A 


2839 


3 


336 


HELLAS I LLI Y * * CRDVTRERI I PLAHH 
TPPVQKGLRYGI ILFITS* VCFLAGLF* 
YLYKS S LP STAQLRGHWPPTG I TPLNPL 
EVPLLITSVLLASLCSFI *AHHSLIEH 


2821 


16722 


A 


2840 


338 


3 


KI IFYLFFLRS I AFLAQIEGHWAVFSSF 
KLWPPGFKHFSASTFLVNRVFKGGPRAR 
VNFGFFI KSGFFP I F* VGF* LFFFFFFE 
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TES RSVAQAG VQWHDLCS LQAPPPS S C 


2822 


16723 


A 


2841 


2 


149 


ARGCSEPTLHPCTPAWVTE*DSISKKKK 
KKVECSNGNVCRTMCILCCFW 


2823 


16724 


A 


2842 


348 


175 


SACKWSITGMHCHA*VIVSVLLV*TGS 
LKVGQVGLELRTSDDMHTLAS VSGG I TG 
MS 


2824 


16725 


A 


2843 


338 


118 


D PGGG A CS E P R LRHCT PA W VTKRD FVS K 
KKKK* N I KC * HAK SGTVLYS FLYLFYS * 
YFLMRWSVH I PGPSTK 


2825 


16726 


A 


2844 


189 


47 


KCVALLLSLFLSCS VM* RNACFRFAFCH 
DCKFLEASPAMLPIQSVEL 


2826 


16727 


A 


2845 


335 


139 


EVSPSWPG*SOTPDLKRSACLSLSKCWD 
YRPGVSHRARLNGSFLTQLYEMLTYFPI 
IWVTLQVFR 


•2827 


16728 


A 


2846 


3 


352 


HEQRLT PEWKKAATALGD WKVGAVDAD 
KHHSLGGQYGVQGFPTIKIFGSNKNRP* 
DYQGG I TGBAI VDAALS ALRQLVKDRLG 
GRSGG YRSGLQGTS YRS S KKD VI ELTND 
SFDN 


2828 


16729 


A 


2847 


2 


269 


ARGLFSTNKKDIGTLYGLFGA*AGVLGT 
ALSLLIRAELGQPGHLLGRDDISRIHVK 
R I I PGCGTLDG I S VGRECACLWTPACSR 
KCARG 


2829 


16730 


A 


2848 


317 


28 


SFLPN* KLAGHGGRPL* FQVLRRVRLEG 
CFSLEG*GCSEPWSCPCPLTWATETDPF 
SREKKKRRRREEKAEKRRENRRVPSLSV 
FLEEYVELTLIL 


2830 


16731 


A 


2849 


326 


107 


FHHVGQDGLDLL?S*SAHLGFPKCWD*R 
CKPPCPASRNFKTLKLYH I FSNNLI E*N 
*KS I TRGTLETVQLPGN 


2831 


16732 


A 


2850 


224 


3 


SILQAWLKVPSPPGNPLGLLQSQVDFFF 
*RQGLALS PGLDCSGIY1I MAHCS LKLLGS 
TDPPTLAS*VAGTPSSC 


2832 


16733 


A 


2851 


27 


329 


VRTAAINRPGFLLPCFIGQNFFFY*RFE 
TYRLGV*MILTPH*YLLSSLSGLDCVIF 
FSFILFFYWFIVYCHVFMVYLFFFLFVI 
LFVLCVFWCFYFLLFV 


2833 


16734 


A 


2852 


351 


60 


NLLSSLQGGRLPRPPGLS PHPQI FTPHG 
GVPL*LPGPWKSRPEA£LNPRS*PPPGQ 
PRGNPFPPKKNFFFFETVLLCRPGWSAV 
AQFQKKKNLLIC 


2834 


16735 


A 


2853 


186 


365 


LTQLLNLI*YQNQIEYPQPYAQCGPRSG 
NAQQIGHSFPNG* QVPASGMHGLPWTHQ 
GLKK 


2835 


16736 


A 


2854 


17 


239 


HFTTLQISMSLLTETLARHDSTHLWSQL 
L*RLRRNDLLKSGAGGCSELGLHHCTPT 
WTTEQDPVSRPGAVASEH 


2836 


16737 


A 


2855 


325 


140 


GFHYVSQDGLDLLTS* SACLAFPKCWDY 
RHKPPCLAAYIFIQSRVFLAVISISTHS 
SFQLL 


2837 


16738 


A 


2856 


1 


157 


GTRGFHHVGQAGLEVLTSDNLPVSASQS 
AG I TGESH* AHERSDGYANTNS PG 


2838 


16739 


A 


2857 


157 


361 


LLLR I VS LFWRKS S PLGVQAEM YRPDLP 
VTKPPP?D*DKDSCLSLLITWDYRLAPP 
LPANLE F * I * TG 


2839 


16740 


A 


2858 


3 


346 


HEGNHKFKI INVHLR* LNQNL* SGLEGD 
KVI WRN I * TS VTFLFC *MNGXMNPDLFR 
QFTYEVGRIFFFSLKD*VV»LCCLGSSTV 
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DLGSLQPPPLGFKQFSCLSLPSSWEPPC 
PA 


2840 


16741 


A 


2859 


18 


233 


KN PT I LCCKGGGAQA* KPRLLGRLRQE K 
GLTPGGEGCRDLKSPLGIPPWATRPKLL 
LKKKKKKGGKKKTLF 


2841 


16742 


A 


2860 


356 


3 


RVNFKIFGKKGFCPGGPGGFEVLTPGDL 
GPLFFQKGGDS PSKPLKPFFFCFALKST 
TLNYFQFAFCVLRPTPN*SVTQAGMQ*H 
DHSSLQPQPPSLK*SSQLGFPSSWVYRH 
APPSC 


2842 


16743 


A 


2861 


115 


374 


IVCGKILNV*YTQ*HNKLQNKNKVD*PT 
IKLMLISQSLGILIFIQKRDFEVNYGMI 
LKKRGWPG PVAYVCNS KTLGGQGGQI T 


2843 


16744 


A 


2862 


163 


360 


LKRIVJLLILHSHPHTSPHDRGVTQLEAI 
GPRW Y * TYEYTDYGGLI FNS Y I LP PLFL 
EPGDLRLLDA 


2844 


16745 


A 


2863 


1 


236 


GTSPFFSFFFLSFFPSFLPFLSFFNPTT 
LLLSLHSNIS*KKVLFSFEMESHSVNRL 
ECSGT I SAHCS PAWATGTDS I S 


2845 


16746 


A 


2864 


67 


364 


VCVCVCVCVSVS VCVCVCVYI CTYI CMY 
VCA*YRIRRYIMLPTLVYNSCPEIHDSK 
S CALHTG AI TRATR * PCLCIHI PVCLT V 
CLTVWLHICLSVCMD 


2846 


16747 


A 


2865 


2 


155 


ARGLTQENRLNLGGGGCSELRSHHCIPA 
WATE * DALSQKRKKKKGTYRG 1 1 


2847 


16748 


A 


2866 


344 


81 


IQFNS F I FLKI FI SRDGALLCCPGWSPN 
SWAQQSSRLNLPKCWDYRHEPPHLWQF 
L* KHYMYKWHKVHCVFSHYNKI FTGQTK 
TEN 


2848 


16749 


A 


2867 


217 


340 


RVFSDFYMNLKWSLRKRRTWPGLVAHAC 
NPSTLAG*RGQIT 


2849 


16750 


A 


2868 


3 


349 


HEATSP I IQELI TFHDHALI I I ILICYL 
LLYALYLTLTTKLTI TIMSDAHE IHTA* 
TILPAIILDLIDLPCLRILYVTDEVNDP 
YLT I TS IGHQWY* T YEDTD YGGLI FNS Y 
ILT 


2850 


16751 


A 


2869 


1 


352 


GTR I D VYTRAYFTS ATI GIAI PTGVKVF 
S*LATLHGSNMK* SAAAL*ALGFIFLFT 
VSGLTGIVLANSSLDIVLHDTYYWAHF 
HYVLS IGAVFAI IRGFIH* FPLFSGYTL 
DQTYA 


2851 


16752 


A 


2870 


2 


342 


ARARFRTSLLLAFALL*LPWLQEAGAAQ 
TVPLTTLFDHTMLQDHRAHQLAIDTYQE 
FEETY I P KDQKDS FLHDSQTS FCYSDS I 
PTPSNMEETQQKSNLELLRIFLLLIESR 
L 


2852 


16753 


A 


2871 


254 


2 


YPGYQAISQSDMQSYAYRHL*LQPPGVK 
*SSIVSLLSS*DHRPVPPCLANEKNFFF 
* RQGLAMLQSRLVLNSWLLLYSHAEPRA 


2853 


16754 


A 


2872 


282 


3 


HTHILHYIYIHHGHIST*HLSPPTNIYS 
LCNIKILIYTPYDH* IIRIPLHQEHAVH 
RRRLIHIHIHTSVCVCVCVSVCVACVCV 
CVCVWVLV 


2854 


16755 


A 


2873 


278 


3 


LWPLKNSGPQPWPQMDLGKPQVKTPWY 
YKKGP*KKFFRKF* KWPGFFFFFFLRQS 
HSIAEAGVQWRDLGSLQPLPPGFKRFSC 
LNQISSC 


2855 


16756 


A 


2874 


2 


348 


ARA* S L I LV S L 1 1 PNATSNLLGLL P Y S F 
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TPTTQLS INLAMD I PL* AGTMVIGYSSK 
IREALAHFLPRGTPTHLI P ILAMIESMS 
liLIQPIAMAVRVTDNITAGHLTMPLTGS 
PTV 


2856 


16757 


A 


2875 


1 


260 


GTRENHLDPAGRGCSKSRSHHCI PAWVT 
D*DSLKKKKKKKPQI ILGNGGTEGNQK* 
RGNPFRGPWGPSRQKIKELGPGGKHGKS 
KF 


2857 


16758 


A 


2876 


176 


1 


EHIiFCKKLLGEEKGSLSKEFGKCWVNPR 
* KWNSRPGWAHTCNPSTLGGRGGQITR 
SG 


2858 


16759 


A 


2877 


168 


1 


GVRVFGFS SVPE FS FDTCAGAQWRNLGS 
LQPLPPGFK*FSCLSLLSSWDYKRTPRA 


2859 


16760 


A 


2878 


67 


333 


WQGLGRAVARVPRSYEITWAGNLKGAA* 
FQQLFFFFFFHLLFFFFFFFFFFFFFFF 
FFFFFFFFFFLFSFFFLFFFFFFFFFFF 
FFFFL 


2860 


16761 


A 


2879 


335 


1 


GRKEGRREKEKERKKKKERKKERKKT I F 
KKHIPSPLAFPRKAWKLKEIQIKLVAH* 
KINRQAIRGCPPNKRPHPFQNMIKPGRE 
EGEGGLETSRDGERNLWIFGPSVGPLR 


2861 


16762 


A 


2880 


3 


345 


HERHETASIILLIAILFNNILSGQ*TIT 
NTTNQYSSLII IMAIAIKLGIAPFHF* V 
PEVAQGTPLTSGLLLLT*QKLAP ISIIY 
Q I S PS LNVSLLLTLS I LS 1 1 AG S * GGLN 
QT 


2862 


16763 


A 


2881 


347 


156 


WLIFVLLVEMGFFHSGQAVLKLLiASSDP 
ASMTSQSSGITGLRHCAQFR*AFLIALC 
LPPSAKI 


2863 


16764 


A 


2882 


338 


2 


IKKAI>*HE*LREKKKSPIRTSQSSKAP* 
NLPRSKKKKESSNNYQLL+AHKLQKVKE 
MGKLLETPNLPLLSQKVAEPLNQPITSS 
KIETLIKKPYHPEKSYGHDEVTAXFSRA 


2864 


16765 


A 


2883 


206 


1 


IRTRPFTPMFIFYMFVYLATRSCPFTLA 
GVQWHKHGS LQL * PPGLKESSCLSLLSS 
WDYYACLHAELV 


2865 


16766 


A 


2884 


341 


140 


DGGCREPRSRHCSPAWVTE*DSKQNKTK 
QKQKRYTNISSPQTSPLCYSFSAYTPLH 
NMPGNIMFSSL 


2866 


16767 


A 


2885 


360 


247 


FHRVSQDGLDLLTS * SAHLSLPKCWDYR 
REPPRRPKAL 


2867 


16768 


A 


2886 


22? 


351 


KLNNRPGWAHAFNPSTL*KLNNRPGVV 
AHAFNPSTLGG*GGWIMRSGVRDQPNQH 
GETP 


2868 


16769 


A 


2887 


359 


2 


FYSSSSSSSELVPCRQDVQVPHYLSGLF 
LRSCFMEIKYDT* KRKIKYTHVHTHTHT 
HTHTHTQREKERDFPRFTAKWKANLEAG 
SGYATASS IRAVLWTTITLV 


2869 


16770 


A 


2888 


4 


305 


ADSHMWKYKAPG I TKI LLKKSKI * GYHQ 
DWISSRYQDLFHKDMIFKIMSYWYKKRQ 
VDHWDRIESPETSSQMYEHLIYWKDNIA 
E W* EKDS FLSKWCWDN 


2870 


16771 


A 


2889 


1 


436 


IAILT * YDYTLL* RVGPRGQEFGTRATV 
ITNLLSAIPYIGADWK*G*GGYSLDSP 
TLTRFFTFHFILPFI IASLATLHLLFLK 
* TLSNIPLRITSHSHQITFHPYYTDQYS 
LRLLP FLLS FTTLLL I LPNFLF YPP * TT 
LTFPF 
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Q=Glu tannine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop cod on, /=possible 
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2871 


16772 


A 


2890 


1 


218 


RKISPLIKLINHSFIDLPTPSNISA**N 
FGSLLGACL I LQ I TTGLFLAMHYSPDAS 
TAFS S IAHITRDVNYG * 1 1 RYLHANGAS 
IFFICLFLHIGRGLYYGSFLYSET*NIG 
I ILLLAT I ATAFIGYVLP *GSLLGACLI 
LQITTGLFLAMHYSPDASTAFSSIAHIT 
RDVNYG 


2872 


16773 


A 


2891 


3 


368 


LSNS * ANNLI * LAYTIAFI VKI PLYGLH 
L* LPKAHVEAP I AGS I VLAAVLLKLGGY 
GIIRLTLILNPLTKHIAYPFLVLSL*GI 
I ITS S I CLRQTDLKSLIAYS S I SHI ALV 
VTAILIQTP 


2873 


16774 


A 


2892 


3 


342 


HENLI*LAYTIAFIVKIPLYGLHL*LPK 
AHVEAP IAGS I VLAAVLLRLRG YGI IRL 
TLILNPLTKHIAYPFLELSL*GIIITSS 
ICLRQTDLKSLIAYSSISHIALWTAIL 
I 


2874 


16775 


A 


2893 


2 


361 


ARVCLRQTDLKSLIAYSSISHIALWTA 
I LIQTP * S F1GAV I LI IAHGLTS SLLFC 
LANSNYERTHSRI I I LSQGLQTLLPLI A 
F* *LLASLANLALPPTINLLGELSVIiVT 
TFS*SNIT 


2875 


16776 


A 


2894 


229 


3 


YT FVQ S F I FSWL F I CKVCLL * PAHS MV 
CACECMCVCVCLCVCLSSLKIIPIuLECL 
I YLQL I LVLSS AI YNSRFE 


2876 


16777 


A 


2895 


3 


391 


G FL I TNN I S PAS P FQTT I PLYLKLTALA 
DTFLGLLTALDLNYLTNKLKIKSPLCTF 
YFSNILGFYPS ITHRTIP YLGLLTSQNL 
PLLLLDLT*LEKLLPKTISQHQISTSII 
TSTQKG I I KLYFLS FFF 


2877 


16778 


A 


2896 


381 


2 


GRCRVSGS I IDHRMAPQKWRYTKGGPHQ 
WHNFCFF* KKGVFPYGPKVFLFRAPVFS 
PPGPPKRWEIKV*TPPPALFFFFFFFFV 
EMGSHCVAQAGLKLLGSSTPPTSASQGA 
EIAGVSHRAQPHAS 


2878 


16779 


A 


2897 


324 


1 


LYTNNTKH I NPQQNNTK* YQRNTKNRRK 
KEQQTKNNIKRHITESARKNS**HRNIQ 
KANKSQQSNKHYRARSRHNNWNNNNNNN 
KKKKI ENS HAAS D* ITSSGGRSRA 


2879 


16780 


A 


2898 


120 


345 


PPAPSS*YTTSLIQDRLFLMMAVLSSAS 
LMRGNVGSNIMNALSHFLPQGTATLFI P 
VLDIMEAISLLIOPIALAV 


2880 


16781 


A 


2899 


381 


219 


CVSQDGLDLLTS * STRLSLPKS WD YRCE 
P PRPAGS GHLYSS FKKQTGDTHHNF 


2881 


16782 


A 


2900 


1 


224 


LSWCLTLYFAYLLALFYFYFLKILDLAI 
LPRLVPYSWP*ASSPASASQSSGITGMS 
HYTWPLPQTLNPFLFFVP 


2882 


16783 


A 


2901 


1 


228 


KYLINNRLITTQQ*LIKLTSKQE4ITIHN 
T * GY * YNRSGSS FNGVYDYKLLRSESEH 
PWMIVDNTEYDEIYTRGGIE 


2883 


16784 


A 


2902 


2 


332 


LPPSFSLLLAPSPSSSLPPSPSSSLPPS 
PSSSLPPSPSSSLHTSHSSTLTPSASCT 
LTITESFTQRIiAQSYITILP*ASRTTMQ 
IiLKYRTLQTTRNNTHSMTRPKALTKI 


2884 


16785 


A 


2903 


3 


243 


DIGFGTDFSDMTPKAQTKQVDQLDFIKI 
*NFWPGTVAHTCNPSTLGGRGM* ITKLG 
V*DQPGQRGETTQKLAGHGGVDAV 


2885 


16786 


A 


2904 


179 


2 


CGYFNAYICYSLLCFLYLSIiCNQPFWKK 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K^Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=G!utamine, R=Arginine, S=Serine 7 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /-possible 
nucleotide deletion, \=possible 
nucleotide insertion 














IFLPLFGLVFFFFFF*NRV* FCCPGWSA 
VSQ 


2886 


16787 


A 


2905 


108 


2 


GRVD I KLTS KQM I TIHNTKGRT* SL ILV 
SLIIFIA 


2887 


16788 


A 


2906 


318 


2 


YKESNSKNNEKKKITRQRKRQKKNRPSM 
MVHTCNPSPMAGRDGQI PS I LFNQPHS P 
RI IRLTLILNPLTKHIAYPFLVLSL*GI 
I ITS S IGLRQTDLQSLI AYSS 


2888 


16789 


A 


2907 


17 


308 


KLAG YGGMCLWSKLLERPTRMNHLS PRS 
EGLMEP*SHNCIPDWTP*QNPVSVQITG 
FCFLTPNLAMPPRLNLSGPICCQLSLRL 
LGPRLFAASVSGV 


2889 


16790 


A 


2908 


378 


1 


RSLHS TETAAKTHGHQTHPGLSTFS K* T 
PTP IAS I PI PNPHNCL PAFNS TTSASYF 
SECVLTSSFKE I IADFR* KYSSTN* HP Y 
HLFLLKFVFFCSRDGSLTMLP IRPGLNS 
RSQAILPPWPPKVL 


2890 


16791 


A 


2909 


1 


129 


HLFGTNHRDIGTIiCLLFGA*AGVLGTAL 
SLLIRAELGQPSTRP 


2891 


16792 


A 


2910 


1 


323 


CVCWGYRIPRCNIGHHLKFFFFSLFFFF 
FGNKVWFCSEG*RAGDQLLIMEPLASGF 
KGIFFLNLPQNWE*RVSPKLPGKFWIF* 
LKTGFPLVAQWFELRTSGDQMA 


2892 


16793 


A 


2911 


367 


136 


ETPGNHLSPGV* GCSEL* SCLCTPAWAT 
EQDP I SKQKRTKKS KTLLKNTKADLTRW 
KH IHKS ACV C I CLCMYMCLRE 


2893 


16794 


A 


2912 


258 


140 


FLIFFCLDCFLNIKVLGVPGPFFSLKKK 
QT*RQGLALSPRIiECSGLI IAHCTLEPL 
GSRNPLTLAPQ I AGATGMSH * VQGYNEL 


2894 


16795 


A 


2913 


48 


242 


YSM I F CVL I LP VS FTS SDG I FLIjINLFL 
FY* STYPLAFFKTESHSVTQAGVRWHHL 
GSLQPPPPR 


2895 


16796 


A 


2914 


309 


76 


GFI ICVCVCVCVCVCVCWFIFTEYSFH 
SLFC* IRCCVFYFLALSLLIGFYIiFWWR 
YLEVYNI SFEVWASLFNRFLLL 


2896 


16797 


A 


2915 


1 


122 


FSVETGFHHVGQPGL*PPTSGDPPSIiAS 
QS AG I TGMSHHA 


2897 


16798 


A 


2916 


167 


1 


GNMCSKESVSGTNINRKPD* * YNPRLGT 
VAHAYNPSTLGGQGRWITGGQEFKTSL 


2898 


16799 


A 


2917 


2 


356 


ARGILLLRIMLTTLTRYP * * RDVTREST 
YQGHHTPPVQKGLRYGIILFITSEAFFL 
PRFLPPFLSSSFPPTPSLLKPFPPSSFP 
SLPS FFPPFLPSSLFL 


2899 


16800 


A 


2918 


2 


351 


AREYTSLQLILQMTFIMAFTCTDRTLYE 
LAFEC*LMRTLDIITR*GNQP*RPNAGS 
YWLFYTLLGTLPLLIALIYTHNTIjGSLN 
ILLLTLTAQELSNS * ANNLI * LAYTIAF 
IVKI 


2900 


16801 


A 


2919 


32 


321 


ALMGITFFFFLGKGVLAPRGGGRGGNPG 
LWGGPPPGLGPFSG*SLQGGGA*GPPPQ 
AGANFGFLILRKTRGSPGCPGGFWIPGP 
GDRPAGAPQGGG 


2901 


16802 


A 


2920 


1 


361 


GTSTRLGVLLLSLHHAGS INPLGITLHS 
DKVTLHP YYTI KDALGLLLFLLSLMTLT 
LFSPDLLGDPDNYTLANPLNTPPHIKPE 
* YFLFAYTILRSVPNKLGGVLALLLS IL 
ILAIIPIL 


2902 


16803 


A 


2921 


3 


152 


HERLYSVPLRILRRLPDPLSIPQGWRHV 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Pheny!alanine, 
G=Glycine, HHHistidine, Wsoleucine, 
K=Lysine, L=Leucine, MNMethionine, 
N=Asparagine, P=Proline, 
Q=G!utamjne, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X^Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














IRESAYKSHLTLPARRGLQDGI ILFITS 
EWFFAGFF*AFYHSSLCPTPQL*GKWP 
PTG I TPLNPPES PLLNTS VLLASG WI T 
* AHTKAI SHYLREEAFKTG 


2903 


16804 


A 


2922 


410 


1 


RAGYRQKWGSLATVIGQLGLPVERGWYG 
PSGEGGTPQRHCSELVLDDLHVI TQKRA 
NHRPRCGGSLLSELKFAPLALQPGRQSK 
TLSQKKNKQTTTKKNTKKQPGSVAHACN 
PSTLGG*GRQITLPSGVRDQPGQH 


2904 


16805 


A 


2923 


2 


145 


ARAARGDVI * S FHVFAQRDLNLLSSRDL 
PALASQSTGITGMSHQCPG 


2905 


16806 


A 


2924 


29 


340 


EMNEVERRGRK* LFHDS IQSR * EYRCAP 
QCPANS CVLM* RVJGFAMLPKLVS S DLPA 
LASQSAGITGVSHCAWLVFLPLLSTSFI 
SQRHLYLPRA 


2906 


16807 


A 


2925 


2 


337 


ARVLILPGFGIISHIVTYYSGKKEPFGY 
IGMD*AMISIGFLGFIV*AHHIFTVGID 
VDTRAYFTCATI I IAIPTGVKVFN*LAR 
LHGCNMI * SAAVL* SLGFI FVFI VSGQA 


2907 


16808 


A 


2926 


76 


388 


RYCTPAWATETPSQKKEKKKKCQTFEPE 
S S F P VCLENGGGGN F I SL F FL FL 1 1 LS L 
FFYFY* FFFI IFVYSYYYFYFCLFSYFF 
FYFFINLI IFLLLNYFFFFI 


2908 


16809 


A 


2927 


4S 


374 


KDQBPTDMGSAHFQVFKGWRQVGGANED 
RINSHGGPVMFSGQYYGQLVRFLVY* I I 
I FFMLLFFSLFYSYFL.FI S I FFVLVFY C 
LFIFLFYFSFFFIFVFFFYYFIVIF 


2909 


16810 


A 


2928 


3 


294 


HEGFHYI SQDGLDLLTS * SARLGLPKCW 
DYRREPPRPARFPTLRIWLYILKALYTV 
LGCYREYEAYRPWCWTHRMDGGQEVEGR 
REGTGGGHI LGF 


2910 


16811 


A 


2929 


7 


420 


I RRRDVLLLTLTSLS PTAAR I CYNGGRR 
GNRLNSLAYLSKELLAAWSLRKPSHGLL 
TP IRCVLY I RGHHF * LLPPSHLLLLAS A 
IMGAAGGTG*TV* PTLARNYSQPGVCVN 
LAMVSLHLSGVS S I LGAITFI TAIMNME 
APAITQYQTPLFV*YGLITADLLVLSLP 
DLTAG I T I LLTDRKLS TT F FD P AGGGD P 
ILY 


2911 


16812 


A 


2930 


11 


253 


GLLHKAPSP*KFFFSPKPFNFFWKFSPI 
FSPPKKKFLSKNPHIVFKFPPFKGKIFT 
FPPPLKFGPPRVFFKAPPPFFFFFFFFF 
FFFFF 


2912 


16813 


A 


2931 


70 


341 


DSSFFS * LATLHGSNMK* SAAVL* ALGF 
I FLFTVCGLTDI ALPNS SLDIALLDTYY 
WAH FH YVLS LGAV FA I IRGL IH* FPLF 
SGYTLDQS*AKIHCAIIFIGVYITFFPH 
HSLGL 


2913 


16814 


A 


2932 


3 


340 


N YSHPGAS VNL 1 1 FTVHLAGVS C I LRAI 
NFITTIINIKPPAITQYQTPLFV*SVLI 
TGVLLVLS I PVLSAG I TI LLTDRNLNTT 
FFDPAGGGDHILYQLLF*LFGHPEAYIL 


2914 


16815 


A 


2933 


2 


364 


RVQKGLRYW I ILS I TSE VLLFAG YF* AL 
YHSSLAPTPQLGGHWPPSGITPLNPLEV 
PLLYTYVLLASR VS FT * AHHRL I QNNRN 
HIIQALLITILLGLYCTLLQAS*NFEPP 
FTISDGIYG 


2915 


16816 


A 


2934 


2 


367 


PR VR P FArR YLLFGA* AG VLGT ALSL L I R 
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Predict- 
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nucle- 
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ding to 
last amino 
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peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D^Aspartic Acid, 
E=Glutamic Acid, F=Pheny!alanine, 
G=Glycine, H=Histidine, I=IsoIeucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop cod on, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














AELGQPGNLLGNDHI YNVIGTAHAFVI I 
FFIVI P 1 1 IGGFGN* LGPLI IGAPDMAF 
PRI NN I S F * LLP P S LLLLLAS A I VEAG A 
RTG*TVYPPL 


2916 


16817 


A 


2935 


3 


365 


YHIV* PNP * PLTWALSALLMTSGLTM* F 
HFHSITLVILGLLTNTLTIYQ*WRDVSR 
*STYQGHHTPPVQKGLRYGIMLFIT*DR 
FFFAGVF*ALYHSSLAPTPHLGGHWPPT 
GITPLNPLE 


2917 


16818 


A 


2936 


3 


144 


DSHGRHVING*TCVTICFIRQLIGHFTS 
KHHFGFEAAA*YWHKKKKK 


2918 


16819 


A 


2937 


3 


422 


QRLLATNHHDIGTLYLSFGA* PGALGTD 
LSLLIRAELGQPRDLLGNDHIYNEI*TG 
HALGILFFIGLPIIIEGFGN*LHALIIG 
APDMALPR INNISF* LLP P YLLLLLAS A 
I VEAGAGTG * TVYPPLAGNYSHPGAS VE 


2919 


16820 


A 


2938 


3 


382 


RTRGLFSTNHIDILTLYLLFGA*AGVLG 
AALSLLIRAELGQPCNLLCNDHIYNVIV 
TAHAF VI I FF I VI P I I I * CFGN * LLPL I 
IGAPDMAFPRINNISL*LLPTSLLLLLA 
S AI LEA * S GTG * TV 


2920 


16821 


A 


2939 


225 


3 


NISWRMKQIVPKGEPYNGVPVFVSLWFH 
KMFIFETGSHSVYQAGVQ*RHLGSLHPH 
PPGIKQFYLSLPSSWDY 


2921 


16822 


A 


2940 


1 


247 


PTRPPVIYSTIFAGTLITALSSH*FFT* 
VGLEINMLAFIPVLTKKINPRSTEAAIK 
Y FLTQATAS IILLIAI LFNN I KKKKK 


2922 


16823 


A 


2941 


3 


369 


TRDS T Y LGH HTP P VHTGLR Y WR I LF I TS 
DAFFFAGFF*AFYHSSLAPTPRLGGHWP 
PTGITPLNPLEVPLLYTSVLLASGVSVT 
*AHHSLVEDDRYQI IQALLITILLGLYF 
TLLQASEDFE 


.2923 


16824 


A 


2942 


3 


401 


LTVCCVIVLRLKTLFFFFNKPFLTQKGY 
FNTPEEGFFKKPNRRVGPPSPMTDPTML 
TNLIKGKVPKAPPRILMGGGINMTF*GF 
VTPRAPFPLTLRLNPMLQQGIDLLTLKA 
SGGSPASGNFLKELGLRSINF 


2924 


16825 


A 


2943 


189 


348 


RNARGWVTDKE KRLRLGMVAHACNS STL 
GGDGR* IA*GQEFETSLANMVKPCL 


2925 


16826 


A 


2944 


2 


349 


ANSNYERTHSR 1 1 1 LSQGLQTLLPLI AF 
* * LVAS LANLAL P PT I NLLGE L S VLVTT 
FS*SNITLLLTGLI ILVTALYSLYIFTT 
SQWGSLTHHINNIKPSFTRENTLMFILL 
SPIL 


2926 


16827 


A 


2945 


621 


929 


GCSSGTGCCPILCDLPRPWSCRGVGGSP 
SSTAHLCPRGWRSGRCFLPPLSAS*VDS 
AMSLIQAAKNLMNAWQTVKASYVASTK 
YQKSQGMASLNLPAVSWKM 


2927 


16828 


A 


2946 


379 


395 


SQHFGRLRREDHEVRSS*PRDPPALTSQ 
SAG I TGMS HCAR PL VATS I H K I DNR Y I D 
DRS*NINIGTIFINLLIFYLSIYLSIYL 
S I YLS IMYS FTVAQAGVQWRDYGSLQPE 
SPGPK*SSCISLSSSRNHSKTPPHGWVD 
PKIP 


2928 


16829 


A 


2947 


2 


357 


HTYD I RKPRR + TLACALSALLTTSGLAM 
TDCFHYITLLILCLLTNTLTIYQ*WRDE 
TRESTYQGHHTPPVQKGLRYGI ILFITS 
EAYFFAGFF* AFYHS SLAPTPQLGGHWT? 
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PTGITP 


2929 


16830 


A 


2948 


2 


357 


PRVRYSTDHSDIGTLYLLFGA*AGVX»GT 
AI SLLI RAELGHPGNLLGNDHI YNV IGT 
AKAFVIIFFIVIPIIIGGCGN*LGPLII 
GAPDMAFPR INNISF* LLPAS VLLLLAS 
AIVEAG 


2930 


16831 


A 


2949 


2 


217 


KNLKIVQYGDMCLWSQLLRELRWEDRLS 
SGG * GCSELCS CHCTPAWTTQQDCLKKK 
KNFWPVGEVKGRHML 


2931 


16832 


A 


2950 


2 


354 


AI PMTTARLT I HEAYL I ILERTTTTTKD 
VKNPRRI AAAI TAS CLGGGLEDAISCQY 
GIATKDRPTGLGTPEVLLGALPGAGGTQ 
RLPKMVWPAALDMRLTGRSIRAHRALK 
M*LVD 


2932 


16833 


A 


2951 


3 


150 


LFSCSPTFSSDPLTTPLLILTT*LLPLT 
IMASQRHLSSEPLSRKKKKKK 


2933 


16834 


A 


2952 


283 


344 


EKKKKKKKKKKKKKKKKKKK* KLRKKKK 
KKKKKKKKKKK KKKEK 


2934 


16835 


A 


2953 


3 


378 


DAWADAWNQT P Ii FN * S VL I T AVLLL LS L 
RVLAAG I TI LLTDRNLNTTL FD P AGGGH 
PILYQHLF*CFGHPEVYILILPGLGIIY 
RIVTDYYGTKEPFGYIGMV*AMRSIGFL 
RFIVRAHHIVTVG 


2935 


16836 


A 


2954 


3 


387 


I LYQHLY * FFGHLEGY I L I L PGFG 1 1 SH 
MVTYYSGKEEPFGYIGMG*AMISIGFLG 
FIGRAHH I FTVG I DVDTRAYFTYAT III 
AI PTGVKVFS * LiATLHGSNMK* SAAVL* 
ALGFI FLFTVCGLTGI 


2936 


16837 


A 


2955 


2 


432 


PRVRPRVRKKIAYPFLVLSL*GIIITSS 
ICLRQTDLKSLIAYSSISHIALWTAIL 
IQTP * S FTGAVI L 1 1 AHGLTS S LLFCLA 
NSNYERTHSRI I ILSQGLQTLLPLKKKK 
KKKKKKKKKKKKGGGLLKESLGGPNLTG 
EGK 


2937 


16838 


A 


2956 


12 


391 


SYFISSSKPHLSPPWLSSPDEATSKKKK 
KKKKKKKKKKKKKKKKKKKKKKKKKKKK 
KGGGFK*KRIGGSFKKGGGEKKIFFFKG 
GKKKKTGGFFEKKFFLGGEKY*DNLPKK 
NKTVGEKKNFLGGG 


2938 


16839 


A 


2957 


1 


352 


PTRPYFPVDAGEAQHHPRTCRRPLRALW 
SSHHERWKVTLCTHCSLGVFFLYCCTYY 
IFVLFIP*SSCGLTIiIFITCIILFGSIS 
FFLFFTI VFS I I I VTTFKFRLLYSI I FL 
SYLLC 


2939 


16840 


A 


2958 


1 


243 


NLPRLNPKE I E I LNRS I I CNK I KAI I KS 
LLSKKSSGHNDVTAEFY*TFKEELITIL 
LKFYLTPKKKKKKKKKKKKKKGGPF 


2940 


16841 


A 


2959 


373 


3 


FSSLKKRVTPPPPPKTGFSLEGLHLLKK 
NFPQKPPPPKKSFSQKNPPPPKKKPPF* 
XKKPPPPPPI*HPPPKILQPPPPPFFFF 
FFFPFFFFFFFFFFFFFFFFFFFFTDMR 
GFIVRKFRTRG 


2941 


16842 


A 


2960 


2 


136 


PRVRSTLPISYKWNNRALMKAHLIMK* F 
TE Y F KTTDEL YLHDNT 


2942 


16843 


A 


2962 


1 


358 


HTPPVRKGL* YGI ILFITSKVFFVPRFI 
*AVYHSSlrCPTLQLGGLWPATGMTWLNP 
L* VPLCTTFVLIiA* *VSIT*AHHNLIDL 
NRCHVI YALV.T T T LLRLYFTLLPASE YF 
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C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=Iso!eucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, V=ProYme, 
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W=Tryptophan, Y=Tyrosine, 
X-Unknown, *=Stop codon,/=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














EAPLTIS 


2943 


16844 


A 


2963 


2 


373 


RAYD IVRPRP * PLLGALSALLMT YGLAM 
*VHWDCITLLILGLLTNTLAIYQ*WRDV 
SRESTYQGHHTP PVQKGLRYG IILFITS 
EDVFFAGIV*AFYHYSLAPTPQLGGHWP 
PTGITPLNPLEV 


2944 


16845 


A 


2964 


403 


61 


LFFPLEKNFLPPGFSAFFSPFSP*KFFF 
SPKALIFWGNFSPFFPPPKIRFLPKIPP 
WVFFSPPFWEKLFSSPPPLNFGPPRVLF 
KGPPLFFFFFFFFFFFFFFFFRFEGECW 
RL 


2945 


16846 


A 


2965 


2 


360 


LI PNLAMVTR* GNHPQRLNAGTYLLFDT 
LDGSLALLIGLTYTRNTLGSLNVILLTI 
TAQELSNS* ANNLV* LAYTLAFI EKI PL 
YGLHL*LPKAHREAPIAGSIVLAAVLSK 
LGGSGKN 


2946 


16847 


A 


2966 


321 


2 


STGMHFPHINMAINPPPPPRPPPPLFPP 
NPKTKPNPTQ* KGGFPPLGPF* KKLS PF 
LTLFFFFFFFFFGHHPGPWQKKKKKKVF 
FFCVCFS FQRVHNI YKNTHQQQ 


2947 


16848 


A 


2967 


200 


■ 


RRT YT S HLLACLRQGLAFS P RLECGGK I 
RAHCSLQLYGS SDP PT *APQTAGTKQHN 
QRIAQCNADN 


2948 


16849 


A 


2968 


2 


354 


LLTASSSEIAPLQSSLGDRARLCLIK*K 
EGVLNSI * SGNQGKS YANVYRLLYLDPI 
PKIYAEAYTP*NVNSTNLETKSPKTIQK 
FPEDREFKNDFFEKTKNGDRETGAFHPF 
LLFLV 


2949 


16850 


A 


2969 


99 


400 


ALGIHFI FTVSGLTG I GLANS SLHI VLH 
DTY YWAH FH YVLS LGAAFAI IGGFIH* 
I PL YS C YTLDQT YAKI HFT I I F I G VNLT 
FFPQHFLGLSGMPRRY 


2950 


16851 


A 


2970 


2 


377 


N I LLLTLTAQELSNS * ANNLI * LAYT I A 
FI VKI PLYGLHL* LPKAHVEAP I AGS I V 
LAAVLLKLGGYGI IRLTLILNPLTKKI A 
YPFLVLSL*GIIITSSICLKKKKKKKKK 
KKKKKKKKRGGPF 


2951 


16852 


A 


2971 


1 


407 


GTRS YTHLYRVFLELIKMSVYDLNHTVI 
MVI SGHVRLAF YG I VHLTL I LNLLTDH I 
LYPFLVLSLSGVI I I S S I FLRRTDLE SL 
IAYS S I SHIALWTAI LI QTP* S FTGAD 
ILIIVHGLTSSLLCCLANSNYER 


2952 


16853 


A 


2972 


10 


259 


SRSVAI YFKGMASA* *RMFSSKKKKKKK 
KKKKKKKKKKKKKKTAITKKTTTAKWKN 
*RTEKIRHTRLPLISSRDAKCVDFLYT 


2953 


16854 


A 


2973 


12 


400 


LNCRTPSLYLQRAGELLSVENPH I WCQK 
CVRKNWFFLFFFFFFKTDLYCPH I CI AL 
SITCFLAISISGLFAFFLLNSFNYHFII 
VP*NFSLS I YLCLHSLFYVGFFFS FDMM 
IFSILT* TLFFCLLFHL 


2954 


16855 


A 


2974 


125 


3 


RPRRPAAQVGVQ + KNLSSLQPVP PGFKL 
FFPLSLLSSWAS 


2955 


16856 


A 


2975 


2 


332 


RECTNYPAQPPPPPLEAEEGFQPLPPTD 
TG * LECLPPLGGGQHRALGLGATWHLEN 
ACALVLADLGTGPKPSGPFSGQACGASG 
RLPLLSGPILPPLGQGHTLSAWGHHA 


2956 


16857 


A 


2976 


7 


329 


SRDILVMTEDGEFF*GHV*P*DRSRRAD 
TPRLGGHW PPTG I T PLNPL * VPL LNTS V 
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N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V~Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possib!e 
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LLASGGSIT*THHRLIEYIRHQI I HALL 
I T I LLGL Y FTLLQ AS E Y FE S S FT 


2957 


16858 


A 


2977 


2 


339 


AHH I FTARIDVDTRAY FTS AT IIIAIPT 
GDKAFS * LATLHGSNMK *SAAVL * ALGF 
I FLFTVRGLTGI VLADSSLDI VLHDTYY 
VGAHFHYGLS IGAVFAI IGGFIH* CPLF 


2958 


16859 


A 


2978 


394 


0 


QGCSEL*SCHCTPAWVTE*FPVSKKKKQ 
RK 


2959 


16860 


A 


2979 


149 


383 


PLCFPSQHWGKS SNCLSSLKYFPSSGLP 
LRCKAELDDVKQKADKELDVVAHACNPS 
TLVGQGGRI TLRSGV* DQPGQH 


2960 


16861 


A 


2980 


2 


378 


ARVS IGFLGLI VGAHDMFTGRMDLYTRT 
YCTFATILIAIHTGVPVFI *LATLHGCN 
MK*SAAVL* ALRFI FISTVCGLTGIVLP 
N* SLDIVLHYMYYAVAHLKYVLS IGAVF 
AILGRFIHRLPVF 


2961 


16862 


A 


2982 


2 


409 


PAVAEAYLKP WDD S KGS FL WGKP DLDG 
IRECCQRNFGWNRTRTDESLFPVLKQLD 
AQQTQLRIDSFFTLAQQEKDDAKRI KSQ 
RLNRAVTCMLR I E I EAASSE I EAVS VD * 
QKELEL I DKAVACHLE YMYETD P 


2962 


16863 


A 


2983 


3 


453 


HASAHAS AHAS G QR KGAAPAE KKCG AE A 
QHEGLELRVENLQAVQTDFSSDPLQKW 
CFNHDNTLLATGGTDGYVRWKVPNLEK 
VLES KAHDGE I ED L VL VPEGML * I VCPD 
HYS P VG INDFM WLL * LNWGNRL YFPH I S 
VYLMFHFRTF 


2963 


16864 


A 


2984 


3 


417 


LILPGFGIISHIVTYYSGKKEGFGYICM 
V* AMI S I GFLGL I VRAHHI FTVG I DVYS 
RAYFTSATI I IAI PTGVKVFS +LSTLKG 
SNMK* SAAVL* ALRF I FLFTVSGLTG IV 
LANS S LD I ALHDT YYV* THFH YVLS I 


2964 


16865 


A 


2985 


273 


408 


MKIFIF*VSPPLFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFLAVFTFI 


2965 


16866 


A 


2986 


176 


335 


TFQPSELWRTALLPRLECSGLI IAHCSL 
ELLGSSSPLASAS * IAGTTGSILRY 


2966 


16867 


A 


2987 


65 


398 


KKEFKIGRKAAEKMTRNINKAFGPGTAN 
ECTVQWWFKKFCKGNKSLEDEECSGRPL 
EDDNDQWRAI IEADPLTTTREVAEELS V 
NHSMAVQHLK * VGKVKKLNKWVPHEIjS 


2967 


16868 


A 


2988 


2 


158 


PGWSLTPDLR*STCLSLPKCWDYRHKRP 
SPTSRQ7FDHAPNKNSSHSPPIYM 


2968 


16869 


A 


2989 


435 


23 


GWPPNPQKSFYFPQRLKIWGGGGRKRP 
PPKKKGFSQKTPEGF* KPP PKRRKKI FH 
DPGKKGPPKGIFKRGPPLFFFFYFFFFF 
FFFFFFFI ITIECLHSRFPHRHHNKKFP 
PNPPSPRFWPQHLNTSLPNPKNKEP 


2969 


16870 


A 


2990 


410 


3 


GGRGHFFSGAFFIKFPWNKKGISQPLCF 
PRGGGSPPPLGPVRGGGPPCWGPLCHKG 
PVKKTGAPRGKNG I S P FFCFP PLGS FNR 
SGFFWALLGSPPLFFFFFF*DRVSLHHP 
GWNAVAQS * LTKTSTS WQVILL 


2970 


16871 


A 


2991 


371 


132 


KQSLPFKVXGFFFPPE VENAI YPDSHTA 
FYSW* KRSTFSKKKKKKKKKERKRKKEE 
RXEGRKEGRKERKKEGXKEIENA 


2971 


16872 


A 


2992 


336 


1 


CPRWSRTAGLK* STCFGLPKCWDNRHES 
PCPAS S VLND * QLSTVQRAT * WQTKS LV 
LKAPPARGALPVSRAVSPPPSSHHFAYF 
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LHMI *QVSCLFLR*SLALVPQAGVQ*RD 


2972 


16873 


A 


2993 


317 


440 


HLGRLRQEHHIjSPG* LRKFFFFFGRDGV 
LMLSMLVFNSWTQVMLLPQPPQVLGLR 


2973 


16874 . 


A 


2994 


2 


181 


FHHVGLDLLTL* SACLGLPKCWDYRREP 
LCPALVI LLTAKFTNFRYRVKNVCT FHV 
ESN 


2974 


16875 


A 


2995 


1 


165 


GFHCI SRDGLDLLTS* YARLGLSNCWDY 
RHEPPSPAPLFISYSICLFLSKLIQFL 


2975 


16876 


A 


2996 


405 


2 


KKGFP I FSRMFFLYKHIE KPPLASQKFG 
DPKWSPHPRPIFFFLLKKGVLYVWREGF 
KFHPP*FPPPGPPKRWE+RVKPPHPPPF 
FFFFFLYVEMGSHCVAQAGLKliLGSS TP 
PTSASQGAE IAGVSHRAQPHAS 


2976 


16877 


A 


2997 


2 


415 


CLFTGGGIjTG IGSAGSSLD I VLHDT Y Y V 
VAHFHYDLSIGAGFAIIGGLIH*FPLFS 
GYALDQTYARIHFTIIFIGVNLTFLPQH 
FLGLSGMPRRCSD* PDAYTT* SMLSS VG 
SVITLTAVI VLI IMIREAFDSKRTVL 


2977 


16878 


A 


2998 


349 


370 


HHFY*SITAFRPFQHMELRNFFFFFLFF 
F FFFF F F F VFLF I FLFLF YFLFL YE I YL 
ILFFL*HHFYSAV 


2978 


16879 


A 


2999 


36 


139 


LCHCTPAWETFSKEKKKKEKRKKKKKKK 
KKKKKKKPGGGS * KTALSKDCATVLQPG 
RLFQKKKKKR KKG KKKKR KKKKKKKK 


2979 


16880 


A 


3000 


104 


373 


SADREXXSKTDNIaLGH*TNVNKCKVPRV 
I QSVFSSHSGLKLE INNRKMKEKSLNT * 
KINNISYWVQCSLYNLKSNSYTKSSDNT 
TTQYMY 


2980 


16881 


A 


3001 


1 


396 


LDCS KISS YLQKS S SHVLFFS FS FFFFF 
GGGGGFALENPYPPAGIjGPQKKKTLiSPP 
WLVGGPPFPEKAPP*GGHFLGKKPQKNP 
GGQNPKRPSGEGNPFFQPPGGEKKQIGP 
PTGFGGKP FFS F * PRSPPGP 


2981 


16882 


A 


3002 


412 


104 


FFFFFLPVRQTFYPQPFSVFFPLFPFKF 
FFFP*AFNFFWGFFPIFSPPKFGFFSKI 
SRLVFFSLPFWEKFFFFLPPFFFAPLRF 
FFKGPPXFFFFFFFFFXXY 


2982 


16883 


A 


3003 


400 


46 


LFFFFFYKLFPPPAFGGFFPPFPL* NFF 
FPPGPFFFLGGFPPFFPPPK*VFFPKIP 
PGFFFPPPF*KKFFFSPPPFFLPPPGFF 
FNPPPPFFFFFFFFFFFFFFFFFFLCW 
VQVEAY 


2983 


16884 


A 


3004 


411 


69 


YSPLPPLFFSSPPSKFPWPPFSLFFLTR 
VYKGFFFFPFFPFELFGPPGLSLGSKPP 
PVFLGGDPAFFSIPHPRVGSLPPPPNWP 
FIGPSFW*ALFPAPPLFFFFFLSSFFFF 
F 


2984 


16885 


A 


3005 


3 


200 


DAWG*LFSTNHKDIGTLYLLFGA*AGVL 
GTALSLLIRAELGQPGNLLGNDHIYNVI 
VTAHAFVII 


2985 


16886 


A 


3006 


259 


146 


FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFGLLNDILIi 


2986 


16887 


A 


3007 


118 


414 


QNQPPQNKATHTVKIEKKEKPETKTVAK 
EHNKAKTAEKSEE* TKKEVKGGKQEKVN 
HTAAKVKEVQKTPSKPKEKEDN3CKAAVS 
KHEQKDQ YAFLRYM I 


2987 


16888 


A 


3008 


427 


111 


FFFFSPVGNFSPPQQFPFFFPPFPPKIF 
FFPPPL* FFWGGFPPFPPPPKKVFFPKS 
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PPGFFFPPP*GKKFIFFPPP*FCPPPGF 
FLSPPPPFFFFFFFFFFFFFS 


2988 


16889 


A 


3009 


3 


302 


SLAS CLS YLVCV I FLGQPKPT I * LQNST 
PHRK*NPTERYVKTCTQIFIALLFKKEK 
QPRCPSAGEWINKMLYACTIEYWLAIKR 
YEIL I YATV* MYLEKI 


2989 


16890 


A 


3010 


1 


409 


RLHDATFP 1 1 E EL I TFHDHALVI T F L I W 
LLVL CAL FLT LTT KLTNTN I S HAQE I E T 
V* T I liAAI IIoALMVLPSLR I LY I TDEDN 
DPS LT I KS IGHQ WY * TYE YTDYGGLMFN 
S Y I LAPLFLE PGDLRLLDVDNRW 


2990 


16891 


A 


3011 


157 


2 


GRVDLKIQKLARCGGACLQSQHTQQNHL 
NPGEKGCSES*LPPCPPD*VTKQ 


2991 


16892 


A 


3012 


2 


423 


ARAARAHI VTYYSGKKEPFGY I GMG * AM 
ISI GFLGFIGRAHH I FT * * I DGHTRAYF 
TSATIIIAIPTGVKVFI*LATLHGSNMK 
* SAAVL * ALGFI FLFTECGLTGI VLANS 
SLYI VLHDT YYWAHFHYVLS IGAVFAI 


2992 


16893 


A 


3013 


2 


140 


ARANILLLTLTAQELWDPRANNLI*LAY 
TLAFIVKKPLYGLHL*LPKAHVETPMDG 
PILLAAKLLKLGGSGIIRLTLILNPMTK 
HIAYPLLGLSL*GIIITRSICLRQTELK 
WLIAYS*ISHIALWTDILIQTP*SKHF 
TTNSHCPRTMGPQSQQLNMTSLHTSFYS 
KETSLRTPLMTP 


2993 


16894 


A 


3014 


2 


420 


PVLAAG ISI LLTDRDLLTTLFDPGGGGD 
PILYQHIrF*FFGHPDDYILILPGFGIIS 
HIWDYYGRKEPSGYIGT*V*AMTPVGFL 
GFI E * AHHI FTVGI DAHTRAYLTS VS I L 
I VI PTWRQVFS * LATLHGSNMT * YAALF 


2994 


16895 


A 


3015 


6 


292 


AHHIFTV* IDVYTRAYFTSATI VIAIPT 
GVKVFS * LATLHGSNMK* SAAVL * ALGF 
I*LFTVGGLNGIV* SY*LLDIELHDTY* 
FVMGCPRKVYF 


2995 


16896 


A 


3016 


3 


422 


TPI I IGGFGN*LFPLI ISAPDMEFPRIS 
NISL* LLPPS ILLLLASAIVEAGT*TD* 
TVYPPLAGNYSHPGAS VDLTI FSLHLTG 
VSSILGAINFITPIINIRPPAITQYQTP 
LCA * CDLMTAVLLLLS LPDLAAG I TILL 


2996 


16897 


A 


3017 


21 


490 


TPFPGRHLTMFSLHLAGGCS ILGAINYI 
TTL INI RPPAI TQYQTPLFV * S VL ITAV 
LHLLS LPGLTAGVT I LLADQNLNTTFFD 
PAGGGDPI LYQHLF * FFGHPE VYI LILP 
GFGIISHIVTYYCGEKEPFGYIGMC+AM 
ISIGFLGIIV*AHHIF 


2997 


16898 


A 


3018 


98 


402 


LRSQHSKSFQI SGKPSQEEWPQI S PDST 
DYIINT*HFNAQMLKNIY*KQPHGLHDF 
FPKKKKKKKKKKKKKKKKKKKKKKKKKK 
KKASSSSSKKDPPGGGPT 


2998 


16899 


A 


3019 


399 


229 


PPPPGGGGPQGPPPPRG3FLPKSPGGVF 
YPPPRGGKIFSPPPGFFGPPRGFF*GAP 
P 


2999 


16900 


A 


3020 


2 


401 


SDAVL*ALGFI YLVLEGGLTGI VLADSF 
LDLELHDTYYVGAHFHYVLS IGAVFAI I 
GGFVD*FPLFSGYTLDQrYAENHFTI IL 
IGANVTFLPQHFLGLSGMPRRYSDYPDA 
YTT*NILSSVGSFITLAAALL 


3000 


16901 


A 


3021 


413 


67 


PPPPGKIFFKKTPKKKIFPPPOF*IFFP 
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PLPPKKFFFSPNP* FFLGGFSPFFPPQK 
KIFFPKIPPNFFFSPPLKKKIFFFPPP* 
IFPPPRFFLKPPPPFFFFFFFFFFFFFF 
FFF 


3001 


16902 


A 


3022 


2 


332 


LTLS I LS 1 1 AGS * GGLNQTQLRKI LAYS 
SITHIG*IIAVLPYNPNITILNLTIYII 
LTTTAFLLLNLNS S TTTLLLS RT* NKLT 
* LT P L I P S T LLS LGGLP P LTG FL PKW 


3002 


16903 


A 


3023 


1 


412 


RGPPFFFFFCVFFFFFFFFFFFFNKGCR 
HSKRFFFFFKRKKKSSRPTRDRV* FYPK 
GWRSPLFLFFSPGGRGFFPLSHQVGFSN 
EVLWFKKNLEL FRGS PARKKKKKKKHL 
E 


3003 


16904 


A 


3024 


279 


1 


LGRNTELWKSGKGMDILKTNCGKIiANEP 
FRQPRVLGIGGEAPRAGSGPPSRAPPA* 
TPGPSSAGS WP* PPGTGRAPRGPAPSAP 
GARSPGRPG 


3004 


16905 


A 


3025 


151 


401 


KKPLGGPNLTGEGKKKFFSLKGGKKKPP 
GKFLKKTFFLGGEKMGKTPPKKLKP *GK 
KK I FKGKRG KKNP KTLAVKKFS KKKKK 


3005 


16906 


A 


3026 


416 


140 


YLSPLKKFFTPPPLRMFLPPNPLKNI FF 
PPQLKIFWGGWAQNSPPPKKGFFSKNPK 
RVFLPPPIRKRYNFPPHGKILAPPKNL* 
SAPPPIFF 


3006 


16907 


A 


3027 


379 


31 


PPPRRAGVFFFKNPKPKPPPPREGGRF* 
PFPPLKF*FFPKPQNFLGGGGAIPTPPP 
KRGLGQIPTERFNLSPPTQKRINFPPPG 
KGGPPPPLLKPPPPPPFFFFFFFFFFSL 
NSFI 


3007 


16908 


A 


3028 


28 


420 


MQQTTMAHI FLCNKFANCAHVPRT* SET 
KPMSTPLQFD*TYKGEKSCKYAEHERTW 
KQ*CVFSLYQI IPT*EKTWKCNQCGTNF * 
NQFFKQTTHL *NHTRDNQ ICFSKIGLE Y 
YYRITTRQHLLKLRTVCIL 


3008 


16909 


A 


3029 


I 


401 


LGNNGEAVSEKRKEKSQKEKSHNVWGF 
FFFFWGKPPFVPQAEGQGRNFC* PKPWP 
PGWGE FPGPAPRGGGNGRQKQPGRGNFG 
V*REKGGSMGGPRGA*NPGPKNPHPWPP 
QGPEITGGTTRPHPSGIFKKT 


3009 


16910 


A 


3030 


285 


397 


MFIKGDGLNKLRPGAAAYACNPSTLGG* 
AGRITRSGD 


3010 


16911 


A 


3031 


265 


2 


KKESS C I KATNSNSLFFLF* S VFFFFVF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFF*HFKSALLWVTDLGFLNSD 
PRV 


3011 


16912 


A 


3032 


410 


49 


GFSPPPP* KFFFSPKPUSIFWGGGGPNFP 
PPKKRFFPKNPPGVFYSPPKKKKKFFSP 
PP*NLAPPKIFLKSPPPFFFFFFFFFFF 
FFLSNVSNGLTNM Y I LPCKDPS CPTTFP 

ILGSLISL 


3012 


16913 


A 


3033 


177 


2 


VTPPFFFFFETRSHSVARTGV* *SDRCS 
LQPHPPSIK*SSHFSLPSSWDYRSMPPH 
AS 


3013 


16914 


A 


3034 


75 


395 


MS YKHXXKKKKKKKKKKGGAP * KKPWGG 
PKLTRDGKKKFFS LKGAKKKPTWKFWKK 
TLILGGRKMGTTPPKKLKPLRKKKIFKG 
*RGKKHPKSLPVEKFASRGRIKK 


3014 


16915 


A 


3035 


400 


S3 


KKEG PAR WS P VTPPLLEGPVGRS PQTR 
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Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptopban, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














NF*PTPPTHRKSLPS*KSKIDWRGIPLY 
PPPPRVKPKKSFNPGNRRFC*TKIFPCP 
SPWAPKTHPPFQKKKKKRVYK 


3015 


16916 


A 


3036 


3 


196 


DSMPQT * NKS FAKARKKKKKKKKKKKKK 
KKKKKKKRGGPPLKKTLGGPKLPGGKKK 
NFFFFRGG 


3016 


16917 


A 


3037 


1 


411 


FCYDVCVESGCADYS I VI IMKKKKEK* K 
KKKKKKGGGPLKKNPGGAQN Y PGVEKK I 
FSLKGGLKKTPRGNFEKKPYFGGGKNGA 
PPPQKNKPLGEKKKFKREKGGKKLQFPW 
GKKISLPGFYLKKIYPPGRGFFNFS 


3017 


16918 


A 


3038 


3 


429 


NFFFKKPRGGNFFPPPKKGFFSPPSPLK 
FFFFPPPFFFFGGGGPHFPPPPKRFFFQ 
KPPRGFFFPPL+EKNFFFPPPVFFAFPP 
VFFLTPPPP 


3018 


16919 


A 


3040 


1 


111 


IGLSGMPRRFSDYPDAYAT*NILSSVGS 
FMYLQQDNK 


3019 


16920 


A 


3041 


76 


967 


QLLKGGVSGVCPLLE4FRCVRS FFLLVGS 
WSSLASGVKPQTFAVSVTVLKAARLELF 
I PPRGLWSLASGVKLQTFAVSVTAHKS 
SVDPKNSGAQLAS PSGSRTRAAGGAACQ 
SRCRVPALLSPWWDGTGRRGAGGGARR 
GGSGRTGAHGVGGRLRHGGLHVPS PAPW 
KGS * GLARNRAQR R WAGTAGGP S T PSAA 
AGPGAKSLTALCEQGWPAAPSAGPTKPT 
PTRNSSWPASVARSPGSRSCLSLHTSLQ 
AEGVGSSLGQPS KGLPQCSGGAEGLLKC 
RQSGSPGRGGTESERGL*GLPQCSGGAE 
GLLKCRQSGSPGRGGTESERGL 


3020 


16921 


A 


3042 


39 


141 


LS IRGLNI I IKRQRL* DWI KQQDSTLCC 
P*EIH 


3021 


16922 


A 


3043 


2 


405 


LFSTNHIYIGTLYLLFSTRAGVLGTALS 
LLIPAELGQPGNLLGNDHIYNAIVTAHA 
FVI I FFIERP III GGYGN*LGPLI IGAP 
DMAVPRI NN I TF * LLP PSLLLLLAS AI E 
EAGAGTG* TVYPPLAGNYSHPG 


3022 


16923 


A 


3044 


3 


134 


HLNPGGRGCS EPRLHHCTPS WATE * DSV 
SKKKKSRKGWTGLFI 


3023 


16924 


A 


3045 


44 


187 


DPRVRQYQTPLFV*SGLIFAGLLLLSLP 
TLGAG I TI LLTDRNLHTAVFDPDGGGDP 
ILYQHLF*FFGHPEVYILILPGFGIICH 
IVTYYSGKKEPFGYIGMV*AMISIGFLG 
FIAGAHH I FTGG IDVDTRAYFTSATI I I 
AIPTGVKVFS*LRLIRPNLCRLTSPISP 
NPRGRHHYTTNRPQPPHRRV 


3024 


16925 


A 


3046 


434 


40 


. GAPPPPPGRFFFFLNPREDTFPPPPQKG 
GFSPP PPPKFFFS PPALFFFGGGWPNSP 
PPQKNFFF* KPPEFFFFPPFLKKKIFFP 
PPLFFPPPQIFF*TPPPLFFFFFFFFFF 
FFFFFFFRGCKINFIVRGF 


3025 


16926 


A 


3047 


1 


399 


LFTGGGLTGI VLTNS SLD I VLHDTY * W 
PHFHYGLS IGAGFAI IGGSIH* FPLFSG 
YTLDQT YAKIHFT I I FIGVKLTFFPQHF 
LGLSGMPRRYSDYPDAYTT*NILSSVGS 
LISLTAAILI IFMS *EAFASK 


3026 


16927 


A 


3048 


116 


367 


GASMILSSMIFLECIVGGFPVFSVYLFK 
LQILRQSSTMCFVLFCFFEIRSCSVTQA 
GV*RRGHGSL* PQPPGLSHPSSRDHGHV 



733 



WO 01/64835 



PCT/US01/04927 



SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
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E=Glutamic Acid, F=Phenylalanine, 
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K=Lysine, ^Leucine, M=Methionine, 
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3027 


16928 


A 


3049 


6 


345 


SQLLRRLRKENCLNIiGGGGYSHQR*HQC 
PLAWATS * D YLKKKKKNFSLFLKTGG FY 
PFLKKAPLGTTLSPKNKLPLSPLYKKRT 
GPWNWERKNFGKKKGGVIGLQGGKTI PK 


3028 


16929 


A 


3050 


261 


2 


NKKSPPV3STLWWKMGFFFKFAKRVVLSWK 
GGGGFFFFS WRRS FTFVAQAGMQWRNLS 
SLQPRLPDLR*SACLGPPDC*DYRREPQ 
YP 


3029 


16930 


A 


3051 


160 


2 


ICVDEQAGVQWRYLGSLQAPPPGLATLS 
CLSLMSSWECRQPPPLG*FFVCPR 


3030 


16931 


A 


3052 


89 


3 


PLTSGLLLLT*QKLAPISIIYQISPSLN 


3031 


16932 


A 


3053 


344 


2 


HVEAPIAGSIVLAAVLLKLGGYGI IRLT 
LILNPLTKHIAYPFLVLSL*GIIITSSI 
CLRQTDLKSL I AYS S I SKI AL WTAI LI 
QTP*SFTGAVILIIAHGLTSSLLFCLAN 
SN 


3032 


16933 


A 


3054 


266 


2 


FQP P I SAYTK I S PSLNVS LLLTLS I LSI 
IAGS * GGLNQTQLRKILAYSS I THVG* I 
IAVLPYNPNITILNLTIYI ILTTTAFLL 
LNLN 


3033 


16934 


A 


3055 


3 


32 


KYNS L I MPTM I AT I TLLNLYF YLSPLLY 
**SSCPP 


3034 


16935 


A 


3056 


3 


33 


KNNSLI I PTI I ATITLLNLYFYLSPLLY 
**SSSPPS 


3035 


16936 


A 


3057 


57 


445 


ANVWAPHGPAKLTNKDNYHIWKSKRLKI 
ANMTIKKLNE VIGLTLPDFKTYVELVQ * 
RQNAID*RKHKQPVKQSPEA* PHSYSQL 
I FHWGAKANHGRKDSLFYK* CW *NWTI S 
SQKLNLHTDLTNFTKIN 


3036 


16937 


A 


3058 


311 


1 


RVGLLLKLNKISWPPPPFYGPS * EKEPL 
CFSQ I GLFLTRTMVLNNLHS P PVKTRPY 
NKIAPFRELFFFFKDRVS I CLPGWSAW 
* SQLTATS TSQAS SDPGRV 


3037 


16938 


A 


3059 


138 


411 


WERPWKAQEAVFWI *VSAFWAPPPLMEK 
QIPPDLEQHYRNVPGVNRNQPFVSFFLR 
WSLTVAQAGVWWRDLGSL* PLLPGFKRF 
LCLSLLS 


3038 


16939 


A 


3060 


1 


189 


FCRVGQAGLKLLTSSDPPASAS * SAE IT 
GVSQRAWSKITILKSSSFSYFPNSCKMC 
FWLICUNT 


3039 


16940 


A 


3061 


3 


406 


DAWADAWVLILPRFGRTSHIVTYYSGKK 
EPFAYIGMV*AMISMGFLGFIV*AHHIF 
TVGIDVNTRAYFTSATI 1 1 AI PTGVKGF 
S * LATLHGSNMK* SAAVL * ALR F I FLLK 
KKKKRGAVLKVPWGGPSLPGCG 


3040 


16941 


A 


3062 


451 


82 


PPPTNYFSPPPAFLPGGGGPPRPPPFKK 
WSPPTPPPWIMPPPKKKKKFFPPPRGW 
GPPPKIF*KPPPFFFF*KKNPPFSPFGE 
NRGVFFSTKPPPPWGKKNFAAPGAPPPP 
PFFFFFPRGGG 


3041 


16942 


A 


3063 


430 


1 


FFPPKQLI FWGGGGPKSPPPKKKFFPKK 
PPGVFFSPPKKKKKFFFPPPLNLAPPKI 
FFKSPPPLSFLFFPFFFFFFFFFFFFFF 
FFFFPLS VQTLLKRTRAPPQP P LLD * E K 
APAPRVP*TGEGMPAVNVAFAPPPFYKE 
RPS 


3042 


16943 


A 


3064 


1 


409 


PTRPRESTYQGHHTPPVQKGLRYGI ILF 
ITSEVFFFAGFF*AFYHSSLAPTPOT.£?G 
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HWP PTG I TPLNPLE VPLLTT 


3043 


16944 


A 


3065 


1 


137 


HTFNFS I YQKATVI KTVW YW YNNRHTVE 
SPEINPYIYS*LIFFLF 


3044 


16945 


A 


3066 


4 


426 


KLEN * KMVLKE I KEDLNKQTD I LFS * LQ 
RLITVRMSILPKLIYKFSAIPIQIPA* F 
L* I KI I IKCMRKGK*TRIAET I FFFFFL 
SQS F I LSPKLDHRGG I TANCTP P WAI KG 
KLLLKKKKKPPKTKPEKFFIQKATGAEG 
GVH 


3045 


16946 


A 


3067 


411 


187 


RNLPNVPPRPTHFVLLVKTGVSQVGQGG 
GKLLASKNPPSPAPPKSWDYRGEPPRPA 
PRKFFF*LNKFKIYGGPKN 


3046 


16947 


A 


3068 


193 


471 


QCTCIKVHSGQKTGSTPLVIGELQIKIT 
LGCYYTPTLMAQIKKTDRTKCW*GYGAI 
GMLILCWRECKIVQSL*KRVWQFII*LN 
IYLAIKLNI 


3047 


16948 


A 


3069 


323 


478 


FFXFXFLFFFILXYFYYF* KLFYLYLFX 
YIFKIYFIYNFIYIILFYIIFIIF 


3048 


16949 


A 


3070 


1 


378 


GTRRFFFHS I TLL I LGLLSNTLTI YQ * W 
RDVTRESTYQGHHTPPCQKGLLNGI I LF 
ITSEVFFFAGFF*AFYHSSLAPTPQLGG 
HWPPTGITPLNPLEIPLLNTCVLLASGV 
SIT*AHHSLIENNR 


3049 


16950 


A 


3071 


201 


2 


TTPIQLFLKHYHT*NFNYNFFFEIGSCS 
IAQAKVQWCDLCLLQPQTLGLKHSSHLS 
LQSSWDYRHA 


3050 


16951 


A 


3072 . 


261 


1 


EKAMGGGPFKVKKS PGEGPTTKGWPLKG 
PLEGGQRGLTGPFKSNGRLFFFFFGSNE 
VSLCCPG*SRTPDFK*SACLSLPKCWDY 
RHG 


3051 


16952 


A 


3073 


333 


52 


EIFKKKKKGGGRFFNKRVFPRPRVSNGR 
PRAQFFLETFFFFPERGFFFFFFFETES 
YS VTQAEV* WNYLGSQQP PRFKRFSHLS 
LPSSWNYRCK 


3052 


16953 


A 


3074 


7 


214 


SQLQENRIiNPGGGGCGEPRS CHCTPVWA 
TE*DSVSKKKKKKKKRGGWVPPSLGGGP 
KKKNPFFNQEGGL 


3053 


16954 


A 


3075 


180 


419 


QKHLS ILHYLFKRDVFFFFLKGSFWSQ 
VGGQGHNLG * LKAP PPRLTHFS CLTLRE 
TWKNRPRYFFCFFI KTGFHHVTRE 


3054 


16955 


A 


3076 


371 


208 


QKLSGHGGSRL * S * PF* EAKAG * QLVSG 
GRGCSQL* SHQCTPAWVTE * RLVCKR 


3055 


16956 


A 


3077 


349 


3 


TFCWQKYTMCRSLCHPHPPRTWSTKKKR 
PPFQKGQGDAPPYKKVQRGNPPPPLKGR 
PSRGPPKKCKVFKAPVFLFPRFS PPPFF 
PPPLFFFFFF*DRVLFCCPGWSAVAQLT 
ATS 


3056 


16957 


A 


3078 


124 


1 


ISSETPAVKSSFWPGVWHACNPSTLGR 
*GKEITRSRDRDH 


3057 


16958 


A 


3079 


205 


1 


CLVQW I WACVSHWRY I MS CKG C * RLGLM 
CVCVCVCVCVCVCVYFCCCCCCC YC* DE 
LSLCHSDCSAVA 


3058 


16959 


A 


3080 


3 


413 


GHHTQ P VQKGLL YG 1 1 LF I T * EG FFFSG 
FF*AFYHSSLAPTPQLGGHWPPTGITPL 
NPL* VPLLNTS VLLASGVS I T * AHHRLI 
* NNRNQ 1 1 QALL I T I LLGLYFTLLQAS E 
YFETPFTISBGIYGSTFFVATGFHR 


3059 


16960 


A 


3081 


1 


321 


NSLNPGDGGCS * PRLHYCTPAWGTELDS 
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ISKEKKYPYHVYHPSMKMTIYNYRQQNV 
NHIYIMLVEHSQTQENTCFMISGNFFFN 
LP I VLGEGEKNQH SIS FKL FFN F 


3060 


16961 


A 


3082 


117 


484 


VLKYFIDSEWAVLFSISCSFVTDFVFL 
FFFFGKGVSFCPPAGIKGGGFGFLEPLA 
SGFKRI FFPNPLEKWE*RARPPPRGKFW 
NFFFFFFLKRELFFAPRWEGRGKILVYL 
KGPPWGYPHF 


3061 


16962 


A 


3083 


381 


227 


C I SRDG F YHLGQAGLE LLT S S D P P ALD S 
*SVGITGVSHRTRPI>LLRLNVQF 


3062 


16963 


A 


3084 


2 


391 


SHAYHIG*PSR* PVTGALSDLLMTSGLA 
M* IDFHS ITLLI LCLLTNTLS I YQ* WRD 
VTRESTYHGHHTPPVPKGLRYGI ILFIT 
SEVFFFARFF*AFYHSSLAPTPQLRGHW 
PPTCITPRNTLDVPLLNT 


3063 


16964 


A 


3085 


2 


248 


I MRSGDRDHPG * HGETPSLLKYKRLAGH 
GGMRLWSQLLGKGGTADSHHHVLLILET 
FYSLRERRHLTSVPTLGMNYWAQDIR 


3064 


16965 


A 


3086 


276 


243 


EKWPD*SRAACPVLCRGNGQYSKGRCLC 
FSGWKGTECDVPTTQCIDPQCGGRGICI 
MGSCACNSGYKGESCEEAPRYIPEKE 


3065 


16966 


A 


3087 


3 


130 


GFYHVGQAGLEKLTL * SACLSLPKCWDY 
RREPPRPAHTPPHS 


3066 


16967 


A 


3088 


277 


2 


SSSSVFCLLVWTSSSSSSSAARLPPLTG 
FLPKWAI IEEFTKNNSLI I PTI I ATI TL 
LNL Y F YLRLI YS TS I TLL P I SNNVKI K* 
QFEHTKP 


3067 


16968 


A 


3089 


404 


60 


FSKGKMRFFS PPS PKKI FFSPQS FYFLG 
GGGAKMPPPKKRFFFKKTPRGFYFPP * K 
KKNFFFPPPVKFGPPRGFLKSPPPFFFF 
FFFFFFFFFFFFFSPTFFTVFHLMLKSD 
ND 


3068 


16969 


A 


3090 


405 


186 


INKKPEAFTNTVDQMVLTNSHRTFYPTA 
TACSLSGAHRTFSRMDHV*DHKTSLNKF 
KTEITLSTLSNHKLEP 


3069 


16970 


A 


3091 


371 


1 


S PPLVQKGLR YG 1 1 LFITSEVFFFAGFF 
*AFYHSSLAPTPQLGGHWPPTGITPINP 
LEVPLLNTSVLLASGVSI T * AHHSLIEN 


3070 


16971 


A 


3092 


263 


3 


NKRSPPVNLWWKMGSFFKFAKRVKISKK 
GGGGFFFFSWRRSFTFVAQAGMQWRNLS 
SLQPRLPDLR*SACLGPPDC*DYRREPQ 
YP 


3071 


16972 


A 


3093 


1 


392 


FFFADFKKMFILINHFKMELTTYFELKR 
NEATASENC*DAVKAVLGGKFIVL»STYI 
RKEERPRINNISFQIKHWTKKN*T*GKQ 
KKKKKKKKVTGPE I PKFLI VKSGKPPKV 
I LTGAWGP I KFLS F I TRL 


3072 


16973 


A 


3094 


3 


367 


EM* IE ITMRYHYNTARRLKF* KTDNIKC 
* *GHGTSGTLI YHWQE * KMVQPLWKIV* 
QLL 


3073 


16974 


A 


3095 


3 


257 


HEVSQDGLNLLTS * SARLGLPKCWDYRR 
EPPCLAWLILPDDCVIFQKLKLLHHNLL 
NLSCIDVLMGIYSLSNFSQSNFPFFFFF 


3074 


16975 


A 


3096 


189 


414 


KGLVMVTS * CKMFFCLS I FFFFFERGFC 
FFAQAGVQGHNLSSLEPLPPQLKQFFCL 
TLPRS * KYRPAPPCPANFY ! 


3075 


16976 


A 


3097 


1 


299 


ENYR PIS FMNTDAKILNKILAWQ IQQCS 
KRI THRD QVGFL PCMQGQFY I * KS I ICS I 
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NAIHHISRLKKRNHMIPSIEAKKAFNTH 
S * FMKKNKKLAFQET 


3076 


16977 


A 


3098 


362 


126 


FIHTHTHTHTHTHTHIYI YI YI IASFQW 
NAIRGRMKCINKP * KDMEELKCI LPSER 
SQSAKATYLLYDSNYITLEKAKL 


3077 


16978 


A 


3099 


3 


423 


RHEHAYH I VKPTP * PLTGALS ALLMTSG 
LAM* FHFHS I TLL I LGLLTNTLTI YQ * W 
RDVTRESTYQGRHTPP VQKGLR YGI I LF 
ITSKVFFFAGFF*DFYHSSLAPTPQLRG 
HWAPTGITPLNPLEVPLLYTCVLLASGV 


3078 


16979 


A 


3100 


392 


83 


LRNTNCGHGAP FKNS PFFPQFLVKNS PA 
YFQNKEKDVGKPPFFFRAPDGAPFFKKK 
KK*GLALSSRLEYGGMISLELLGSSDTF 
ASASRVARTTGQCHHAGPT 


3079 


16980 


A 


3101 


81 


247 


GGWGGPPLKPPLF* KKTGKNFWPPFFKG 
KEKPPPNPRGGGKKGGPKPPPVIFFFF 


3080 


16981 


A 


3102 


3 


388 


HEKKEPF* YIGMG *AMIS IGFLGFI V*A 
HH I FTVG I YVDTRAY FTS AS 1 1 1 AI P TG 
GKVFS *LATLHGNNMK*SAAVL*ALGFI 
FIFTESGLTGIVLiSNSSLDIVLHDTCYL 
VAHLHYVLS IGAVFAI 


.3081 


16982 


A 


3103 


3 


387 


HERHEELSNS * ANNLI *LAYT IAFIVKI 
PLYGLHL * LPKAHVEAP I AGS I VLAAVL 
LKLGG YG 1 1 RIiT L I LNPLTKH I AYP F L V 
LSL*GIIITSSICLRQTDLKSLIAYSSI 

shialvvtailiqtp 


3082 


16983 


A 


3104 


3 


381 


HEQSHAYH I VKP S P * PLTGALLALLMTS 
GLAMGFYFHFITLLILGLLTYTLTIYQ* 
WRDVSRESTYQGQHTPPVQKGLRYGIIL 
FIT*EAFFFARYF*AFYDFRLGPTPQLR 
GHWPPTGITPLNSL 


3083 


16984 


A 


3105 


412 


111 


FLGHPLFKKKMGGKKRGLPKMGV * HP PA 
PKGKPPPLKKKKKKPGGGGAPLYSPFSG 
GEGKKNPLTPEGGGPKNPNSPPPPPPGG 
KKKNPPPLSPKKKKKK 


3084 


16985 


A 


3106 


298 


98 


FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF . 
F*L*IIY*AICAINNIRGAGAELYRSRD 
FVCY 


3085 


16986 


A 


3107 


2 


417 


ATHVGLQD VTS PM I EQLITFHDHALMN I 
LI ICFLDLGALFLTLATKLTKTNM* YAQ 
EI*TD*TILPAIILNLMALPSLRMLYIT 
DEGDDPSLTIKS IGHQWY*TYEYTDYGG 
LIFNSYIRPPLFLEPGELRLLDVDNR 


3086 


16987 


A 


3108 


8 


430 


VGLVLFLS ELPLNGGI LTFFHQG I YSP F 
PGGRTWALMVGSWGLVMASTDLLGPLCH 
AFTPATQLLLNLAVAS PL* PAALR IGCH 
SKTTNALTHFLPRGTPTPPRP ILVTIET 
MSLLIPP I APDQRLAAGFTARHLLLHLS 
V 


3087 


16988 


A 


3109 


3 


453 


PRAIKFYRDWPGHERKRIAWKGDPCHMV 
LIKDEKGLMCQKKKKKTPFFWAPKIPLV 
FPPAQKNQGS* PNPPGG3GNPPLRPGP* 
RKKPPALMGPPPSSPGGINPKSFFYLNP 
GPAH* PRGEN* LG PVFENLFLPLLKKI F 
LWGELRCSQT 


3088 


16989 


A 


3110 


398 


2 


SLFQKNPNPLVG* KKKGKALPQGPPPLF 
PPLGGAGPGGSQGRGWGPPRPPWGNPFF 
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G=GIycine, H=Histidine, I=Isoleucine, 
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F*NPKIWPAKGGAPFSPPSEKG*AGGFF 
LPRGGGFPLTQIPPPPPHLGGKSKPPFQ 
KKKKKNRQGTSWMMQGWGPQ 


3089 


16990 


A 


3111 


3 


371 


SSDPPTSAFQGSGTTDMCHHHHAQLI FN 
FFVETGS CYVAQAGVQWHDHGS LLSQTP 
GLR * SSGATVLNV7APALGPRR PDPTRME 
SLVLMKPRGSLIRSACPDCFLVF/JFSFF 
HEAEGCASEC 


3090 


16991 


A 


3112 


233 


449 


FALFS * LATLHG SNMK* SAAGL * ALGF I 
FLFTAGGLTGIVLANSSLDIGLHDTYYV 
EAHFHYGItSIGAEFAIIGGFIH* FALFS 
GYTLDQTYPKIHFTIIFIGGNITFLPQH 
FLGLSGMPRRYSDYPDAYTT*NILSSGG 
S FNALN AG SIT 


3091 


16992 


A 


3113 


458 


2 


RGPPPPPPPKFFFF*TPGKNSPPPPPEG 
VFFPPSPPPNFFFSPPPLFFFGGVPPIS 
PPPPKSFSPQTPPRFFFPPPPLKKNFFS 
PPPLFLPPPPFFFYPPPPFFFFFFFFFF 
FFFFFFFFFFCGDLEGELPGTGMLACVI 
LLRANRKARTRG 


3092 


16993 


A 


3114 


1 


418 


LNTTFFYPDGGGDPNLYQHLY* F*GHPD 
PDIHILPRPGIRSHIDTDYSGKKEPYAY 
VGMGWAMTSIGFLWLMVRARPLFTVGVG 
VDAQAYS S FAS I TI AI PTGAEVFS * LCP 
LPLSGMK* TGAAVWALGLRFI FTCSGR 


3093 


16994 


A. 


3115 


1 


425 


PRINN I S F * LL P PS LLLLLAYA I VEAGA 
GTG+TAYPPLAGNYSHPGASGNLTIFSL 
HLAGGSSILGAINFITTIINIKPPAITQ 
DQTPLFV* SVLITEDLLFLSLPGLAAGI 
T I LLTDLNLSTTF FDPAGGGDP I IjYQHL 
F* FFDPAGGGDP ILYQHLF 


3094 


16995 


A 


3116 


2 


383 


GLSCTNHKDMGALYLLL3ARAGVLSTAL 
SLLIRAELGQPGYLLGNDHIYWDIVTAH 
AFVIIFFIVIPTVLGGFGN*LGPLIIGA 
PDTAVPR INNI SI* LLPPCLLLRLACAI 
E E AGAGTG * TVYP PL 


3095 


16996 


A 


3117 


1 


259 


PTRPALWTAILIQTP*SFTGAVILIIA 
HGLTS SLLFCLANSNYERTHSR 1 1 ILSQ 
GLQTLLPLIAF* *LLASLANLPAPTPTP 
HQ 


30% 


16997 


A 


3118 


3 


174 


L I RGGRGCHELRS RHCTPAWATRARTLS 
QKK* KTTNPKKKLCLI FFGGKKKKLKKG 


3097 


16998 


A 


3119 


155 


1 


PDFFNKSMDKKKTARGWEDSSSSFCFFK 
RDRVLLCCPGWSAVAQS * LTAAS 


3098 


16999 


A 


3120 


420 


2 


PPPKFFFPPTPPFLGEGGAKTPPPKKIF 
FLKKPPGVFFFPPLKKKKFFFS PP * FLA 
PPKIFFKRPPPFFFFFFFFFFFFFFFFF 
FFFFFFFFFLVFIVLFQVKVKFLKKCFN 
IQFPLASDNS*PSMIHEKFYCESNIEF 


3099 


17000 


A 


3121 


48 


387 


RDPVLQKKEKKKKKKPXNQKXKKKKGGP 
F*KIP* GAKI KPGKEKKNFS PKRGAKKK 
NPGNFEKKTNFGGGKKWG KP PPKN * RFK 
GKKKFLKGKGGKKTQIPWGLKIFFNGFD 


3100 


17001 


A 


3122 


237 


2 


P FP WLPPFPLKASSSP * SL * FLLGGWP 
NLP SS PNKGS FPK F PS WFLFR P P * GKNF 
YLALPR* PWPPQGFL*TAPP 


3101 


17002 


A 


3123 


448 


130 


PPRFFPEFFYSGPPKPPFFKTPVFLGVK 
PGVFFFSPYQKKPTNFG?KMGAL*RIPL 
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N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possib!e 
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FETPIWFPIKGFHKKKPPVLN*PPTRX 
P P PDKI LKKKKKDDC I S RLARN 


3102 


17003 


A 


3124 


453 


3 


SSAREGGGVPPPPPKIIFFPPPPIFSWG 
GGGTKSPPPEREVFPKNHGGCFFSPP*K 
RGKYFSPPPRMGPPPGVFFKGPPPNFFF 
FFFFFFFFFFIFFFFFFFFFFTKKKNWF 
FFKAFRMSPKPVPPFFFFCNYRWFRPR 
VRPRVRPRV 


3J03 


17004 


A 


3125 


443 


2 


YFPPFGRVI WGDSLGAGVLNPPRPHKGT 
PFFPKKIFSIRPGWGGGPPLPPPQRQMW 
GGPPPPLFGLFRPQGKI PFPPKKKIKPP 
GGVCVGVGVCV * KKKKNPFPPFLWAGLK 
EPCFF F FF F FEME FCS CCPGW S AMARSQ 
LNCNLHL 


3104 


17005 


A 


3126 


3 


186 


PVIYSTIFAGTLITALSSH*FFT*VGIjE 
INMLKKKKKKKKKKKKKKKKKKKKKKKR 
GGGL 


3105 


17006 


A 


3127 


171 


38 


KKKKLFFPPREKWGPPKNFLKRAPPFFF 
FFFFFFFFFFFFFWSERSS*VA 


3106 


17007 


A 


3128 


16 


189 


ILGEVIWV* *FF*FIKKKKKXKKKKKKK 
KKKKKRGAPFKKTPGGPQITPGEKKKIF 
SL 


3107 


17008 


A 


3129 


401 


85 


LVNFFSPQEKRGFFPPPPPKNFFFSPRG 
FFFLGGGGP I FPPPKKS FFS KNPPGVFF 
SPP*KKKIFFFPPPLFWAPPRFFFKGPP 
PFFFFFFFFFFFFFFFFFSQF 


3108 


17009 


A 


3130 


2 


312 


ANNIil * LAYTIAFIVKI PLYGLHL* LPK 
AHVEAPIAGSIVLAAVLLKLGGYGI IRL 
TL I LN PLTKH I AYP FLVLS L * GGGV F 


3109 


17010 


A 


3131 


449 


3 


FFFFFFFLGGPPEIFFFFFCPPKKPKPP 
LGGGKKKPP FF * NFPQKPLGVLGAP P PP 
LCFFIKKKKGGGKKFF£PPLF*KGGPFK 
KFFFFPPPPKGPPPFLKNLRGWVFFKPP 
PKKKALSFFKKKKKKKKKKKKKKRAAAR 
DLELADAW 


3110 


17011 


A 


3132 


95 


448 


VINRE*KCVC*MKKKKKKKKKKKKKKKK 
KKKERAG KGGG 


3111 


17012 


A 


3133 


60 


442 


LGG F F F FGGKKG FCLWC PRWGAKAG I P V 
NGTPPRGV*RNFIAQPP * EGGITGPPPL 
PQ* FGFLRENGVPLRGFGGFEPPILGEP 
PPLPPQKGGKNGRNPPPPLKGFLVLFKK 
EFSSLVPSWKARGDP 


3112 


17013 


A 


3134 


236 


45 


FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFGAAD * VILFRYFQ 
FEATIMGV 


3113 


17014 


A 


3135 


441 


88 


KQQTPPGLIFF*KAPRREIFLPPPIMVF 
FSPPSPFKFFFFLSPFIFFGGVLPFFPP 
PKKGFFFKNPRRVFFCPPLKKKNFFFLP 
PFFFGPPRVFFYPPPPIFFFFFFFFFFF 
FFFFFF 


3114 


17015 


A 


3136 


2 


328 


TMLSPKPCX^LNQQNCSFEHYEPQLKTQR 
TWR * KKKKKKKKKKKKKKKKKKKKKKKK 
KKKKKRGGGQKKKMVGGEKKKPG* KI FF 
FF I KKVKKKTALGDKKKTQ FWGGS 


3115 


17016 


A 


3137 


281 


2 


KATKSGTPIPSQGQQSLAWSWAGIGSAQ 
PPALLHS * P IGKI FKNCKP VGRKSPQLP 
RNTSWOLGAVAHPSNSSTI.GGRGGRITR 
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SGVRDRTRG 


3116 


17017 


A 


3138 


448 


3 


FFFLPPSPFFPY*KRGFSLFGRWFNFP 
PPVFPPPWPSQIFGFQALIFLPPPPPLF 
FSFPVGFFQTALFPFNGFFPGFFHGLFP 
LLRFCPRK*VFWGGFFFFFFFFLRDKVS 
LCHPGWNAVAQSEFTTALTSKAQAPTRP 
PTRPPTRP 


3117 


17018 


A 


3139 


2 


436 


DR * LFS TNHKE IGTL YLLFGA* AGVLST 
ALSLLILAELGQPGNLDGNDHIYNVIVT 
AHALGKI FFIAI P 1 1 IGGFGN * LAPL 1 1 
GAPEMAI PRINNI S *GLLPPS ILLLLAS 
AIEEAGAGTG * TDY P PLAGNYCHPGASG 
DLIIF 


3118 


17019 


A 


3140 


362 


2 


KPRRGKF F P PREGGGG FPP P P P KNF F F P 
KGGKFLGGGGGKNS PPQKKGFFQKNPGG 
VFFPPPKKKKNIFFPPGKMGAPPGFF*R 
GPPPFFFFFFFFFFFFFFFFFGQSGQVK 
LKSPKCKL 


3119 


17020 


A 


3141 


440 


102 


PTPPPCCKFSFKRPPKKTLFFPTTNLVF 
FSPIPP*NFFFSPQALIFVGFLAPIFPP 
QKKFFFSKFPPLFFITPPLIKKFFVPPP 
PFILSPLKIFYKPPPPIFFFFFFFFFSF 


3120 


17021 


A 


3142 


1 


79 


FKLDYFS 1 1 FI PVALFVTWS 1 1 E FSL* Y 
INSDPNINQF* KLD YFSI I FI P VALFVT 
WSIIEFSL 


3121 


17022 


A 


3143 


3 


441 


FFFPPPLKKKKFFPPPPNIGPPPKSL*K 
PPPPPFFFFF 


3122 


17023 


A 


3144 


223 


3 


LPYWKLPYLKH* * *LQDTNQESRG*HFL 
RPRPFKNQMKSGTVAHACNPSTLGGRGG 
RITRSGV* DHPGQHGEI 


3123 


17024 


A 


3145 


65 


414 


KKKKKKKXKKKKKKKKKKKKKKKKRGGP 
PKKKTRGGPQNPPPKKKKFPPQKGGKKK 
PPLGF*KKTPPLGGEKIPPPPPKKNTPP 
KKKKKF*GGGGQTPPPPPPGKKFSPPRN 
KKKK 


3124 


17025 


A 


3146 


410 


31 


RANQKAFRGKPLCDLAVGKNLS SRTQIA 
LTI * KW INWTILKLRTSGH+ KTPIKTIK 
RYPI EGEKISDEEL YLQY*KVL * IGKKK 
PDNPVEKWANDLKRSFMKGN ILTVFKGM 
QR*LGSMAHICSLS 


3125 


17026 


A 


3147 


182 


241 


SHPSHHSTINITNKGLL*TPLPIPNPLV 
NLNLGLLF I LATS SLAVYS I L * SGGASN 
SNYALIGALRAVAQTIS YEVTLAI ILLS 
TLLIRGSFNL 


3126 


17027 


A 


314S 


254 


1 


KTKKGLKI KDPLTRF * I S VC * ITKS I * F 
FKLLPFFPLLKEHIPLKYLFFFFFLFET 
EFRSCCPGWSTMV*SLLTATSTSQIQAI 


3127 


17028 


A 


3149 


499 


2 


NTPPAAGGGCFFFFFFGEKNFPPPHPTP 
RFFPPPPLKNFFFFSRVFFFWGGGAQKA 
PPPKKVFF*KIPRGFFFPPPKKKKIFFF 
SPGFFGAPPGFFLSGPP3FFFFFFFFFF 
FFFFFFFFFFFFFFFFLINFLLSLQGFF 
LVSKELFLFGLTVKFTRGFRGFCGQ 


3128 


17029 


A 


3150 


256 


146 


KNAKVTQ VG PEFN KG PG * HTHTHTHSHT 
HTHTHTLQ 


3129 


17030 


A 


3151 


333 


1 


TI SCLCTRGEHPLS PRRAGPYTGSPLHC 
CVDWDKVFS S WKDLTDWPLGDLD I E YF 
TDGSSFI LRGVCRAG YAAVTLDS AVEVL 
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S VSAETSA* KAEL I ALTRALWLGKEQK 


3130 


17031 


A 


3152 


43 


379 


NKTSLFGWDYIWEWGAPEPETPPKRAAG 
ALHSLAQPFSVALPPCFFDPCAPSLSPG 
*HALRPLPLPGLASEIQTAPSWHVPPKS 
LSPAPQPCPIPTLVPVGYKTPP 


3131 


17032 


A 


3153 


398 


12 


NTTPGGGKFFLKKTREEKFFP PKKKRGF 
FPPPPPKNFFFPQGGNFFGGGGGQISPP 
QKKGFFQKNPRGVFFTPPKKKKI FFSPP 
GKMGAPPGFF*RGPPPFFFFFFFFFFFF 
FFFFFFLKKSWRPLAI 


3132 


17033 


A 


3154 


371 


105 


SPSPQVNFIKGPKPPPPK*IL*RAPNPP 
LPKKKFSNPPTWGPGPQPPPSKFSKFAR 
FPFLPPPFPFKKEPPKKKIFFPTKEGTV 
I*KNPPFSGFQSPDSIK 


3133 


17034 


A 


3155 


-> 


371 


DVGADPILHTSTGL*LAMQY* PEA*TAF 
S S IAH IT* DVYYG * V IRYLHANGA* IFF 
ICLLLHIGRGLYYRPFLYSKT*KIGLIL 
LLATI TTAFIGYVLP *GPI * F *GATVMT 
NLLSAIPYIGT 


3134 


17035 


A 


3156 


1 


398 


I AI PTGVKVFS * LATLHGSNMK * SAAVL 
*ALRYIFLFTEGGIiTGIVLPNSSLNIVL 
HDPYYWAHFHYVLS IGAGFAI IGGFIH 
* FPLFSGYTLDQT YAI IHFTI FIGVNLT 
FFPQHFLGLSGMPRRYSDYP 


3135 


17036 


A 


3157 


401 


46 


P PGGRNF FKKT PG KKI FST KKKKG FF P P 
LPPKNFFFSPGGFFFGGGGGPNFPPPKK 
GFFS KNPRGVFFS PP * KKKI FFFPPG* I 
WAPPRVFLKGPPPFFFFFFFFFFFFFFG 
KKSVRA 


3136 


17037 


A 


3158 


404 


IS 


FFSPPPPPPGGGVFPPNPNKNPFPPPPP 
PFFLGGGPPPPPPPPP*FPPPPTPPNVF 
FFSPPKKKNFFPPPPGPPPPPPKPPPPP 
PPPFFFFFFFFFFFFFFFFFFFLFFFEP 
MEKGPRPGDIGSNPPS 


3137 


17038 


A 


3159 


3 


192 


SLVIAGCPR*NLSSTLNLPTEPSKSPCK 
FNC* KKKKKKKKKKKKKKKKRGGALKKN 
PWGGKK 


3138 


17039 


A 


3160 


205 


80 


VQRDNFGFLQPSPSGVKLFFCLSLPNKW 
DFRCGPPNPG* FFS 


3139 


17040 


A 


3161 


3 


384 


LIVPTIILLPLT*LSKKHII*IINTTTHS 
LIISIIPLLFFNQINNNLFSCSPTFSSD 
HLSHP I LKKKKKKKKKKKKKKKKKGGGA 
FKRTPGGAAHWGGGGRETFFPKGGEKKN 
RPGVFWKQTFFWGGK 


3140 


17041 


A 


3162 


348 


70 


GPPPKKRVFSKTPKWLNKPPQKKKKFI 
FPPPVNLGPPKNFLKGPPPFFFFFFFFF 
FFFF F YFFWMGCDR * CSWRHSSPPRLSG 
TPKCSPSVT 


3141 


17042 


A 


3163 


2 


353 


LKTI PLTSTCLTIGSLALAGI PFLTGFY 
SEDHI I * TANVLYTNA* ALS ITLIATSL 
TSAYSTRI I LLTLTGQPRFPTLTN I ETK 
KKKGGPFNR YPLGAQVYGGGQN EKF FL I 
GREII 


3142 


17043 


A 


3164 


1 


221 


PTRPRDCSELRSCHCTPAWATEGDS ISR 
KKKKLSTRTAF*YTEAINSLIYSLNIGT 
FKTYFPKI KTYDRHFDF 


3143 


17044 


A 


3165 


26 


383 


I P FYQ * SL I * YTRKKKKKKKKKKKKKKK 
KKKKRGG 
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3144 


17045 


A 


3166 


251 


381 


GNLCAGWARWLT P WPAL WEAEAD * S RG 
QEIGAI IiANTVKPHL 


3145 


17046 


A 


3167 


391 


115 


LFKKI SPHAG I WGFFS PLTP *NFFFSLE 
PFIFGRGLAPIFPPPK*RFLSKNPPVVF 
IPPPLMGKPFPPPPPVRLGPPIYSFKGA 
PPFFFFFF 


3146 


17047 


A 


3168 


94 


389 


S PG I LGQKGQ I GP I GNKVPGGIiAAPVTP 
* FRFKPRLP *GFGPKAS PPLALKPERAQ 
VGGT P P PGPRG PNGQP P PFKENQPGLG F 
RFRGKLAEKRGFHL 


3147 


17048 


A 


3169 


3 


363 


WATALQPGQQSETQSQKKKKKKKKKKRG 
GPPFFFFFKKKIFFSPPGAKNKRGKFF* 
KPPKKKKKIFFFPPPPLKKKKKKKKKKK 
KKKKKKMGGAFL KKPRG AP P P PGKRKL I 
FFFLKGV 


3148 


17049 


A 


3170 


2 


171 


KEPLGYIRMV*AMISIGFLGGIV*AHHI 
FTVG IDVDTRAYFrSATI 1 1 VI PTGVKV 
FS*LATLHGSNMK* SAAVL*ALGFIFLF 
TVGGLTG I VLTNS S LD I VLHDTYYWAH 
FHYVLLIGG* FYLR YHNHCYPHRRQS I 


3149 


17050 


A 


3171 


390 


27 


QSLTVKSPYPWILIKTKGHH*VMNAGL 
TRYQSLIiCENPHIRSEVCITLNPPPLLP 
VSESPVKHSCVQVLDSVYSSGPNL*DHP 
* TS VDWEL YVDG I S FANPCKVS LKKMTS 
PAPVTPRS 


3150 


17051 


A 


3172 


127 


310 


KNP GGAK I LRGGE RKN F FLKRGGKKKHL 
G I FGKKTFFWGGKKWAKP PK* WRFFSPF 
SP* KFFFSLKALI FFGGFCPFFSPPKKS 
FFSKNSQVFFFSPPFKEKIFPFPPP*NF 
GPPRVFLKGPPPFFFLGVFFFFFFF 


3151 


17052 


A 


3173 


376 


<■> 
j 


FFFFFLRRQSLAVTLAGV*RCNLSSLQS 
PPPGIK 


3152 


17053 


A 


3174 


365 


14 


RENFFSPEGGAPKNKPPPPPPPPGGKKK 
I FFQKKKKKKI FYPWKNFPPPPKKKKKN 
P * KPG PLKSQG FFFFKKNQNLP WGPPQ I 
SFPKKKKKKKAKTVQERKYNSNTQLVSA 
ETQLL 


3153 


17054 


A 


3175 


1 


377 


VPLHSSLGGKARLHLRKKKKKNPGFLKN 
FGPIiALLGMGVGN I KGLKGQKGKNPAFG 
AHTGGGCFSLRGRNTPFPKRAEGIICYN 
SPH*KET*KALEPRGYKGL*QAIiALPNL 
KSGKMEH I LRGAP 


3154 


17055 


A 


3176 


352 


125 


GHEVLDSSDLPASASQSAGITGVSHHAL 
* EILLSMFETTWACDLLFQN ISFIKSSI 
PCFIGLDFIMPHRYCRFFF 


3155 


17056 


A 


3177 


139 


366 


TAHTS * GY * VKNY I NLSFCFF FFFLERN 
HSALQPGGQGHNRSSLQPWPPGLKQFSC 
LSLPRSWDNGLVPQHLVNFF 


3156 


17057 


A 


3178 


206 


1 


KGTLFKKDFFFKNFPKKVFLGVPKNSLF 
* KILTLPPVLNPYPFFFFFFFETVSLCH 
PGWNAWPRKEP 


3157 


17058 


A 


3179 


330 


1 


IVSTLETCYIAYNEEEKDTFITLRIYVI 
GGNGKFLG 1*1 KQH I KKI IHHDQ VGF I P 
GMQGWFNI CRS INVIHHI SR IKGKSRAQ 
WLTPVIPALWEGDAGGSPEVRSSRPA 


3158 


17059 


A 


3180 


3 


399 


HAS AP LQS S LGY RARP CFKKKKK KKKGG 
GGGGFFFPKGGFGPLPKKGFFSGKGGLG 
KWGLGGAGKTPG I KKPLGKGP PKKEGGK 
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KGQKGPHF*KKKKKKGGKKLKKGPF* KP 
FKKPLLGRGVQP *NPHLLGG 


3159 


17060 


A 


3181 


2 


204 


CPTACPFW*NKELLMPKKKKKKKKKKKK 
KKKKKKKKKRGGAPFKCvSLGGPHFPGAG 
KKNFFFFWGGY 


3160 


17061 


A 


3182 


71 


377 


PKRGGQPKQKKIWGPPPPGAPPQKGMGF 
FNPRGFKKQITFFPPPPPPPEKKPPFF* 
KKKKKKKKSNCSD 


3161 


17062 


A 


3183 


151 


2 


FFFLEELVPLLLKLFQKIEEEGCSPNSF 
NEAF I ILI PKPFR* TTKKENF 


3162 


17063 


A 


3184 


152 


387 


YFQGFIYWACICTSFLFFFFFUCREFC 
FVTQVEVQGPNFT* LNPPLFGLKKFFCL 
TLQIGWNNRPLPPPQVIFCFFK 


3163 


17064 


A 


3185 


22 


336 


YEKCTALLQMVSSFIWMEREGTHQYSFY 
RKDFSLASKVNI VSYYLSPIVE* FFFFL 
RGS PFAPQAGGQGPNLGSWKPLPPGLMP 
FSCLTLPGGWNCRPPPPGPVN 


3164 


17065 


A 


3186 


344 


1 


WVLKKI FFY PGRGGPPFI PP PLGGQGGP 
IPWARGF*PPRGPPPKNGF*KKKKKRGG 
GGPPPGFPPPGGPRGGVPFFLGGGGPRK 
PKKITKKKNPGEKKKTSFKNQKRKTKI K 
TT 


3165 


17066 


A 


3187 


296 


1 


NPKKI LTLPKKTKVYKCEGENQVP 1 1 FQ 
GIKNI FWKGI F* PKKEREVCV* SMRHVI 
PVFPKKRGSKRSNKSCCYKDTCTRMFIV 
ALFTIGKTWKQPKY 


3166 


17067 


A 


3188 


2134 


1 


G VAAHACNAS I LGGQGGR I I *GWEFETS 
LANMVKPC 


3167 


17068 


A 


3189 


1 


159 


LQDHPG*HGEPPSIVKIQKLARHGSLRL 
* S * LLGRLRQRMRQETCLNPGAR 


3168 


17069 


A 


3190 


119 


340 


QIKKNRLVSARGKNNKRK* I YKP* VDI F 
FKEDIQMAGKQMKRDLISLI IREIQIKT 
TITCYLIHARMGTITRD 


3169 


17070 


A 


3191 


75 


1 


LSVNWFWPGTVAHACNPSTLGG *GG 


3170 


17071 


A 


3192 


343 


1 


I F I LGGG P CCS PV FFQFFGGGGGGFFL P 
QNFFPPRGKIFRPFFF*KKKLKRPNWGF 
F*NFFNPPLGFFNLFFFFFKKKPKNFFF 
LGGFFFFFFFFFFFFLAGGDSLALSPRL 
EC 


3171 


17072 


A 


3193 


105 


368 


KFKDPPFPPPFFWPPPKQLPPPPGKIGA 
PHF*TPKGPPPPKKKKC*KKKILKGGRG 
KKKKKTPPKRPQKIWGPSKKKNPWGGGK 
TPPL 


3172 


17073 


A 


3194 


1 


365 


FCRDEVLLFCPDV7SPTPGLKQFSHLCLP 
KCWSYRC* PPYPAQGVFLK*HLTKSLSC 
LKLFMASLCLQDKVPAPQPCVKGLSKFF 
LCHLLSTLI PFTHSLFFLFLGWFLRQHR 
SVTQAGVQWR 


3173 


17074 


A 


3195 


1 


362 


GNQPKRLNAGTYLLFYTLEGSVPLLIAL 
MYTHKTLW SLNI LLLTLTGQELSNC* AN 
NLI * XiA YTRAL I VM I PL YGLHL * LPKAH 
VEAPIAGS I *LAAVLLKIiGGYGI IRLTL 
ILNPLTNT 


3174 


17075 


A 


3196 


137 


3 ' 


KGQPRFPPISLKKGSQGKKGFFFFFF*Y 
RIPLCHPGWCPWQSW 


3175 


17076 


A 


3197 


1 


266 


EGRGCSEL* SCHCTPA* VTVRYPVSKKK 
KKKKKKGGGKGKKKGGEKNTLFGPKKGK 
LRGPQKRGKKIGPEKKVGNMLKKG I FFR 
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BKTF 


3176 


17077 


A 


3198 


217 


446 


YWSEHYAMTQVLEGFSYSLQDHFYFCFR 
SIRRIIFYSLIKPSIND*GERELEPITT 
SQALQIAGRAGRFSSRFKEG 


3177 


17078 


A 


3199 


345 


314 


QPGPEGKIRFFLKI PNLTPSGGKSLKFP 
LFKRVKPENCLSLRG*GCN*PI 


3178 


17079 


A 


3200 


2 


330 


SRH YT PAW* QS KTLS PKKKKKKKKKKKK 
PPFPKNTQKKPKKKKRGGLNWGVKTPPP 
LKTQKMGISPGKKLKKNLFPLKKKKPEG 
NPLF* FRGF * KKKKAPFKKKKNPPK 


3179 


17080 


A 


3201 


2 


385 


FPPFLGGPQIPKFLKFFLRAN*NLFGIiF 
FLGGVLKKIFLRKPFSLWIKPFPTPFKG 
KKI FFKTFFQKPLFFF* KKKFFFCFPPF 
FFLSRGFFVFFSPKKPFFFFFFFFFFL 


3180 


17081 


A 


3202 


355 


2 


FFFFFSETESCSVAQAGVQWHYLGSG*A 
LPPRFTP 


3181 


17082 


A 


3203 


156 


1 


LKSLLWEAKVGGLLEVRSSRPA*ATWRD 
FISTKN* KISQVWWYILVWTTWE 


3182 


17083 


A 


3204 


352 


175 


QPRGRPAPAKPP*CPLRLALPC*CPCPA 
CCPPWAEASPSGVQASPARAPACPARAL 
LNE 


3183 


17084 


A 


3205 


280 


373 


QRGTRIFSDLQT*KKKNKSPFKILLLID 
NALGHSRVLTERYKDIFRPANTTSFLRP 
MVQELIPTFQS 


3384 


17085 


A 


3206 


1 


367 


EIESIQIDGKTKNKFLGIHLTYLTKEVK 
DLCKKNYKTLLKE I IDDTEKYDMLMD *N 
NIVKMAI LPKAI YRFDTI STKLLMS FFM 
ELEKI F*NLY * KA* MAETTI SKKNKAGG 
ITLLDFKLYY 


3185 


17086 


A 


3207 


368 


12 


FAQKKKKKKKKKKKKKKKKKKRAPPQN* 
RAPQKPLKPPPRVFKEPPPLGSPPPPAF 
FWRGGGPPPGPFSKKKKKMRLTGGGRLF 
LLA 


3186 


17087 


A 


3208 


2 


356 


KYLFSS I PEGKEKMKGIANLFWE I I SEN 
CPSLARDLDIQKTRHANPYNLKKSSPQH 
IIVKLSKVKDKERILKTARKECFITYKR 
TPIRPGMVAYACNPSTLGG*GGWIMRSG 
DQDHPG 


3187 


17088 


A 


3209 


359 


170 


INIKQLPRCX-GTCLWSQLSRRLRRQDCY 
SLGGQGCSEL* SSQCSPAWATERDSVSK 
YKINNFL 


3188 


17089 


A 


3210 


295 


122 


VIKTV*NWNKDRHIDQCSRIESPKIHLH 
I DDQL I FDKDVKT PE * RNNGLFNKWCWE 
N 


3189 


17090 


A 


3211 


218 


466 


ATFDTTLPSCYQNAPNNRFDDLSD*EQE 
IDTMTVNIILPLRSLNIVITNPYNI *HH 
QHDLDYTYPDTTGQLVNCAYFINLLQP 


3190 


17091 


A 


3212 


340 


355 


LGLQG*REGNFGSLQTSPFGFKRFSGLS 
liLSSWDYRV 


3191 


17092 


A 


3213 


254 


40 


WPGTMAHACNLSTLGG*GGC I TRPGDRD 
HPG*HNNNN 


3192 


17093 


A 


3214 


198 


25 


YNEKEKPVKMLKI I ANS LG AVAHAYN P S 
TLGGQGGRIMKSGDRDHPG*HADAWVET 
VL 


3193 


17094 


A 


3215 


376 


3 


?PGVF*RGPPGFRTLPPKSSSSSPPPKI 
LGGPPFLAFFGGAPPQKKPPSSSSSASS 
PPSSSSRAPLKKGGPFNPAGGFPLFGGP 
PRGGPPFSSSSSSSSSSSSSSSSGRRSR 
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SRTSRTRGRTRG 


3194 


17095 


A 


3216 


393 


260 


VSQDGLDLLTS * STRLGLPKCWDYRREP 
PRPACTLCISSTYNDP 


3195 


17096 


A 


3217 


3 


387 


HASASSGSRAP P FFFFFNKKTGSKELET 
TRGTVLKTCLFFKKKKKVLGGPKPPLGN 
PKGVGKPLKGFPGPNPWPPP*KGPGCHF 
* KNGF* DPGPKKSHI PS WVGGKGSSLLP 
GPLGGPSEVYFSPLRA 


3196 


17097 


A 


3218 


77 


406 


RMARPELGLPGNLLSNDHIYNGIVTAHA 
FVIIFLIVIPIINGGFGN*LIPLIIGAP 
DMAFPRINNISF*LLPPSLLLLIiASAIV 
EAGAGTG* TVY PPLAGNYSHPGASVK 


3197 


17098 


A 


3219 


394 


2 


KRRYFPDGLNFFWGPGILKIFVKKKVSS 
LKKKKKKNFPPVFLQGWGGNKNFKGGGL 
KFSKPNLI *FFFPQKEAR*KVFFFFFRS 
PQKDPLRE FFFF FLRQVSLCHPDWSAVA 
RSQLTSAS ISRAQGAGRV 


3198 


17099 


A 


3220 


376 


157 


KFFFS P * KFFFS PKPLKFGGGVGPI I PP 
PKKRFFFXNPQGVFKKPPQKKKKIPFQP 
PVNFGPPRDFLKGPPPFFFFFFFF 


3199 


17100 


A 


3221 


394 


1 


VPPPQKFKTPGPPPPPREFFFF*KKKGF 
PPLGGFLNPAPKNPPPGPPKKVGFPGGP 
PPPPGGFFFSFPLSF*NPGERVFFGPKI 
PKRKFLLKWGKGGKFFPKKPIFPPPPKK 
KKKKKKRAAARDLELADAW 


3200 


17101 


A 


3222 


207 


402 


SILM* LCCLFPLPGVTPIDGAPHRSYRE 
CYPVLLDGVMVGWYDKDLAPGIADSLRH 
FKVMREKRI 


3201 


17102 


A 


3223 


309 


3 


YPFFHLIDIAIKPCVCFTKFYKATVTQT 
AWSWYQIRYIDQ*NGTEISEIPPHIYNH 
VIOTKHDKNKQVCGKDSLFNKWCWEN*LA 
I CRKLKLDPFPTP YAKI 


3202 


17103 


A 


3224 


3 


382 


LDRER P P FFFWGARHMDI PQLVNLS INK 
GHWAKFNFLGYKKKGGWEKKKKKKKKKK 
RGGPP * KKPPGGPK* PPGGKR* IFPYMG 
GKKKPPGGFLEKPPPLGGAHLGNPPPQK 
YTPPGKKKNLKRET 


3203 


17104 


A 


3225 


133 


2 


FFFETESHSVTRLECSGTISAYCNLCLP 
GS SDS PASAS *AAGI 


3204 


17105 


A 


3226 


349 


2 


AGVPPGNPPLWGGEGGGSPRGGGLKPGF 
PQRGNPFFFKKSQPTPPGGGPPLIPPPW 
GGGAGGSPLPQGQRFQ* TKIGPFPS PRG 
KKKKP P FPKXKKKKKKRKEKSEMPG FMV 
LNA 


3205 


17106 


A 


3227 


3 


239 


LN KVGRG CS E P R S RHCT PAWATE * D5 1 S 
E KKKKKP E S RG I LiKV KG LTQAL F YLALW 
LYYLHPTAKQQIWGFFIYFSKP 


3206 


17107 


A 


3228 


205 


1 


IGLKIQNSCPLKDSLKKIKRQATDWRKY 
LQNTSDKAS VFI I YKEHLQLSNYKAVDP 
IK*WAKEMNKLH 


3207 


17108 


A 


3229 


174 


1 


VQMLEDKSFEETF * FSSETIjVLKI PHVQ 
PGAVAHTCNPSTLGGRGGRITRSGDRDH 
PG 


3208 


17109 


A 


3230 


2 


340 


KNHS VYLLC VFS I PFPTFCFF* F * VFNT 
F*FKLN* IN*FTYIDRVLLCHPGWSAVA 
* S * LTAALNSWAQAVPCLSIiLLAHHHAW 
LI IFKKTAYIIHIWYVWYDTTYPFKVYN 
S 
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3209 


17110 


A 


3231 


1 


313 


KKSTPYQRGFDPISPARVPFS IKFFLEA 
ITFLLFDLEDALLLPLP*ALRFIFLFTV 
GGLTGWLANSSLDIVLHDTYYVAPHFH 
YVLS I GAVLAI I GGFIK* FP 


3210 


17111 


A 


3232 


207 


355 


ELS PFRLKKTL Y * LGMVAHACNPS TLGA 
RGGQIA*AQEFKTSLGNMAKP 


3211 


17112 


A 


3233 


3 


355 


TTQQ*LIKLTCKQTIAIHNTKGRT*ALI 
LISLI I FIATTNLLGLLPYSLTPTTQLC 
INLAMAI PL* ADA2VIGFRSKI KNALAH 
LLPQGTPTPLIPILVIMBTINLLIEPIA 
LARRL 


3212 


17113 


A 


3234 


2 


355 


KYLINNRLITNQQ*LIKLTSKQMITIHN 
TKGRT * SLILI SLI IFI ATTNHLGLLP Y 
SFTPTTQLSINLAMDIFL*SGAMVIGFR 
SKI KNALAH FLPQGTPTPL I P I LAI I ET 
ISLLI 


3213 


17114 


A 


3235 


43 


370 


QGCVKGWVLEEQVRRGWILDSSEGKTDL 
KQRGSPGSWPEHVGGWSGVMG*SEAWTG 
QARWLTP I ITALWEAEVGGS LRPGVQWH 
NLGSLQPLPPRFKRFSYFSLPSSWD 


3214 


171 15 


A 


3236 


15 


356 


LIQPSLKLMISIHNTKGRT*SVILLSLI 
IFI ATTN LLGLRPYLFTPTTQLSINLSM 
VIHL*AGAWIGFRSNIKNALAHLLPQG 
TPTPLMPILVI IETIRPLILPIALAVRL 
TA 


3215 


17116 


A 


3237 


1 


376 


GTRTNTLTI YQ * WGDGTRE ST YQGHKT P 
PVQKGLRYGI ILFITSEVFFFAGFF* AF 
YHSSLAPTPQL*GHWPPTGITPLNPLEV 
PLLNTCVLLASGVSIT*AKHSLIENNRD 
QIIQALLITIVLG 


3216 


17117 


A 


3238 


1 


358 


GTRNG * YTNA* ALS I TV IGAS LTS AQG S 
R I I LLTLTGQPRFPTLTN INENNPTLLN 
PIKRLAAGSRFAGFLITNNISPGCPFQT 
T I PLYLKI TBLGVTFLGLLTGLDLNYLT 
NKLIIKA 


3217 


17118 


A 


3239 


258 


390 


RQGLLMLAGLVLNS WPLQS SHLGFPKCVJ 
DYGREPPCLGN*LIL 


3218 


17119 


A 


3240 


2 


372 


ARARFHHVSRDGLDLLT* *NTHLRLPKC 
WDYRRE PLRPGKTFFLKKKKKNS I FLFF 
REGFKEKS ILGI KFFRPTGGVLI LTGNH 
GWGCKTGTELLVPSRFPGLAFKI CGLWA 
HDTPHRVRNWL 


3219 


17120 


A 


3241 


3 


283 


HERLWGGWKTGAAGLGRTSSRPTASLTQ 
T*TTMTH*SRTTGCSTGSGRTWTRSRKS 
CAWGMAGLYRVAVASRGPRGMM * PTPPH 
WLCWAWKVP 


3220 


17121 


A 


3242 


170 


3 


I KSQAGLVGFLGP FS FQDSLNLVFVFG F 
FCF *DRVS FCSPGWSAWQSEFTAALV 


3221 


17122 


A 


3243 


2 


385 


ARAJDVDTPS YLTSDTIMRDI PTGGQVFS 
* LSTLHGSNMK* TAAPLLTLGFI FLFTA 
GGLTGLELTNS WLDI VLHDT YYVGPDFH 
YVLSIGAVFAI IGGLLH* YPLFSGYTLY 
RTYAQIHFAIIFIGEN 


3222 


17123 


A 


3244 


1 


106 


GTRYVGQAHLKCLTSSD3PASTSQSAG I 
TGVSIiSA* PASTSQSAGITGVSHSA 


3223 


17124 


A 


3245 


2 


360 


ARANTLTIYQ*WSDVSLENTYQGHHTPP 
VQKGLRYRIILFITSEDYFFSGFL*AF* 
BSShS PTPQL * GH VI P PTG I TPLN PL KM P 
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LLNTS VLLASGVS I T* AHHS L I ENNRNQ 
IIQALLI 


3224 


17125 


A 


3246 


17 


160 


GG* GCS CSEL* S CHCSPAWVTEQDFDS K 
KKPAILASCLKHLNPLSSH 


3225 


17126 


A 


3247 


236 


2 


WAHIYMTPLSPPSFLKPVQ*KNFYIYST 
YSLDN*NPSSPKAKRAPKKSYTLPYLHL 
CVCVCECV*VCVCVCVCVCVWV 


3226 


17127 


A 


3248 


2 


2220 


FFGGGRPSPPQGYFLLNNHSSPSPPVKL 
NPGPA* FYPPTKGKNFPPPQR* PSPPKN 
IKTPPPSFFFSS 


3227 


17128 


A 


3249 


2 


372 


AYTISFIGKISFYGLH**LPKAHVEAPI 
VGS IVLAAELLKLGGFGI IHLTLIFNPL 
TKHIAYPFLGLSL*GISITSSMWLRQTD 
LKSLIAYSSISHIALWTAILIQTP*SF 
TGAFFLI IDHG 


3228 


17129 


A 


3250 


293 


56 


EGSPKVIFNKSPPHHLFLFLFFFFIFFF 
*FFFFFFFFFFFFFFFFFFFFFFFFLYL 
LAMFYLSFFFKQDNQRYRQYSIK 


3229 


17130 


A 


3251 


169 


370 


LKMTELRGAPASKPRGQEPPPHYPCHHH 
HHHHHHFL* VTKGQGPHHWPS PTRDPGW 
L*SPS*EDQRR 


3230 


17131 


A 


3252 


22 


156 


GER I GLGLGGQG CS E P* LCHCTLAWVGD 
TVRPCLKKKKKKKGPPF 


3231 


17132 


A 


3253 


3 


400 


QNQTPLLD* GGLI TAVLLLLYLAVLTGG 
ITI LLADRSLDSTLFYPAGGGDP I LYQH 
LF* FFGHPBV Y I L I LPGFGI VSH I ETNY 
WGGKEPFGFVGMV*AMIAMGFLGFIG*A 
HHI FTVG VDVKTRA * FTSAT 


3232 


17133 


A 


3254 


373 


31 


REVGPPTP*KI FFFPKGLNFWGGGGPKF 
PPPKKKGFFKKS PVGVFPPPGG * KSGPG 
PGFKKPPQKGKNISFPAGGKIGPPRGTL 
KRAPPFFFFFFFFFFFFLLWVWQVERP 
TL 


3233 


17134 


A 


3255 


1 


379 


LNLIQRQ*R*V*KFL*LPPQT*KKKKKK 
KKKKKGGGAFKKNLGGAKFNGGRKKKI F 
FLKGGVKKKKAGGGFKKRGKGKKCYLG I 
FEKKPFFGGGKNWENPPKKIKGLREKKK 
F*GEKGEKKPEKAG 


3234 


17135 


A 


3256 


42 


376 


FC Y I S L VH HC I YNDLS FERKKN I F VPG Q 
INSISSIA*EAHCKNKSLLEAVKXKKKK 
KKKKKKKKKKKKKKKKEKKRGEKKKKKK 
KEGRSSLKKEK 


3235 


17136 


A 


3257 


353 


67 


C YPLS PLKFFFS PRSLKFWKGVGP 1 1 S P 
PKKKVPSQNSQEAGFPSPNVLKRRPGPN 
FKTTP*KEKKNPFPPPVKFGPPKESLKR 
PPLFFFFCGLQ 


3236 


17137 


A 


3258 


2 


129 


FHRISQDGLDLLS S * S ARLGLPKCWDYR 
CEPPRPAKNKI LLS 


3237 


17138 


A 


3259 


205 


415 


QQKNRRLLHFKGARTHN5YNRGQPTTPS 
ITAHMYPRLRQSHTIYVLRVHHP*VPSA 
I EGP VS V* ALLHS S T I WAGNLLL I RFH 
PLAE KS PL I QTVTL CLG ANTTLVAGGWA 
LTQNDSKRIAAFSTSRGLGLI IVTIGVN 
QPHLALLHICTHAFVKAI LFMCSGS r IH 
NLSKEQDIRKIGGLLITI P 


3238 


17139 


A 


3260 


3 


393 


S FNLS TL I TTQEHL * LLLPS * PLAI I *F 
ISTLAETNRTPFDLAEGEAELVSGFNIE 
YAAGP FALFLI AE YTN 1 1 1 INTLTTTIL 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=A!anine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, LHLeucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possib!e 
nucleotide deletion, ^possible 
nucleotide insertion 














LGTTYDALSPELYTTYFVTKTLLLTSLL 
h* IRTAYPRFRYDQLIHL 


3239 


17140 


A 


3261 


2 


400 


ISDLSEK* FKRLVVKLIMEAPEKGKAQC 
KEIQKMTQE VKGE I FKE* IA*RKKKSKF 
QETLDTL I EMQSALES FSNR I KQ VEERN 
SELKDKI FELTQSNKDKGKRI RKYEQSL 
*VAWDYIQ* PNLGI IGIPEEE 


3240 


17141 


A 


3262 


450 


129 


NNLAFN* V* EFCVLAI KLLKVKRYPIEW 
EKM FANH I SDKGLVSGI YKEL FRLSNKQ 
AI DLTF * KWAAGHGGS PL 


3241 


17142 


A 


3263 


238 


3 


KEKKIGLKKCLQGSHFSIHTAWS I I I YM 
FSPLTI ISRKRMGQPGI VAHTCNPST*G 
G * GRW I TRSGVRDQTGQHGKTP 


3242 


17143 


A 


3264 


350 


3 


SPTLLGSKDPNLLGFRFPLWKKGKI IRA 
PLSLGLN* RFFSE WLI P* KPPKNWPGG 
TFLWCFLKRGFPFLSQKKKKKNPGAVA 
HACNPSTLGGRGGRSQGQELETSLANTV 
KTR 


3243 


17144 


A 


3265 


69 


200 


RLECSGVI SAHCNIiNLPGSGDS PAS AS * 
LAGITVMVKLPVIAK 


3244 


17145 


A 


3266 


223 


408 


GGFPFFPPGGGEGGNFGELEPLPPGLRK 
FFCLGPPRRGD*GPRSSSPGSFWVFKKN 
GVSP 


3245 


17146 


A 


3267 


185 


420 


DQGLWGFI I YFYRQSLA* VHVfNNPSSL* 
PRTPGLKHTPVPSLLISWDYGRTPPHLT 
NFCIFFDRGSFFF*DRVSATHA 


3246 


17147 


A 


3268 


3 


392 


TGCHS IPQAGVQWHNHGLLQPQPPGLR* 
SSCLSLPSSWHY 


3247 


17148 


A • 


3269 


1 


398 


KFSCISSKHQKLKLTPKPPKPPPKKSPL 
VLPIGKKIRETFWGAFKS PPPNQPKGAQ 
TIiPLKIWDKMGGGGGLALW*KAPPGNF 
KGPPGGKPMEQP * LGPGPPLKWKGGLPH 
QKGGFSKAPGEKKKGGEGRL 


3248 


17149 


A 


3270 


422 


183 


ETEWSLFKVIITEKSPNLEKDTNIQVQ 
ESYRTPSRFILMKTTSRHLIIILPKVNN 
TER IL * MQQDRGNN I QWS SGCS A 


3249 


17150 


A 


3271 


3 


35 


KNNSLI I PTI IATI TLLNLYFYLS PLLY 
**SSSPPS 


3250 


17151 


A 


3272 


155 


1 


KDFFFFFLRQSFTLVAQAAAEWCDLCSL 
QPKPPGFK* FSYNSLSSSWDYIG 


3251 


17152 


A 


3273 


413 


71 


PPSTGLFLTEEYEVSFPPFPL*KFFFPP 
SGLFFGGGVPPFFPPPKKGFFPKYPRLV 
FKGPLLGGGGLPPPPP* ILPPLGSFLPA 
PPLFFFFLFFL 


3252 


17153 


A 


3274 


336 


45 


DRVLSCSPAWRAMARSHDFG*LQLPPPR 
VKVFSCLSNPSSWDPRHVPPRKGNFVFL 
VKTGNPWNLGGQGCRERRLCPCIPAWGT 
DKDSVSKKKKKS 


3253 


17154 


A 


3275 


206 


366 


SVFFFVLFGFFYGALLCFPNPWLECSGT 
I LVHCNLH FPG S KDS S AS PS* VAGT 


3254 


17155 


A 


3276 


334 


75 


ENTRRVERRRRTHI I YYNFF*EFMNRFF 
FFI FFDGRFSCVFFFFFLFYFFFFFFFF 
FFFFFFFFFFFFFWFSARSFIYFLFPRH 
VT 


3255 


17156 


A 


3277 


144 


17 


KAPPLFFFFFFFFFFFFFFFFW*FRG*T 
HWNGDACMCNLTKS 


3256 


17157 


A 


3278 


2 


109 


YHIVKPSP*PLTGALSALLMTSGLAM*F 
IIFHSITL 
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SEQD) 
NO: of 
nucleotide 
sequence 


SEQ1D 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQIO 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=A!anine 
C=Cysteine, D=Aspartic Acid, * 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=lsoleucine, 
K=Lystne, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /-possible 
nucleotide deletion, \=possible 
nucleotide insertion 


3257 


17158 


A 


3279 


404 


65 


NLVSSPKRSVFFISPSQKIFFFFFNWKE 
SKISLPKEGGGPLLLIiIRQRTP I FFFFF 
FYFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFL* TACLF*TFKFTKTK 
R 


3258 


17159 


A 


3280 


226 


383 


GI * KNSMDGCCLGPFWFETGSHSHPGW 
SIVWRNLGSLQLLPPKFKRFSCLS 


3259 


17160 


A 


3281 


91 


26 


LFFFFFFFFFFFFFFFWSERSS 


3260 


17161 


A 


3282 


3 


321 


HTVIYYFGKNEPFGYIGVA*AMISVGSL 
GFMA* AHHI FTGGIDVYSPAYLTCATI I 
VAIPTGVKVFI * LATLLGSGMKRSAAVL 
* TLGFIFLFTARGLTGIVLSNS 


3261 


17162 


A 


3283 


393 


112 


PPLRVFFPPFPLKNFFFSSRRFFFWGGV 
APFFPPPKKGFFSKIPPGFFFPPPLKKK 
IFFSSPP*FWAPPGFFLKGPPPFFFFFF 
FFFFFFFFF 


3262 


17163 


A 


3284 


360 


47 


PPLHRWVPPPPP * KFFFS PQTKFFGKVF 
SQNPPPPKKSFFPKNPPGVFYSPPKKKK 
KFFLPPPIFFPPPRFFFTTPPPFFFFFF 
FFFFFFFFFFLPLFFWSVSF 


3263 


17164 


A 


3285 


386 


157 


FFSPNPLI FWGGGGPKFPPPKKRFFSKN 
P PG VF FS P PLKKKKF F FP P P * NLAPPRD 
FLKGPPPFFFFFFFFFFFFF 


3264 


17165 


A 


3286 


3 


385 


DAWVPCKLYLGVFFCRFVKISAILILKI 
LI FN I ENKS I * YLLKKKKKKKKKKKKKK 
KKKKKKKKKKKKKKKKKKKKPT P KKKKK 
KDSR 


3265 


17166 


A 


3288 


81 


270 


1 1 KLEKKKKKKXKKKKKKKKKKS * KGNB 
KTRGPKKKVHRDVTKKI LCYKI CPFNKG 
ISLIGT 


3266 


17167 


A 


3289 


3 


145 


1»GR LRQENHLNLGG * GCRE PKSHHRI LA 
WATEQDS I S KKKRKI GGPV 


3267 


17168 


A 


3290 


404 


47 


GPGGKI FFKKNPEEKI FSTQKKKGFFPP 
FPPKNFFFPPGGFFFWGGGGPI FPPPKK 
GFFPKIPRGVFFTPPKKKKIYFFPPREI 
WAPPGIF*KGPPLFFFFFFFFFFFFFLV 
LNDILLA 


3268 


17169 


A 


3291 


106 


362 


KKKKKKKKKKKKKKKKKKKKKKKRMKNK 
KKKKKGRGGGKKKXPRGAQI JRGXKKII 
FFFKKVFK* KGGGDL* KKKS FCGENNLE 
NTHKE 


3269 


17170 


A 


3292 


3 


226 


NPLVNLNLGLLFILATSSLAVYSIL*SG 
GASNSN YAL IGALRAVAQKKKKKKKKKK 
KKKKKKKKKKKKKGGGVL 


3270 


17171 


A 


3293 


337 


2 


FFNRELLKSKNKETEKNHDYHKERIWHF 
DRPYKFYMLFFCFGVLIFFLYLPFFIYF 
FFFFFFFFFFFFFFFFFFFFLNSLNI*F 
FS INTHKCGNRGQGGGAIGPRLSNGRV 


327! 


17172 


A 


3294 


327 


21 


KFFFSPPNKYFFFVFPHKFPTPTKMFL* 
KIPPPPLISPPPKENHQCPPPPHIFAPP 
SHPFFGPPHFFFFFFFLFFFLWRTGSRY 
IAQAGLELLGSSYPPAS 


3272 


17173 


A 


3295 


367 


101 


FFFS FF* KKKNFFPPPPLWPPPPNFL*N 
PPPPFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFLVNSLCRRYRG*SLLY 
YAWL 


3273 


17174 


A 


3296 


178 


2 


KGPPFFFFFFFFFFPDSITLFIITKNEK 
KTRCTSIGE* ITWYTKTGE YYSVMKRNE 
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SEQ 10 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN' 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=lsoleucine, 
K= Lysine, L=Leucine, M=Methionine, 
N=Asparagine, PHProline, 
Q=Glutamine,R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /-possible 
nucleotide deletion, \=possible 
nucleotide insertion 














LS 


3274 


17175 


A 


3297 


147 




RGPPPFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFTW* FALI FLV*NF*AI 
ITF 


3275 


17176 


A 


3298 


3 


347 


PFGEGETNGRFDLDTKKALVSKKKKKKK 
KKKKKKKKKKKKKKKKKKKRGGALKKKN 
RGGQKKTGEKKKKFFFKKGEKKKTPGNF 
* KKKKFLGGDNLAQT P P KKKNLWGKKKN 
FWG 


3276 


17177 


A 


3299 


405 


2 


WRPKPENSPSPGGPNPPGQKN*TPFPPK 
TPKLGRGCGGAPQFPLPVRARGENPPNP 
RFLGFN*PKFPFPPFAL*GGGNNETFFQ 
KKKKGPRFWAGVLPTGGAEGESIPHLSP 
ASGSSRHPWCFMLCRPSRLCLL 


3277 


17178 


A 


3300 


125 


376 


DQPVQNSETSSLQKKKKKKKLFPGGGGP 
PLFPH FLGGWGGGGTRP PG * RKRGKSGV 
PQPPPPGGRVPPPPHPPKKWGNKGGPPP 
PGKSFFFFFFFWRDEVSLFCTGWSQTQT 
PGLKQSIHLSLPRWLDYGREPPHLAIHF 
FPTRPRTRG 


3278 


17179 


A 


3301 


348 




PQRLKILGGGGPPNSPPPKKGFFPKTPR 
GF*NPPPKGKKKNFPFPRKIGPPQGFF* 
RAPPLFFFFFFGPWFFFFFFFFLFLENG 
VSLYCPGRSRTPGLKLSSRLLASLRAGI 
TGVS 


3279 


17180 


A 


3302 


2 


354 


GRVGGRVGS PTTLTT*NPHSHEKTPSCS 
YPKKKKKKKKKKKKKKKKKKKKKKKKKK 
KKKKKKKRKRKKKKKKT * GGGAYI KKKG 
GGS PRTEEENQ YCFLLGGKKNI LRGS CL 
KKTPH 


3280 


17181 


A 


3303 


362 


26 


SSQAAHPTKKKKKKKKKKKKKKKKGGAL 
KKNFGGGKNKGGEKKKIFFFKGGGKKNP 
GGNFKKKNFFGGGKIGENPPKKK*RFRK 
KKKFLRGKGGKKS 


3281 


17182 


A 


3304 


371 


3 


NQKKPPPPVFFFFFFKKKKNPPPATPPP 
GRP* FFPPQKKFFPEGGPPPFFKKGAFF 
PPPKKNPFEPFFPGGVFFF*KKKNIKVF 
PPPKKKKKKKKKKKKKKKKKPKKQKKGR 
PARDLEPHAS 


3282 


17183 


A 


3305 


2 


196 


DR* LFSTNHKDIGTLYLLFGA*AGVLGT 
ALS LL I RAELGQPGNLLGNDH I YNV I VT 
AHAFVIIF 


3283 


17184 


A 


3306 


4 


339 


LFSTNHKDIGTLYLLFGA*AGVLSTALS 
LL I RAELGQPGNLVGNDKI YNVI ATAHA 
FVIIFFIVRPIIIGGFGN*LNPLIIGAP 
DRAFPR INNIS F* LLPRS ILLVRASAIA 


3284 


17185 


A 


3307 


2 


367 


KPPAITQYQTPLFV*SGLITGGLLLLFL 
PDLAAGITILLTDRNLNTTLFDPAGGGD 
PILYQHLF*CFGHPEGYILILPGFGIIS 
HIGTYYSGKKEPLGYIGMV*AMRSIGFL 
GFIE*AHHIL 


3285 


17186 


A 


3308 


243 


370 


KGHRPGMVTPVCNPSTLGGQGGWIRRSG 
I *DQPGQQGVTPSL 


3286 


17187 


A 


3309 


2 


363 


LVPL 1 1 GAPDMVFPR INN I GF * LLP P CL 
LLLIAYAI MEAGAGTG*TAYP PLAGNYS 
H PGAS VDLTI FSLHLAGVS S I LGAINFM 
TTI INI *PPAITQYQTPLFV* SVLITAG 
LLLLCLPG 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenyialanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Lysine, LHLeucine, M=Metbionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unkno\vn, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


3287 


17188 


A 


3310 


1 


364 


LYITGPTLALTIALLL*TPLPIPNPLVN 
LNLGLL FI LATS S LAG Y S I L * S GGAS NS 
NYALIGALRAVPQTISYEVTLAIILLST 
LLIRGS FNLSTL I TTQEHL* LLLPS * PL 
AII*FISTL 


3288 


17189 


A 


3311 


215 


2 


GMAKLKARFLPKKGGKKSLSPLLWGSGF 
PTSGTGETFFFFF* DRVLLCQPGWSAVA 
QS * LTTASTFWAQAI 


3289 


17190 


A 


3312 


153 


3 


MNKCI CMCVCMCVCFRSHKHTVCAWVCV 
Y*V*TYTVCVCVCVCVCVCVC 


3290 


17191 


A 


3313 


149 


1 


KTPPGFFPGFKGPF*FKGPPLFFFFFFF 
ETEDGVLLCRPGWSAVSRDRA 


3291 


17192 


A 


3314 


282 


69 


KIGKTAPFFSWPRRVPFLKKKKKKK*GL 
TLSSRLE YGGMI SLELLGSSDTLASASR 
VAR I IGQCPEAWLT 


3292 


17193 


A 


3315 


479 


60 


MMQHS I KHALVE I TGWVRWLR P VI PAL W 
EAE VG E FNS I S KTNNRKRS N I VKMS I LF 
NLIYRINAPFFKFLIALLNKNSKSKSIW 
NLKS * * STQ* SSRNlSrVGGI TI SDF* THQ 
QATELRQEFLHTGLVHRSTVSR FRFKA 


3293 


17194 


A 


3316 


129 . 


3 


S FKQKVW PDAVAHACS P S TLGG * GGR I T 
GSGDQDHPGQHGE 


3294 


17195 


A 


3317 


3 


156 


EFHRVSQDGLDRLTS * STLLGLP KC WD Y 
RCEPPRPASFPSFLTVRISPFL 


3295 


17196 


A 


3318 


36 


340 


TGLVIAEHLLFFFFKKGFWFCPPGGRAG 
PQGNLMEFPVSGGKGIFCPGPPKSGE*R 
APP P S RG KFVi FFKKKGGLP FW PGG V * TS 
DPRGTPPPLGPKGGEYR 


3296 


17197 


A 


3319 


326 


2 


KAFFLNFKPHPGFA*GLKFKPNSLLPI * 
SFIRKT*AFLICL* IQLNRADGKIPLKP 
GVIPFKKCFPNFTPPLFFKKNCSSTPNL 
FFFFPFFF FGDG I S LCRPGW S AVA 


3297 


17198 


A 


3320 


175 


312 


QAGVQWCDLSSPXPPPSGLT*FY*CSLP 
RTWEYRCPPSRPCNFSIF 


3298 


17199 


A 


3321 


363 


51 


RWG P RL P VAWL P S VS RRAE ECS PGRGAH 
GQGGGGGGQAR*REGPDEWTGGAPVGMG 
SPSLNRGAGRGPPQPGPSSKSRRGRVRC 
TPHSCLIGCGLSFFICKMG 


3299 


17200 


A 


3322 


344 


187 


LRQETCFNPGGKNFG * QKLPPCPPAWAT 
RGWVS KKKKKRKKRKEKIRI CVLT 


3300 


17201 


A 


3323 


3 


313 


TRRERERERERERERERERERERERERE 
RERE FFFFFF* KKNLS PPPGGKKRGGGG 
FHSPPFWGEKDFLPPRPPKGGERGGRPP 
PPVFFFFFKKEGAPPGGGV 


3301 


17202 


A 


3324 


365 


1 


SLGWWRVTPPPFPLGPRGGSPPRPDKRG 
SPGRVKFGGVKPFPKVKP*PPPKRLDPS 
GA* FFFS PKKI CQGPPPRKKKKKEAELS 
ILCTKFPLQEEEVMQVPPPPCTCSSEPG 
INCICKRHC 


3302 


17203 


A 


3325 


1 


269 


ARALSLSLSLSLSLSL5LEFFFFFLAPT 
QILGGGLFYPPKGGAHLSNRLGGGRVFF 
GPPGKKTAPLGGGRVIKKKTPPGTPL*L 
SPPPGGPFFFQGGQKKLSPPPAGLTGGP 
PPWGGKKAPPPKFGSGPKKKKKTLERER 
ERERERERERELV 


3303 


17204 


A 


3326 

1 


1 


316 


ARGERERERERERERERERERERERFFV 
SPPGPDTLQDRGGVSLFASDPTHEIYLP 
CPLRGPLSERDPEELKGVCFFSEPSHRW 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

etb 

od 


SEQED 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Giutamic Acid, F=PhenyIalanine, 
G=G)ycine, H=Histidine, I=Iso leucine, 
KHLysine, LHLeucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














FPEEGVGDICQC*HRGECGSP 


3304 


17205 


A 


3327 


2 


369 


ARGDVTRESTYQGHHTPPVQKGLRYGI I 
LFITSEVFFFAGIF*AFYKSSLSPTPQL 
RGHWPPTGITPLNTLEVFLLNTSVLLAS 
GV* IT*AHHSLIENNRNQI IRALLITI I 
LGLYFTLLQA 


3305 


17206 


A 


3328 


1 


259 


GTRKKHSTILIIREIDIKFMPRFHISPI 
RLVKI * AIANIRC*QQCX3KVGNLLHC* F 
SKLVQSTGRAIWHALVLSLARSRALSLS 
LH 


3306 


17207 


A 


3329 


2 


352 


ARGTLLLLCLLFFALTIYPRVfCDVTRKS 
TYQGHHTP FVQKEPIiVlJGGKLF I TS * V I 
FFAGFF*AFYHSSLTPTPLLGGHWPPTS 
ISLFNPLKKPLLNTSGLLASGVS IT*AH 
HSLLQ 


3307 


17208 


A 


3330 


3 


361 


HEEPLGYIRMA*7^IISIGFLGLIVRAHH 
I FTGGIDVDSRTYLTSATI I I AMPTGGK 
VFR*LATLHGSNMK* SGAGL* ALGFMFL 
FTVGGLTG I VLTNS * LDIGLHDT Y YWA 
HFHYVLS 


3308 


17209 


A 


3331 


1 


340 


GTSGDTRA Y FT SAT 1 1 1 AI PTGVQVFS * 
I±ATLHGSNMK*SAAEL*ALRFIFLFTVS 
GLTGI VLANSSLDIELHDTYYWAHFHY . 
VLS IGAEFAI IRGS I H * FPLL * G YTLDQ 
T 


3309 


17210 


A 


3332 


2 


352 


ARGDVDTRAYFTSGTI I IAIPTGVKGFS 
* LATLHG SNMK* SAAVL* ALGF I FLFTV 
RGLTGI AIiANSS LDI VIrHDTYYGGAHFH 
YVLSIGAVFAIIGGFIH+FPLFSGYTLD 
QTYAK 


3310 


17211 


A 


3333 


85 


370 


QVSHRVRPCFERERERDRYRHTQRECER 
AKEIYSGFFLSASEMESCSLAQAGGCKE 
LRMCHCTPAWVTQGDCVS *NKK* HWEKK 
GLWFLGVNIST 


3311 


17212 


A 


3334 


362 


3 


QEVKVSMNRDHTHSLQRGEQKRNSVSKK 
KKKSSQEKLKRVHTQTHT*IFIAVLFIV 
AKNGKQFKFPSTCEWINNM*YIHTMEYY 
SATKNQLIHVTT*MNLNNIYTKFLKARQ 
KMITYSC 


3312 


17213 


A 


3335 


3 


351 


HEGTTLHHLFUjETGSNNPLGITFHSDK 
KTFHPYYTI KDALGLLLFLLSLMTLTLL 
* PDLLGDPDNYTLAHPLNTPAH I KPE* Y 
FLFAYTILRSGPNKLGGGPALLLSILIL 
AIIP 


3313 


17214 


A 


3336 


17 


350 


FI FTLTETNRTPFDLAEGESELFSGLKI 
KYAAGPLALLFITEYTNI I I IDTLSTTI 
VLGTTYDALS PEL YAT YFVTKTLLVTSL 
FL* IRTAYPRLRYEQLIHLL*KNFLPL 


3314 


17215 


A 


3337 


2 


349 


ARDSLISPILRLHPRPNLRQNAFHYHIH 
RGKSNFLPTTLSRP IRNAPTLLGLPRCI 
HHMKHPIICRLIH* FPLFSGYTLDQTYA 
KMHFTI IFIGVNLTFFPQHFLGLSGMPR 
RYSDYPDAYTT*NILSYVGSFISLTAAI • 
L 1 1 FMM * EAFAS KRKVL I VEE PS I NL E * 
LYGCPPPYHTFE 


3315 


17216 


A 


3338 


2 


386 


ARAILNAMAFLILTERKILGYIQLHRGP 
NWGPYGLLQPFADAI KLFTKEPLKPAT 
STITI.YISGPTLALNIGLLL*TPLPIPN 
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PLRNLNLGLLF I LATSSLADYS I L* SG * 
ASNSNYALIGALRAW 


3316 


17217 


A 


3339 


175 


16 


ILGDLFPPAWLHPFLFLLLLFSRPSLAV 
TEAAVQWRNLGLLQPLPPGY* SSWC 


3317 


17218 


A 


3340 


1 


369 


GTSNLSLLFIVATCGLAAN*ML* *ggki 
N* IYTIiRVALAAI IHTILYE I TLS I ILL 
* TLLI SGSFNLSTLITTQEHL* LLLPS * 
PLAI I *FISTLAETNRTPFDLAEGESEL 
F*GLNIEYSAR 


3318 


17219 


A 


3341 


1 


363 


GTRGGTILPAMVLMLIDVPSLRILYITN 
KVNE P SLTI KSNGHQWY* T YQYTDYGGL 
ILNSYILPPLFLQPGDLRILDVDNRVVL 
P IETP IR I INT * QDGLHS *AGPTLGLKT 
DAIAGRVNQ 


3319 


17220 


A 


3342 


181 


315 


PKSPPTPPPPTXXXXXP3GNPPGPTL*G 
PFYWGGERGPP F * TGGP 


3320 


17221 


A 


3343 


3 


183 


HEVSQSCRELLTSGDPPASASQSAGISG 
MHHRAWPVS*TFHIFNIYCGPGTVLKFY 
LNL 


3321 


17222 


A 


3344 


83 


354 


VLGARCPCAGVSRVSMEPWFLWQQACLL 
PWWRCCLSTLYANRADRDVP* TSRTGPV 
MVAHACS PSTLGGQGG WVTRSGV* DQPG 
QDGETP 


3322 


17223 


A 


3345 


124 


379 


GQGI CFMWSLGLSHRLAHCRYTVGVNN 
LLCS S SL P FLL P LL P LPP PEG PGWDD IN 
IIIFII FRDKVLLCCLGWSAVT* SQLTT 
A 


3323 


17224 


A 


3346 


3 


368 


YEPI P* PLTGALSALLMTCGLAM* IHLH 
S ITLLILGLLSNTLTI YQ*WRDVTREST 
YHGHHTPPVQKGLRNGIILLITSEAFFF 
AGVF*AIYHSSLAPTPQLGGHWP?TGMT 
PLNPVQGPLL 


3324 


17225 


A 


3347 


1 


352 


GTSAGDVNYG* I IRYLHADGAKI FLRCR 
FLHSGRGLYYGSYVYSKT*NIAINLLIA 
TIATAFIGYVLA*GQILF*GATVSTNLL 
SDIPYIGTDLSQRI*GGYSVDRPTLTRF 
FTFHF 


3325 


17226 


A 


3348 


2 


360 


ARAE I S PLHSNLGNESETPSQKKKKKKP 
PGGGGKNWNPYPSGTPPPNPPKGGE*RE 
EPPPLTKRVF*KKKVFFEGPRVKKNLGG 
KGKKKRGFPAR* KNPVFKAKGKGRNFGP 
QAFFFKK 


3326 


17227 


A 


3349 


226 


366 


PQSFCSVASWPAGDLMEI*AWQKFEYKP 

GKVPMDPEGCLLPLAKKKEEEEEEEEEE 

i 

EEEEEEEEEEEEEEEEEEEEEGEEEELV 


3327 


17228 


A 


3350 


2 


374 


ARGGGYSPHRATLTRTVTFHS I LPF I I A 
GLAAIHLLFLHETGAYTPL* ITSHSEKT 
TFHPYYTMKDALGVLLFPLSLTTLTLCS 
PNLLGEPHDYTLTNPLNTRPKI KPE * DC 
LFAYTNMRSGPN 


3328 


17229 


A 


3351 


1 


266 


LGLVEHFLTLTTKLTSSNISDAQEIQSV 
* NI LTAI I LNLMVLP S LRMLY I TNEVFD 
PSLTIKSIGHQWY*AYEYTDFGGLIFNC 
YILP 


3329 


17230 j 


A 


3352 


207 


362 


ILFLLKQKIYVLFGFSN*GSSIPELAHS 
DAYQTKE I CSSGKVYKYLQCWEKW 


3330 


17231 


A 


3353 


178 


365 


NHGIDKGLRYRIYTVQQEDNSVKR*LKD 
FNQHLS KDN I RSADKHMKR CPTS L V IN I 
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MPIKPP 


3331 


17232 


A 


3354 


359 


3 


LQVRSLMPLGQFHPP *NFFCSFFPKPKI 
PFFFLFFLKGVFVFFPGWRARGQFWVFG 
PPPPKFKKFFFFTLLRNWGFRDFPPSRG 
KFFFFFFFWYF**RLRFTMLTRLVFNS* 
PQVISC 


3332 


17233 


A 


3355 


2 


354 


ARE P S P * PLTGALSALLMTCGLSM * IHL 
HSITLLILGLLTNTLTIYQ*WRDVTPES 
TFQGHHTPPVQKGLGYGIILFITSEVFF 
FAGFL*AFYHSSIiAPTPHLGGDWPPTGV 
TPLNAL 


3333 


17234 


A 


3356 


1 


364 


GTREM I CCSALS PRIHLS FHRR WPDWHC 
ISKLITRHRTTRHVLRWSSLPLCPINRS 
WICHHRRLHSLISPILRLHPKPNLRQNP 
FHYH I HGRK * S AAVL * ALGF I FLFTVGG 
LTG I V LANS S LD I GLHDT YYVGAHFH YV 
LSIGAGFAIIGGFIH*FPLFSGYTLNQT 
YAKIHFTI I FMGVNLTFFPQHFLGLSGM 
PRRYS DYPDAYTT 


3334 


17235 


A 


3357 


2 


376 


ARGGQPEDYILILiPSFGI I FHI VTYYSG 
KKEPFGYICMV*AMISIGFLGFIA*AHP 
I FS VG I DVNTRAYFTSAT 1 1 IAIPTGVK 
VFS WLATLHGSNMK* SAAAL* ALRFI FL 
FTVSGLTG I VLCN 


3335 


17236 


A 


3358 


357 


139 


AQPWFFFLETVFCHAAQAGLKS IGSSDL 
PTSASQSVGTTGMSHHAWPERSC*QNTF 
TYYVQSIYSDDIRKSMF 


3336 


17237 


A 


3359 


277 


1 


SCQKD I S IGFLFVPGFRVKGVAQEHL VG 
VRM Y VL VPTVGGDG KERER * RG IRHTP P 
PSERVRQRERERGRDRDREKERASQRKP 
EMDRETSC 


3337 


17233 


A 


3360 


261 


2 


TICSKCAWLCFSKTLLTNTCASRSDIxAH 
GP*CPNLCSGIFSSAWTDISRLLLFFCL 
FVLRQSLTVAQAGVQWCNLCS PQPLPPG 
SC 


3338 


17239 


A 


3361 


199 


354 


HLP YWPMFMATFNFFFFFETE FHFFPQA 
GGQ WGD FG* LKP P PARLKQFS CLN 


3339 


17240 


A 


3362 


3 


364 


HEDRDTPAYFTCANI 1 1 AI PTGVKVFN * 
LATLHGSNMK* SAAW* ALRY I FLFTVS 
GLTGIVLSYSSLDIVLHDTDYWAHFHY 
VLSIRAVFAIIGGFIH*FPLFSGCTLNQ 
TYAEIHFT 


3340 


17241 


A 


3363 


3 


360 


HEETYIPKDQKYSFLHDSQTSFCFSNS I 
PTPSNMEETQQKSVSGCRLP*AGMGETC 
GQGSRAAQPLPSLPLQNLKLLRISLLLI 
EAWLEPERVLKSMFANNLGYDTSDSDDY 
HLLKDLE 


3341 


17242 


A 


3364 


352 


27 


VYLSSQRNSAASVNRMTWTDRSLSPYT 
LGWHRPDRLFHFVFSLFVYTFVERGSQS 
IAQAGVQWCDKSSL*P*TPGLK**TCSC 
LP*VAGTTGTHHYTGPLILLNLFN 


3342 


17243 


A 


3365 


347 


99 


GGPPLPRGFSPPPGARPLAPPWGPPQES 
PPPPPPKTAPQKKKKAPS I PAGQWLVEQ 
NPRPQPFKKPPGVGF*NPPPPLNMGF 


3343 


17244 


A 


3366 


3 


319 


HEECLAHTLCFFNL.^4S*RTTVIIlHRI / iFS 
IVTAI * YLLSSQYVIGYYI YVCIHTHTH 
MYIHTHRHRTFI CVCVCVCI YI YTS IC 


3344 


17245 


A 


3367 


3 


474 


YELLLLNGP ILNARAFLILTERXI LGY I 
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QLHKGPNAVSPYGLLQPFADAIKLFTKE 
PLKPATSTITLYITAPTLALTIDLLL*T 
RLPIPNPLDNLNLGLLFILATSSLPVYS 
IL*SG*A*NSNYALIGALRAVAQTI * *E 
VTPTI ILLSHYYKWLRN 


3345 


17246 


A 


3368 


2 


348 


ARGLQDATFP I I EELIT FHDHALI I I VL 
I CFLGLYALFLTLTTKLTNTN I SHAQE I 
ETV*TILPAI I L VL I ALP S LRMLY I TDE 
VNDPSLTIKSIGHQWY*TYQYTDYGGLI 
FNS 


3346 


17247 


A 


3369 


64 


363 


KKWGFVPQQKKGGGRISVNGTPIFKGKG 
NFPP* PPGEGGTKGPGPKPK* IWIFKKR 
GGLIFWPKQFPTPGPKGTHPPGPPKGGK 
*RGGP PS PGG FNFKRG 


3347 


17248 


A 


3370 


34 


355 


AKGKEY I ET VAEKXKKKKKAP PPI * KAP 
*NPGKKPPLALKFPPPSQKAPPPALFCP 
*KGPPPRVFPKKKKKFPPFGRKGFFFTG 
GKPP 


3348 


17249 


A 


3371 


1 


374 


FLHSLHLSLRLLYLP*KFSMFIILLKAY 
YTLCVCVCVCV CVCVCV * PLNKIH INHF 
TCSTVRGRSNPIFFPPLLN*RPFYTT 


3349 


17250 


A 


3372 


429 


3 


FFFFLKKQKISRVFPRGPEFFGQRKPFW 
AKGPPRAPQKGGEKGKNPSPPPPLIFFK 
KGEKPQGGPLFKKGLGFFSPPKVFFKKG 
VFKSGPGGP*NFGFRGFSPPSPPKKGGK 
KRKPQSPGFFFFFFFFEILGCSGRSRSR 
T 


3350 


17251 


A 


3373 


2 


404 


LGTDLS LI IRADLGQTGDLLGNDH I YNA 
IVTAHAFEIIFFIEIPIIIRGFGN*LIP 
LIIGAPDMAFPRINNISF*LLPPSILLL 
LASAIVEAGAGTSGTDYPPLSGNYSHPG 
ACVDLTI LSLHLAGVSS I LGAI 


3351 


17252 


A 


3374 


2 


417 


AFV * I AH I TRD VYYG * 1 1 R YLHAKGAR I 
FFMCLFLHIGRGLYYGTFLYSET*NIGI 
ILLLAT I ATAF I G YVLL *GQ I S F* GATE 
ITNLLSAI P Y IGTDLIQ* I * GGYSVDS P 
TLTRFFTFHFILPFIIAGPNPLHLTT 


3352 


17253 


A 


3375 


85 


1 


FWPGAVAYACYPSTLGG * GGW IMRSGVR 


3353 


17254 


A 


3376 


396 


1 


KEGYIKGPRGENFF*KNPERKIFSRQRK 
GGVFS PLPPKKFFFFPKGLI FLGGGCP I 
FPPPKKRVFSKNPPGVFINPP * KKKKI F 
SPPPENWGPPRVFLKGPPPFFFFFFFFF 
FFFFFQTLFLGGCGYNTKLR 


3354 


17255 


A 


3377 


3 


118 


LL P P S LLLLLAYA I LVAGAGTG * TDY P P 
LTGNY*KTKA 


3355 


17256 


A 


3378 


301 


82 


ERKGPPQRGGKKGPKAQKPEKKKPKKRP 
PPPQN*FFSNPPKKKKGGKDPPSKKRGG 
GKGVPPPRKKKKKKKKE 


3356 


17257 


A 


3379 


2 


195 


DR*LFSTNHKDIGTLYLLFGA*AGVLST 
GLSLLIRAELGQPGNLLCNDHIYNVIVT 
AHAFVIIF 


3357 


J 7258 


A 


3380 


21 


411 


VFCTNH * DIGTLYLLFGA* AGVLRSALS 
LLI RAELGQPGLLLGNDH I YNV I VTGHA 
FVMNLFI VIP I 1 VGGSGN* LGPLE IGAP 
DMAYPR INNIGF* LLPPSLLLLLAYAI V 
EAGAGTG* TVYPPLAGNY 


3358 


17259 


A 


3381 


2 


421 


GRVGGRVGSTNHRD I GTLYLLFGA* TGV 
LGTALSLLIRAELGOPGNLLGNDHMY^r 1 / 



755 



WO 01/64835 



PCT/US01/04927 



SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylatanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N^Asparagine, P-Proline, 
Q=Glutamine, R^Arginine, Sf=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop cod on, /= possible 
nucleotide deletion, ^possible 
nucleotide insertion 














I VTAHAliAKI FFI LVPI I IGG FGNGLVP 
L I IGAPDMAFPR I SN I SL * LLP PS VLLL 
LESVIAEAEAGTGRTVYPFLAGTYSNPG 


3359 


17260 


A 


3382 


3 


185 


VIYSTIFAGTLITALSSH*FFT*VGLEI 
NMLAF I P VLTKK I NPRS TEAA I K YFLTQ 
ATASIILLIAILFNNILSGQ*TITNTTN 
QYSSLI I IMAIAI KLGIAPFHF* VPEVT 
QGTPLTS GLLLLT * QKLPSS I SSRKQPH 
P 


3360. 


17261 


A 


3383 


5 


398 


FFGHPEVYILILPGFGIISKIGAYYSGK 
KEPFGYLGMA*AMISIGFLGFIA*AHHI 
FTVG IDVHTRAYFTSATI I IAIPTGVKV 
FN*LATLHGSNMK*SAAAL*ALGFIFLF 
TVGGLTG IVLANS SLDIVL 


3361 


17262 


A 


3384 


1 


250 


LGNTGETLSLQKNV* KLARCGGTCLRSQ 
LLRRLK* EHCLTPGGRGCSE PRS CHSS P 
AWATGQDLSQQQQQKYPLRIFSAIIKS 


3362 


17263 


A 


3385 


456 


0 


I FFFSQVETGIHHIGQAGLQLLTSGDAP 
CVGLQKSWDYRPEPLHLAWLIHSLLSNF 
YYC* FGFCKLSVF I LINVFCNKTCNSSM 
FLFSYS *NPCAQ* KNEGPLLNKNPCRIT 
DAWADAWVAPVF 


3363 


17264 


A 


3386 


1 


225 


PSRNFLVLVDKLILKFI *KYRGPRTAKT 
TLKKKKKWRLTLLIFKSYYKTrVITIG 
WYGFQDRQVD*WNRIESLE 


3364 


17265 


A 


3387 


3 


400 


LLI LGLLTNTLF I Y P A WGDATRES TYQG 
HHTPPVQKGLRYGI ILFITSQVFFFAGF 
F*AFYHSSLSPTPQLGGKWPPTGITPLN 
PLEAPLLNTSVLLASGVSIT*AHPGLIE 
NNPNPIFRALLITNILGLYF 


3365 


17266 


A 


3388 


24 


407 


IASGRPFFFFFFFPKGKFFI SRGGGPFS 
PPKI*KNWGKGKSFFPPPQREPKKRGVP 
KNPSLSFYPFGGKPPLGPPPKLVPPSS* 
PGKKAFFFF*TPPPGFLTPLGGGLSPLK 
GGYLLAPHFVPTEGPL 


3366 


17267 


A 


3389 


148 


413 


I^LNNNNNGKNHLLNTYSCT* LEEKKKK 
KKKKKKKKKKKKKKKKKKKAPGGGFF 


3367 


17268 


A 


3390 


157 


14 


RVSACCQVWW*MPWPATREAEARESLE 
PGRHRLRS CHCDRDPVS KS 


3368 


17269 


A 


3391 


2 


398 


LFSTNHKD I GTL YLLFGARAGVRGTALS 
LL I RAE LGQ PGNLLGNDH I YNV I VTAHA 
FVINFFIVIPIIIGGFGN*LVPLI IGAP 
DMAFPRINNISF*LLPPSLLLLVAYAIV 
EAGAGTG * TVYPPLAGN YSH 


3369 


17270 


A 


3392 


2 


395 


DQKDIGTLYLLFGA*AGVLGTAVSLLIG 
AELGHPGYLIGNDHVYNVIVTAHAFVI I 
FFI VIPI I IGGFGN* LVPLI IGAPDMAF 
PR INN I SW * LLP PS LLLLLACAI VEAGA 
RTG*TVYPPLAGNYSHPGA 


3370 


17271 


A 


3393 


396 


54 


GYPPNLTFVTSSLTSVPSPSHPPSHVCS 
PTPAIPQTPRSPPSESTPPTEVSLTTPT 
LS LGAI S S LAS AGPQGAS PAS PGF WGTS 
P?*PYS*NR?QTAACGVFSQLFFIRPEV 
P 


3371 


17272 


A 


3394 


417 


60 


KGWVFQRILWPGPGGSPRYSRPFGGPRG 
LIPLGSGV* TPPGPHWETPFFFNNQPIS 
GQGGGGPLIPLLGGVKPEKIFYPGNQSF 
H* PKIPPHPPPLGAKQNFPPKKKKPQKK 
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TKGFWV 


3372 


17273 


A 


3395 


417 


3 


FFFFFFFQHHFGFEAAA* YWHFVDW* L 
FLYVSIY**GSYSFSIKS 


3373 


17274 


A 


3396 


229 


1 


RSFWTKTMGFSCYKDFF*TNGNKDTTYQ 
NLWDTAKAVLRGKFIALNVPINKLERSQ 
ISNLISQVKEGQAWWLKPVI 


3374 


17275 


A 


3397 


344 


165 


ELKS PQPRKRNGCFFLAVDPDEGF *NLN 
FCGNGQGLNAPGG YPCE I PMGFVPNAYG 
KTF 


3375 


17276 


A 


3398 


1 


409 


HKDIGTLYLLFCARTGILGTALSLLIRA 
ELGQPGNLLGNDHIYNGIGTTQALVRIL 
FIVIPIITGGFGN*LIPLIIGAPNMAVP 
RINNISF*L*PPSLLLLVAYGIVEAGAR 
TG * TGYP PLAGNYSHPGACVDLTM 


3376 


17277 


A 


3399 


3 


396 


KD I GTL YLS F LS * AGVLGAALRLL I RAK 
LGQPRNLLRNDHI YNVIVTAHAFVI I FF 
IVI P I I IGGFGN* LLTLI IGAPDMALPR 
INNI S F * LLP PS LLLLLAYG I VEAGART 
G*TAYPPLSGNYSHPGASV 


3377 


17278 


A 


3400 


49 


350 


KSNSHTEVQAGQEQKSFFRSKHKAGRAG 
SBPAPYVLAGCLCRSISSFLFFFFSLFF 
FFFFKRDGV*QGYSSGSNTPGVKQSSCL 
GLPKYWDYRREPLHPA 


3378 


17279 


A 


3401 


1 


398 


KPPAITQYQTPLFV*SVLITAVLLVLSL 
PGLGAGI TILLTDRSLNATFFDPAGGGD 
PILYQHLF*FFGHPEAYILILPGFGIIS 
HIETYYSGKKEPFGYIGiyiG*AMISIGLL 
GFIA*AHHIFTG* IDVDTRS 


3379 


17280 


A 


3402 


388 


42 


TPLFNPPGAKCVNPFFPKKLKKKNPSRA 
MGGPPLFPPLWGGKPKGFP*NRKSGAPV 
THFAPPAPPFGGKREPPPPKKKKKKKRK 
E EKE SNRAVQ I KKRGNKKGRL P VLD PAE 
ESQ 


3380 


17281 


A 


3403 


413 


2 


3PHPLKNFFLPPKPPNSGGGLAPFVPPQ 
K*GS LPK I PRGRYKS PPFKE KTNAL P PR 
GKPGPPRASSKRPPLFFFFFRGRVPLCH 
PG * S TVQCVQAQCTLELLGS SNFLTSS S 
P VTWGHRQGPPRP INFFI FFRGRVP 


3381 


17282 


A 


3404 


433 


274 


LGWAHACNPSTLEG* GGRTAWS PGVQD 
QPGQHSETPISTNFFWYFRI SYFHI 


3382 


17283 


A 


3405 


131 


427 


GPPPQPFFFWGKRGGPNPFFFFFSEKRD 
VF*QFFLRRRGPQKFFKPPFFCLQKTSL 
FSEKKKKKGFGPPLFPQKKKGCGGGPHY 
KKEG FTLNS FLFQGDGVSS VAQAEVQ WR 
NISSLQALPPGFT 


3383 


17284 


A 


3406 


413 


137 


SPPPPPQGGIFFFFKPR*ETFSPPPNLG 
FFSPPSPLKFFFFPKGFIFLGGGGPKFP 
PPKKRFFFQNSPGGFFFPPF*KKNFFFL 
PPVILGPPRVFF+GPPPPKK 


3384 


17285 


A 


3407 


319 


540 


IVFKDQCEESFTTLNVDIRNHQNLLDSL 
EQHG KGDLL * G ANAYHCE KCN KKVRAVQ 
LCLILITFNGFRKNRFFW 


3385 


17286 


A 


3408 


2 


403 


SLHLGGGSCI VGAI YFITTI INIKPAGI 
TQYQTPLFV* SGL I TG I LLLLS LP VLAA 
G I T I LLTDRNLFTTFLNPGGGGDPI L YQ 
HLL* LLSRPEVY I LI LPGFGI I SHIGTY 
YSGKKEAFVYIGW*AIISIGC 


3386 


17287 


A 


3409 


2 


142 


KKEN YRP VSI-NFJDAKS LNQ I QLH I * K 1 1 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=G!utamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Metbionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
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X=Unknovvn, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














HCNQVGFI PGMQGWFNIL 


3387 


17288 


A 


3410 


407 


48 


PPGGKIFFYKNPEKKIFPTQQKKGVFPP 
FPLKNFFFPPRGKI FGGGGGQNAPPPKK 
GFFPKI PQGVF* SPPKKKKKNFFPPGKI 
WAPQGVFLKGPPPFFFFFFFFFFFFFFF 
FFFLGVR 


3388 


17289 


A 


3411 


3 


402 


SYRLE*QLEDKAA?PQDFTQQLRWPEKQ 
HRMVWPD KEDMHKQLVEAS ETLKSQAKE 
L * DAHQQQKIALQ E F LE FN ELMAEL Y SH 
NHKVWDKEEEMEVAMHKADMMWQEI *RS 
KKLTKRMLFTQMRR I TP VI PS 


3389 


17290 


A 


3412 


1 


224 


LliFQLLRRLRQENHLNLEGRSCSELKMH 
HWTPPWGTE*DCL*LKKKQSAYLPNNTI 
KKLTGSRTIEDI CTEYNK 


3390 


17291 


A 


3413 


202 


1 


IVMSWPGWTKA* I PSTLGGQGGWITRS 
GVRDPLASAFQSAGITSVSHCSQLIFVT 
VFYLSPLSHLY 


3391 


17292 


A 


3414 


407 


309 


PS FKAS KDRLTLLLGANAAGDLKFTPM F 
IYHSENPRALKNYTKSTLPVPVLCIRNI 
KAWM I AHLFTAWLNEHF KP I VE THC S G K 
KISFKILLLINKAPGHRRAQMERHKMNV 
V FMLANTTP I LKLMDQGVF 1 * TF I LCLS I 


3392 


17293 


A 


3415 


404 


46 


LKKPPFQKFLKILKXKKKGGGGGPPP*T 
PPFGGGRGGVPPRAGVSKNPVPQGETLF 
FFKNPKFPGVGGPAP* FPLLRGVNKKNF 
FNLGGGGFKKRKFS PPPPPRGKKFFFPK 
KKKKKKE 


3393 


17294 


A 


3416 


23 


404 


S VL I TGALLLLSM P VLGAGI TI LLTDRN 
LKTTFFDPAGGGDPILYQKLL* IFGHPE 
DNILILPGFGIGSHIVTYYSGKKEPFGY 
IGMA*AMKPIGFLGDIAGAHHVFTG*ID 
VNTRSYFTSDTINMG 


3394 


17295 


A 


3417 


38 


469 


SGTTHASAL5DPAGG * DFI LYQHLF * I F 
GDPEVYILILPGFRI ISHIVTDYSGKKE 
PFGYIGMA*AMRS IGFLGFIG*AHHIFT 
V* I DVDTRAYFTS AT IIIAI PTGVKVFT 
* LATLHGSNMK* SAAVL*ALRFI FLFTV 
GGLT 


3395 


17296 


A 


3418 


1 


419 


FSTNHKD I GTLYLLFGA* AG VLGTGLS L 
LIRAELGQPGNLLGNDHIYNGIGTAHAF 
VIIFFIVIPIIIGGFGN*LDPLIIGAPD 
MAFPRINNISF*LLPPSLLLLLASAIGE 
AGAGTG* TVYPPLGGNYSHPGASVDLT 


3396 


17297 


A 


3419 


397 


107 


KNSAPPFSPIFLP*GPKFAFFPPGKKVG 
KKIF* KFPAPGPQG VYLREGGGS PSPPQ 
TPNWPNKKAAFVLPPPPFFVFGNPKKKK 
KKAVAFSSPNRR 


3397 


17298 


A 


3420 


3 


253 


RGCREMRLCHCSSAWAIAGVSEEKKKKK 
KNWQ* CLLGICS YLDLWLPGLPKWLSQ I 
GRIVELQLLELKQYLRGITKQTLKILH 


3398 


17299 


A 


3421 


1 


393 


RTRTI KTPAITQYQTPLFVGCALITAAL 
LLLS L P VliAAG I T I LLTNR YLNTT FFDP 
VGGGDPILYQHLF*I.FGYPGDYIVILFG 
FGI I SHMVAYYSGKKEPFGY I RMD * AM I 
SIDFLRIIV*AHHIFTVEN 


3399 


17300 


A 


3422 


142 


377 


LSKNCYKPSYVGFVCQCKFFNQLSKFLE 
VKIiLYCMVKLC* * EG * MAQPL * KQFVL V 
AIjKXlNI VI.PYNKRT SLPGIYSTD* KTYT 
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Predict- 
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residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylaianine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L^Leucine, M-Methionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














DTQNLHTTVYSSSLI I IKN* KQQRCPSV 
GE* INKCFI YTTEYYSLTKRNKIiSSHMK 
A 


3400 


17301 


A 


3423 


2 


435 


LILPGFGI I SK I VTY YSGKKEP FGY IGM 
G*AMISIGFLGFIV*AHKIFTVGIDVYT 
RAYFTSATI 1 1 AT PTGVKVFR* LATLHG 
SNMK* SAAVL* ALRFIFLFTVGGLTGIV 
LANS * VDIVLKDTYYVGAHLHYVLS IRA 
VFAI 


3401 


17302 


A 


3424 


2 


436 


LNTTFFD P AGGG DPI L YQH L F * F FGH P E 
VYILILPGFGI ISHI VTYYSGKKEPFGY 
IGMG*AMISIGFLGFIV*AHHIFTVGID 
VDTRAY FT SAT 1 1 IAIPTGVKVFS*LAT 
LHGSNMK* SAAVL* ALRFI FLFTVGGLT 
GIVLS 


3402 


17303 


A 


3425 


1 


193 


PTRPP7R?ACSELRSRHCTPAWVTE*DS 
VS K KKKKK KF S PARG AHVCTPT F LGGQR 
GKKFCFAP 


3403 


17304 


A 


3426 


3 


446 


HKKFLFPKSFFNRVFPPTKKKNPRGPGF 
FFFFLKKI FFFP PGGI * SGGFGSLQLFP 
PRVKKFFSPPPPEKGGFKAPPPGPGNFF 
FFLKKRG?SFFGGGFLK*NPGFPPPPPF 
KKWGVPKGGAPPPPPFF 


3404 


17305 


A 


3427 


202 


1 


FS PGRMLGNT I KLPYTHSFILMNSQNNL 
E K Y KVGG I PLS N F KTN YKATA I KTMW YW 
HMNR YMD * WNR 


3405 


17306 


A 


3428 


159 


1 


NPPHEKNFCS FFQMESHSADQAGVQWCD 
LSSLQPSLRGFKRFS*VSLVSSWDH 


3406 


17307 


A 


3429 


22 


298 


ESYASTAKATERDMSDRMLSASGMPRRY 
SDYPDAYTT*NILSSVGSFISLAEGILI 
IFMI*EAFASKRKVLIVEEPSINLE*LY 
GCPRLRGR 


3407 


17308 


A 


3430 


267 


26 


GI TFFFFFWLE I * LYYWLFLANKLYFHE 
QIWFGLVFFFGLLLFDFSFYSDYFPLLL 
CLGS FSNF* CLIHCFFSVI i vllf 


3408 


17309 


A 


3431 


274 


376 


NKG* I SCS I SPPQHTVRRGLTLS PRLEC 
SGVISAHCRLCLPGSSDPPASAS 


3409 


17310 


A 


3432 


177 


47 


fyfffffffffffffffsllffffffff 
ffffffffffffqinrsfiasfkyhk*v 
lifkkkkn 


3410 


173 11 


A 


3433 


379 


83 


FKRHFTKEDLRMVTEHIKSCSSSIiATRA 
MQIETTMAHCYTFMRLAKIKKYQYTKCC 
*R*RVTGTLTYCWWECKMTQPFWXTIVR 
RFLKQYIYHVIGYV 


3411 


17312 


A 


3434 


214 


3 


AASTFFFFTRSTYYVCIFLKSKRMINK* 
NY I KLKS FCTTTTKI NKMKRQPTDWEKM 
FANHINHHEGLILK 


3412 


17313 


A 


3435 


1 


353 


GGQGQ PGQ * SETLSQ * KVS WAW * LQLHR 
RMRQDNLLS PGDQG * S EPC * CY YTP AWA 
RVRLCQNRS VNEKSIiHDI CNDKK* QNI * 
I I SVPEGKQRMKGLENLFNE ILDENYPS 
LARDL 


3413 


17314 


A 


3436 


29 


394 


VFMEVDFFFFPFFFFFFFGKRNFFFPPR 
GGGGGPF* I FGPPPPRGKGNFPPPPPGK 
GGKKGPPPPPGFFFFF*KKGGSPPGGGG 
FLTPPLWDPPPWPPQRGGFGV*TPWPGP 
GSDFLNFFFF 


3414 


17315 


A 


3437 


108 


2 


PVAVAHSCNPNTSGGF.GG Wis *GKEFQT 
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residue of 

peptide 

sequence 


K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P^Proline, 
Q=Glutamine, R=Arginine, S^Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














SliANTVK | 


3415 


17316 


A 


3438 


215 


3 


LLLLREFFLPKHREVFIFVFYFMRQGLT 
LTEAGVQWRSHGSLWPQTPGLK* SSCLS 
LPSIWEYSAYHHIR 


3416 


17317 


A 


3439 


1 


186 


QTS S FS FRRS V I TNF FKLKDD V * TH * KE 
AKNLE KRLDEW LIRIKSVE KTLNDLME L 
KTSTRP 


3417 


17318 


A 


3440 


290 


3 


PGLGGGPRYSRFSGGLGPKI PLNPEGEG 
S I KLNPPPAPPPGGPN *DS FLPPPPKKK 
KKKTHVLFAGSGNLLQPLQPGAFTTEVN 
GGLAQQSPKVL 


3418 


17319 


A 


3441 


163 


2 


PAGLPVKPPRFQDKLNFPFGPFFFFFFF 
* I *DKVLLCHPGESATVHSSLGDRA 


3419 


17320 


A 


3442 


131 


1 


RPRRPLTIKSIGHQWY*TYEYTDYGGLI 
FNSYILPPLFLEPGS 


3420 


17321 


A 


3443 


162 


2 


PGAAHSTGHLPLRFHLRLGAVTHAGNPR 
TLGGQGRW I T+ GQE FKTSLAKRVR P 


3421 


17322 


A 


3444 


2 


130 


WEKI FANHVSDKGLI YRI YNELQQLYK* 
KTNGPIKKRPKSTRP 


3422 


17323 


A 


3445 


2 


117 


LAIiLGLKPGLK*ATCLGLPRYWDYRHQP 
PHPACFSKGI 


3423 


17324 


A 


3446 


350 


101 


KAFSSSLGSSRKKGDSDAGFCIHFKTVP 
FGFFWTLELKKGIPFFLNFFFFPFLRVL 
KGSKNPF*KRGPFFFFFFFFDNFYIWN 


3424 


17325 


A 


3447 


106 


1 


LLQPNGQLGPGTVAHGCNPSTLGG*GER 
ITTSGVR 


3425 


17326 


A 


3448 


28 


333 


GQQEQNS ISXKKKKKKKKKTPFKKGKRK 
I PGGKKKPLGKSRAPPPLGGGEKNPPPG 
KRAPEKI FKKKPRGNWETFKRMEXGFLK 
GQ* PGNP * KVKPLWGGKK 


3426 


17327 


A 


3449 


81 


339 


P I VSS CPGVAGALKQAMTLE FKVYQHHV 
VANCRALSEALTELGYKIGTGRDTDGVQ 
QALLWGLRLAS * P I VSSCPGVAGALKQ 
AMTLE FKVYQHHWANCRALS EALTELG 
YKIGTGRDTDGV QQALLWGLRLASQFV 
QPGCGPGSAAAAAMM 


3427 


17328 


A 


3450 


166 


322 


RDFSTTVLITDRTSRQKIPTDVSQLNLI 
DIYKTL*PTIGKYTFFSSTYRSLR 


3428 


17329 


A 


3451 


257 


354 


KFLSFFRDRVLLCCPGWSAW* S * FTAA 
LSSW 


3429 


17330 


A 


3452 


286 


3 


TDPFPIKRAKEREMLQITKIRSIRENFI 
TDQQKTKEL* EEQLYAHRVDNLDEMDKF 
LEGYK*PSMTHGKIENLCRPITSKDIES 
VIKNLLTDPP 


3430 


17331 


A 


3453 


358 


353 


R* *KLLMMPC* IETNKT*NTCTVIKEEK 
GQPGT VAHACNPS TLGG * GGRIT 


3431 


17332 


A 


3454 


108 


353 


THLGGNPVRLYAWLLLHYVASYRI FS VS 
HCKDQTYLPILL*NFLMELGTVAHTYNP 
SILGGQGRRIT*VQEFETSLANMVKC 


3432 


17333 


A 


3455 


342 


50 


GPVTPNIPPVPPLGGQKELFPPKKKKF* 
KFWGSPPPLFFKKPKNPRETPPESCPKP 
KKTP I PVKV * GKKTP * T FSKKKKT? PIP 
ELFIiLKKKKKLIN 


3433 


17334 


A 


3456 


343 


2 


KAPFFFFFKKFWSQKKRGLFFFFFKNFF 
IPGFPGPTFFFWGFFFFKKGPPVF*LKK 
FFGGPLPPLKDPPGVFPFSPF* FWDFPP 
PPPFFFFFFFFFETVSLCRPGWSAYSRD 
C 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=G I utnmic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L^Leucine, M-Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
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X=Un known, *=Stop codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 


3434 


17335 


A 


3457 


2 


259 


LTKPTESGTEDKKGRDSKESEGKTERTE 
SLRSQKGGENSV* SCPSTSSTAALNTAA 
AANPLALNPQVSLSSLSCCPISIKGSFV 
P 


3435 


17336 


A 


3458 


186 


334 


AGGNKKMEKRNM * NKE I F * WFKKNWPGA 
GAHACNPTTLGGRGRRIMRSG 


3436 


17337 


A 


3459 


207 


I 


YFKPTLKINDKG YLNETIDKFDPRD INR 
TSNLTL * EHAL FP S TCG I FMRRD Y I EGH 
EVNLNQFQRIRTQ 


3437 


17338 


A 


3460 


268 


3 


PLNPGPKQSS P PGLPKPWPFRGEPLPLG 
PWFFFVKNRGRP*AFFGTNPLAFFLFFL 
* DRVL FCHPGWS VWQSQLTVQ PP PLRL 
KQFS 


3438 


17339 


A 


3461 


350 


2 


AETAHNINNAFGPGSANKCTVRWWFK*L 
CKGVKRLEDKALGGWPSKVAINQLRAI I 
KGAPLKPTRKVAKELNIEPSAVIQHLKR 
IGKVKKLHKWVPHELHENKRNRFEVWSS 
LIIP 


3439 


17340 


A 


3462 


312 


1 


HIVTYYSGKKEPFGYIGMV*AMISIGFL 
GFIV*AHHIFTVGIDVDTRAYFTSVPII 
IAIP PGVKVFS * liATLHGSNMK * SAAVL 
*ALGFIFLLPVGGLTRIVLA 


3440 


17341 


A 


3463 


181 


3 


FLPLLAAFSGLLLFSGFLGVFLFCFFET 
ELPRLECCGTISAHCSL*LLGSSDSCAS 
GTH 


3441 


17342 


A 


3464 


176 


263 


VGKDAHNPSILGGPGGWIT* GQEFKVDA 
A 


3442 


17343 


A 


3465 


239 


336 


EPGEVAYSCNPSTLGGRSGWII *GQEFE 
TSLT 


3443 


17344 


A 


3466 


88 


326 


QTSLLNTI * * PLH* KNIPLPPPPLPPTL 
TPPFSPPPPPPPPKKKKRKRK 


3444 


17345 


A 


3467 


4 


345 


EEERRRERKKKRRKKEEKKKEEREREEK 
ERKKKKKKKEKKKRRGRQEKRKGTG* IE 
RKQ 


3445 


17346 


A 


3468 


265 


335 


QMNPDRHLQF * DKVLLCHPG WGTVARS * 
LTATSRSSCLSPPSGWDCRRVPPCLASF 


3446 


17347 


A 


3469 


115 


342 


INKQINKIKDPTS YNILLSQSLI * FKAL 
TIiLN S VKAQRGEKATEEKFEGS IS* FMR 
FKQINHLYNIKVQGEKPSTD 


3447 


17348 


A 


3470 


56 


328 


RTYPTKPQKMRDPEDPTSRPRYIPRHLY 
I TDE VNDPSLT I KG IGHQWD * TYE YTD Y 
GGLTFNSYILPPLFLEPSDLRLLDVDNR 
VEPATRA 


3448 


17349 


A 


3471 


3 


329 


RIGARDESSLAVTHKPRKIHRKLTLWVF 
VNDSLVMPRPDKNHQWAFNKNCFPLVDV 
GIDPYLVYHLRPHQAQGI IFLYECVMGM 
RMNGRCGAI LAD * NGLGKTLQC I S L 


3449 


17350 


A 


3472 


154 


2 


PNLLYPKSYWKKKDI WTPMF I VALVTI V 
KIQKQPRCPLMDKSIR*L*YIH 


3450 


17351 


A 


3473 


132 


340 


ALKTENITTLCLDF * L IETVR* * MCVFK 
KKKKKKKKKKKKKKKKKKKKDPPKKKKK 
IK 


3451 ; 


17352 


A 


3474 


3 


345 


VIYSSIFAGTLITALSSH*FFT*VGLEI 
NMLAFIPVLTKKINPRSTEADIKYFLTQ 
ATAS I FLL I AI LFNN I LSGL * T I TNTTN 
QYSSLIIIMAIAI KLGIAPFRRRGREFQ 
IY 


3452 


17353 


A 


3475 


i 


254 


FKKV GQAG FTFDLR - STRLGLPKCWOYK 



761 



WO 01/64835 



PCT/US01/04927 



SEQ II> 
NO: of 
nucleotide 
sequence 


SEQ 10 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenyla»anine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, LHLeucine, {^Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Unkno\vn, *=Stop codon,/=possible 
nucleotide deletion, V=possib)e 
nucleotide insertion 














REALCLVHILLYLPMIGPFRIQKLLKAA 
VFVFLQLHQAFIQSLLTLFYFISFPCNS 


3453 


17354 


A 


3476 


351 


186 


SVLSADNTEMGFHHVGQAGLELLSSSDP 
LAP AS + SAG I RGVRHNTW P VLT I TLN 


3454 


17355 


A 


3477 


330 


3 


GEKVFE I PLTRDNVS EILRCFLMAYGVE 
PALCDRLRTQPFQAQPHQQKAAVIiAFLV 
HELKGSTLI INEMDKTLESMSRYRKYEW 
NVKERLRRLKTARA* RLGRSQ VKK 


3455 


17356 


A 


3478 


373 


248 


NRLNPGGGGWSEPRSQHSTPTWVTEYDS 
VSKK* INKNKNWK 


3456 


17357 


A 


3479 


3 


92 


EGQGCSEP*SRYCSPAWVTE*DCVSKNF 
F 


3457 


17358 


A 


3480 


380 


2 


VEVPLLNTSVFFAS GVS I T* AHHSL I EN 
NRNQ 1 1 QALLITNLGGLYFTLLQASEYL 
ESP FT I S AS I YGS T F FVATGFHGLHV 1 1 
GSTFLTMCFIRQI I FHFTS KHHFGLEAA 
A*CWHFVDWCIFL 


3458 


17359 


A 


3481 


225 


1 


ERAER WEQVM IALFS PNYIRLS FPISP 
TPWRHKFRFHLKGYIWPGTVAHTCNPS 
TLGS QGGQI TRSGD * DHPG 


3459 


17360 


A 


3482 


332 


1 


IKYFLTQATASIILLIAILFNNILSGQ* 
TITNTTNQYSSLI I IMAIAIN*GI APFH 
F*VPEVTQGTPLTS 


3460 


17361 


A 


3483 


277 


1 


LSMS SFSISSI PSAS INS * ALLFS SAES 
NSLECTFSSSFSPNSSESFSSSFPSCSC 
AS * FSNSQSLI KASSSTEAS PLNS S AS C 
FLPLPLVG 


3461 


17362 


A 


3484 


404 


209 


AL»SVQPLWRPRQENGVNPGGKASR* PRF 
PPCG PAWATKGELVP KKKKKKI S FLGHD 
QSLELLLQR 


3462 


17363 


A 


3485 


31 


347 


FCILERKAFLYW* CWKFFFFFFFFFLE 
KGFFFLPLtLVFFVJCYYHLKGASLP*VFL 
FFYLHCYSSLFTLFFIFLQSLTPHFFLV 
* TFFYYDPLMFFFLTLFLELV 


3463 


17364 


A 


3486 


327 


161 


QKKVRDPYEQL*ANKFKN*EEMPRFLEK 
HKLPKLTQEEIKNQ 


3464 


17365 


A 


3487 


311 


3 


PRRGRFQLTQNFALPFNDGNKTKIPFKK 
KKKKTEPLCPCTCVCAAFVCLCFHLCT* 
PCDCAHDCDNI CVCVNICVYVCWTHLH 
TGLCSCVDCDHICVGTCC 


3465 


17366 


A 


3488 


245 


1 


RPRRLLALLLI I ITF*LPQLNGYIEKST 
PYECGFDPISPARVPLSIKFFLVAITFL 
LFDLE I ALLLPLP * ALQATNLPLI V 


3466 


17367 


A 


3489 


2 


400 


AAP I LGMLGLRQRGKYKRQRQLLCLWS A 
I LLFL Y FLNKLL * LTKKKKKKKKKKKKK 
KKKKKKKKKKKKGGPRGGNPQILKGKIG 
KPPGGFFKGGGGPKKI FFFKNRGALFFK 
KKPPQKKNLGPGPPGbKNFLKEKKKA 


3467 


17368 


A 


3490 


57 


366 


INVFAGKDQLQPLCACPSSPIISVESDR 
PDMYQCGIiNHFIYLI IYLRWSFALVAQA 
GV*WHHLGSLHPLPPEPKCFSC 


3468 


17369 


A 


3491 


25 


384 


IRGTF* FFWGGLGGLSKNNPL* KKNFSP 
PKLGVLPPPLFFPGGGFFFFFPGAPPGF 
KKKKPPPKKFFF*KKGAPPFFFFFFFFF 
FF 


3469 


17370 


A 


3492 


170 


41 


AQPDQHGETTFLLKIQKLTRHGGICL*S 
QLIGGLSFITQKITC 


3470 


17371 


A 


3493 




240 


VliSPADKT^v'TCAAVJGKVGAIIAGEYGAEA 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
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Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, 0=Aspartic Acid, 
E=GIutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, LHLeucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=G)utamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X= Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 














LERMFLS FPTTPS * FLDFFFFLEKIGFH 
HVGQSSLGFLTSRDPSAYNSLLGS 


3471 


17372 


A 


3494 


359 


204 


TQWLMPVILAPWEAEVGR* PEVRSLRPA 
LNSPFFXRLWIMMYIMNI I ILS 


3472 


17373 


A 


3495 


1 


178 


DRLS PGV* GS S EL * LCHCT PG WATE PDH 
VSRKKKKRKKVGLGKNKAKQKHPRGYYR 
AEL 


3473 


17374 


A 


3496 


361 


1 


LGPKLLKKPFWIS IGIKGVAGLPGFFPT 
PPQ * G *NFGLKRGGFFFYFFEAKGWFNT 
PGGKQRGPFRSLEPFPFRVKLFFPPNLP 
NN WGHKGAPP KP P FFFFFLR W S FALVAQ 
AGVQWCDL 


3474 


17375 


A 


3497 


276 


79 


AEVY K I KCELGAVAHACNP S TLGGQGGR 
ITRS * VSASFTKKYWQLALMKELLNE 


3475 


17376 


A 


3498 


3 


476 


FGTSMSI ITLITPRLARGNNNLFSCFPT 
FTFDHLATPLI* I LTT* LLPLTSMASQRH 
LFREPLSRFCKLYLS ILNSLQSSLI ITFT 
GTEL 1 1 FY I FFETTLI PTLAI I TR*GNQ 
PERLNAGTYLLFYTLVGSLPLLIALIYT 
HNTLGSLNILLLTVSAQE 


3476 


17377 


A 


3499 


581 


2 


AHACNLNTLGGRGRWIT*GQEFDTSLAN 
K 


3477 


17378 


A 


3500 


380 


135 


RHQSGQIGKTPSLVKIPKLTPPKGGVL* 
SHLTKRARQEKGLNPKSKGFN* PKLRPC 
PPT I VAKKQNP VS KKKKQTRQE PSA 


3478 


17379 


A 


3501 


1 


386 


GSR *GNHPERLNAGAYFLFYTL*GSLPL 
L I RL I Y TDNTLCS LN I LLLTLTGQELS N 
S* AKNLI *LAYTIAFIVKIPLYRLHL*L 
PKAHAQAPIAGSIVjJAAVLLKLCGYGI I 
RLTLILNPLKKHIDYP 


3479 


17380 


A 


3502 


2 


389 


EKQh* KDKQVYRATHRLLVLGAGESGKS 
TIAKHMRILHVNGFNGEGGEEDPHAARS 
Y I DG E KAT KVQDI LNNLKEG I E T I GAVM 
SNLVP PVELANPENQFI VDY I LSAMNVP 
DFDFPSEFYEHAKALWE 


3480 


17381 


A 


3503 


2 


379 


PFGYIGMV*AMISIGFLGFIG*AHHIFT 
VG I D VYTRAY FTCAT 1 1 1 AI PTGVKVFS 
* LATLHGSNMK* SAAEL* ALGFIFLFTV 
SGLTGIGLTNSSLDIRLHDTYYWAHFH 
YVLS IGAVFAI IGG 


3481 


17382 


A 


3504 


281 


68 


FKMRFGWGHSQTVS ENYKPLLKE I RENP 
NK*KNIPCSWIGRISIVKMPTLPKVIYR 
FNAFPIKLPSPFFTL 


3482 


17383 


A 


3505 


139 


263 


ETGSHYVPQAGL*LLASSDPPATASQNI 
GITHVSYYVQPL*PIFWEAVAGGSLEAR 
S 


3483 


17384 


A 


3506 


2 


427 


LKTSMTM I STTLLT * LRPAWH * AQ YQTP 
F F V * SAL I TAGLL FL FLP VLAAG I TVLL 
TDRNLNTTFFDPAGGGDPILYQHLF* FF 
GHPEGYILILPGFGIISHIGTYYSGKKE 
PFGYIGMV*AMISTGFLGVIG*AHHVFT 
GG 


3484 


17385 


A 


3507 


343 


3 


GWHSRSSGGASSPSSPSHRSRLRVSSGS 
LGRLfWSKSSKTSVSCGWARDTEKPPL 
SKIEASSSDSCESVMAKANAVFFTSTTG 
EXPASSASLSLSVWASGSSPPGAANSSS 

s 


3485 


17386 


A 


3508 


3 


363 


GP^nKA ^RRIAT EW -D IT* KWGMRTHKC 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=Tsoleuclne, 
K=Lysine, LHLeucine, M=Methionine, 
N=Asparagine, P^Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W^Tryptophan. Y=Tyrosine, 
X=Un known, *=Stop cod on, /^possible 
nucleotide deletion, Y=possible 
nucleotide insertion 














AKQLLS KQPF * S S RTGDLKLE LQI EL VR 
ETKRKC* RVL* IDRALTSHLYSLLQTQD 
ALGDASADLIQKTS *LQEEFGYYAKTQK 
LLCRNGET 


3486 


17387 


A 


3509 


3 


434 


GPGDHVARGISGKKYDIGQKWGMGTYKC 
TKELLSQRFGLGS P TVDLEL * LQI ELLR 
ETKRKYESVLHLGRALTAHLYSLLQTQH 
ALGDAFADLRQKS PQLQEEFG CYAQTQK 
LLCKNGETLLGAANLFVSS I YTLGTKTM 
EDTL 


3487 


17388 


A 


3510 


1 


401 


GTRKNS FLHDS QTS FCFSDS I PTPSNME 
ETQHKSNLELLRISLLLIEPWLEPVRFL 
KS MFANNL VYD TS DS DD YHLL KDL* EG I 
PTLMGRLEDGNRRTGQILKQTYSKFDTN 
SHNKDALLKNYGLLYCFRKDMN 


3488 


17389 


A 


3511 


117 


2 


FKFI FSEMESHSVA*AGVQWRNLGSLQP 
LPLWFKQFSC 


3489 


17390 


A 


3512 


491 


90 


KKVKI VLAGFFFKPPPPKN * KKKGLK* R 
KGVG PRGYTVNPH P CGYKP FGGPGVQNP 
PGQKGETPFFQKKNKKPGGGGGALI PPP 
KGGGRGVPLSPEKFGLKFPPGPPP*GKK 
KNRFQKKKKKKRQN* PQKSQC 


3490 


17391 


A 


3513 


2 


31 


GKGAPTTSLINVAGTKIIAKWEDNKLP 
GAI CS LT CGGAH I G T PMARDERVNLLS F 
TGSTQGGKQVGLMVHERFGRRLVELGGN 
NAI I AFEDAHLNLYVPSTLFAAVGTAGP 
KCTTARRQLIHESIHDEA*NHFPH 


3491 


17392 


A 


3514 


2 


390 


KEFHKTHPtiEKITLAPDPEGPDGLPPVA 
YNPWMDIRQREDGQTLNIREPYGPIPVB 
F*RKIRQSYFASGA*LDTQDGRLLSALD 
DLQLANSTI IAFTSDHGWALGEHGEWA* 
YNNFDDDTHDPLI FYDT 


3492 


17393 


A 


3515 


3 


410 


RGTGNNWAQGHYTKGAELVDSVLDWRK 
ES*SCDCLQGFQLTHSLGGGTGFGMGTL 
LITKIREEYPNRIMNTFRVMPLPKV*DT 
EGEP YNATLLVKQLVGNTDET YS I DNEA 
LYDICFRTLKLTTPTYGDRKHLVS " 


3493 


17394 


A 


3516 


1 


396 


GEDAANI YARGH YT IGMEN I DLGLiDR I R 
KTiADPCTGLQGFLVFHSFGGGTGSGFTS 
LLMERISCDYGKKNKLEVSIYPAPQVCT 
AVFEPYNY I LS THTT LEHS D * AFMVDNE 
AIYDICRINLHIERSTYTNL 


3494 


17395 


A 


3517 


2 


399 


EKIGYNPDTVAFVP I SGWNGDNMLEPNA 
NMPWFKGWKVTRKDGNASGTTLLEAIiEC 
ILPPTRSTDKPLRLPLQDVYKNGGIGTV 
PAGRVETGVLKPGMEVTFAPVNGTTEV* 
S AERHHE P FREALPGDNVGF 


3495 


17396 


A 


3518 


1 


402 


GEDAANNYARGHYTIGTEI IDLLLDRIR 
KLADQRTGLQGFLVFHSFGGGTGSGITS 
LLMERLSIDYGTKSKLEFSIYPAPQVST 
ACS EPYNS I LTTHTTLEHSDCAFMVHNE 
AI YDI * R INLDIERST YTNLNR 


3496 


17397 


A 


3519 


1 


408 


GS CliGQKL YNSTTKTAT* WSSNHTKKNP 
FSKFPKLQNWTHPESHRDWTAPTRLYW 
ICGHRPYVKLPNQWAGSCIIGTIKPSFF 
LLPIKTGELLiGFPVYASHAKRS IAIGNW 
KDNEWPPERI IQYYEPATWAQDGL 


3497 


17398 


A 


3520 


3 


162 


ETVLLCCPGWL*TLGLRQSSCLSLPKCS 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
VSSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glulamic Acid, F-Phenylalanine, 
G=Glycine, H-Histidine, I=lso!eucine, 
K=Lysine, L^Leucine, M=Methionine, 
N=Asparagine> P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Ty rosin e, 
X=Unkno\vn, *=Stop codon,/=possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 














DYRHATVPGQKYRFPNRYYYGSYS 


3498 


17399 


A 


3521 


184 


354 


ETGSSLLP *GSSELPVSHPYFFNTHSLS 
LSHTHTHTHTHTNSS ISNYHCPLYTNMS 
CQRLVAVYRQKGSFKDQCTDI 


3499 


17400 


A 


3522 


2 


313 


YYLSPPMGDQPGQHGETLPLQKLAG*GG 
ARLRSQLLRRLRQENCLTPGIQACRELL 
GWYKSNCAFCQQKPQLLCVNLTLYSSLD 
DKARDSVSTKTKREGRGEK 


3500 


17401 


A 


3523 


13 


127 


LGLQARTTSMPSS*GYRRAPPPC PALLY 
F*VETGFHHVGQAGLELLTSTRP 


3501 


17402 


A 


3524 


148 


1 


VCMFLEESCSVAQAGVQWYNHSSLQP * T 
PGLKRSF*LSLPSSRDHRLMT 


3502 


17403 


A 


3525 


I 


98 


VWPGGEGCS E PRSHHCTTAWVTE * DS I S 
KINK 


3503 


17404 


A 


3526 


138 


3 


AASTHDLG* LQPPPPGLTPFSCLRLLSS 
WDYRPVPLRPANFLYF 1 


3504 


17405 


A 


3527 


10 


206 


PTTSLISFTSYSPCLKCSNVYEQVNRMR 
VLFYKSKQRLGMVAHAYDPSTLGGRGSK 
IT*GQGRRG 


3505 


17406 


A 


3528 


345 


3 


VTELIQSNKAKLKKKSKNKNEQSLQKI * 
DYAKQPNLRISGVLEEEEKSKSLENIFE 
KIIEENFPGLTRDLEIQIQKAQRPGKFI 
AKRLSPRHTVIRLSKVKTKGRILRAERQ 
K 


3506 


17407 


A 


3529 


3 


90 


AVL * ALGF I FLFTVGGLTG I VLANS VDA 
A 


3507 


17408 


A 


3530 


3 


90 


AVL*ALGF I FLFTVGGLTG I VLANS VDA 
A 


3508 


17409 


A 


3531 


354 


3 


AVI NSLPTKS PGPDGFTAKS YQR YKEEL 
VLFLLKPFQR I E KEG ILPNSFYESSVIL 
I PKPHRGTPKES FRP I SLMNLNAKI LNK 
RLAS * NHQK I KKVI HHGQ VGFI PRTQG W 
FNKQ 


3509 


17410 


A 


3532 


2 


357 


FTRPR FS FT FFVL I RKYALS CGKI S I QH 
S Y * KKKS KG KLGG L * G IN FCF FER KFHS 
VTQVGGQKHDPG+LKFPPPGFRGFSRLS 
LPSSWNHKRGTPRLANFCTPNKNGGRRG 
RKFQIY 


3510 


17411 


A 


3533 


76 


365 


KKKKKKKKKKKKKKKKKGGAGPEI PKLK 
MGKSEKPPGVFLRGVGBKI KFFFLKKGG 
PFF* KKKPLKNKNWARBPSFKKI FKKKK 
KKAHFFFGPKILKK 


3511 


17412 


A 


3534 


7 


353 


FFGFGGVFLVFLFFFF*KYPFFFPPPPL 
FFFPFFKWAPRRPPPPFFSSFFKNFFFF 
FPPPPPFFFFFPQKNFPLFFSPLFFFFS 
PKFFFFFFPLTKKKKKKNKKKKTKTPKI 
HPQTQRK 


3512 


17413 


A 


3535 


354 


41 


AGGVPPGIPPPWGGRAGGSPKGRGFGPP 
WPKRGTPPFPF*KKKKKKKKPGGPAIPA 
TREGEGGKSP*PWKPRVQPT*NAPPPSP 
PGEGAKPPFQKKKKSMFHKR 


3513 


17414 


A 


3536 


174 


364 


TFNSLAFTFFFFF*RETGSHFFLEAGVQ 
GNHS S LN P * P PGLRGS S CLTL? KC * D YR 
REPLPPA 


3514 


17415 


A 


3537 


38 


368 


PGRLRLQEAVNHCTLEPGQQS V* KKKKK 
KRGGAF FKAKLTLAG WKG E KKKKTG V F F 
*TLKIRPKEIRNQGGSQQRKKAELGKLQ 
QTYP VLN FKATF YG EQGD Y Y * T FF KP 
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SEQ IB 


SEQ 10 


M 


SEQ ID 


Predicted 


Predict- 


Amino acid sequence (A=Alanine 


NO: of 


NO: of 


eth 


NO: in 


beginning 


ed end 


C=Cysteine, D=Aspartic Acid, 


nucleotide 


peptide 


od 


USSN 


nucleotide 


nucle- 


E=Glutamic Acid, F=Phenylafanine, 


sequence 


sequence 




09/515,1 


location 


otide 


G=Glycine, H=Histidine, I^Isoleucine, 








26 


correspond 
ing to first 
amino acid 
residue of 
peptide 
sequence 


location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y~Tyro$ine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


3515 


17416 


A 


3538 


107 


349 


SVTFPFPFFFLKQTGSHFW*AGGQGGN 
FD*LEPKPPWLRESSLLTLPSSLDYGPP 
PTGPSNFGI FFEKRGFCLVPQASFE 


3516 


17417 


A 


3539 


274 


3 


PRRWSLPLTQMGPLPSRMGPKTKLCPKK 
KKKKIPPRKILEPDVFTGQFYLMFKEIT 
S I FLKL FQQCEEEGTFSNS F * EAR I ALK 
TK*DKN ■ 


3517 


17418 


A 


3540 


I 


374 


ACALVRNDHIYYWGPTYALGELFFTET 
P 1 1 1 *G* DN* LGPLVIGAHWALTLINY 
ISL*L* PPS VLLLLACAI PEARAGTGCT 
AYPPIiAGDYSHRAAYVNLTILSUiLARV 
SSILWDTDSITT 


3518 


17419 


A 


3541 


427 


76 


RGSLS PGV * K* PGGQNQTFPKAPKLRPG 
APRCFPAGKGS I KPTPGSPPGGRRGTPC 
SKKKKNFRETPPSFFWGGPPFLKNRGGF 
MFWASKTKNGLNKTLGPNPGPPLGGLRS 
PFWVG 


3519 


17420 


A 


3542 


234 


88 


FFFFFFPFFFFFFFFFFFSLFFFFFFFF 
FFFFFFFFFFFFFW* * ILYFV 


3520 


17421 


A 


3543 


30 


419 


TFSFFPFLFFFGFIjVLGGEKQGKWDPKP 
PGAEKPPTFSPPKI*KKGAPPPPRVNFK 
KF*KKGPLPFSPGGKNLKS FWGIWSIGN 
PLWVTPPKNPQKKPPFFKGPKKNFFFKG 
PKPGGKKTQ I FFFLWEKK 


3521 


17422 


A 


3544 


159 


411 


VSRDISPVRLSPSSCFH*PFLHISSSVE 
GRFSSFAQFLLSCLNFCTFPLLFLLLRF 
*DRVSLCHPGWSAVA*SQLTATSTFWVK 


3522 


17423 


A 


3545 


2 


351 


CLAHTSGVES * ASLPCGQNS YIHYLKLL 
CMEELSFPPYLLIYSISYLY*HTIMVIY 
FILVL * SILLNL I YLFVTQ I FPALASGS 
SFlWLLCLFlvTTIiAVHVCVCVCVCVCVR 
VCAC 


3523 


17424 


A 


3546 


397 


1 


KPPPPCSYGKKKKIFLLVFFRTLPTKPP 
GAGRKIFPRGPGFFLKNPKAL*TTFAPP 
PFIQKKGPFLLQQA* IPFPVFFLFKKKP 
FLCSP*GGIGVSLFFFFFSETESCSVTQ 
AGVQSCDLGSLQPLPPGLKR 


3524 


17425 


A 


3547 


76 


354 


FLFNFMGKIFKNFRGGVFFFFFWGGYNI 
FLVFLKI FFFFFGGFFLPFFFFLKKFFF 
FLGFFFGKISQKI FFFFVN* I FLLXXGA 
PPLFFFFFL 


3525 


17426 


A 


3548 


197 


1 


INKMKQQRMD*EKIFAKHVSHKGLISQI 
CK*LIHLNSEKTNSPVRKWAEDIiNRHFS 
KRDTKMVNM 


3526 


17427 


A 


3549 


175 


309 


KSTLKVMLIS I S I FNFYFF*DRVLLCHP 
GWSAVA* SLLTAASTRP 


3527 


17428 


A 


3550 


3 


196 


GFERVSQDGLDLLTS * SACVS FPKCWD Y 
SHE PPRPAHKTSLKKNKMRVR 1 1 IMMTI 
TYPAPWC 


3528 


17429 


A 


3551 


3 


360 


RAVEIFCYDVCVESGCADIQLLLLCPTS 
IKVCFSSLSLDIVTVIPDIYICIYIYII 
FFFFFKQARS*CQCLNDKE*YCSLWPTP 
PGFKKLSCVSLPS I WDLRCAPSHPPNFC 
VFGKNGV 


3529 


17430 


A 


3552 


326 


1 


KDNKR I LKLMLAKSGKVRH YVFDFGKQ I 
SKMSEKNIGVKPPIGKEKTKNPKKKPSR 
TEDPKICTYVL* PDKKKWRRFLNT*GIQ 
T.»G* KEIKLSLFADDMI VYLENP IV 
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SEQ ID 


SEQ ID 


M 


SEQ ID 


Predicted 


Predict- 


Amino acid sequence (A=Alanine 


NO: of 


NO: of 


eth 


NO: in 


beginning 


ed end 


C=Cysteine, D=Aspartic Acid, 


nucleotide 
sequence 


peptide 
sequence 


od 


USSN 

09/535,1 

26 


nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 
peptide 
sequence 


nucle- 
otide 
location 
correspon 
ding to 
fast amino 
acid 

residue of 

peptide 

sequence 


E=Glutamic Acid, ^Phenylalanine, 
G=G)ycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, V=possibIe 
nucleotide insertion 


3530 


17431 


A 


3553 


186 


1 


RPRRLP I KLPMTFFTEIiEKTTFKFI *NQ 
KRVRIAKSILSQKNEVG3ITPPDFKL*Y 
KDTVTN 


3531 


17432 


A 


3554 


3 


366 


KTS LGNM VKP YNF FV * TFY I * * KKE I KK 
NSIGVL*PGTEQSKHEFKKILPSKRNKI 
ASKRIKYL * INLT * EGQDMYTENYKTLL 
KEI I EDLNKYKDI PCS * I IRLDLVMMAV 
LTKLIYRFN 


3532 


17433 


A 


3555 


239 


1 


GRGKMLPVS LLN I TRVTFAPCSGS 1 1 LV 
LIALPSLRILYITDEVNDPSLTIKSIGH 
QWY*TYEYTDYGGLIFNSYILPP 


3533 


17434 


A 


3556 


3 


123 


QAGLEV*TSSDLPTSVSQSAGITGGSHR 
ARPANIINSVP 


3534 


17435 


A 


3557 


237 


2 


KGFSPPTQKFGFLGVFPPQKKKTKKNPR 
PPKGLTPVS*KGMLTEPLEPFFFFFLRQ 
SRSVAQAGVQWGNLGSLQALPP 


3535 


17436 


A 


3558 


87 


379 


KKKKKKKFTLQKPVGFLNPPNGEAKSQI 
INELLLKIATKKKKNLKLRLKKKGKDLF 
RGNYNPLFKKIKDDPTNGKNFPCS*MGK 
ISFEKMTLLPKAI 


3536 


17437 


A 


3559 


390 


181 


KKXKKNPGGGGPPFFPTFRGG*GGGPPQ 
PGGGGAPKV* ISPPPPPPGGKKSPPQKK 
KKKKKKKGPFLPHL 


3537 


17438 


A 


3560 


2 


101 


VIVTAHAFVIIFFIVIPIIIGGFGN*LV 
PVDAA 


3538 


17439 


A 


3561 


2 


126 


INTLLALLLI I ITF* LPQLNGY I EKSTP 
YSCGFDPISRRRG 


3539 


17440 


A 


3562 


16 


349 


FAFQKITLAMEKATEAQTGGAKKACQGT 
NNIMKFLKKKCIPTTIFWISFFSFLFFF 
FETKPHFFNWPEGKGPNLG* LKPLPLGL 
KQFCLTLPRKGNYRHVLPPPVNGFFFF 


3540 


17441 


A 


3563 


6 


130 


TLRRILEDHFSLRS*GCGEPCLRHCMPA 
WL>TE*DPVSKNIY 


3541 


17442 


A 


3564 


3 


152 


GFHCVRQDLDLLTS * SAHLSLPKCWDYR 
RKPPHPAGKGTYFYIDY3TIB 


3542 


17443 


A 


3565 


241 


3 


NLVSHVISMRKVINLVEQNVTSSNRSNH 
TWLSFIKLYFI 1 1 1 S IVFGEQWFGYVA 
KFFSDDF*DFGVPITQTVYTAPN 


3543 


17444 


A 


3566 


71 


256 


SVFLLTVFLYPLATSFFFLNRVSLCCPG 
WL*TPGLKQFSCLSLPKRWNYRHEPLTP 
GSNFF 


3544 


17445 


A 


3567 


1 


392 


AGAGTSGTDYPPLSGNYSHPGACVDLTI 
FSLHLAGVASMLGAISVMTTVMNIEPPA 
ITQYQTPIiFV*SGRNTAGLLLLCVPVXiA 
AG I TI LLTDRNLNTTFFDP AGGGDP I LY 
QHLKGKRGI IICSHRDRI 


3545 


17446 


A 


3568 


334 


409 


LGTVAHACNPSTLGG*GGHITRSGV 


3546 


17447 


A 


3569 


3 


467 


PKNPPGGFFSPL*EKKYILPPPP*IWPP 
PGFF*KAPPPFFFFFFFFFFFFFPTFQI 
YFITF 


3547 


17448 


A 


3570 


3 


420 


FFDPAGGGDPILYQHLF*FFGHPEEYML 
ILPGFGILSHIATYYSGKKEPFGYIGMV 
*AMISIGFLGI I V* AHHI FTG* IDVDTR 
AYFTSATI I IAI PTGVEVFS *LATLHGS 
NMK* SAAVLS ALGFI FLFTRRGLTS I V 


3548 


17449 


A 


3571 




418 


NLINHSFIDLPTPSNIFA**NFGSLLGA 
CLI LQI TTGLFLAMHY* PDASTAFSS IG 
HITRDVHYG* IXRYLKAKGASIFFICLF 
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LHIGRGLYYGAFLY*ET*NIGI ILLLAT 
IATAFIGYVLPGGQISF*GATVITNLL 


3549 


17450 


A 


3572 


3 


414 


S SGLAM * FH FHS I T LL I LGLLTNTLT I Y 
Q*WRDVTRESTYQGHHTPPVQKGLRYGI 
ILFITSEVFFFAGFF*AFYHSSLAPTPQ 
LGGHV7PPTG I TPLNPLE VPLLNTS VLLA 
SGVS I T * AHHSL I ENNRNQ I IQALL 


3550 


17451 


A 


3573 


2 


423 


GTL Y LL FG A * AGVLGTAL I LL I RAE LGQ 
PGNLVGNDHI YNGI VTAHAFVI IFFI VI 
PIIIGGFGN*LVPLIIGAPEMAFPRINN 
ISL* LLPPSLLLLLASAI VEAGAGTG* T 
VYPPLAGNYSHPGAYVDLT I FCLHLAGV 


3551 


17452 


A 


3574 


411 


117 


DGGPLLQLLRRARQENCFNPRGGGCS* P 
RLCPCPPAWGAKLNSLSGKKKKKKRHVE 
KYAQHDY I INVLCW CVH I KWLSLYMYS 
I TKS IDKI CKKAS 


3552 


17453 


A 


3575 


2 


394 


ALANMWEQIRSLHQYAVHRI ISLFSLLS 
KKHDRVLEQATQYLRGSLNTNDVPLPDY 
AQDLT VI EEL I PMMLEI INS * LTNSLHH 
I PNLVYALLY * RDLFEQFRTHPS FQD I M 
QNIDLVS VNEDI Y YDS FLS 


3553 


17454 


A 


3576 


262 


418 


G WRE WTEL 1 1 FY I F FETTL I PTLAI I TR 
*GNQPERLNAGTYFLFYTLVGSLP 


3554 


17455 


A 


3577 


232 


443 


PS VQTFPCCPLSEEGPWLLCVAMTPS PG 
VPVP TERAL YSME CAFH PLFSLTS GACR 
LDYRTPDNR*VQLP 


3555 


17456 


A 


3578 


48 


410 


GGLLLHRAGCWCGHADRRADQPDSSDPH 
SLHAACL? PPGAARAAGAPS PA*TPAAA 
CPAPVPAPCSEHRC*PPGLSQRPCLPGL 
PDARPPGAAAHGGWSLPTRLVLHDRSHP 
EEAPRRHED 


3556 


17457 


A 


357? 


422 


186 


VYSKFL* SQLLRRLRQENRLNPGSRGCR 
EPRS*HCIPAWVTQQDSLSISSSTSHEI 
CGSHCSPGYLTVHI CDQPVFSL 


3557 


17458 


A 


3580 


442 


3 


GSAISSRCTQESTYQGPHTPPVKKGLRY 
GI ILFITSEVFFLAGFL*AFSHSTLAPT 
SQLGGHWPPTGITPLNPLEVPLLNTSVL 
LASG VS I T * AHHS L I ENNRNQ 1 1 Q ALL I 
TI LLGLYFTLLQASE* LES PFT I SDGMY 
CGRSRG 


3558 


17459 


A 


3581 


428 


1 


IRDIFMASSLGM* FHFTPITLLILGLLT 
TPLT I YQ* WRDVSRESTYQGHLTPPVQK 
GLRYGIILFITSEVFFFAGFF*AFYHSS 
LAPTPQLGGHWP PTG I TPLNPLE VPLLN 
TSALLASGVS IT* AHHSLIENNRCI AAA 
LE 


3559 


17460 


A 


3582 


160 


415 


RCLSHCLAFPMLSILFIPSQLLSLPFHV 
KWIFFFFFLNKKFNCVPQAGGQGKNLI * 
LHPPPRGLKKFSCLRITSS*NYGREPQP 
S 


3560 


17461 


A 


3583 


399 


2 


STHLGLPKCWDYRCEPPRPALLAIVKLL 
CKTI HVR INNVCCLPLLTGAAS V IGLSA 
SLLFSVSPGTSLQLEAR*TFPWPHMSST 
CVWRFI FTS LGRAQPQHS DLQTCPRRDV 
DLAEPGLLFNGQMLVPPGRS 


3561 


17462 


A 


3584 


215 


1 


KS PGLLNAFSRGSQI YQASRAMGA * LNQ 
RFWPGTVAHACNPSTLGG*GARIMRSGD 
GDRNS KDGVS P CGRV 
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3562 


17463 


A 


3585 


138 


19 


LNQRFWPGTVAHACNPRPLGG*GARIMR 
SGDVYRNSKDEV 


3563 


17464 


A 


3586 


262 


3 


GRPEVFYKPPPRGEFFFFFFFLFFFFFF 
FFFFFFVLFFFFFFFFFFFFFFFFFF*Q 
VRLYLLEKYELFF FFLFYPQE I SLCCPG 
WS 


3564 


17465 

} 


A 


3587 


381 


1 


KKFTPPPPRKMGPPPGFF* RAPPFFFFL 
GGGGGPLFFLGGPPPPGGGFFFSPPPPV 
FF*KSPPLFSFF*KKRGPPFFKRGGSPK 
TPPCFFFKKKNPPKKKKKKKKKKKKKKK 
GRP SSKS D AWVRTRG 


3565 


17466 


A 


3588 


198 


1 


PKS PPRPLARGKEPNFFSKKKKKKS I *N 
LNTNSWLDMVAHICMPSTSGGRGG* PHL 
RSGVRDQPGQ 


3566 


17467 


A 


3589 


373 


1 


GIFFPPPKRGGFPPPPPKKFFFFPGGFI 
FGGGGGPFWPPPRKGFFLKKKKGVF*SP 
P*KGKKFFFFPGGGVGPPRGFLKGGPPI 
IFFFFFFFFFFFFFFFFFPATI FLITYK 
DLTCHPGTHFTH 


3567 


17468 


A 


3590 


506 


98 


HTSRGPPPPPKKKIPPTRPFFFCCFWW 
FFPPKKKFFFFCS PRSF IHS PPKXKKKN 
FFFPKGGGPPLSPFLTPPPPLFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFF*LFVSLFSI 
FVLL I FFLFYNAI TT I 


3568 


17469 


A 


3591 


407 


1 


LVRGQ I TNKKNLFNFLGEELREWR WFF 
LPQQRLFFFKL*LQ*KEIYLFI*KMTCS 
RGGAGFKSSLYFFFRSPPQWLFFFFFF 
I FFYFF* FFI FFFFFFFFFFFFCNDFMI 
YLYISPPLPILEKKKGGRSRSRS 


3569 


17470 


A 


3592 


380 


68 


FRRGVGPQWPPPQKKGFSQKPPGGFKRP 
PLKGKKITFPPPGKFGPPKGPFKRPPPF 
FFLEG KTL FLGP D P * * NLG WGG * RGHG K 
*GPANPRVPPVFFFFEEYSY 


3570 


17471 


A 


3593 


394 


45 


ALMFSTGQEGRQEKHPPPHQKKKASRGE 
SPRGGVRKNYPPPKKKSFPKKNPPGVTP 
PKKKKKAKLPPRPL*GGPQRRPQKTAPP 
PLNFYPPKEPTKVLSIFFSFFFFFSAND 
VELY 


3571 


17472 


A 


3594 


402 


62 


AMIVLLHSSLGNKARPCLFKKKWAKDLN 
KHFSKEDSQMANKYMKRCLI SLVIRNMQ 
MKITVRCHFILTRMAKIKKMVNSKCRKE 
VQKI PTLKHCWWECKI V* ILWTNLAARV 
G 


3572 


17473 


A 


3595 


220 


425 


FI SGFLNLQEERE I AFFFKVN I RLGAW 
HTCNPSTLGGRGRRIT*SPGV*DQPGQH 
GEPCLYLNLQKN 


3573 


17474 


A 


3596 


13 


412 


AEIAPLRSIHRARNCLKKKGRKGRKNPG 
GDGGNPQIGPAFFGGGNGKTPGQKPHQP 
QTWGQGRHLKGGGGGPKRKGVP PXKKT P 
T I WGGT P LA FG VE PNW W3HTGN PG GAD P 
KPPGREK * SGGAQI TGPHTGP 


3574 


17475 


A 


3597 


2 


293 


CQLS P * ELGTPRERERERERERERERER 
ERERLPPRDLREKRAQDVFATHPNYEFV 
CVI RHYRGCWYTPLTTAHS IDAHKMCV 
CVKKRPLYKKKKK 


3575 


17476 


A 


3598 


243 


9 


ILLRQEKQCWSKGTRFQLQDEQVLNT*D 
RTWEVMWPGAVTHACNPSTLGGRGGRIM 
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C=Cysteine, D=Aspa rtic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
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Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
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RSRDRDS CR I HQGGSKP KS PR 


3576 


17477 


A 


3599 


1 


329 


PSARRDWKNIPKLPPPRGKNPTGFFFLK 
PGGPQRGPGKKGGKNF*SFGKPF*KGGG 
PPQFCPPPPRGGFFFFFPGGPPGEKKGG 
VKGPPPPFFFFWPPPPPNLFFPPGA 


3577 


17478 


A 


3600 


206 


1 


LRALG I MERAAHTE VRLVF I Q I LGS PL * 
YIF*RQDLSPSPRLECSGTIAARCSLDL 
LSASNSPASASQ 


3578 


17479 


A 


3601 


322 


99 


KRTGAFGFFFPPLFNKKKKVFLGPEGPK 
I VKKKKKKEKKEKQNKNRKQTKKK* KTN 
NKQNKKRETDIFGVEDET 


3579 


17480 


A 


3602 


410 


25 


LFFFKKKTGGGGGGPPF FFHPFGGVNGG 
VIRVGELSPPCPPGGTRFFFKKTKNYPG 
G*GGPFFPPPSGGGLGPKKTLSPRGGAS 
L*PNFGPSLPPKGKGGFPFSKKKKKKKK 
QSRPRDS*SLRRGIRK 


3580 


17481 


A 


3603 


27 


349 


RSSIQCGGILLFYFLYFMVFNDIFVVL* 
GFSAITSlLISMLFSCI*YYFFKMYCIiL 
FNFRCHWI FYILF * MKT IAQ IGN* I FFR 
DAYLGYFFLFLFFFFFFGGKIiYL 


3581 


17482 


A 


3604 


88 


435 


RAISTCLQNEQYKWLTTITSVEKSLNDL 
TELKTMV*ELHDKCTSFSSRFIQLEERI 
SVTEDQMNEMNNPPLIHGFI FYGSSYSW 
STMVR KQDKLRVKDAG I KHRNQNKG FTA 
EKSA 


3582 


17483 


A 


3605 


331 


428 


YGHYGQIEATFLPAITTEKETLKTRKLT 
* SSI NLLK* T * NVRPGTVGHACNLSTLG 
GRGGQIMRSGIRDQPNST 


3583 


17484 


A 


3606 


3 


294 


GG*GCSEL*SCHCTPSSLG*NTVRKKER 
KKGKREREKERKREIIPGFGPCAHHKGA 
VTLGNLFPRVQVSQLVQRRKI IITVIPD 
RLGE I VLVSQSL 


3584 


17485 


A 


3607 


263 


426 


RKLAGTFPFLWKSGPFQTILAAAQRMGG 
NDPKETRVFHAGLSSSLGENK*VWKI 


3585 


17486 


A 


3608 


213 


439 


LSWACAAEWTVTQPANFPLDYILPFFLL 
IRESSKF*RHMATQLETAASRPGVWHA 
CNP I TLGGRGGR I TRSGVH 


3586 


17487 


A 


3609 


2 


441 


MTDLESFPLESELSANSPVQCSGLCKPV 
LTI FQVI SHKGHCHSKYYLLNANYNVFL 
HLSKSVLPRLLSGQRTETQKHFYTAVLS 
VCLS I YLSRDRVLLCYPGWS AVM* F YSL 
*PQIHGLKQSSCLSLPKC*DYQHGISPP 
WCN 


3587 


17488 


A 


3610 


102 


354 


E PGAVJADRC WLGRVLLTHLLW P CCLY F F 
YFIIFILNSGDRVSLCCPRWSCLK*SSC 
LSLPECWDYRCQPLHPACAVFIENSKR 


3588 


17489 


A 


3611 


462 


371 


GLNFLTL* SARLGLPKCWDYRHEPPRPA 
WS 


3589 


17490 


A 


3612 


463 


319 


RNSHDGLSLLTL* SAHLGLPKCWDYRCE 
PPRPVLC 


3590 


1749 J 


A 


36)3 


49 


476 


PRN S I TLL I LGLLTNTLTLYQGW FDVTR 
ESTYQGHHTPPVQKGLRYGI ILFITSEV 
FFFAGLF* 3FYHSRLAPTPQLGGHWPPT 
GITPLNPLEVPLLNTSVLLASGVSIT*A 
HUSLI ENNRNQLMQALLITILLGLYFTL 
LQ 


3591 


17492 


A 


3614 


223 


467 


TPS * PLAI I * FI STliAETNRTPFDLAER 

ESKIiVSGFN I E YAAGPFALFF I AE YTNI 
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MI INTLTTTI FLGTTYDALS PELYTTYF 
VTKTLLLTCLFL* IRTAYPRFRYDQLIH 
LL* ENFLPLTLALLI * YVS IPITISSIP 
P 


3592 


17493 


A 


3615 


2 


405 


ARGLQEAGAVQNDPVSRLFDHAMLQGHR 
AHQMAIVTYQEFGETYIPKDQKY*FLHE 
SQTYFCFLNS I PTPS SMEETQHRSSLEL 
LRISLLLIESWLEPVRFIjRSMNANNLVY 
DTSDRDDYHLLKDLEEGIQTLM 


3593 


17494 


A 


3616 


328 


63 


PSKF*KPCLISPPPLGPGPPN*TPGFLK 
KPKFPPRGGAHP * S PLLFRVRPGDPLNP 
RGQGFP*PKLPPGPPTRATKPNFVSQKK 
KKKE 


3594 


17495 


A 


3617 


1 


379 


GTSVGIPTGGQEFS * FAALHGSNVK* SA 
AAL*ALRIMILFTVRGLTGIGLANSSLD 
I VLHDTDYVEAHFHYVLS I GAVFAI IGG 
FIH*FPLFLGYTLDQTYAKSHFTIIFIG 
VSVTLFPQHFLGLC 


3595 


17496 


A 


3618 


494 


17 


GSANPRNPRRSGSKTRETS PRNVMSLMS 
PMSPTLPMRPTSPGTPMHLTNSMGPMQP 
PNSSRPVNPASSATPVSPASPARPRKPS 
CFPAPQ* * SPPL* S PEPRHGLSCLSWVK 
KECETWHWSMHIVEDRHSKPYHGPGMV 
AHACNPSTLGG 


3596 


17497 


A 


3619 


350 


480 


LGS WDYVCKPPHLAVLLL* RQSFVLLPR 
LVTNS *AQVIHLPWP 


3597 


17498 


A 


3620 


2 


400 


AAAE PHGC YAKVLDS FKPLG EE P * NL I K 
QNCELFEQLGEYKFQNALLGRYTKKVPQ 
VSTPTLVEVS *NLGKVGSKCCKHPEAKR 
MP*AEDYLSWLNQLCVMHEKTPVSDRV 
TKLLTESLVNRRPCFSALEVK 


3598 


17499 


A 


3621 


3 


476 


LGYSVHPPQPLAQViiSPPLPGASRAGWL 
LRMRGPPSPHPPRTPAGLQAPHAALVPA 
RASPSTPPCKLREWAPALASPERGSHSA 
VGG*RAPQMPPKWEPRQGRCQEQARALR 
TASMLSPLSFTEPVTLSLLNFPTSLPPR 
N5PSWPWSVR 


3599 


17500 


A 


3622 


40 


390 


MKGALTQPGRAGGLGSQSQMDMASTPKP 
SPDDQGCSIHLWGFPFLVSLLPPSLSPS 
PPSPPRHSVGTHCAP*VLPRPLGALRVP 
HSSVPRPHAVPWGTTYRYSRGAWPLSL 
DFRGL 


3600 


17501 


A 


3623 


52 


475 


AVEFHLLFLQPLETTNLLSVCMDLPILA 
NSPK* NQ 1 1 CDLFVF FHLA * CF * DLFML 
EHVSVLHFMAE*FFFFFF*KGVSLFSPG 
WRAGAQTOLTKTPPHSPARLKQFSCLTL 
PGSWNYRRAPPSPANFFVFSGKRGFTIL 
P 


3601 


17502 


A 


3624 


333 


466 


SIIGKIRLWKMRPWPGAVAQACDPGTLR 
G*GKRITRSGVRDRPG 


3602 


17503 


A 


3625 


2 


138 


LRRGNRLNPGGGGCGEP* SRHCTPASST 
E * DS VS KNQKTKNS QWH 


3603 


17504 


A 


3626 


3 


337 


LSLRPPALPPCPPRRLPTPGQGHPGSAG 
FPGRPTL* RLPPLPDPP PS LGLS PGDDP 
GIAREELKSPCPWGPHPLCHLPFP* CWA 
S SQPS INAYEWRKKKKKKKKKKKKKKK 


3604 


17505 


A 


3627 


2 


393 


LGTDhSLhlRFELGQPGNLLGNDHIYNV 
I VTAHAFV 1 1 FF I V I P 1 1 1 GGFGN * LFP 
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LIIGAPDMALPRINNISL*LLPPSLLLL 
I ASAI VEAGAGTG * TGYPPLAGNYSHPG 
ALVDLTI FYLHLARVCS I 


3605 


17506 


A 


3628 


2 


379 


QDATYPI I EELI TFHEHALI IICLI CFL 
ALDALFLTLTTKLTNTN I SDAQE I ETV* 
TMLPAIILILIALPCLRILYITDEGNDP 
SLTIK*IGHQWY*TYEYTDYGGLIFNSY 
I LP PLFLKPGDLRL 


3606 


17507 


A 


3629 


1 


400 • 


PTR P PTRPPTRPMTNTLNS IHPPLPR * K 
KKKKKKKKKKKKKKKKKKKKKTAAGGGA 
* KKKNRGAKTHRGSKKINFFFKKRNKKN 
PPVKIEKKTFFLGGKKKNKPPQKNNALK 
KKKNYFEGEGESLLLL 


3607 


17508 


A 


3630 ' 


416 


1 


SQLGYSGVRDPLEEATCLFSDLKLHAGR 
TTPLFQAVRQGHLILQRFLQPFA*LCPA 
PRGGVYRGRQASLSCGGLHPVRASWPLC 
LPTQFS AMEGAPPLALLPRSSS I SDYCA 
SNEQGS VG IGP SE PCTGYNLL VCHLL 


3608 


17509 


A 


3631 


275 


83 


RWGSCYWLAGLQLLGSSDSPALASQSA 
GITGVSHCTQPKNVI * KI SHLTVAGPAA 
DKTPQTPR 


3609 


17510 


A 


3632 


186 


3 


FFFFFFLFFFFFFFFFFFFFFFFLFFFF 
F FFP F F P FFFLVCQVPLLCNL * HLVYLF 
YMVL 


3610 


17511 


A 


3633 


3 


235 


GI ISFLLIR+ * YARADANTAAIRAILYH 
RIGDIGFILALA* FILHSNS *DPQQIAI 
LPHPSDSKKYLYLSHIQLKKA 


3611 


17512 


A 


3634 


1 


428 


RTLKTTLFNPDGG * DPIL YQHLF* FFGH 
PEVYIIILPRFR* ISHIVTYYSGKNEPF 
GYISMV*AMISLGCLGVIA*AHHIFTV* 
I D VDTRAY LT FAT I MI AI P TAVRALS * L 
APLHRSN I K* SAAKL* ALRLI FLFTVGG 
QT 


3612 


17513 


A 


3635 


2 


129 


PVIYSTI FAGTLITALSSH* FFT*VGLE 
INMLAFIPVLGFWD 


3613 


17514 


A 


3636 


2 


422 


DR*LFSTNHKDIGTLYLLYGA*AWLGT 
ALNLLIRAELGQPGNLLGNDHIYNVIVT 
AHAFG 1 1 L FI VI P 1 1 1 GGFGN * LDPL 1 1 
GAPDMAFPRINNISC*LLPPSLLLLLAS 
AIMEAGARTG*TAYPPLAGNYSHPGASV 


3614 


17515 


A 


3637 


14 


479 


QNCKINNFPMYFLPHQSEFPRKSCKQQN 
T I CFRLSNQNTMKKTTENEALLW I RKWQ 
T* KQQCKSNSGYTPSTPSRAKKKKKKKK 
KKKKKKKKKKKKKKKKKKKKKKKPGGGV 
LKKL 


3615 


17516 


A 


3638 


2 


449 


III ICFVGLCALVLTLTTKLTNANILDA 
QEVETG*AILPAIMLVIIALPCLRILYI 
TDEVNDPSLTIK* IGHQWY*AYEYTDYG 
GLMFNAYILPPLFLKPGDLRLLDGDNRV 
VLPNEAPIRI VMTSQDVMHS * AVRTLSL 
RTDAIPGRL 


3616 


17517 


A 


3639 


347 


98 


HFFLGGGKGGGFS PFP PQKFFFFLRGFF 
FGGGGGPNFPPPKKGFFSKKPQGVF*TP 
PKKKKKFFFFPRGFWGPPGI FFKGPPP 


3617 


17518 


A 


3640 


2 


426 


DR*LFSTNHKDIVTLYLLFGA*AGGLCT 
ALSLLIRAELGQPGNLLGNDHIYNVIVT 
AHAFVI I FFIG I P 1 1 IGGCGN* LVPLI I 
GAPDMAFPRINN I S F * LLP P SLLLLLAF 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIalanine, 
G=Glycine, H=Histidine, I=IsoIeucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Pro!ine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Ty rosin e, 
X=Unknown, *=Stop codon, /=possib!e 
nucleotide deletion, \=possible 
nucleotide insertion 














AI VEAGAGTG* TVYPPLAGNYSHPGACA 
D 


3618 


17519 


A 


3641 


94 


294 


GRGGGGVAGGRGAMWSKGRQRGRNGPVR 
GREGGRKSHFRKEG*KEGRKEPPPPPKK 
KSRGEKNQREG 


3619 


17520 


A 


3642 


1 


405 


RTRGFDLERFVLFDVAQQMVESGYVCEG 
DHKTMANAI IDRVSLIKRKREQRQLVRE 
EQEKKKQEE S S LKQQVEQS S AS QTG I KQ 
LPFADTGIPTASTTSASVSTQVEPEEPE 
AD*HEQLQYQQPSLSVLSDGTVD 


3620 


17521 


A 


3643 


34 


405 


HLLGRKQNKAI FLEAEKKKKKKKKKKKG 
GPLKKNPGGAKNNRGGEKKIFSP*GGVK 
KTPLG I FEKKPY FGGGKKWANP PKKI KT 
LKE KKKF * RG KGGKKPEN P WG E KDE S S S 
PK 


3621 


17522 


A 


3644 


430 


81 


EFAQLSPPQIKGFPSPFPNFLPPSLFGF 
FSPFSP*KFFFSLKGFIFVGGFFPFFSP 
PKKSFFSKNPQLVFISPPFKKKI FLFPP 
PLNFGPPRVFFKRPPPFFFFFFF 


3622 


17523 


A 


3645 


399 


2 


AFFFTKKKKEFFPPFPFKNFFFFSRVFF 
FLGGF FH FF P PQ KK I FFLK I PRGFFLTP 
PKKKKI FFFFPR* FLAFPGFFL*APPPF 
FFFFFFFFFFFFFFFFFFS PCSWSCKQV 
FPHASNRIHNPSNSYPLQQY 


3623 


17524 


A 


3646 


49 


326 


KKKKKKKKKKRGGPFKKTPGGAQKNPGG 
EKKF F FL * GGHKKP PRG I FKKK PFFGGG 
KNWTTPPQKNKAFGGKKKFLGGKGEKPP 
PKPRVKKN 


3624 


17525 


A 


3647 


327 


94 


QKKKKKF FP P P P P KNFF F S PRG FFFGGG 
GGPFSPPQKKGFFSKNPGGVFLPPPKKK 
KIFFFPPGGFGAPPGFF* RGPP 


3625 


17526 


A 


3648 


2 


301 


TSSVASTFIMSLFPTTIFMCLDQEGI IW 
N*H*ATTQTTQLYLSFKLDYFSIIFIPV 
ALFVTWPIIEVSL*YINSDPNINQFFKY 
LLIFLITILILVTGC 


3626 


17527 


A 


3649 


2 


406 


STNHKD I GTL YLLFGA* AG I LGTALNLL 
IRAELGQPGNLVGNDHIYNVIVTAHAFG 
IIFFIVIPIIIGGFGN*LVPLIIGAPDM 
EFARINNISF* LLPPSLLLLLGCAIVEA 
GAGTG * TVYP PLAGNYSHPGACV 


3627 


17528 


A 


3650 


3 


418 


HAYADAWVHFKS ITLLILGLLTNTLTI Y 
Q*WRDVSRESTYQGHHTPPVQKGLRYGI 
ILFITSEVFFFAGFF*AFYHSSLAPTPQ 
LGGHWPPTG I TPLWTLEVPLLWTS VLIiA 
SGVS IT*AHHSLIENNRNQI IQALLI 


3628 


17529 


A 


3651 


526 


2 


' WGGGEKGRAGGAGEETPHQESQRQGCAL 
PQEPPLYPPCPAIHPPPLPLGGLLLFPS 
LPPLP*AAWFPHPGQPGRNLNGESPTTR 
EKRLWGPPLPTPLPWPPTQPTPLSRRTG 
CPPCPREVPTPSPPWAEPPPLPGPPSEI 
AHLTTVRRG AALANVNHEN I S DTDENKL 
NAFVTA 


3629 


17530 


A 


3652 


464 


297 


RNSFHRVSQDDLDLLTL*SPCbGLPKCW 
DYRCEPLLPASYIDVS 


3630 


17531 


A 


3653 


162 


467 


NSKKQQFVLLSE ITSYVYWLHKRDLDSH 
TI IMGDFNTPLSTLDRSARQKVNKDIQE 
LN* ALHQADL I D I YRTLHP KS TE YTFFS 
APHCTYNFLFQSYFSSQF 
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nucle- 
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ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
OGtycine, H-Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, MHMethionine, 
N=Asparagine, P^Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


3631 


17532 


A 


3654 


372 


1 


LMMLSIFSCVHLAFVCFLWRREMSGVSL 
CCPDYSGAAAVHRCNHTALQPQTPGLK* 
SSCLCKRHLQLRPQVYTTMLGYLFLFLF 
LRQS CSVAQAGVQWHDHSS LQP * TPGLK 
RSNS PTLV * DLD 


3632 


17533 


A 


3655 


444 


341 


VGQDGLLDLLTS * SACLGLPKCWDYRRE 
PPRLAY 


3633 


17534 


A 


3656 


466 


315 


VGHTGLELPTSSELPA*ASESAGITGVS 
HCTQQSGCAFC YSTMI PTKPYTC 


3634 


17535 


A 


3657 


40 


276 


EFDTAHAFVIIFFIVIPIIIGGFGN*LV 
PLIIGAPDMAFPRINNISF*LLPPSLLL 
LLASAIVEKKKNEKKKKTGAPLV 


3635 


17536 


A 


3658 


351 


470 


FTTLPVTSASLPLKTLAWPGMVAHACNP 
STLGG*GGRIT 


3636 


17537 


A 


3659 


40 


182 


GIRKENHLNLGGRGRIEPGSHHCTPAWA 
TK*DSVKIKKTRIQKTPLI 


3637 


17538 


A 


3660 


160 


461 


INGKDTDTPGMVAHAYNTSTLGGGGGGI 
A*GQEFQPSLRN 


3638 


17539 


A 


3661 


145 


437 


YFSLSLCLLELMNSVILFFFLVLERGFY 
PPPLVEMQGPNFG* LNPPPPGLKQFFCL 
TLPKSWWNRPPPPRGGNFLDF* KKRGFN 
YVGRVDLKLRTSG 


3639 


17540 


A 


3662 


255 


451 


LIiSFNPFFFFFFFGKGGPPSLP PGGKGG 
GLNFI*TLGPQGKRNSPPQPPGEPG*RG 
PTPHPGLFF 


3640 


17541 


A 


3663 


185 


66 


P£jEQPIiRK*LYLFRDGVIjLCHPDWSAMM 
QSRLMATSAEF 


3641 


17542 


A 


3664 


42 


193 


FQLLTRLRQENHLNAGGTSCSEPRSHHC 
TPA+VTETQSQKQKVKVLGFSL 


3642 


17543 


A 


3665 


216 


2 


KS KGKCS FLHSRVNRGNSKELLFLRDG V 
WRYCPAWS * TPGLKQS PHFNIP KCWDYR 
REEFHHPGLWYEFE 


3643 


17544 


A 


3666 


43 


267 


GRLWSAMTPGKLKTLCKIDWPALEVGWP 
LEGSLDRSLVSKVWHKVTYKPRNPDQFP 
YRDT * LELVLDP PP P THS G 


3644 


17545 


A 


3667 


212 


2 


SQLLGRAEARGWLEPRSLRKINKTSQVW 
WYTP VLPS S CDYRHTPPRPANFC I FSRD . 
GVSPC*PGWSRTPD 


3645 


17546 


A 


3668 


104 


334 


TVLFPPCFLLHIDFCMVLDFFSNFFVLN 
YF*NYRRVQKMYEEFPFFLFFFLI FSFS 
FLFFLSTFNFFLCPLCDLWYH 


3646 


17547 


A 


3669 


369 


260 


DGLDLLTLi* STHLSLPKCWDYRHEP PCP 
ANMPSFKN 


3647 


17548 


A 


3670 


111 


342 


NGVSFFPPKKGGIWANGNFRFPPKPGG* 
RNFLA* P PQKFGMAGPPP PPQP VG I AGP 
PPHARBILEFFFFFFKKXGV 


3648 


17549 


A 


3671 


377 


3 


FAIIGGFIH* FPVFPGYALAQTYAKIHF 
TIIFIGVNLAFFPQHFLGLSGMRRRYSD 
YPDAFTT*NILSSVGSFISLTAVILIIF 
MI * EAFASKRKVLI VEAPS INLE* LYGC 
PPPSHTLEDPVY 


3649 


17550 


A 


3672 


39 


348 


AKIAPLNSSLDERMRVSKKKKKGHGLTP 
KP FLKKKLGKNFPLS QLEGVGKKNQKPF 
GFKKGLELAPFNGKGNGENPNTWRWG*T 
MSFFFFF*DSHPFIQAGVQWRDLGSLQP 
RPPRFKRF 


3650 


17551 


A 


3673 


327 


3 


SKDPVFFKVFKKNPPTPNIGVFPFALPL 
KRGQFQPLFRPKGFFFFFFKP VQLGRG K 
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Amino acid sequence (A-Alanine 
C^Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Pheny la Ian ine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, ^possible 
nucleotide insertion 














ISPQFFFKKRFWG*TKSFFFFF*DSHPF 
IQAGVQWRDLGSLQPRPPRFKRF 


3651 


17552 


A 


3674 


379 


267 


EPRLSP CTPAWATE* DSISKQNKTKKPK 
LALNKKKMF 


3652 


17553 


A 


3675 


2 


270 


FCRDRVSLYCPVRS*TPGFKGSSCLDLP 
SSWDYRHAPPGLAKIGGLHVFCLPCCGS 
LLVCWS SHCSLAHQELLLENKHTDVWMG 
GWMDR 


3653 


17554 


A 


3676 


3 


54 


P VYKNKNKRPDAVAHTCNPS TLGGRGGQ 
ITRSGDRDHPG*HL 


3654 


17555 


A 


3677 


291 


30 


YSLVLLI FFFYCFS I S YFI SLMNFCHNL 
Y Y FLLLD FGLVCS YF P S I LWHNE I Y F I G 
SMFFIFSVIV*IFFCWEMSCNIYKLHQE 
MF 


3655 


17556 


A 


3678 


1 


298 


RDFSASASQVAGTTGMHRHAQLI FVFRV 
CVCVCVCVCERERERERDGVLIiCRPDCC 
SVAKT*VQGRDLGSLQPPPPGFKRFFCL 
KLPS S WGHRCWDAP 


3656 


17557 


A 


3679 


20 


376 


VPPPHLANFLNFILLFVDTGSYYVAWAG 
FKLLAPCNLSASNS * SVGITGVSHHALL 
LLALKCNQLiFQFVIFILLHAVCIYDFIM 
VRFSLQCFPQFYDFPFCFILFVSSKAQF 
CFLRLPY 


3657 


17558 


A 


3680 


110 


334 


NTTDYFKLP* ILSGTR*DMYKQPL*HLN 
T VRE YTS K YG WRPG WAHACNPS TUGGQ 
GERITRSRDQDQPCQHGET 


3658 


17559 


A 


3681 


326 


531 


LTMMLQMKXSVICCLLRNIFWPGALAHA 
CNP S TLGS * GGR I ARSRDLENS PSLLGS 
QPSS 


3659 


17560 


A 


3682 


448 


248 


AKDLNTFFTKE*TPLANKHMKKGSARFT 
YRNVQIQTT*RCLGTPTRVTGV* MRNSA 
RHR*GCGGLV 


3660 


17561 


A 


3683 


486 


176 


PAPINPPQLRSLVQRSGIRFC*AGQVGF 
KLLVSSDPPTLASQSAGITGVSHQTWP* 
NYS*NSGMPTHPRQASSWRLREQWYHVS 
LLPCAYYKNGRQNSLLI 


3661 


17562 


A 


3684 


238 


27 


NKHILCYVGQEMGMGWGSHSHGWKPHFK 
S FQVCPSAVAHACNPSTLGG*GGQITIiV 
QPGKHGKRRIPALF 


3662 


17563 


A 


3685 


1 


299 


LITQTCHVGCLKIKCI *TSKKKKKKKKK 
KKKKKARGPLKKKGPLKTPEKPPGGPFK 
APPFWGGPPPRFFFGGGGAPPPVFLKKK 
KKNS PLWGGGGFFFR 


3663 


17564 


A 


3686 


494 


91 


PLGKRFPGGDI ALTKLSRPWAWAPPGI P 
PPIiGGPGGGVP*VTPFKPTPFRPGEPPL 
PLKNQTFFGVPGPPPYFPPLGGLGPEIP 
FTPGGPGSIKLAPPPSPPPGAPKRNFFP 
KKKKEVMKMLCFHFNLLVQNI F 


3664 


17565 


A 


3687 


17 


249 


YSRFGRHFGHDPGLYFQI LRRLRWENHL 
NPGGRGCS KL* SKHCTPAWATE * DP VS * 
KKNVTTIAKRWKQLTCPLADG 


3665 


17566 


A 


3688 


296 


461 


FWLWVSSVLKHMNWPGWTHACNPNTLG 
GQGE * SA* ARV* DQPG*HGETCSAKEK 


3666 


17567 


A 


3689 


149 


422 


ASGRQGPDRGCWGGQAPRHPARGAKPAA 
DMGMSGQGYTKQAAP * GDPSQRAGVRLP 
REPTSRKASGGSGGHSRASGGAVPGQRV 
AQPSPAR 


3667 


17568 


A 


3690 


1 


415 


GDPAGGGDPILYQHLF*?FGHPEVYILI 
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C=Cysteine, D=Aspartic Acid, 
E=G\utnmic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Lysine, LF=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R^Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














LPGFGI ISHIVTYYSGKKEPFGYIGMV* 
AMISIGFLRFIE*AHHIFTVGIDVDTRA 
YFTS AT IIIAI PTGVKVFS * LATLHGSN 
MK*SAAVL*AL*FIFLFTVSGLTGIV 


3668 


17569 


A 


3691 




393 


GTRKYS FLHDSQTS FCLSNS I PTPSNME 
ETQQKSNLELLRISLLLIESWLEPVRFL 
KSMFANNL VYDTSDSDDYHLLKDL * EG I 
QTLMGRL.EDGTRRTGHILKQTYSKFDTN 
S HNHDALVKN YGLL YCFR K 


3669 


17570 


A 


3692 


402 


2 


FLTQRWGAPLPI PPFWEAPFGPSP*APG 
FKPPLPPRGNPPFS*KTQIYPALGARPV 
IPPSLGGGGGESFLPPGGGFP * PRLFPC 
PPPRGAKPNFFFQKKKKKCLFCKAPQAQ 
HRSACLHECLPLGNLLKLCPR 


3670 


17571 


A 


3693 


3 


434 


I FYI FFGTTLI PTLAI ITR* GNQPERLN 
AGTYFLFYTLGGSLTLLIALIYTHNTLG 
SLNILLLTLTGQELSNS*ANWLI *LAYT 
IAFIVKIPLYGIiHL*LPKAHGEAPIAGS 
IVLAAVLLKLGGYGIIRLTLILNPLTKH 
IAYP 


3671 


17572 


A 


3694 


452 


208 


INKSARHGSMNL * SQLLRRLREVCLS PG 
VQGCIEPWSHHCTPAWETE*DKTLSQKQ 
TOKKTCQHLYDHRPSALEATFRLSH 


3672 


17573 


A 


3695 


464 


146 


LKQTRAT KGNL F L P * KPKLT P PGG FR P P 
FQPFWKPRPLGPPKSGVLDPPGPQGKTP 
FVLKKPNLLGGVI PTLFFPLSGGLGRKI 
PS PRGGGGAKKRK P PRQKKKKK 


3673 


17574 


A 


3696 


2 


465 


VTRLRERPALLVSSTSWTEDEDFSILLA 
ALEKFEQLTLDGHNLPSLVCVI TGKGPL 
REY YSRL I HQKHFQHI QVCTPWLEAEDY 
PLLLGSAEPGVCLHTSSSGLDLPMKWD 
MFGCRLPVCAVNFKCLHELVKHDENGLV 
FEDS * ELAAHLEKL 


3674 


17575 


A 


3697 


373 


51 


TFISWFGEQWFGCMEKFFSGDF*DFG 
APITQG*SFLSI 


3675 


1757 '6 


A 


3698 


380 


26 


KQRTNFFLFFFKGTGAFWFFFPPFFNKK 
KKVFLGPGGPKIVKKKKKKEKKENQNKN 
RKQTKKK* XTNNKQNKKRETD I FGVEDE 
TWRQRWGLGGGRVANSSAEMGKGNSTTR 
PSWNS 


3676 


17577 


A 


3699 


11 


414 


RL WTAI L I QTP * S FTGAV I L 1 1 AHGLT 
SSLLFCLANSQQKKKKKK 


3677 


17578 


A 


3700 


1 


401 


SSSLTLSNNNKSFLNQFVTFNEMWILYD 
NW* * PAQWLDQEAPKHFPKPNLDQKKVM 
FAVWWSAAGLIHYCFLTPGETITSEKYA 
QQFKEMNRKLQPLQSAMVNRKG P I LHDN 
A*SHITQLKLQKLN* LGYEVL 


3678 


17579 


A 


3701 


395 


140 


TPPKKKKKI FPPPPKI FPPPKKYKKPPP 
PFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFSIYIDFNLCICDNL*LQKTLVFL 
E 


3679 


17580 


A 


3702 


407 


371 


SFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFLWYKDI I IKGKNEQNV*CS 


3680 


17581 


A 


3703 


36 


409 


LLKCEGIWKKEGKICKKKRREKKGGPFFR 
VLLGGPTFRGGAHKKSLLFRGVFFNLGG 
FFFEGPFFLGGALLGPPPPPIFTPLGKK 
KI FKGKRGPPPLI FLFF* IRGPPPPPSP 
PSTKPPPPPPPTP 
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C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine. 
G=Glycine, H=Histidine, I^soleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Pro)ine, 
Q=Glutamine, R^Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unkno\vn, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


3681 


17582 


A 


3704 


56 


415 


QAPWGKKKKKGKEKKKKKKKKXKKKKRG 
GPLKKNPGGGPTMGGGEKKYFS PWGGEK 
KKPGGNLGKKTPLGGGGNWAPPPPKKKG 
AGG KK K I * RGGGG KTPL I GGG KKNGG P P 
RTSPR 


3682 


17583 


A 


3705 


108 


1 


NLAK*IQ**IQTIMHHDQVSFIPGIQGW 
LNILKSIN 


3683 


17584 


A 


3706 


425 


52 


PFPH* * * b* KVDPI PLPPQNFLHPSNNI 
YFSPLSPKKFFFSLQPFFFFGE IFPIFP 
PPKKNFFS*NPNLVFISPPLKKKIFFSP 
PPPNFAPPKNLFKSPPPFFFFFFFFFFF 
FSGKNTLYWVDK 


3684 


17585 


A 


3707 


117 


421 


VLIiVIQAEYLLPEVFGTRSVSDFGFFRF 
WNICIILSG*ASLI*KSEI*NALKNISS 
FKKKFI FI LEPRSHSVTQAGVQWQDHGS 
LQPQTSGLKLFSHLSLP 


3685 


17586 


A 


3708 


308 


3 


RSPPWFFFGFFFFVLFFFFFFVFQLFFS 
KYN I TLY I KKKTET FTETNDNGNAT CQN 
LWDTAKAI PRE KF I AI STY I * KVGKHQ I 
NNLMKHLKEIEKQKQTK 


3686 


17587 


A 


3709 


94 


420 


KKKKKKKKKGGGAFKKKPGGAKI *RGKE 
KKNFFLKGGGKKKKVGNFGKKPYFGGGK 
KREKTPKKN*SLKGXKKFLKGKGGKTPP 
KPWA*KKWSS 


3687 


17588 


A 


3710 


413 


184 


LGECLSLRSKKLWSGPAGGGVCL* FQQV 
RRR KWEDHLSPG I RG* RALSSHRS I PAW 
TAEQDPVSRGLWGLPGGSW 


3688 


175S9 


A 


3711 


3 


104 


GGCSELRSCHCTPAWRQS * TSSQRKKKK 
KKKMY 


3689 


17590 


A 


3712 


2 


393 


GLFSTNHKDIGTLYLLFGA*AGVLGTAL 
SLLIRAELGQPGSLLGNDKIYNVIVTAH 
AFVIIFFIGIPIIIGGCGN*LVPIjIIGA 
PDMAF PRINN I SL* LLPPSLLLLLASAI 
VEAG AGTG * TVY P P LAGN 


3690 


17591 


A 


3713 


424 


1 


FFFFFFLRRSLTWQAGVQWRNLGSL*A 
RPRV 


3691 


17592 


A 


3714 


2 


130 


GRVGGRVG*NLKLYSQRLKKKKKKKKKK 
KKKKKKKKKKGGGPF 


3692 


17593 


A 


3715 


3 


197 


FHH VGQDGLNLLTL * S TRLG FP XC * D YR 
CEPPRPAAFFFFFLKKKKTLAPKKFKTL 
DLTTRERS 


3693 


17594 


A 


3716 


3 


497 


RSPPSGFAWPGPAGPWEAGAAAAGQPLG 
ARPHSPCTGGCRSAGTPP* PAGPAGPPP 
HAPAAAERKPRGAEQSQRRVQETGPPPT 
RAESNPSGLRGREVPGSAGCWSGSQESD 
FGAGCPAVT*GTYPYRIAGQVSSRAPHP 
AESRGCTAGLS IGPDGRQ 


3694 


17595 


A 


3717 


40 


379 


AITS PRS P PAVHGS CLARTHPKS PQCLP 
LSGTLLLPSSSLSLSHPATAAGE* IWDA 
EKKKKKKKKKKKKKKKKKKKKKKKKKKK 
SPGGALKKKNPGGPNPPGGGKNISSPPR 
G 


3695 


17596 


A 


3718 


445 


286 


KKKK*METRSCYVGQSGLELLASSDPAT 
LASQSVG I TGVSHQAQPVNKLPRVQ 


3696 


17597 


A 


3719 


15 


383 


GYFLEASCLLHPLLHPMYQNCGHVACLA 
AKENGKFMQAACGQLNS PALS LKNGRLS 
AAYGGSDCVYLFPMSKMVLGTQ*ALNKY 
TLNGPGA VARACNPS TLGG * GGRNLRSG 
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VGDQPGQHGET 


3697 


17598 


A 


3720 


14 


472 


RSCARNPGLTHASAQYQTFLFV* SVLIT 
AVLLLLSLPAIAAGITILLTDRNLNTTF 
FDPAGGGDPILYQHLF* FFGHPEAYILI 
LPGCAI I IHIVTYYSGKKEPFGYIGRV* 
AMIS IGYLG FIVRAHH I FTVG IDRDTRA 
YFTSATItlAIPT 


3698 


17599 


A 


3721 


3 


298 


eaai kyfltqatas 1 1 lliai lfnnils 
gq*titwt™qyssliiimaiaiklgia 
pfhf* vpevtqgtpltsgllllt* qkla 
pisiiyqispslnv 


3699 


17600 


A 


3722 


86 


505 


AGAVTYIVTYYSL* KKDSFWIHSVWC*A 
MRVNWES *GLSV* AHH I FTVGIDVDTRA 
YFTSATI I IAI PTGVKVFS * LATLHGSN 
MK*SAAVL*ALGFIFLFTAGGLTGIVLA 
NSSLDI VLHDTYYWAHFHYVLS IGAVL 


3700 


17601 


A 


3723 


432 


48 


FKTSFLPFLLPSFCPSFPSFSLFPFSFS 
CFLSCPSSLSFLFFFLLSFHPFFFPTLL 
ISF*VFLF 


3701 


17602 


A 


3724 


2 


406 


HH I FTER I DVDTRAYFTS AT 1 1 IAI PTG 
VKVFS * IiATLHGSNMK* SAAAL* ALGI I 
FLFTVSGLTGIVLANSSLDIGLHDTHYV 
VAHLHYVLS IGAAFAI IGGFIH* FPLFS 
S YTLDQT Y AI IHFTMIFI GVNLT 


3702 


17603 


A 


3725 


174 


158 


KFFGGGFPHFFPPQKKGFFPKNPPRGFF 
TPPLLGKNFFFPPPVNLGPPRGFF*GP 


3703 


17604 


A 


3726 . 


2 


392 


LFSTNHNDMGTLYLLFGA*AGGLGTALS 
LLIRAELGQPGSLLGNDHMYNVIVTGHA 
FGIIFFIVIPIIIGGFGMRLGPLIIGAP 
DMARPRINNISC*VLPPSILLLLASAIA 
EGGAGTG * TAYP PLAGNY 


3704 


17605 


A 


3727 


48 


474 


KKKKKKKKKT KKK KKKKKKKKKTRG PP P 
KKTPAGPPPPPPREKKNPPPEGGQKKPP 
PAH*KKPPPPRGGQQPPPPPPQKNPPPE 
KQKTWGAEGGTP P PKP PRKKNGSQQ I AD 
PPSNAAKNRHPTQEPPPQSQCIPPPPTP 
PP 


3705 


17606 


A 


3728 


387 


53 


PPPKKGVLPSPPNFYTP?K*GVFSPLPP 
* KFFFS PKGL I FWGGGG PNFPPPKKKFF 
SKKPRGGFFFPP* KKKIIjISPPREKLGP 
PRIFLKGPPPFFFFFFFFFFFF 


3706 


17607 


A 


3729 


1 


397 


PTRPLQPLSPEFKRFSCRTSPDVAW*VS 
LPS PPPSGRS FFVFL * VESFCVWI KKRF 
TMLARLVLDS * PSDPPAWASQSAG I TGA 
S LRASLLYI Y * CFS I QLGLLPEFL IQ YV 
QLL I VY PRI HHLYLS VPHLH 


3707 


17608 


A 


3730 


407 


0 


TLKHTKKKKKKECKKKRGGLLKKTLGGPK 
*NGGRKKKI FFLKGGKKKQTGE I LKKKL 
I LGGEKKGKNPPKKKKPLGEKKNFKGEK 
GKKNLKMFG VI KLGTS S KKI IF 


3708 


17609 


A 


3731 


3 


200 


GGCSELRSCRCTLAWVTE*DSVSLKKKK 
KKKGPFKVLRGKGPSGNKREGAKKKI QG 
KPCPFWESF 


3709 


17610 


A 


3732 


3 


380 


TNHKDIGTLFLLFGA*AGVLGTALSLL»I 
RAELGQPGNLLGNDHIYNGIVTAHAFVI 
IFFIVIPIIIGGFGN*LVPLIIGAPDMA 
FPRINNISF*LLPPSLLLLLASAIVEAG 
AGTG*TSYPPLAGN 
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N=Asparagine, P=Pro!ine, 
Q=Glutamine, R^Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
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3710 


176J1 


A 


3733 


6 


394 


AMISIGFIX3VIG*AHHIFTV* IDVNARA 
YCTSAT 1 1 1 AI PTGGKVFS * LATLHG SN 
MK*SAAVL*ALRFIFLFTVSGLTGIVLA 
NSSLDIVLHDTYYVRAHFHYVLSIGAGF 
AIIGGFIH*FPLFSGYT 


3711 


17612 


A 


3734 


27 


238 


GI PGS SHASAHIG * I IAVLPHYLNI TI L 
NLTIDI ILTTTAFLLLKLNSSTTTLLLS 
RT*NKKKKKRGGPF 


3712 


17613 


A 


3735 


402 


249 


ASAHAS APLFV * S VLITAVLLLLSLPVL 
AAGI T I LLTDRNLNTTFFDP 


3713 


17614 


A 


3736 


100 


430 


VT W P KSDS YEE VKI KLE PRQTDLRALG F 
CLS FTGKKKK* KKKKKKKKKKKKKKKKD 
KRGG 


3714 


17615 


A 


3737 


158 


306 


FLSSVFSLVQGVLFSSLCLDRNLPDNMQ 
LWS E I FNK* LYLQELS I LG VY 


3715 


17616 


A 


3738 


279 


3 


PGHCFYNFNLNFYFQIFFQNVLANSGSL 
LFLYQTIiXLSFNIFYILYLIIQISIVFQ 
SVIL* FI FSNS YSWWYGFLVCLMTFEGK 
KQTCAHV 


3716 


17617 


A 


3739 


268 


388 


HFFFFEMEADS VAKAGV * WYDVGLQQPP 
PPGVKRYS*LSL 


3717 


17618 


A 


3740 


395 


153 


GWLMRFKERCGLQNIKV*GEAASADAEA 
AAS * PEDLAKIADEGS YT*QWIFYVGET 
AFYWKKMPSRTFIAREKSMPGFKL 


3718 


17619 


A 


3741 


271 


386 


TCPPPLKLWVYELGLb*RKNMRQGAVAH 
ACNP STLKG * GGR I S RSGDR 


3719 


17620 


A 


3742 


406 


190 


QLLKRLRQENPWRWGG*GCSKPRSPPRP 
QAWVTERDFVSKSKTKTNQPKQNGERKV 
FFWFFFLRKPKVQFEI 


3720 


17621 


A 


3743 


272 


408 


MGLQACANSPL+YVDIAIPCNNKGAHSV 
GLMWWMLAREVLRMRGT 


3721 


17622 


A 


3744 


211 


407 


KGKRNILFGPVKNIFCDTQFTSCVNFFR 
EEVSLDCPGWSQTPALK*SSCLGLPTCW 
D*RREQPRL 


3722 


17623 


A 


3745 


405 


189 


CVSQDGLDLLTS * STRLGLPKCWDYRKC 
WTNQLFVNGQKS LNS FVCYS FMVLDTTG 
FKEIVMLWHLVPIiF 


3723 


17624 


A 


3746 


120 


362 


LKA* E I S WLiNGT I QTHQEKPE VQNQVI L 
EENIVFLGPQDRIFECQAVTAELEPEKK 
AIVAGCSDSHLYPSTLGGQDGWIT 


3724 


17625 


A 


3747 


401 


233 


N SGGRGCS E PR S L P CTPAW VTDRDP I S K 
NKNKTNHQ*QKKCKR*TGKVI FMTRSIL 


3725 


17626 


A 


3748 


353 


109 


LGLANFCI FSRDGVFPCWPGWSRTPDLR 
* SAHLSLPKCWEYRA* ATMPGLLMCFLI 
KAliDFQGCVGRWGGGSWTDI KLCQM 


3726 


17627 


A 


3749 


412 


221 


HGLKECLVYPKWCQNSNISLQGIHFSEV 
AM*QRAHNHGIHWFYHDSFPRKIWINWA 
RRGGSCL 


3727 


17628 


A 


3750 


1 


355 


FHHVSQDGLDLLTS * S PRLGLPKCWDYR 
REPLRPAENTVYVHFSSWSFTNHSGKIN 
* TLDQWKEQLPGWL * ILRGVFFFNPSIW 
NS * QKKAS I YVFPLCWKRDI FLYHFPVS 
GTFRGF 


3728 


17629 


A 


3751 


423 


125 


QVCEKVLISLITEMQIKTTVRYHLTPVK 
MAFRQKTDDNKCWRGYGEKGT PVHCWWK 
CKWQPL*RTAWRFFK*RKIELPYDPAT 
PLLGI * PKERKSGQV 


3729 


17630 


A 


3752 


3 


171 


FQPIAFLYANSEQSGKE33TSIYSSYEE 
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G=Glycine, H=Histidine, ]=]so!eucine, 
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YLGINLTKEMKDL*KENYKTLMEKIELD 


3730 


17631 


A 


3753 


2 


247 


FHHVS * DGLNLLTS * SAACLGLPKCWDY 
RHE P RRPALTR FLFVFL YLQRLDHL I R I 
NSFFIMSVNGMFTFWLKLFNLIHA 


3731 


17632 


A 


3754 


3 


197 


GLELLTL*SACLGLPKCWDYRHDPLCPA 
QYLSFLYAINKLFSSTYLEIHNRLVCCI 
VTLLIYKP 


3732 


17633 


A 


3755 


2 


103 


VSQDGLHLLTL* STLLSLPKCWDYSCEP 
PRPAW 


3733 


17634 


A 


3756 


103 


2 


FRPGMVAHACNLSTLGG* GGGITRSGVQ 
DWPGQ 


3734 


17635 


A 


3757 


394 


244' 


ASTHPANFFCRNGVLLCCPGWSQMPGLK 
QSS*LSLPKC*DYRCEPPHSAS 


3735 


17636 


A 


3758 


341 


2 


AE IAPLHS SLGMKRDS VSKKKELKKLNK 
QKPNNPLKK*AKDMNRHFAKGIQGANEH 
KIKCSTSTNLREMQIKTTVRSLLI + VRM 
AIIKKSTNISaSf*ARREKGILIYTWWEC 
K 


3736 


17637 


A 


3759 


2 


322 


PPRPTRPLNNTINQMETICM YTKCYLI I 
AE * TLFPS AHG TL I K I GH I LGHKAS LN K 
FKR I K I I QS AF FKHNG I KLEMDKRK I TG 
KHPNTWKLRKHTSKPMSQEMFF 


3737 


17638 


A 


3760 


444 


422 


VRLPHRPSPPPDPRVRFHHVGQACLKLL 
TSSDPPTSASQSAGITVPSLQ*QP*SVR 
LP 


3738 


17639 


A 


3761 


266 


1 


GGGVPLGPVIPPLRPPQRVGFQALAPQP 
GPGSKIF* IFFGPQFKNRVFFKGRNLVF 
FPPLFSLLPFFFFF*DGVSLCHPGWSLE 
CSGM 


3739 


17640 


A 


3762 


70 


381 


HSITSYI*AFIWGFFLFCFVFFLIKEME 
SHYVAQADHELPASWAP PTS ASQYAG I S 
GGSHPTQHYFI FYWIFYFTVFCGGGFLR 
RSLAPS FFAQAGAQWRDLGS 


3740 


17641 


A 


3763 


229 


390 


DI P YS FCFFFFF WKGS FVFVPQAGGKGR 
KLG*LKPPPPGLRGFSCLTLPGSWEN 


3741 


17642 


A 


3764 


119 


1 


QTS CRPGAVAHT YISTPSTLGG * GVDHLRS 
GVRNQPDQHGE 


3742 


17643 


A 


3765 


400 


200 


ITAYCSLDLLQPHFLFFSRDKVSLCCPV 
WSRTPELK*SPCLSLPBCWD*S*ATASG 
LKHFIKSLPL 


3743 


17644 


A 


3766 


376 


44 


TQP I FLGPG FQAP PGPMG * PNFLF* EKK 
LVFFPKKPKTAPAGGGGPKFPLPRKPKQ 
KKGPHPGSQNFQKPRPKFPPRPPTWGKK 
REFFSQKKKKKKRERDVKITDLNTST 


3744 


17645 


A 


3767 


403 


3 


DHFFFKGPPSQKGEFPLFLKTFPSPLGG 
FG*ISLFPPPINFPPPPKKRIFPGPLSF 
FFFKKKKTYPAGKGLLFFPFPKKPGGGF 
KRWPGPPKKFFFPPPGFWACSPPFFFFF 
FFGAKVSLVAQAGARSRSRTS 


3745 


17646 


A 


3768 


407 


377 


FHHVS QDGLDLL TS * S AR LS LP KC * D YR 
HDPPRLARCIKNFCRGKRCNILRKSPLS 
LPPSAGELRAPFLDFLEDAQ 


3746 


17647 


A 


3769 


292 


391 


RPGWAHTCNPSTLGG*GGQITRSGVQD 
QTGQY 


3747 


17648 


A 


3770 


395 


198 


LDMGFHHVGQAGPEIiLTSSDLPASASQS 
AAITGMSHRAQPY*VISMCQQCYKHFIR 
IPFNLHNNV 


3748 


17649 


A 


377! 


1 


241 


RTRGRTRGRTRG LLS AIITGRYQKKPR IR 
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& 








TPQIFSTGTTKAEAGELLEPGGGACSEL 
RS CHCS PARATRVKLHVKKFKKI K* KHP 
R I RTPQ I FS TGTTKAEAGELLEPGGGAC 
SELRS CHCS PARATRVKLHVKKFKKI K 


3749 


17650 


A 


3772 


112 


406 


FFFFFFFFFFFFFFQGEGVPPI *KPPPP 
K*FMFPKKKLPPPPPPALKLFFFSPPPP 
FFFFF 


3750 


17651 


A 


3773 


24 


197 


I FS VEMGFCHVGQCDLDLLGSSNPPASV 
S * S AGVIGVSHHAWPLVKLFVKRNKLS Y 
V 


3751 


17652 


A 


3774 


85 


201 


VNFFHLI * FVNKRI KLEKKKKKKKKKKK 
KKKKINWKKKKK 


3752 


17653 


A 


3775 


1 


382 


WENHLSPGSRGSSEP*SHYCTPA*VTE* 
DS VS KKKKPNLQEAEKRLT LLLi I KKAI F 
KNRGS PCFSI KWAGFKKI KQGWFFVATR 
MGDRSRGTPNLGENFLKSFLSFSIKIQR 
FYYLGLKNSPSRNIY 


3753 


17654 


A 


3776 


1 


181 


IKMATVTKATYRFCVIPSKLPMLLFTKL 
ETKILKLI WNLERA* IPKAILS * KNKMG 
LSNS 


3754 


17655 


A 


3777 


70 


385 


LNKMQTKTFDSDTHTSICFTLHFINTKL 
QFIFLLAIYLLFVNFPFIFINFFEEPGF 
CFDDLSSCFLFY*CIP*FYYFFSAFFRL 
HL F FF YRFLS LLTLDLFS FL I 


3755 


17656 


A 


3778 


2 


406 


KDDRNTEHQVHPNASDQEDEAFDPYENF 
FEDSDSPTKSSSTEPSPHIHPVDIQMTI 
FHCADNFARQ Y I LAKLAKKKKKKKKKKK 
KKKKKDSSSPGGGPKKGAPPPPPRGEKN 
FFFFGPPLIFSGGGF*NGGGGGP 


3756 


17657 


A 


3779 


403 


176 


FLFPILFFFLRQSFTLVTQTGGQQRDLG 
SMQPPPPRFK* FNEGKGKRKLWGGA 


3757 


17658 


A 


3780 


3 


248 


GLNQTQLRKILAYS S ITHIG * I IAVLPY 
NPN I T I LNLT I Y I I LTTTACKKKKK KKK 
KKKKKKKKKKKKKKKKKKKKGGGGA 


3758 


17659 


A 


3781 


3 


330 


VDRLRTGVQDKPGKHSESPSLPKKKKKR 
* KI LP KRGGVYLNPHFFERLRGENNLTP 
GGEGQRETESPPCLPAGGKNKIFV*KIQ 
KKKRGGPFKGNKFFPRGGGKKI FF 


3759 


17660 


A 


3782 


399 


187 


FPPPAKGDFLPSHLFGLPPGFSPPPVFK 
PRPRNL I LGP P * KKFYLPPPRP * TWFL * 
NGPPPFFFFFFFLDF 


3760 


17661 


A 


3783 


392 


11 


KKLKVNLPDHPAVPPLGIFPKEMKAGI* 
RAICTLMFIAALLTIEK*WKQPNWLLVD 
K 


3761 


17662 


A 


3784 


324 


80 


PGPFFFFPPPAKRGFFPPPFFLVPPGFF 
PPPLF*TPPPDFFFLAPLKKFFFSPPPA 
LNFFFFKGPPPFFFFFFFFFFFFFF 


3762 


17663 


A 


3785 


354 


24 


PPNKIFFSPPPQKKYFPYKTPPPTFFFL 
SSFFFFFFFFFFFFFFFFFFFFLC*MVH 
FIVKYRKCKRGTRI I LHTPTLRQLVLCC 
LSDVLLGCC I F I YVTMCDLCAPKLG 


3763 


17664 


A 


3786 


3 


369 


S CRGLVWVY GEL LLHP CRGNQ I S RVMQK 
KKKKKFGAS PPREGAP PGEPP* RG P* TQ 
KEPPPLKK*APPPGPGGGRPPPMGLGGG 
PAPPPFFWPKAFFKKGPFSGQEFGFPLK 
GLFFWKFPF 


3764 


17665 


A 


3787 


1 


125 

i 


FRHRFSRDGLDLLIF*SARLGLPKCWDY 
RP.EPPRPAHS III 
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3765 


17666 


A 


3788 


•427 


251 


LTPKGRGCSEPR* CHCTPAWVTEGDSVS 
KKKQAKNRCSLRMKYERNEFTSSPC 


3766 


17667 


A 


3789 


250 


2 


GFPPVEKIKKNPFPIF*KKPGFFFFPNP 
FF* ISGGKGKIFFLGPPPFSFFSPFFF* 
DRVLLCHSGRSAVAQSQLTATSTFRR 


3767 


17668 


A 


3790 


412 


3 


GSTPI TTPFWGVGGALPPGGGVP PPPAP 
PGGTPFFFKKKKKKKFQNFPRPRCSRYN 
GGLGRKNFFTLEAKGSINPNFSPAPPPG 
GQNKTPFPKKKKKSR*MTLQSFPRPGTA 
PGAAM*PVSPPEPCGPPAPAI>STK 


3768 


17669 


A 


3791 


10 


187 


GCSEL*LIHCTPAWHSETLSQKKKGRG 
KGGFKKNGI PCFGKNKGGKRFGKSFGHF 
STG 


3769 


17670 


A 


3792 


1 


331 


FRVPLGRLSRGEHQQQ*QQQPPPPPPPP 
GPLRPLAGPSRKGSFKIRLSRLFRTKSC 
NGG S GGGDGTGKR P SG ELAAS AAS LTDM 
GGS AGRELDAGR* DRPGAGRQTS FSW 


3770 


17671 


A 


3793 


1 


431 


FRKGS ARCRAP KP AGADGVH WGHHRAS P 
QTAQAPPVTCSARGSWMDFSHASGPQDR 
DLHS PS PVCPRGWAPGTHS PTGASGA* G 
RTPVGGGK*VAGVQQRIHPGLAPG*LVL 
PGIAVHITKKPGSKGTITPGTVAHACNP 
NTLG 


3771 


17672 


A 


3794 


342 


48 


KRENMPGVAYNPSTLGGCNPS I VGGFCN 
RVCNPSALGGCNPSFLG3FCNHVCNPSP 
LGG*GGRISRSAAQDHPRQHGEKMSPQK 
IQKI S QACFARVG 


3772 


17673 


A 


3795 


2 


189 


LNPGGRGCSEPRSCHCTQAWATE*DSVS 
KKKKKKGSRFQNTI PSALPGFPVSGELI 
LGGPNL 


3773 


17674 


A 


3796 


1 


401 


LNPGCKGCSEPRSCHCTPVWVSE*DPIS 
QKKKKKKKKK 


3774 


17675 


A 


3797 


279 


3 


ILAKVYPMVNSFLEKIHYFLLVELCCFL 
QHVL YYLTFKTTETG SRSVAQAGVQWHD 
RSSL* PLPHGLKRSFFLSLPRSWDYMKT 
PPCLATE 


3775 


17676 


A 


3798 


229 


394 


VFVKSLLCQPGMVAHACNPSTLGG* GGR 
ITMSGD*DHPGKHGEVSTYNTKKKKGG 


3776 


17677 


A 


3799 


380 


206 


AGHM * S * LLRRLRQED P LS PG VQG C S E P 
*S*PCPPAQVTVRISLKKKRLHERDKSS 
AS 


3777 


17678 


A 


3800 


450 


3 


IYLLRGRHSLHYSRPSFYFSNYRGACSY 
MRPPQQATLLGRLNQGITREGPAIAHSP 
AVRPRLSLAHAHVRRLCLPRPGAAPEFP 
AFFLPCLLVAQAGVRWRDLGSLQPPPPR 
FKRFSCLSLPSSWDDRRPPPLSS * FAFL 
VEMGFRRV 


3778 


17679 , 


A 


3801 


301 


10 


KKQPPKKKKRCFPPLFPPPKGFKGSPPF 
FFLTPPPLKNLPWTLKALPQIFLRTFFF 
FFFERGS PS VTQAGVLWHDLGS LQP * TP 
GFKHMPSCPVLLC 


3779 


17680 


A 


3802 


420 


51 


SASSPPASFFFFPPPKKKIFSHPFFFFF 
PPFFFPPPFF*PPPPFF?FFPPKKKKIF 
PPPPQKI FFFLKPPPPFFFFFFFFFFFF 
FFFCGFVNCQFS VL I 


3780 


17681 


A 


3803 


253 


73 


QTTPS KR YS YVLNCCNN * I TTCKR I XLD 
PFLTLDLKIN*KSFADI»NVRAKTIDTIK 
KVKG 
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E=Glutamic Acid, F=Phenyla)anine, 
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3781 


17682 


A 


3804 


385 


55 


AGTRPVFAALAAAAPAPRLLPLAPHGSA 
ATGFP PP PCPKGEGLRGAPAAS SS P PLA 
HRSQTPNWVDCGESPIPDPQFPLL*NGP 
PALCLPQGLETVNVESLALCFTSHGN 


3782 


17683 


A 


3805 


191 


2 


QKHLKALNFLPRCVCVCVCVCVCVCVNV 
TF*ACKICTCSDFPGSPRKKYLEFGTKI 
INIVDFN 


3783 


17684 


A 


3806 


1 


367 


KLRQVI YLEDLPT I I DEGSYTKQW I FNV 
N*TAFYWKKMPLKTSVTREEKSMPGFKA 
SKDRLTVFLGANWSDFKLKPMPIYNSE 
NPKAIQNDAKSTLPMLYKRNNKACIAAH 
LFTAWFTEYF 


3784 


17685 


A 


3807 


3 


145 


LGPGGGGCSE PRS CHCTPAWATRGKLCL 
QEKKKKR* RLDPE AEVAVS RGRAI ALQ P 
GQQEGNSVSRKKKKKGRDKVSLCCPAGF 


3785 


17686 


A 


3808 


362 


86 


NPPFICSPPFFPPLWGPKQWPPWCEG* 
TPPGPPCSIPLPPKKKSKSPPGVWGPPF 
FPPPWGGGAPKFFLPPKWGFPQTQICPL 
PFGLGEKN 


3786 


17687 


A 


-3809 


245 


377 


FGYMPSFFETQSRSVTQAGVQWHDLGSL 
QPLPPGFK*FSCLGLP 


3787 


17688 


A 


3810 




268 


KRNFLNLIKRRYEKPIANFVFNSKIVNT 
FHLRSSTS*GYPLLPLLLKILKVLPNEI 
RQQKE IKGRKNKGKKTISVYS IMLSANT 
EKPK 


3788 


17689 


A 


3811 


2 


388 


SAENCLNPGGRGCGDLRWCHCTPAWVTE 
RDCVSKTNKKETEKYMRN*TGNMLTI ID 
AGRRYMRCHQTIFTFFFFWRGSLIFLAP 
PPSQGEGI FLPHPPEKWKKKVRITTPTY 
FCFLR7DGFLPCGPGGF 


3789 


17690 


A 


3812 


292 


136 


PPT V IARXGKPR PKLS SEPWQFS AEQLG 
LPSFLCFFV-CVRCCCLLLFLLLFCRDEV 
S I CCPGWS * TPGLKQSSHLS FPNCWDCR 
CEPLCRAI F* FFCLCTLLLFAFVS AAVL 


3790 


17691 


A 


3813 


359 


52 | 


KKGRFFFFEKNPFGDLVPKKKKKKTPPP 
PNPKKKKRGAPPFFFFFFFFRH*VPLCC 
LGWFRTPGLKGSSCVGLPKPWDYRHKPL 
PLTKDDYYVYGLPFVSLQ 


3791 


17692 


A 


3814 


382 


1 j 


GGPGMEAPLGYHI KTPFPLKPQKNNPGG 
GAKP WPP PPGG * AGKMGLPPRQR FPLT 
QKGPPPPPLGGKREPPPQKKKKKRVTAM 
GDP*NSECR*GCHLRVQKEMRLPGLRNS 
PPHYIPNRNTDMCSC 


3792 


17693 


A 


3815 


472 


340 


TGFHRVSQDGLDLLNS * SAGLGLPKCWD 
YRRGLFI I I FNMGEWS 


3793 


17694 


A 


3816 


247 


378 


PPPPGGGXGXNLGLLKPLPPG*REIPPP 
PPRGGGNPGGAPPPPG 


3794 


17695 


A 


3817 


25 


465 


TPPLGG*GKRFPRGPGFSPPPPPKGNPP 
FF * KFKKKPRPGGGAP * FPPPGGGRPKN 
PLPPGPKGSH* PKFPPPPPPGETKRKPP 
PPKKKKKKRKKE KKKE I * AY KAY ICFDC 
PLTGPGDLKSDKTLAFPSVIGLPRPCTC 
LSPR 


3795 


17696 


A 


3818 


475 


147 


PPKG * RHPPGPPRFRRGPGNPPPHRFFF 
VLRTPG YS FP F* KKRGP PPTF FFTLPG P 
PGFLKGPPSPGGGGQKIPFPRGGGGPLK 
*IPPFPPPPGKKKDFFFLTKKKKKKP 


3796 


17697 


A 


3819 


461 


57 


TNQKKAA P P P LL KKK F F F FF K KMTGGAF 
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PGEKKKRGGGEKKKRGVYRKGPRAPSRG 
CAPPLYYRGGLFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFH VKNI LFVF* DY I VI IADL 
IQSLRHKDFQELLRR | 


3797 


17698 


A 


3820 


1 


418 


HAYH I * KPSP * PLAGALSTLRMTSALAM 
* FH FHS I TLL I LGIiLTTTLT I YQ * W CDG 
TPESTYQGHHTPPVQKGLRYGI ILFI7S 
QAFFFAGFF * AFYHSS LAPTPQLGGHWP 
STGITPLNPLEVPLLNTSVLLASGVSI 


3798 


17699 


A 


3821 


385 


1 


LQFPPFSPLLFGVQRPFFSSFPSCPWWG 
PVPSKLLPLLQILGRGFYPQKSLLVSLG 
FFFFFFLI IKXHRNEVLLCCPGWSRTPE 
LK*SSHLGTPKCSDYRCEPPHPPDPWDI 
CSYVPFLQSTHFGLNY 


3799 


17700 


A 


3822 


72 


424 


KKKKKKKKKFFPRKKGAPPAPPLFGKAG 
GEKLFPPRGLIFKKPKKAPGEKKKNLFK 
KKKKRGGP* KKKKLKAGGGEKKI FFKGA 
QKKNPGAGFKKGGRGKNRGAPQKKGGEG 
SSS 


3800 


17701 


A 


3823 


154 


2 


GHHTQHTEFQRLNNSYRYAELKHLSNTK 
NNPI*KWAKDMNRHFSKEDIQM 


3801 


17702 


A 


3824 


422 


212 


MGSSNPSPTASWVDEPTGICQHT*FYVE 
MGFCHVSKAGLGLLGLRDPPPSASQSAG 
VAGVSHCTRLWIIL 


3802 


17703 


A 


3825 


2 


417 


KRVNTRGFPPPPFFKTPPPKIFMGPQKK 
KNFSPPPPIKIFFF*KAPPSP 


3803 


17704 


A 


3826 


423 


65 


ACGRGKYFKSGGRVKPFCAVDFPPSGKK 
ANQPTPKNPRKKQXPSGRGMSPMGKPPG 
CPCPMPKASPGPVSFNKGGVAPATKDFC 
PIGKTA* IPS KKKKKKSDPG * VKKGKPN 
LYSIYNK 


3804 


17705 


A 


3827 


285 


136 


FFFLFFXFLFFFFFFFFFFFFFFFFFFF 
FFFFFFFFSGA*KIFIIATIHY 


3805 


17706 


A 


3828 


425 


316 


QDGLDLLTS*SARLSLPKSWDYRHEPPH 
PAQTFFKQ 


3806 


17707 


A 


3829 


3 


169 


LSNS * ANNLI * LAYTIAFI VKI PLYGLH 
L*LPKAHVEAP I AGS I VLAAVLLKLGG 


3807 


17708 


A 


3830 
• 


6 


452 


IGFCQGFAKDFCSKKFQRNVRVSPRGGE 
GRKFLLFKGGGFQKFPPLPPGWGAKTPF 
LKPPPKKKKKKERKSKKEKEKKRNPGPP 
VSPSSWSPVP*APSEQRSGPQLPPSRGD 
*WRLPSPPRPFRDCVRRSATRG^SPPC 
RGSQGP 


3808 


17709 


A 


3831 


45 


387 


NFGVKVFS*LATLHGSNMK*SAAVL*AL 
RYIFVFTVGKKKKNKKKKKKKKKKRGGT 
PPGGEKRKFFS PKKKKKKGFLKNPQPKK 
NFFWGKPQKKKGGKTPFWGGGKKAPKKT 
P 


3809 


17710 


A 


3832 


463 


288 


DGLDLLTS *SVRLGLPKCWDYRH2PPRP 
APCILMATFFQFEVLFSEFLETIVKSVIi 
LH 


3810 


17711 


A 


3833 


294 


90 


FFFLFFFFFFLFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFF*AGIYFI 
TCAPPPAYPRS 


3811 


17712 


A 


3834 


427 


37 


SYHFGRLRRADHEVRRL*NRKGRPGAVT 
HACNPITLGG*GGRIMRSED*DH 


3812 


17713 


A 


3835 


499 


53 


AKLMYFQS RExwVAKRS YRRI^QKASMVHP 
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K=Lysine, D=Leucine, M=Methionine, 
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NQEPAVIAGQGTIALEVLNQVX* LSS YH 
CRLLVGMYHGCLLVSSVTPFCNIFSGLK 
VH 


3813 


17714 


A 


3836 


304 


3 


RLKI TKTNNNQTE INS * Y I KDLNARAET 
IRLLEENIGGKPHDIRFAMIFWI*HLKK 
TKNIYKLDYTKILDFCISKDTINKVKR* 
PLEWEKIFANKISDKG 


3814 


17715 


A 


3837 


475 


135 


MEGLF C PAPMVH P F I PPPLGG * GGGS PG 
PGIQTPPGYPGEPPFFFTLKKLPRRGAV 
FFFPPPLGGLGPKKVFTPGAGASLEPGW 
APSPPPGGPP*NPFPKKKKKKRLKIGWH 
F 


3815 


17716 


A 


3838 


415 


245 


ETGFHHVGQDGLDLLTS * SACLSLPKCW 
DHRREPPHPDPQVSFIRALFPFMRAPLS 


3816 


17717 


A 


3839 


460 


413 


WDYI KLTGFCTAKETRVKRQPTHWDKTS 
AKYI SDKGLKTRLHKEQ* KMTNLI *NWV 
K*LGRYSH 


3817 


17718 


A 


3840 


2 


470 


GGDAPH FPDGVAARRRGS S L LRWGG CQA 
QGLLTSQTGWPGRDAPHLPDGVTAGPRR 
SPHLRRWAAGQTRSSLPRCDGGREEALL 
TS*VGWRPGRDAPHFPDWAARQRGSSHP 
RRWAARQRRS S L PR RGSG RAE AAI SNS A 
SLQRVSKVSVSCRPGG 


3818 


17719 


A 


3841 


54 


332 


QFGFHHVDQAAPELLTSDHLPSSASQSA 
GITGVSHRARPSLAHSFTKSLAI«*PTPA 
HSLIHWLTHSLTHSLSFPCSFSSPPSHT 
HSLARPLGV 


3819 


J 7720 


A 


3842 


208 


468 


ENPTPPPLWCFALGVL*EDGFSI>YSLSA 
HVCE I VFLYCPVLE SARPLMFLWNLAHV 
CYYEWHLFSVSLEFHTVS 


3820 


17721 


A 


3843 


406 


2 


PSSSSSSSSSPPAAFFLGCPGVFPSPPF 
liKPAPGFFFWGPKKKNFFSPPRGVKFFF 
FKGAPPLFFFFFFFFFF*DGVLliCHFGW 
SAAHAS 


3821 


17722 


A 


3844 


370 


253 


VGLLE PRGQGCS ELRSHHS T PAWATE * D 
PILKQNETEQK 


3822 


17723 


A 


3845 


369 


73 


LRQDNGMN PGGRAC S 2 PR S HHCT PAWAT 
E*DSISKKRKSQNSVFTESYFASEHCYS 
LTGGGVAILMEGPTGPDGAI FMCHLRNL 
CRCSPILPLALQCDH 


3823 


17724 


A 


3846 


198 


388 


FRKS INVRYHVKR I NLKNHT 1 1 P I DAQK 
P * EKLQQHLTMKYS 1 1 MNSRKLSQPALR 
NLF*KPS 


3824 


17725 


A 


3847 


391 


20 


GFPPPRVF*HGPANFFLGPPKKKFILPP 
PR**TWFLLKGPPLFFFFFFF*DGVSLC 
RPGWSAVSKN*QTNKKSI ILPIRGKLNL 
FVHSMLIKGFTCDLLIFKNRKVKESTKT 
IDHIFSRQLMA 


3825 


17726 


A 


3848 


227 


2 


KRRNPFHKAIAAIDSSDGPR*SKLKNFW 
RGFSIADAI KIFRDSWEVKI S I LTGFWK 
KWILTLMDDFEGFNRTRGR 


3826 


17727 


A 


3849 


373 


2 


SSPQKYWGGPGFFPPPGFKTPPPPFFFW 
APTKKKIFFGAGRENFFFLRGPPPPIIF 
FFFFFFLFFFFFFFFFFFFFFFK* YNSF 
I * DI KTHVFLLHS VG FRVLVYKTRLYFT 
LYHSFLKNELG 


3827 


17728 


A 


3850 


394 


2 


EACS VTQAGVQWRNLGSLQAPRRQS FCL 
RGS LLS PQH * EDRT * H I GGTQQTS MNE * 
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MNLYLQWLCPLLYQLCSTS YT* EVIQLN 
IAFCVCIjVGFFCFFVFLFFETASQSFA* 
AGVQWCHHHS LQPQLPKL 


3828 ' 


17729 


A 


3851 


3 


164 


NHLNLKGRGCSDLRLCHKTLAWATEGDS 
ISKTKTVQ+RKLQNNIFKEHTCKNL 


3829 


17730 


A 


3852 


35 


390 


NFVSNSFFFFFFGNGAPRGPRGGGQGPN 
LG*GEPPPPG* KQFSFLTQPRGGNNGGG 
PPPRWFGFLRKNGVPPGGPGGPQPPDL 
GTGPPPPPEGSPPPGPQFSKTKRTAPGQ 
PKGGPF 


3830 


17731 


A 


3853 


6 


437 


PNK* KTPGLPPPPFFYFFPPPPPFLSF1* 
KPFSFPPPPPPPLFFFLPPP 


3831 


17732 


A 


3854 


I 


236 


PTRPLRWLRQENRLNPEGQGCGEPRSCY 
YTPAWATE * DS I SKKKKGGGRLKGPTLT 
SGGGQGLFFLRGPQKSTSGAAF 


3832 


17733 


A 


3855 


52 


351 


TVGIMEPQTTDVYVKLHKCTDFLLPDYI 
FNKQHY PKKKKKKKKKKKKKKKKKKKKK 
KKKKKKKRTKQKKKKREKKKKRGGGAPQ 
*LTLSPPPPPLYYFFF 


3833 


17734 


A 


3856 


72 


420 


NT YG KKKKKKG KKKKKKKRGGRE KE K LW 
EAGGGGTTRGL* KENSGGNMGKPRIKKK 
GGKARGAPLGVSSTQLL*RGEGEKCLSP 
GGGGCSEHKTTPLAYPGLGKKKETQSGK 
EKK 


3834 


17735 


A 


3857 


410 


107 


SSSSSPLARYFLGVAPGFPLPTLFKTGP 
RIFFLGPP*KTFSLPTPGR*T*SP*KGP 
PFFFFCRDRVLLW* PGWSEMCGLKQSSC 
LCLQKCWRYRYKPPQLA 


3835 


17736 


A 


3858 


419 


239 


TTRSQLLLFLVEMG FHHAGQAGLKLLTL 
R* SApLPKCWDYRRESLYLADA* GFYNL 
EDPL 


3836 


17737 


A 


3859 


262 


1 


TAAQEFNFGGPIKKIFRCQPGEVKLDPF 
KRPPFFFFFFLFFFFLFFF*RRVLPCCP 
GWS * TPEFK*STCLGLPK*WDYRHEPPN 
PAS 


3837 


17738 


A 


3860 


357 


133 


NTPPPGAGRNKENPLFPKKKPRGVGFLF 
PPPRKAQTEEWIiYPGPK*FFFFPPPPPG 
GKNTKKKPPPQKKKKKKSL 


3838 


17739 


A 


3861 


253 


375 


TLSPQG*GGPPPPPPRGGGLEAPPPPPG 
LFFVFWGKRGVSP 


3839 


17740 


A 


3862 


3 


482 


YITDEVNDPSLTIKSIGHQWY*TYEYTD 
YGGLI FNS YILPPLFLEPGDVRFL 


3840 


17741 


A 


3863 


I 


129 


LANSNYERTHSR 1 1 ILSQGLQTLLPLIA 
F**LLASIiANLAKEG 


3841 


17742 


A 


3864 


289 


372 


NHLVKKW AKDLNRH FS * ED I Q I AHRHMR 


3842 


17743 


A 


3865 


266 


31 


KICFIFSPPQKKSVFFPKVRFPQKRKPP 
PALLFSAYFFFEAESSSVTQAGWQ* CN 
LGSPQPPPPGFKRFSCLSTLGG 


3843 


17744 


A 


3866 


1 


395 


NTSDFCIKPRAYNESEHHWDMVRRHL*G 
KEDNLTLDI SKLKEQI FETS KAQLNLVS 
E TEAMVKAVDSLTNLNP VTWVKT IGNST 
IANFVLILGCLASLLWYRYIQQLRRDS 
DQREGAMMTMAVLS KRKGG 


3844 


17745 


A 


3867 


137 


2 


APLIKKHECLRAVAPACNFSTLGGRGGE 
ITR*GV*DQPDQHGETP 


3845 


17746 


A 


3868 


1 

=6. 


108 


ENRLNPGGRRCS E PRS C YCT PAWATERD 
SMS*INNK 


3846 


17747 


A 


3869 


i r\o 


302 


LLiNINHNYGGV * GCSEh* SKHCTLAWAA 
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ASLHPKKKKKKRKKKSVKSQGAFYSHYA 
MAILNYXREKKLFLKWK*LLNINHNYGG 
VLEKKKP * S YTYNLKKSQNLPQLL 


3847 


17748 


A 


3870 


237 


419 


KE I QS KQGLRAI C t PMF I AAS LMIDKG * 
RQPKCPWREE* INKLWYKHLMAYHSALK 
RNEIL 


3848 


17749 


A 


3871 


239 


232 


KAFLHTRVKRGI QPGAVAHACNPSAL.GG 
RGGR I TRSGDRDNPA*HGFS VIGKCLG I E 
LVGHMFNFLRNCQTVFQSGYI I YILPMP 
LDS 


3849 


17750 


A 


3872 


417 


219 


CHVAQADLKLLSSSDPPASASQSTGIAG 
I SHRAWPRGTNP IHKGSTLMT * LPPQRP 
HLLTLSPRGF 


3850 


17751 


A 


3873 


45 


414 


NILFFFFFWKKSNLVPQGEGQGQSLG* W 
NPPPRGLRGFFNLTPPRTGNKGGDPPPQ 
ENFGF+ RKGGVTQGGGGGS KT PALGGPP 
GLTPQRGGNNGQDPRPPAKKFGLGKKKK 
NRQLPFLGFFR 


3851 


17752 


A 


3874 


278 


2 


LISIYVSIIYLYHSGFQLSKISPPQGQL 
AYLSIYLSVCHFLIYEIGVITMPASKID 
K*DLIKIKSFCTAKETIIRANR* PTEWE 
XIFAIYSS 


3852 


17753 


A 


3875 


233 


379 


QCDYFNELLRME I LGPGAVAHAC3STPNTL 
GGRSG*ITRSGV*DQPGQHGE 


3853 


17754 


A 


3876 


6 


247 


MDITSASYPEDLAKXIDEGGYNKSQIFN 
VDETALN * KKTPS KTFTARDKSMPGFKA 
S KBR L TLLLGAN F KL» KPMNDL I L I 


3854 


17755 


A 


3877 


411 


3 


AKI P PLTS SQGHRTRVCQKKRERERE KE 
KEEGRKERERERKKGRKKERKEERKGS * 
VRMLFLDKVDFRSQKI ITDKEGHYIMIR 
E * MHQ * D I ILNMY I SlSnSJRALKYMKQKLT 
ELIGRVGRPGRSGTVPGRRNGVQ 


3855 


17756 


A 


3878 


174 


375 


SFQQ*TAELEIKNTTSFTLTPPKLKYLG 
INLTKYVQDLYK 


3856 


17757 


A 


3879 


I 


106 


FHRVSRDGLNLLTS * SARLGLPKCWDYR 
RSPPSPA 


3857 


17758 


A 


3880 ! 


383 


24 


FALNNFPQGGAFSGPLTLS PQVGLFFAP 
GPPPPGGGFSCPLAQSFFRIFPQKSGPP 
ALSFSPLFKKNPWFFLRTQPLLPLFPQ 
PRVFFFPPFFFFF* DRVSLCCPG * SAVA 
QYGSLQP 


3858 


17759 


A 


3881 


400 


71 


RIRQENRLNPGGGCYSEPRSRPRTTAWV 
TG*DSISKNNWKNK*NLSFIYFYYYFLR 
WSLTLSPKL 


3859 


17760 


A 


3882 


282 


3 


IFNFCGYIVSVYIYEVQEIF*YKHAMCN 
NHIVENGVSIPSSIYSLCYKQSNYILLV 
ILKYAGWAQWLTPVIPALWEAEAGGSRS 
QEMETILA 


3860 


37761 


A 


3883 


1 


175 


LYKNFFKLAEHGGVYQILRRLRQEDCLS 
PGG*GCSEFKSCHCTPAWVTQQDPVSKK 
FF 


3861 


17762 


A 


3884 


415 


233 


SQDGLDLLTP* SARLGLPKCWHYRHEPP 
CPAANFFLKARYFWLMVARYFWLWCNYS 
ALPS 


3862 


17763 


A 


3885 


237 


367 


GP VLGPGMVARTCNPR I LGGRGR W I T* G 
REFETNLAMMAKPCL 


3863 


17764 


A 


3886 


359 


108 


TGQGGGGPPPPPLGRLKKKNGVNPEGKN 
SNKPKLAPSPPPGGKRKPPSKKKKKKKG 



787 



WO 01/64835 



PCT/US01/04927 



SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
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G*HLPMLPRLGSNSWAHDMILPPYPPK 


3864 


17765 


A 


3887 


245 


377 


QF I KN I S WPGAVAY ACNPTTLRG * GG W I 
KRS RV* DHPDMVKPHL 


3865 


17766 


A 


3888 


406 


2 


SLNI FFFFFFFFFFFFFFFFFKPGRVFP 
LKNFFFFPFKGCRIPFPPIFPLWLKVFF 
GPTPL*NPRG* KEGFCPPPFFFFFFFFF 
*GRVLLCRPGWNAVAQSRLTATSAHAS 


3866 


17767 


A 


3889 


3 


114 


EFHCVSQDGLDLPTS*SAHLGLPKCWDY 
RRKPPCPA 


3867 


17768 


A 


3890 


395 


1 


MVLIS*PCDPPTSASQSAETTGISHRAR 
QFPMLLSQ* LVCHSVTEVFLIFLLKLFY 
IYDILLYIIYNYIIjLIIYYLFLFLLFFE 
TGSCSVIQTGVSSTITAHYSLXSHPSGL 
KRSSHLSLPSSQDYRCAPT 


3868 


17769 


A 


3891 


2 


189 


GRVGERGGTLL* SQLLRR*ENHLNLGGK 
GCSEPRLHHCTPSWATEQDSISKKKRSF 
IKALHM 


3869 


17770 


A 


3892 


410 


258 


GC*GCSEI*S RP CTPAKATE * E P VAQ KE 
KKKKKR KEKKKHVLAT LAI SSLICL 


3870 


17771 


A 


3893 


244 


412 


GIEEACGNFRVIVLKSSLALEGEEPSGR 
DSS * EAS GGRAR WLTP V I P ALWEAEAGG 


3871 


17772 


A 


3894 


208 


2 


RISPPITPFPPWVTKKTKG* PGFFFQKN 
*KSFFFFLRWSLALVPQAGAQ*LDLGSL 
LPPPPGYKPFSC 


3872 


17773 


A 


3895 


407 


18 


PLLGHPPKKAGAPLIFNLPRRPK*KPEF 
PPPPLSPTQPPPPPPPPICPAWLPPPGV 
PSSFKG*MGGPP*RPEV*GPINPFWAPS 
LPVWPQNPIPFPKKKKWGEIHCMVCPLS 
FNETFSFFKMESLRDGK 


3873 


17774 


A 


3896 


82 


401 


KGFLKNSPKGEGLFLFSFPFFFLFLATW 
LWPRLKCSGAI IARCSL*LLGSSNPSDP 
PTSTFGEAGITAAGHAQFPPPDS PPPKT 
RAHHVTQPGLKLPS * SSPPAS A 


3874 


17775 


A 


3897 


263 


396 


VWWPGGLG*Y*TPPLLGGQGGRITRSGD 
QDHPG * RGKPVSTKNT 


3875 


17776 


A 


3898 


2 


386 


KTGLELLGLSNPPASASQSAGI IGMSHC 
AWPNLFLYWKHHLYPHKK*VWPGTVAHA 
YNPSTLGG*GRRIA*AQEFKTSL 


3876 


17777 


A 


3899 


140 


33 


IFFFHL*GEIGFTMNMLDIGGGFTGTEF 
QLEEVNFSVNDINKI IAFFVT*GVDV* S 
R I FTMAKTKP YN I S LKLCS * KTF 


3877 


17778 


A 


3900 


377 


2 


TPPKENGGLPKI KVPYFRGNKFKKFPLC 
FSKVPPISTMWVLKNQFQPPIPFLKKNF 
FKTPKKGVSHYNFPPQPKS*GVFFFSKK 
KNFYPFFWEEQKKQRGPNAFLFFLKKKK 
RAAARPTRPPTRP 


3878 


17779 


A 


3901 


2 


123 


RVSQDGLNLLTS * STCLGLLKCWDYRHE 
PPCPALRCVFYG 


3879 


17780 


A 


3902 


188 


3 


LRCTWCPGTDGWVSHLAHL* FKKGRPGA 
VAHACSPS ILRGQGSWISRSGVQNQLGQ 
HVETP 


3880 


17781 


A 


3903 


422 


238 


I LHVQL P S S WDS KRKP FF FFL * R * GFTM 
LPKLLLNS*GSSDPLASACQRLGITGMS 
HYTWP 


3881 


17782 


A 


3904 


414 


1 


S THLGLP KCWD Y RHE P LHP AS M S YL * * I 
FFIYLFTHWSVSLTARMQFAEDRDLIFL 
LHCYISSDSNSTSWL*YILVFVHSPWL 
\^IALJLGFWFVFVFLFFFEtGSRSV 
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AQAGVQW*GLSSLQPS IPPGFK* FSC 


3882 


17783 


A 


3905 


396 


236 


DRVSLLSPRLECNGTISAHCNLCLLGSS 
NSPGL*NRGHQIFSIEGQIFLQAMSLSQ 
LLCSALWQKQPYTMYKGMEMAVCL 


3883 


17784 


A 


3906 


377 


57 


RGLVFFFQNPQA* XNLGRGRVFFPSGAQ 
KNPQKGGWAFFGRPPPRFGGRPRGPGAR 
CPLNLTPGGPKGKALGPPPWGWGSKFQK 
GVPLQWVGGNSGLFFSKKKHKL 


3884 


17785 


A 


3907 


341 


3 


TRFFFFSPLGKKGFFSKGFKLGTPRVFP 
LPPFLNPGPGIYFWGPIKKKKTFPPPGV 
KFGSFKRAFPFFFFLMFIAALFMWKTW 
KQPRRPSVSE * *NKLSYL+ TTEYFFS I K 


3885 


17786 


A 


3908 


3 


162 


MMRKAIRGHL*NWPAIiEKLLPHIRGNVG 
FVFTKEDMAPFQKPALLQHHSQYI 


3886 


17787 


A 


3909 


1 


421 


PTRPPTRTRWGFTMLVRLLLN* KHSQAG 
AVAHACNPSTLGGQGRQI TRTGVQEQPD 
QHGETPSRTRGRTRG 


3887 


17788 


A 


3910 


2 


424 


PRVRTSPGLALSGLTGLKFSKNQKKKGK 
KPQTTKTQKKKKSPKKKKTPKKKKGGAP 
LKKKNFNARGGENFFFF*GPKKKNPGAR 
FKKGGGGKTRGT PKKKAVG KT P FFRGGE 
KKKES PSQFYKKI KKKVFGGKKI FP 


3888 


17789 


A 


3911 


3 


236 


GFCHVGQADLELLATDDLPAS APQSAG I 
SGLSHCAWPV* PHVMSSLKSLLATRPSL 
HRVKLQKNGLRKTR3LYMKNS 


3889 


17790 


A 


3912 


304 


2 ' 


VAKKGKSKPGGLKKNPPKIGLGYPQGGP 
QNQIWEKFFFLTPKNRWGAPL*GGGGPP 
SLPKKRGPPFPGWSNPPPPFFFFFFLRQ 
SCSVTQTGVQWRDLEL 


3890 


17791 


A 


3913 


91 


344 


HAAPVCGRRTLKVHPHAPSWPPSSWAHP 
PPTWPEPSGTTWAPEGTWTLGPPQELS 
LRRGVSYHCPPP* PL I WGRKWPPPLFCL 


3891 


17792 


A 


3914 


328 


402 


KSWGGWPXAVAHVCH P 5TLGG * GGR 


3892 


17793 


A 


3915 


16 


179 


GGGCSELR*RHCTSAWVTRARLSQKKEK 
KKKGPRKGEVATGSVWTRGELKSWAL 


3893 


17794 


A 


3916 


229 


46 


AGPELLASSDPLTSASQSSRITGIPFLT 
GLYSKDHI IETANI SYTNA*AL* ITLRT 
RGSTR 


3894 


17795 


A 


3917 


42 


383 


AFPCFVIGRVISKGKECTLYFFLFLFFF 
FFFEKKFFFFSQGGKKGGEFGSMEPFPS 
GLKETPLPHLLEEGKFWAPPPP PVYFFV 
F* *KRDFTLVARVVSNRLPSREKGLDYR 
P 


3895 


17796 


A 


3918 


402 


286 


PPPPF*TPPPPKIFGAKKKKKNSPPPPG 
ESFSF*KRPPP 


3896 


17797 


A 


3919 


349 


380 


KKSPFPQ*RNFGSLQPPFPGLKPSSPLS 
FPSS WVPR * TPPNPANLFFFKKMRI SHV 
AQ 


3897 


17798 


A 


3920 


28 


327 


KTTLLNQATI IQAKAGTWGKPP PLS PRE 
}>JKQNQKKKKKKKKKKKKKKKKKKKKKKK 
XKKKKKKKKNGGGGP * KKKNFS GGGGEK 
FFFFWGAKKNFGGGGF 


3898 


17799 


A 


3921 


414 


1 


NFPKVSG PKLMS PS PPHFFFPKKS FPPP 
PLFLEKKNGRFSPPFPEKKV*F*KKPPF 
FFSPKKKPRFPPKVLKVGGKKRVLKNLG 
PGLLS FCKGRGTP P P P PL FFFFFF* DGVS 
LCLPGWSAWRSQLTVTSASQVQVQE 


3899 


17800 


A 


3922 


i 


169 


LRS CKCTP S WATE '» DS VSKKKKKKTLKN 
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LSPQKLNRAGKI FLRVFPLEKKSQTQLL 
*NRAGKI FLRVFPLEKKGQTQLL 


3900 


17801 


A 


3923 


263 


74 


LLSVWRIPCAQKKERKKRYECECFLIVF 
VFDLSEENSSYI SFKPILCVCVCVCVCV 
CVCVW* VKNFRKS* *SSDRSKTKTIKKH 
SHSYLFFLSFFCAQGILQTERSHLLPHL 
EVSLFKE 


3901 


17802 


A 


3924 


3 


129 


TGFHHIGQAGLKLLTIi*SACLGLPKWWD 
YRHEPPHPAPFCD 


3902 


17803 


A 


3925 


281 


10 


NLPLPGATDYPASGGGGCNEMRSCHCTP 
AWVTGAKPHLRKK* KKKRQMLKTISKEM 
GNFNMVYILDKITK* FLYFKQLILSFLM 
GELGVK 


3903 


17804 • 


A 


3926 


110 


1 


QLGAVAHACNPST* GGQGQAEHLRSG IQ 
DQPGKHGK 


3904 


17805 


A 


3927 


180 


368 


EINF I TNYYI I LFFNRQGLALS PRL * SG 
GAI ITHGTLKKLDSSEPPTSDS * VAGTS 
GTYHHA 


3905 


17806 


A 


3928 


92 


2 


VRPGAVAHACNPSTLGGGEG* ITRSGDR 
DH 


3906 


17807 


A 


3929 


2 


146 


HLNLGGGSCNELRLHCCTPAWATE*DPA 
SKKKKSFKITSAFTCQIVML 


3907 


17808 


A 


3930 


221 


3 


KERAIWGTELNMVSKIFPI *LKQRKNL* 
PPNIKCYNNLLRP+ LGPGTVAHACMPST 
LGSRGGRITRSGDQDH 


3908 


17809 


A 


3931 




102 


GGRGCSEPSSHNCTPA*VTKRDSISKNI 
NNKIVK 


3909 


17810 


A 


3932 


351 


69 


GQGGGFRFIQPPSSRGKRFFPPPPPPIW 
G*RIGPPPRANFFFLFF*KRGFFLFGRE 
IFFFPPPGVPPPWPPQNFGFQGGAPPPP 
PPFFFFFFF 


3910 


17811 


A 


3933 


401 


94 


RGPPPLN?PLWEKNHAGPPKVGG*KPPP 
PTKKKPLFS*KTKIFPPGGPPPLIPPPR 
GGKPKKFLYPPKFRFREPKKGPFPPPRG 
TKKKTPPPKKKKKKKKVS 


3911 


17812 


A 


3934 


184 


2 


LKE FFPS RNSRS VLHTSNF * HLF I *L 1 1 
YFFEIESYSVTLAGVQWSDLSSLQPRPP 
GLKQ 


3912 


17813 


A 


3935 


2 


393 


KIL I KVE I EGTYRKI TNAI CEKPTAN IV 
PNR KKLKAFPLRNGTR * GCP VS P VLFN I 
TLEVLARVIYSEK* I XGTKI QI S IATME 
NSKKFPQKTENKTT I CSSNSTTGCISKR 
KEMH I S KRHLHS PVYCNT 


3913 


17814 


A 


3936 


291 


2 


FPPPPQTRGPKSSSRAHSPPGPGPGSTT 
GAR* PFRPLLLVRASLRPPLGLPTPTGP 
TGGMQRP EA * RT P ATD KPQN S RRG P * G A 
WGRPGCRCRGGA 


3914 


17815 


A 


3937 


382 


1 


QWTAQTWSVQNELLLFCLKPHPPPSLLP 
PSSCPGPHLGTPWTPAFPSHPTSTLKAH 
PIGSGPSSPLLLTSPRMPPEIQTTATG* 
PASSLTPTVCSQHGNRFFKKGVRAGRSG 
SHLESQHFEGPEAGG 


3915 


17816 


A 


3938 


398 


2 


PPPLLLNKIGAKKKAFFCPS PFGPRKI P 
CGQKTKKSVFPFRVP * DKSTFDCFLCP I 
VYVKKQ I WLAPLHC V FSGRVAQG S KKS P 
LCLSLPFFFFEMESHPVTQAGVQWRDFR 
SPQPPPPGPKPFSCLGLPSS 


3916 


17S17 


A 


3939 


240 


385 


KLLFL FLF I Y FF * DG VS LCC PG VIS A WQ 
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TLLTEPSAI PGAGGTP VS P I 


3917 


17818 


A 


3940 


230 


381 


LISSSYREYIiAKDLRFSSKQS*WLCSKE 
FRGLGAVAHVCNPS TLGGQGRW 


3918 


17819 


A 


3941 


1 


107 


CWPVWS*TPDLKQSTCLGLPKCWDYRQA 
QPHRNLL 


3919 


17820 


A 


3942 


2 


95 


DLLTS *SACLGLPKCWDYRRKPPRPALF 
AFS 


3920 


17821 


A 


3943 


163 


404 


CRHKAQWLPQLPGHLLLSEKTLKYSSPK 
GLRSQH* KLLFLFCFG*TESRS VTQAGV 
QWRNLGSLQPLAAGSKRFSCLSLL 


3921 


17822 


A 


3944 


3 


200 


GLDLLTS * SAHLGLPKCWD YRREPFRLA 
YFTFIKKKKKDGCELLNLFPNSCMDTAH 
VHPAHLTGG 


3922 


17823 


A 


3945 


423 


2 


LFFFFSEV* SHSVAQAGVQW YNLGSLHS 
S 


3923 


17824 


A 


3946. 


209 


413 


TTC I EQCNKTRRLGAVAYACNPSTLRG * 
GRWITRSGVRDRPDQHGETTKNTNISTT 
KKSTKNTKISQA 


3924 


17825 


A 


3947 


425 


3 


SPSQGLRDKTP I SLGQSTWGKGQMWAQR 
QQT*TSWPDGSEESSRSPSTFEQQISER 
SSSAKGQTNSSSGSLNSVSPDWETPPSR 
SQQTPHTGELQLASGRCS SGMNLPEEGT 
DSNLCCSATHAS 


3925 


17826 


A 


3948 


299 


3 


ASQTTFREGCFLFFSFFISVCLFVCLFE 
MEFHS CRPGWME CNGDRARLHIiKTTI TT 
TIEIVKDDTNKWKHIH*RLPNIKMSILiP 
KVIDILFNAIPIKI 


3926 


17827 


A 


3949 


331 


374 


NAPKSERASKAF*AILEENKEKLPLDIN 
PWHPHGH I FKCVAPPHCLGWDLSAVPA 
SHALGFPQPQNSAWHVARSKSAASCLGT 
AVTSAGSGVLGTPNKMPQKVRERARLFW 
R 


3927 


17828 


A 


3950 


439 


69 


TNQKVWVPPKEKGLFKKKQLRPRVGGPA 
FHSPPLEGQGS* FP*GQRFKPRPPPWVK 
PLFSKNPKNYPGGGAPPFFPPSPGG*GG 
KFPPTPKAKVPFIQISPPPAQAGGPTQV 
PFPKKKKKKSH 


3928 


17829 


A 


3951 


422 


3 


CFPALGIGPPHPPKRVG* KGGTPPPGLF 
FGKFLKEKNFGKNPGPGQGGPFFPPKKQ 
RGPFNNGRPGKVKKKGAPGPGVPKKKPP 
RVPPPPGFFGNPPGAPPKFFFSPRGGED 
QKRKNFFFFFFLRWSFTNEAQAGVQWR 


3929 


17830 


A 


3952 


223 


3 


ALFFWGRGTLPRVKGLNPKGGGGKFFFF 
FFF*ETESHSVAQAGLQWHDLG * LQPPP 
SRFKPFSCLRLPGSSLS 


3930 


17831 


A 


3953 


1 


388 


E KGVS PGGRTCS EQRWCHCT P VWATERD 
SVSKKKGMSNNFSLILRSWNNKGQGGTY 
FQQKFIYPCFLKIKGHSYINCKIFKK** 
NLSLHVHVTDYNVR 1 1 LQKLDCQIHWVD 
FNFELNILQCQGLGGCK 


3931 


17832 


A 


3954 


1 


396 


KLRPKDQDTWPTQQSAIWEFVPPSVYF 
VKSISEILFFFFLKKNFFLGPQLKGRGQ 
ILVKGNPCPGGKKKFPAPPPQRGGKKGG 
PPPPQF I FVFLKKTGV* KIGPGGV* TPA 
LG I APP FFPKGGE * RGGPPP 


3932 


17833 


A 


3955 


2 


199 


PGGGGCGEPKTRGCTPAWATERDSVSKI 
NK*KLKLKNNSWLYAKSLASSPTGR*EE 
WGCFTNEHI 
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Predict- 
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ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Iso leu cine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=G)utamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /-possible 
nucleotide deletion, \=possible 
nucleotide insertion 


3933 


17834 


A 


3956 


393 


2 


KEFPLFPPGAPPWPIiSGPFSVIiEKPICCG 
GGWPGINTPPGGGGFPQGSLWAGDFP* 
PLFS PQ I PF FKKQKFF FFLPRGGLWAP F 
FPP FGGPLFSQNF* KTGF* GGFFFFFFG 
DGVSLCYPGWSAVAPHAS 


3934 


17835 


A 


3957 


278 


390 


LVY I FKI RTCW PGAVAKACNPGTLGG * G 
GRVTRSGDQ 


3935 


17836 


A 


3958 


385 


311 


GPKNPPAS ASQEGETTEV * HHAWL I FFL 
FFVELWSSHVAQAGLKLPASSDQS * TL Y 
I 


3936 


17837 


A 


3959 


410 


119 


SPPPRFKTPPPEFIFGAPKKKFFSPPPR 
PLNLILLKGPPLFFFFLPKAFKPHINFV 
FPPQKKKNV* INLYFPPPKKENESS *GK 
HFALHCNPSCISK 


3937 


17838 


A 


3960 


77 


400 


TALPLKPKKKKKKKKKKKKRGGPLKKKK 
FKAPGEEKKNFFKGAPKKKFRGRV*KTG 
ERKKPGGNKKKTFG KKP P F PRGEKKKKP 
QA 


3938 


17839 


A 


3961 


2 


194 


CL*SQLLGRLRWEDHLSSGN*GCSEP*S 
PHCTPAWVTE*DPVSRKRKKYPWNQTL 
RGFLSPAH 


3939 


17840 


A 


3962 


52 


409 


NSKTSSQEKKRRVIRLSLRSRVGDDTKV 
NTRRWGVFGWHLKNLPTTLVLKKMEKIL 
ETYFMTPDLHDPHTTS YFNGCQLQAI KV 
GGTITDLIVF* IVSKPNAAGLSLNENVN 
AKFNDLI 


3940 


17841 


A 


3963 


382 


96 


LSSQLLREAGERNE*TGWLQQPLQMQIH 
KNPCIEPRSRYCTPAWATEQDPVSKNKQ 
TNKQTKNSLHL 


3941 


17842 


A 


3965 


3 


169 


DAWELRSHRCPPAW*QSETPSQKKKKRG 
KKKI FRFYLFFNKRKKFLNGGNLGVI P 


3942 


17843 


A 


3966 


1 


121 


AGFHRVSSDGLDIiLTS*SARLGIiPKCWD 
YWCEPPHIiAENS 


3943 


17844 


A 


3967 


200 


2 


PPFRIYWGEVLTPLFLRVKPKFFKPWGF 
PFCLFFFFFFEMESRSVAQA* MQWRDLD 
SLHPPPPAFK 


3944 


17845 


A 


3968 


471 


440 


MELRFYHVGQTGLKLLTS * APPVMASQS 
AG I LG VSHST W * EAPNSLAITFLSSHLS 
PRQP*THLGC 


3945 


17846 


A 


3969 


214 


2 


EGQPECQLQAGDWVSLWNLVCSGLEVRA 
WREAPRLEWSFCIiFFVSEM* SHSVI *DR 
VQWHNIGSLQPLPP 


3946 


17847 


A 


3970 


53 


398 


GDLPKKQQQQKPLNFCPQLLLYARLFGP 
I FLV* HLATYNNKHLLECVGKVM I NCQA 
DILNKME I SKVGAQKKKKKKKKKKKRGG 
PLKKKKFIPRGGEENFFFLGAPKKKTGG 
GFK 


3947 


17848 


A 


3971 


396 


160 


KRERI KEKEDNKEEKDKEKKASGKKERK 
KEKEIRDRESKEESSNLLLQEEIiSLCCP 
SPGHPPNSSL*DPYFATGNSLRC 


3948 


17849 


A 


3972 


416 


3 


F P F PR V PEGV PGGEGVFP P G F FG KSGVP 
FPRPRGGGAP I * KPPPFPGGPPNFFF * K 
KTGPPFSPPQGAPPAKRAPPPFFGGGGG 
KIFTPLKKGGLFFFGPPKKGVFFGPKKT 
LFFFQKKKKKKKGRPPRSHASAHAS 


3949 


17850 


A 


3973 


232 


381 


YKTFSLIF*LFFFRDRVSFCHPDRSAW 
LS * FTVAS I S WAQGI LLAQPLE 


3950 


17851 


A 


3974 


396 


141 


CFFPPPPKRGFFPTPIFGGPPGFFFPPF 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E>=Glutamic Acid, F~PhenyIalanine, 
G=Glycine, H=Histidine, I=lsoleucine, 
K=Lysine, L=Leucine, M=Metbionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














F*NPPPKI FFGAPKKKKI FPP PGGKNFF 
FLKGPPPFFFFFFFFFFFFFFFFFFEDG 


3951 


17852 


A 


3975 


286 


422 


RKQLISPHSQDNSKQKEQSWRHHATCLQ 
TILQGYSNQNS* *APIAKTILSKKNKAG 
GITLPVFKLYYKATVTKTAWYWDQK 


3952 


17853 


A 


3976 


3 


116 


GFHRVGQNGFDLLTS * SACLGLPKCWDY 
RHEPPRPAR 


3953 


17854 


A 


3977 


223 


399 


EGPRTQSFFDTEVHSVTQDGVRRRNLGS 
LLPPPPGIK* FFYLSLPSNWDYRRLPPR 
PVN 


3954 


17855 


A 


3978 


227 


422 


SFPS PC I FFS LVDLGVSL'YCLGWSQTLD 
LKGSSLFSFPKCWDYRQ*ATTPGLPVYF 
FPFPVFTFF 


3955 


17856 


A 


3979 


519 


313 


KKNLLNPGDGGCRE I KSSHCAPAWVTEK 
DCLKKKKKKTSVCVCVCVCVCVCV*NHL 
SSLGLFPEPVGLN 


3956 


17857 


A 


3980 


217 


434 


IYIFCGDGVSLCCPGWLKQSFCLSLPKC 
*DYRHVPPRPTSFHLLTNIGVLQFLKLQ 
ATLSFTLFL.L 


3957 


17858 


A 


3981 


2 


409 


KLTFAV* DY * EKLLLSLASLTSQAAFLF 
PEHPYTVISNPCSPANYIFTLSSGTTFH 
QFYAKS * CVTYE * KKKNLL S P P TGDKPN 
*GRSQTKKIQTKY*SEICFFVHLFETGS 
HHVAQAGLELLSSRDPPTSASQST 


3958 


17859 


A 


3982 


234 


416 


SVLPSTPLLNKL* KSKKKKKKKKKKKKK 
KKKKKKKKKKKKKKKKSRGGAPKKK 


3959 


17860 


A 


3983 


414 


291 


GGRACSEPRSCPCTPAWATERDSV* KNK 
QPNKQKNKKDLL 


3960 


17861 


A 


3984 


192 


415 


TOANKLDNLDEMDRFLETQNLLRLNNEE 
TEYQNRPVTREEIE * VIKNLLTEKKNPG 
PDGFTGEFYQTFKEKLIP 


3961 


17862 


A 


3985 


403 


145 


AFFFFFFPPGEKGDFSPTVLFGGPPGFS 
PPPVFKTRPRNFFLGPPKKKYSFPPPGP 
KNWFP*KGPPPFFFFFFFFFFWKLGIEV 
IS 


3962 


17863 


A 


3986 


255 


425 


GGRFKGSNFTSAGLQGFXXXXGPPKFIS 
RPGV*QRGEWKNPGVPNLFALSTSPFGR 
W 


3963 


17864 


A 


3987 


247 


2 


EEAVRVSNIiYTKLDEEQ*WKKQLNYVE 
RLKK*ELFLQDLYRKVPLKI * IF*VKNL 
NSFWPDAVAHACNPSTLGGRGGWIT 


3964 


17865 


A 


39S8 


1 


394 


HHHTWI IFKFFCKDGVSLCCLSWARTPG 
LKRSSCFRIiTKCWDYRHEQLCPAFKHFF 
LY*HNFMSTEKMQE*HKNS*YPSPTVPK 
GVTFYYISYICFIILILSAITYKFVHTY 
IVSLNHVCYSNEAPLSLNI 


3965 


17866 


A 


3989 


208 


402 


ETDRRLKSGGSLSKTLI KCMTCFWFRD 
GVSLCCPGWIQTPGLK*SSCISLPSSWD 
YRHMPPCPA 


3966 


17867 


A 


3990 


2 


408 


THRPYHATPTYLPFYTNNLIKGGKKKKK 
KGGKKKKKGGPPFKKKHFSPPGGEKNFF 
FYGPQKKKSRGRV* KRGEGKNPGAPQNK 
PLGKNPLFPGGEKKKKPRFAS* PPPPPF 
SFGKKRAPPTKYPGAGLHP 


3967 


17868 


A 


3991 


81 


406 


KGGLFLLPRGKGKGKIWVKGNLGLKNKK 
NFPA* PSEEGEKKGGPHQAGKILKFGGK 
RGLNKGGKTGQKLKPGGI PPPGP * KVGE 
KKEKPPAPGQKLFFKKRKGQKKNF 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L~Leucine, M=Methionine, 
N-Asparagine, P=Proline, 
Q=Glutamine> R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, V=possibIe 
nucleotide insertion 


3968 


17869 


A 


3992 


396 


64 


YSSPPPTKKTDDSDASRGPFFFFPPPEK 
RGFFPNDFFWVPPGFFPPPGF*TPPPVF 
FFWAP*KKIFFPPPRE*KFFFLKGPPPF 
FFFFFFFFFEHLQVLLIC 


3969 


17870 


A 


3993 


367 


193 


KLWE KKKI LWKKG *GEI KS CHCTPAWAT 
RVKLCLTKKKKKKKKXGGEGE I FKKKSF 
SFLGKQKKKRGPAFF 


3970 


17871 


A 


3994 


188 


2 


KRANTLPD FKTY YKAI VLKTLW YWRKDK 
HRSMKQN*ESNEVELIiYHNIHRNKLKMD 
HRLKCQ 


3971 


17872 


A 


3995 


2 


147 


QENHLNPGG* GCS E LRS * HY I P AWATE * 
DSVSKKNYI * +TSHI FSGDF 


3972 


17873 


A 


3996 


23 


329 


RNSFIRSAFSNVLNKWLFLTSEKKKKK 
KKKKKKKKGGRKREYLTRRGAGENNKYF 
YK* KKKKNTCG * KKKGWGRKREREKKRL 
GGKKKFPPFEEEGKKKPG 


3973 


17874 


A 


3997 


164 


392 


KPLGRAGLVPCLCSSFTGFLM* KTEPCS 
WQAGVQRGHLCSLQPQSPMFKRFSLLS 
LPSSWDYRVLLRHPSWSAW 


3974 


17875 


A 


3998 


396 


3 


FFFFFPPGGKWGI SDS PQLFFSGVKNFS 
PPPPPVNWGLRVWPPPQKNFFFFF*KKV 
FSFFWGGPPNPPFFFSLLLGGPPPPPIF 
SGEKIPPSLFFFFFFFEMESCSVAQAGV 
R*RHFGPLQPPPPWLKQF 


3975 


17876 


A 


3999 


383 


2 


VPSPEKVGIKKIQPLDPFF*FLGDCQRK 
KKSPS*RPPPPPEGKKLN* IRLMGPPFN 
SKEKFF* KF* AFKPFFFPPKGEGI *GFF 
SRAHS FPNQRALFWFFFFEMESHS I TQA 
SMQWHDLRSLQPPPL 


3976 


17877 


A 


4000 


213 


1 


RGRQNKTLFLTKPNQTKPNQTKPNQTKP 
TT FL* S VI E I RVKKKYKTTRKTTKNPTT 
SLTRSLGRYKSPTRP 


3977 


17878 


A 


4001 


412 


129 


GSLSSPRLECSGTIPAHCSLNLPGSSSP 
CTSAPRVAGPE*MGQACMPPCLPWFFF 
WGGGGI LVETRS C * VAQADLELLS FSDP 
PASASKVLEL 


3978 


17879 


A 


4002 


411 


293 


RDTRCLPP* LDNFVFLVEMGFHHVGQAG 
LELLTSSEBIA 


3979 


17880 


A 


4003 


46 


392 


QDFFFFFFFLGKGAPPPPGKKGGTPFPK 
KKTKGGGPRGPPLEPPPLGGPGPPEPGG 
GDLKKPWPPQGTPPL* KKPKMGGGGGGP 
P * TPGTGGGGEKNPLTPGGGPKNGPP PP 
PPG 


3980 


17881 


A 


4004 


360 


1 


KCLGFQFFFRFWNICIIFISQLSIASQV 
QKSEIQNAPMSIFFEHHVSFKQFQILEH 
FGFWIFILGMLSTYKTLILLKLN*RFNA 
MPINIPWI FMEIDKLILKFTWKYXEPR 
RVKAILRN 


3981 


17882 


A 


4005 


152 


408 


NKFVSHFYWMGNVLSRY?QFT*QFSFCH 
VRLPSVF* YLYTRNKIL I FFAHKAFFLD 
AKYVCVCVCVCVCVCVCVYFGIGEGNLP 
S 


3982 


17883 


A 


4006 


411 


67 


YLPPPSSPTPGVFFFFPPPKKEFFPNPF 
I LGTPRFFPFPPF * KPPPKFFFLGPKKK 
K*FSPPRGKKIFFFKRAPPFFFFFFFFF 
FFFFIiLGGSLS FM YFH I KTYK 


3983 


17884 


A 


4007 


217 


2 


PLFFFFKHLKAYYSDVIHI SFAI LRYLL 
F * DRVS L CHSG WS AVAR S WIfPVAGQK TF 
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N=Asparagine, P^Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, V=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














ITRQEAPYLLAI PRV 


3984 


17885 


A 


4008 


235 


406 


GRHRVSSNPISI *RRRGLSLLPKLECSG 
VS I AHCSFDLPG* SDPSASASRLLGRLR 
Q 


3985 


17886 


A 


4009 


833 


237 


SLDNRARPCLVS FI FKKGRS TAERCGAP 
GTTSPPFPRLSERGHPI IPS *RGWERKS 
QLCSHLQSSRDPSGLGRAGCGGGGDLRE 
SPGTFPGKGGSDLVPVPTAPIPIEPQIC 
QATCAVMKLSFDEEYRRAMNELGECPRS 
SGKPSSSPRTGGGSSAAKGTQAGSSELG 
LGIGARRGTDSLESHSTSHEWTVTLGPL 
SP 


3986 


17887 


A 


4010 


276 


2 


VEVKGNNS FSEGS ITL I PNPDKQPKETT 
DQPLMSTYAKIESPGKNPSHDG*LIFDK 
GAKI I Q* VKNS I FNKWCWDN WDFHVKTM 
NPHFTLI 


3987 


17888 


A 


4011 


188 


27 


REKFQ I YTI KNDXGS I TTDTTER * KI I R 
DYYEHLYAHKLENLEEMHKLIGIPP 


3988 


17889 


A 


4012 


1 . 


202 


CLHQKVIISN*L*ATTQTTQLSLSFKLD 
YFSI I IIPVALWTWAI IKFSL* YINSP 
PKKKEAEKDLN 


.3989 


17890 


A 


4013 


1 


403 


CLDQEVI ISN*H * ATTQTTQLSLS FK1»D 
YFSIIFIPVALFVTWSIIEFSL*YINSP 
PKKKKKKKKKKKKKKRGGAPLKNSRGGP 
NFWGGGKNI FFFFGGGDKKPPRAFWKKP 
L FLGGGNLGP PP PQKFTP WG KK 


3990 


17891 


A 


4014 


67 


439 


TEIiI I FYI FFETTLIPTIxAI ITR*GNQP 
ERLNAGTYFL F Y TLVG S L PLL IAL I YTH 
NTLGSLNILLLTLTAQELSNS *ANNLI * 
LAYTIAFMRKKTKKKKGGGLLKDPWGGQ 
ILAGREKIKFFP 


3991 


17892 


A 


4015 


3 


260 


L I VP Til LL P LT * LS KKH 1 1 * I INTTTHS 
LI I S 1 1 PLLFFNQINNNIiFSCSPTFSSD 
HLiSHP I LLKKKKKPCKKKKKKKKKKGGGA 
F 


3992 


17893 


A 


4016 


258 


3 


LKKIFSPPTINLFFPPFPLKKFFFPLSL 
* FFLGVFSF FFPPPKKGFFPKI PPGVFF 
SPPFKKKKFFFPPPFYFAPPRVFFKGPP 


3993 


17894 


A 


4017 


287 


2 


KKQKNFIFFFKNIiSFFGRGAFFPPLFPP 
ILRGKGGDFFYPRGLKPALGNQ*NPFSR 
F* I FFFFFFFLRRSLAPVTQARVQWHDP 
GSLQPSPSGFK 


3994 


17895 


A 


4018 


228 


487 


SYDVKQEKPNGI *ASKYWAYVFHSFFLS 
FFFFFFKKKNFFFCPQGGGQGPNLSLRE 
PSPPGVKKLFGLNLSKSWE*QNCPPPPI 
IF 


3995 


17896 


A 


4019 


35 


228 


EELNHIiNQGGEGCSEQRSHHCTPAWATE 
*DSSQKKNKFWYIHPCNRIVTTTNTPMT 
ESPKHYAE 


3996 


17897 


A 


4020 


2 


402 


ARGNLNTTFFDPAGGGDP ILYQHLF* FF 
GHPEVYILILPGFGI ISHIGTYYCGKKE 
PFGYIGMG*AMISIGFLGFIV*AHHIFT 
VG I D VDTRA YFTS AT 1 1 1 A I P TGV * VFS 
* LATLHGSNMK * S AAVL * ALG 


3997 


17898 


A 


4021 


1 


420 


TRGATELI ILDI LVETTL I PTLAI ITR * 
GGQ P * RLNAGTY FLFYTLVGS LSLLMGL 
I YTHKTLG* VN I LLLTLTAQEL * NS * AN 
NLI * IAYTIAFIGKIPFYGLQI.*L,PKAH 
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N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W==Tryptophan, Y=Tyrosine, 
X=Unknown, -=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














VEAPIAGSIGLAAGLLKLSGYGIIRLTL 


3998 


17899 


A 


4022 


409 


3 


LEMASCLIICNNNWPFFYQIVTCDEKWI 
LYENW * L PAQWLH * E EASKHS PRPNLHE 
KKIMVPVW*SADGLIHYSFLNPCETITS 
E*YAQQINVMH*KLQGLQLTFINRKGPI 
FLHNNTKLHVAQPMLQKLNELA 


3999 


17900 


A 


4023 


464 


374 


HLPPWEDKARRHLPDARTFILAFLASRN 
VRNKFFSLQATQFLVLYASSTKITKPLK 
TNQKKRLKKSEQSLRNMWTPIKRTNICI 
VGI IEGKEGEKGAERIFEEIVPQTF*NL 
VKDIK*KVQ 


4000 


17901 


A 


4024 


58 


323 


RHKLLCEHVKKCS AVLI IREMQTKTTLK 
YHFLLVT IAKI FNL IHF*RSVGETLI CQ 
WAGRNAKS * P FWRR I SQYLTKLYVCVCN 
YTHS 


4001 


17902 


A 


4025 


480 


150 


RNSLLRRLRQEDGMSFGCQGCSEL*LHH 
CTPAWVTEQDPVS KKKKKKKAYPDP I PK 
KTDKSNGKKGPDFSRKNPPKNSPPPPWQ 
RSTAFRGRAAKSCPTH 


4002 


17903 


A 


4026 


204 


1 


KKKFGALFFFFMQGIjTLLPRLEYR*WCD 
HGSL*P*PPGSSDPPTSVSQIAGTMGVH 
HEFLRSGWLSK 


4003 


17904 


A 


4027 


116 


472 


MPPLPKPVSGPRPHSCSPLGGPQLSTPL 
TGPRPASLSLPISEGICGCITAILWAVW 
YLVLQMLGLSLFMA* RGAGSSCSAHKKW 
WLGLASPSC*TFFKNRTKGRARWLTPVI 
PALWEAE 


4004 


17905 


A 


4028 


355 


444 


VSYKKK* SRPGAVAHACNPSTLGG*GGR 
IT 


4005 


17906 


A 


4029 


410 


181 


ASIILLIAVFFNNILSGQ*TITNTTKQY 
SSLI I IMAIAIKLGIAPFHF* VPEVTQG 
TPLTSGLLLLT*QK1APCF 


4006 


17907 


A 


4030 


49 


474 


PXRXXFCHIVTYYSGKKEPFGYIGMV*G 
MISIGFLGFIV*AHHIFTVGIDVDTRAY 
FTSAT IIIAI PTGVKVFS * LATLHGSNM 
K*SAAVL*ALGFI FLFTVGGLTG I VLXN 
SSLD I VLHDT YYVXAHFHYVLS I GAVFA 
II 


4007 


17908 


A 


4031 


274 


427 


FFF* EMESHSVTQAGLLGRNHSLLVPQT 
PGLKRSSLLSLLSNQYYRSGPHT 


4008 


17909 


A 


4032 


205 


393 


TWGGFKNPSPGEG* IKKMHDFNEMFLQN 
CWKI IiFFGGGFFTGQ I FPRAAPP INKN I 
LVWGGGK 


4009 


17910 


A 


4033 


2 


386 


SEPLSRKKLYLSILISLQISLIITFTAT 
ELIIFYIFFETTLIPTLiAIITR*GNQPE 
R LNAGTY FLFYTL VGS LPLL I AL I YTHN 
TLGSLNI LLLTLTAQELKKKKKKKKKKK 
KNIGGPPLRAPSGAPN 


4010 


17911 


A 


4034 


3 


169 


GLFLiAMHYSPDASTAFSSIAHITRDVNY 
G * I IRYLHANGAS I FFI CLFLH I GRGL 


4011 


17912 


A 


4035 


3 


317 


HASADAWAFIGYVLP*GQISF*GATVIT 
NLLSAIPYIGTDLVQ*I*GGYSVDSPTL 
TRFFTFHFILPFI IAAIAALHLLFLHET 
GSNNPLGITSQKKKKKGGPF 


4012 


17913 


A 


4036 


446 


194 


KKKNQPGGPPPPPAFPVGGLIKKRFPPT 
PGKKFKFR*KFFPPKGGFFVRDPPPKPG 
CWEREKKKKPLLNTDKWVILEGSEAGLF 


4013 


17914 


A 


4037 


213 


1 


GMAKVKAR FL PQ KRG QRVLN P P W WG KG F 
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PNSGTKETPFFFF*DRVLLCQPGWSAVA 
QS * LTTASTFWAQAI 


4014 


17915 


A 


4038 


307 


36 


RKGFFFSXCGEFKKLPGPPLGGWYLKLI 
FCQKKFFHPKFPGFPG* IFLPPPKIFFI 
FSFEMEFCSCCPGWSAMGRPRLTATSAC 
WAQANS 


4015 


17916 


A 


4039 


2 


372 


NDTS LTVKS IGHQWE * TYE YTD YRGLVF 
NCYILPPLFLEPGDLRLLDVDNRVILPI 
EAPI RI INTSQDVLHS* AVPTLC* KTDA 
I PGRLNQTTFTGTRPE VYYGQ CS E I CGA 
HRRSMLIVLELI 


4016 


17917 . 


A 


4040 


107 


279 


LLLLFFFFWRKSLALAPRWECRGKI *GH 
CKLRLPS PCHS PA* AS PEAGTTGWTYGC 
F 


4017 


17918 

f 


A 


4041 


3 


351 


GSHYIAQAGLKNPGSSHLPASAAHSAGI 
TGVSHHAWLKI FFKESVSDYSPHLNSRK 
NAVYVSFFFFWKKS FVFVPQAGGKGLNL 
GSLKFSPLRLKQFSCLTLPRS *EYGLAP 
PPPV 


4018 


17919 


A 


4042 


1 


385 


R PRRPD I E * QERRTQEVLQAVAKKVKE E 
SQLPGTGGPKNVLQPVPRAKAERPKKQA 
EASGLKKE TD WLKVDAQEAKTE PFTQG 
KGGGQTTPESFEKAPQVTES I * S SELVT 
TCQAETLAGVKSQEMV 


4019 


17920 


A 


4043 


3 


282 


TAI L I QT P * S FTGAV I L I IAHGLTS SLL 
FCLANSNYERTHSRI I I LSQGLQTLLPL 
IAF* *LLASLANLALPPTINLLGELSVL 
VTTFSVDAA 


4020 


17921 


A 


4044 


2 


407 


GTRLE IMSRNHG I FPFTLE I FKDNEFEE 
PYREALPTLKLRDSLYFGIEPEEHVSG* 
ESLEESCFATPTSKIDEVLKYYLIRDG* 
VSDDSVKQYTSRDHLAKHFQVPDFKFEG 
KDHKEVILH*RDLVCGVMDERSR 


4021 


17922 


A 


4045 


4 


380 


ENWLTI IRQAWKEADRNLNTTFFDPAGG 
GDP I LYQHLF* FLGHPEGY I LI LPGFGI 
ISHIVTYYSGKKEPFGYIGMG*AMISIG 
FLGFIV*AHHIFTVGIDVDTRAYFTSAT 
1 1 1 A I PTGVKVFS * LATLHGSWMK* SAT 
IIIAIPTGVKVFS 


4022 


17923 


A 


4046 


2 


94 


ADRNLNTTFFYPAGGGDPILYQHLF*FF 
GHPEGYI L I LPGFG I I SHIATYYSGKKE 
?FGYIGMV*AMIS IGFLGLI V*AHHIFT 
VGIDVDTRAYFTSAT INIAI PTGVKVFS 
* LATLHGSNMK+ S I PTPILIFRPP 


4023 


17924 


A 


4047 


376 


138 


LLARQRQENGVNPGGGACGEPRSRHCTP 
AWATE * DS VFLLWREAL LTQTAPFNRLQ 
LLALPSEFFHFRILIVLQPHHSQ 


4024 


17925 


A 


4048 


145 


226 


EPIQACSSRRVCEPIQACSSRRVCVCVT 
PSRHALPGCVCV*EPIQACSSRRVCVCV 
RAHPGMLF?ACVCV*EPIQACSSRYVCV 
RAHPGMLFPACV*AHPGMLFPACVCVCD 
PIQACSSRLCVCV*ERAHPRLLF*ACVC 
VCESPSRHALPGVCVCVRAHPGMLFPVC 
VCESPSRHALPGVCVSPSRHALPGVCVC 
V 


4025 


17926 


A 


4049 


2 


414 


ADRNLNTTFFDPA*GVDPILYQHLL*FF 
GHPEVYIIilLPGLGI ISHIVTYYSGKKE 
PFGYIGMV*AMISIGFLRVIV*AHHIFT 
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VGI DVDTRAYFTSATI I I AI PTGVKVFS 














* LATLHGSNMK* SAAVL * TLGFI FL 


4026 


17927 


A 


4050 


472 


228 


LTRDGGACPFSKLFRRWQKNPLVLGE* 
SCSEPKLCPCPPAKKTKDYPVSKKKKKF 
IWI INRTKIPRKRKKNKAYHKVKYF 


4027 


17928 


A 


4051 


238 


3 


RRHGSFAPLPPGLPLCISYIWIiFITILC 
NILYLYFETGSRYVAQVGLKLLCSSEPP 
PSTS * STEITGMSHGARPHN I P 


4028 


17929 


A 


4052 


225 


172 


IARYPFHIQSCTNMPSFNKS ITSKEEDN 
GPGAVAHACNPSTLGGQGGQI TKSGVRD 
HPG*HGLV 


4029 


17930 


A 


4053 


471 


30 


GPGGIKI SGALFPLNKVKRVP WFPRPP 
L I FPQKGVFNQNGGDPGGGFWPGETPGR 
VPPPPGVFFVKSPFLSLPFPPPPV*KTA 
VKTGFPYRPCGVPPRGLVFPKKKKSVLN 
RPDLSMGPQPE*SLPLPGEPSVSHAGCL 
PQAWRG 


4030 


17931 


A 


4054 


425 


41 


NWGSGTLCPQKLGVFPPPPPGEGGVPGA 
PPPARLFFFFF* KKGFSPVGRGGFKFLP 
PKNPPPLPPQKVGIPEGGPRAGPIFSPP 
QLFFFFLTWSLTPSPRLECSGAILAHCT 
PAWETEQDS ISKQTKK 


4031 


17932 


A 


4055 


312 


1 


I KNARAHLLPQGS PPPL IPIWIIETIS 
LLIQPIALTVRLAANITAGHLIMHLIGS 
AALAISTIILP*TRIMFTIIIERTILEX 
AVAL I QAYVFTL WS LYLHA 


4032 


17933 


A 


4056 


648 


123 


DPDPVPSARGCVPSGAPGRGAQLKKAWD 
GALA* LPPCLCAQELPSFS LGTGGPAVG 
LSVNSERDGRLGEVSINAEFTVAAFLKG 
TQSRRGINGMPLPQREPSSRYCLGLGGE 
SSAQGTSGLTGQRV*QNSDLPELASGDD 
KGQRSSQGGAVSQSSPRGRQIPPSSPPA 
ALFNFL 


4033 


17934 


A 


4057 


135 


464 


QHNSRLMQQAKKGVTVLARVIDLDYEDE 
I SLLLHNGGKEE * AWNTGEP LG CLL VL P 
CPVINWGKLQRHNPGRTTNGPDTSGMK 
VWVTPAGTKPQPAEVLAEGKENKECG 


4034 


17935 


A 


4058 


3 


575 


LRSRPLHIILSCGLLVTPRSLSPPPPPQ 
RLRLCRPSRGAAEFFFSLRTKLHFATLP 
LRIEGSKTL*NCC*LYHGC*IS*IHHGG 
LFDVAAKNI IHEEVEKYDKQYRGKELLG 
FATYKTFEI IVHQYIQ*LVEPALSMLQK 
AMETI QQAFINVANKHFGEFFNLNQTVQ 
STIEDIKVKHTAQAEYMIQLQVR 


4035 


17936 


A 


4059 


389 


3 


TFKGPKVFRYPTFFLCPPPRGFFSKNIW 
TPPPPPFFGVFPEREKIFFFLKGALPLF 
SFRVTPFFGLKRGFFLSPTPPFFFPPFF 
WPPPFFFFFFFFFPLRRSFALVTQAGVQ 
WRDLG * LQ PLPPGFKR 


4036 


17937 


A 


4060 


397 


1 ! 


KRRGNFFFKKGISLFI*TPPRGPLWVFF 
F* KAPKGFCAKFFPFFYPPLPKENGKTH 
RQKTPFPHFCQI FKNGFWVLHAFSLGDF 
PARWGPPFKRFFKGIFKEFPPYGSPPP 
KGPKKKKRTAARDLELADAW 


4037 


17938 


A 


4061 


38 


410 


AWIS I * ERPFTLVRVFKNFLAI SGLLFS 
YFRINFLYVQIHIHTFFRKLFVLSYVFR 
FTDVKFFFYILPFFPPHFLFGWICSYAA 
L*FFIYFCFLPFFFNGPTSP*IGT..FbLI 
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FFFKKPTFGFGG 


4038 


17939 


A 


4062 


405 


81 


GGANVFPQKS P P FGNKKGGH FFS YKI LN 
NPGHPPFLKKKKNAPGGWGNPWFSPPWK 
GAL* PKKKNPPKKKKKSPPPKGGKKKNP 
FPPKKKKKKKKA * GDNFDMGTGN 


4039 


17940 


A 


4063 


415 


239 


RPSFFFFPPPKKSCFSTFFFFLSPGFFF 
PPPFFFTPPPFFFFFPPKKKKYFFPPPG 
KKFFFF* PPPPSLFFFFFFFFFVFFFFF 
FFFFFFFFF FFFF* GLTS PHKQIGLVLA 
FLLALS KITHAS I P VP VS S P S KK KKKKK 
KKKKKKKKKKK 


4040 


17941 


A 


4064 


95 


241 


SPCRSPHRWVNS*ANNLI *]LAYTIAFIV 
KI PLYGLHL * LPKAHVEAP I AGS IVLAA 
VLLKLGGYGI IRLTLILNPLTKHIAYPF 
LVLSL 


4041 


17942 


A 


4065 


28 


288 


YFDIFVEARSPYVFQVGLELLGSSNPPA 
SASQSVGITGGSHCAQP*VTILFTTCTL 
HVNG*I*CIKFLIHNIPILVISLQYIFG 
FFL 


4042 


17943 


A 


4066 


338 


134 


PGGGCS EPRSGHYS HCTPAWQ * SETP S K 
KNKKKKITYFSVKNWGIRVFIBIILNSF 
TTVRP 


4043 


17944 


A 


4067 


26 


365 


WVSLMTSALAMRVDFYFITLLILGLLTS 
TLTIYQ*WRDVTRESTYQGHHTPPVQKG 
LRYG I ILFI TSNDCFFAG FF * AFYKS S L 
APTPQLGGHWPPTGITPLNPLEVPLLNT 
S 


4044 


17945 


A 


4068 


491 


174 


TQLKTH * GSTTDNRTEVRVE PR VRTNY K 
DLLKFL*SKGYDFESETETETIAKLVKY 
MYDNRESQDTSFTTLVERVIQQLVIiSHI 
FKDNYANINAEKLSGN 


4045 


17946 


A 


4069 


210 


402 


NVSKGLRKVPST.Q*VINISCRPGWARA 
CGPSTLGG*GG* ITRSGVRDQPGQKGET 
PSLGPGAV 


4046 


17947 


A 


4070 


202 


I 


DSAIALQPRQ*EQNSVSKKKIYRTIDLiR 
SEYFARWSFTLAARAGVQWRNLGSLQPP 
PPGFKRFSHAS 


4047 


17948 


A 


4071 


412 


143 


FLGAGV * PRLG P KG * TP FLL KNQKL I GH 
GGGALYSQYFGGWGRRNSFNPGGKGFTN 
QNFPPS LQTWGKKGVS FS KKKKKERKTR 
KWGGE 


4048 


17949 


A 


4072 


2 


324 


RRGNFCMF*SDRVSSCCPSWS*SPGLKR 
SSCLSLPKCWDYRFEPLYLARLVLMRCY 
STHNTYIMYCQMQRLFQGHRYLSVSTSS 
ATRPYLDNFFFFCKNKVMPCCQG 


4049 


17950 


A 


4073 


2 


326 


RRGRMLPRTPGRPGNDAQQPQKHSLVD* 
LFIYLFYFLWTGSHSVAQAGGQWRNEGS 
LQP* PPSLKGSSRQSAGRVKGVSHCAWL 
LLDFLSFGKPYTYKKVDKWSFEA 


4050 


17951 


A 


4074 


422 


116 


EIKQEKNPGPFFFFPPPAKRGFFPTPLI 
WGPPGFPPSPFLKRRQGGFFLGPL*KGN 
PPVFPPPKVFFFWAGPPFFFSFKKKFFF 
LVPQSTFIHFIFHICYLL 


4051 


17952 


A 


4075 


3 


294 


VFCHVGQAGLELLDSSNLPS VASQSAG I 
TDTSHLT * PDLS FCCKTKFD 


4052 


17953 


A 


4076 


60 


389 


PSAQLGSTKHIHMAVQP*PPPSPELLHL 
PKLMLCPHSAHTPRPPPNPGTVCTYSPR 
SSPAFPQEGPSSGEGLGGT.VSLFSSPPN 
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HVHPAPTSPESGRGRGGRVPREGAYV 


4053 


17954 


A 


4077 


93 


385 


KPLFAPGVLFGGPGEPKREKKGGAQGGP 
KKFSPPPPPFSFPPPKTLWVAPLFQETQ 
GLRLCPPFFF*RGKGALF*KRGGGRFKP 
PPPPPGLLGPPTP 


4054 


17955 


A 


4078 


2 


365 


RKVEGRVS * DEDLKLTELLRYYMLNI EA 
AKDLLYRRTKALIDYDNSNKALDKARFK 
SKDLKSDCAHPRDC* RA* AP YFLFAKNE 
LWPTGEQRHFLQDVPLIVQRTLTLRTP 
DHTSLPLSL 


4055 


17956 


A 


4079 


6 


298 


PLAIMGGFFTLAETNRTPFDLAEGESEL 
VSGFNIEYAGGPFALFFIAEYTNII I IN 
TLTTTI FLGTTYDAPPRP * LSPSLFYYE 
PPSPYPTPWSTST 


4056 


17957 


A 


4080 


414 


154 


PI PATREGKAGWSLNPEGEGCNKRRSRH 
CTPAWQEGKTPSQKNKKVLLLALKV*S I 
FKLMES * KYLKPGFSLDL I TVI PTYFDL 
FRC 


4057 


17958 


A 


4081 


347 


1 


VLKPRPGNIIFSPKKKKNLPPPPGRNYF 
FFFPPPPFLFFFFFFFFFFFFFFFFFFF 
FFFFFFFF*RGGLIiLFRLKlQLPPTPHP 
YLSNTLRGWNPGEGSCWRSEGGRGEDCL 
QDH 


4058 


17959 


A 


4082 


I 


354 


STFIISLFPTTIFMCLDQEVIISN*H*A 
TTQTTQLSLSFKLDYFSI IKKKKKKKKK 
KKKKKKKKRGGGGLKKKKFNGGGGGKKF 
FFWGGKKKNWGGGVKKPGGGKKRGGKKK 
KGGGEK 


4059 


17960 


A 


4083 


384 


85 


FFFPKEFFGCPRFFFSPPGFLTPPPLQV 
LEFPIKKKLSPPPPPLKNFFFYTPPPFF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFLLM*AKKFISWR 


4060 


17961 


A 


4084 


1 


122 


FHRFGRDGLDLLTS * SARLGLPKCWDYR 
REPPRPAFLPQP 


4061 


17962 


A 


40S5 


223 


401 


NGKLVSVFFGFCFCFFEM*SRCVAQARV 
QWRI FSSLHFLPPGFERISCLSLLSRWD 
YRC 


4062 


17963 


A 


4086 


160 


408 


ALPPRRGGGGAPSFNFCLLPPGP*GKKG 
GYGDAPS CRSGLGQHGETAAPLKAQKS F 
GHGGRGYQVRRRVG P PGPGGRRQKLNEG 
APPPPRLGGKAAP 


4063 


17964 


A 


4087 


261 


81 


YDIDGRI FCQKLESSHYSPGAVAHACNP 
SPLGG*GGQITRSGVQAIHPSWPPEVWG 
LQA 


4064 


17965 


A 


4088 


428 


140 


Q PLKNPPLGGRAP P FLRGKKLP P P PPQG 
EPPPF* KKKKKKRGGGGPPFFP PPPRGE 
PKKSL*PQKGGGKKKKKSPPPPPQKKKK 
NPPPKKKKKKKK 


4065 


17966 


A 


4089 


1 


205 


FRLRQENCLNPGGGGCSELRSHYCTPAW 
VTEQETLSQKKNFFN*KTFRKAKSTGNF 
HMPNTS I QQLP I 


4066 


17967 


A 


4090 


154 


410 


ITGCSII K YFYRAMREREKS EVGYFPS S 
SINTPLRHRLSDGVLDSRITSLSTHSPA 
P FIKS KNR PGTVAHACNPSTLGG * GGRL 
TK 


4067 


17968 


A 


4091 


428 


67 


LPHPPPPPPPLSSSSSSSSSSSSSSFPP 
PPPLLPGQGRTLTRPLPAPAGQLGSRES 
CGAPPSS LGASLHVA*NADGRQ * RAALS 
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TPHGPLTGSPAAWRGRREQDLLLLLCS 


4068 


17969 


A 


4092 


431 


3 


APTSNKGSTIPILQIRKLRATPCSSIGP 
SPSTLPVKLEL* SVLLPGQWLNFSFCHF 
FPRPKCSNRPRYTLPVPKLPSSGLPPRP 
SPSQQVTAFPPLTSRBCEPSSPSTPASDA 
FPFPPAPVSARRQEVLCES I PS VTMHE 


4069 


17970 


A 


4093 


151 


1 


S IMPMWMDAVAHACNPCTLGG* GGWI I K 
SGVRDPPDQHGETSSPPKTQKL 


4070 


17971 


A 


4094 


35 


363 


LQSKIYCTFFFFLERGFFFSPRVGRPGA 
H PGLGAKTTGVKGNP P P * P P KGGD PGGV 
PPPPGCFFWFFLGKGGFFLLPGGG*NFG 
KKKKPLVW PPKGGE LKGGTPGP P P Y 


4071 


17972 


A 


4095 


376 


1 


LGKSPPFLGAPFFLKKGSLLGPLLGKGF 
TPP*LGGPHPLEKGPLLGQWNPNQKVWK 
KRGFPQKGGENPPFFFFFFFFRDGASLC 
CPGWPQTPGLKQS SNLSLPS SWDYRCLS 
KSLFKENTETLAS 


4072 


17973 


A 


4096 


65 


446 


PFFLFFLNKGPLFTPKRQGKPRNFPT*T 
PCPRGKKNPPPPPP*GGEKRGGPPPPGK 
F FFF FGERG FPGGGVL FLDLGT P PLGPL 
KGGEKRGRPPGPGPKKPFL* KRKKNSPP 
TGGERGRGGKKHCLP 


4073 


17974 


A 


4097 


34 


387 


I KEADEYVLTRLACVFVGDMEPLSSPHR 
LHTRIGSSPSTTCCTPACPCSSW*HGAS 
VIPSQTAYEDWFITLYNVLYTSLPVLLM 
GLLDQVGASHKQGH FWTDENALEKSQAN 
VSMKSSGVYFFLIKPYYNSWRHQGSWL 
CFLFCNKERGF 


4074 


17975 


A 


4098 


80 


422 


ILHVLYTCI ILCFVLQS* ILWFVLQNYI 
LHVLYTCI ILCLHVLYTCII YMCYIHV* 
FCILALFAGLS * DS P YHESL * * S FFMEA 
DISLCRCVITDQLNFICSLLQH+ELRMF 
QF 


4075 


17976 


A 


4099 


74 


402 


I YLL S THLH YQ FS W I S I TFD I S FFFFFF 
LKKESLFFPQRGGGGGNU5LLKPPPPGE 
SPFPPPIFRGGGKKGPPPPPWVIFFFF* 
KKGGSPIYPGGG*IFGPGDPPPPPP 


4076 


17977 


A 


4100 


3 


335 


DAWASAWVTE YDS VS KKKKKKRGGGGKK 
KQFPKKIGPRGSLFKPRGGEKPLSIiKTP 
PPPFF* F*KPPFKKGGAEPWEPFLKPPF 
CLKDRGGAPKIYFFPNPPGAPRGAAFIK 
RGGGK* *GPPGGLLL 


4077 


17978 


A 


4101 


406 


1 


RFPKPFPPKPKMSKWALFK+KGFFPPRE 
LLIGPPPPPPKKKKFFPTLFFAKVLPPR 
FF*NFNQVSKKRPTFPIiKVGQ* PPPMLV 
QKKPTRGFPALFKKKKRSSLVIREIKRK 
PTMRYHLTPIRMAVFHKSKNNRC 


4078 


17979 


A 


4102 


2 


183 


VN PGGGACS E PR * CLCTPAWVTERDSVS 
KKRADNDKQGALRS FFLGFPFFLPLHPT 
SFAF 


4079 


17980 


A 


4103 


119 


391 


DNLQCFFFFFWKRGLLFLPRGGGFKSTG 
PFSFGVPGTPPPPPPGEGGLTPKPPPPG 
* ILFFLEKGDFPLLARW*NRQKKKNPP 
PSPPKGG 


4080. 


17981 


A 


4104 


413 


64 


GFKPPPPKDPPPPPPQKGGFMGWRPPPR 
P FAPLFS P P P PRGGGG I FP *GGGGGAKK 
FHIWGPFG* XKPPPGGG*NSGGQKNPPG 
GGGKNP PQKKKKKKQPTTAN I WFLCLE I 
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LSGM 


4081 


17982 


A 


4105 


2 


196 


RGRVGKVLI SLFNTDLLWLLC I LI CFHK 
TQ*LCNG*CKSRMKCVCVCVCARVCVCV 
CVCERERD 


4082 


17983 


A 


4106 


1 


335 


FLVET* FHPVGIiAGLELLTLGDPPR* ST 
LGDPKCWDYRCEPLDPASVSFFLCLKLF 
IFVLLKLFFFFKPPLYSKLTRGRDNNYN 
YFPPPNPGPLPHWRADKSIIGGNRRLL 


4083 


17984 


A 


4107 


1 


148 


GERGCSELRS CHCTATWAAE * DS IYKKK 
KKKEPPKFFLAGFSGGGKKTP 


4084 


17985 


A 


4108 


331 


41 


GLNFLAQKFSGRFSPPPPGRVPLGPKPF 
FFLGKPPILEGFPGP*KVFFSPILPPKE 
VLTFFSPPGEKIGPPFSGGACSGFPPKG 
AKKKKKKRPWSN 


4085 


17986 


A 


4109 


137 


3 


TKKERNINRGPGAVAHTCNPSTLGG*SG 
RITRSGDRDHPGQHGE 


4086 


17987 


A 


4110 


3 


269 


GFHRVSQDGL YLLTS * SACLRLPKC WD Y 
RHEPPHPGFLGFFCFFFFFRDSVQPGQQ 
SENPSHKKKKDEGPFWKGAGKIFLFKPN 
SSAL 


4087 


17988 


A 


4111 


196 


330 


• GAIARGRSNCDCI *GSVGHVL*NDPTLN 
CILDQDTLYGSIMINEYYERASDAWALL 
PDEEQLPEGDQTVIA* * A*DMSGGQQHR 
VSLARAVYSGADVYLLDDP 


4088 


17989 


A 


4112 


426 


241 


LLKRVRHEILFTIRGKGFSEPRSPPCTP 
AWATER I S VSH KEKKRKVLVRP * VDLSN 
SSLSVL 


4089 


17990 


A 


4113 


394 


1 


GPGVFSPSGCLYHARAQYFWPHKKKKYF 
PHPGQKMSVFLKGRPLPGFGFIjLIFFFF 
LVEMGSCCIAQAGLELPTSNNPLTSASQ 
RAE I KDVSHRSQ * FFVFLFETGS CSVTQ 
A*VQWCNHSSPRTPGLKQF • 


4090 


17991 


A 


4114 


328 


2 


KKRAH P I W PDTG I E VPNS PS PN I RFLGA 
PESFFFKRNLPQGDFLEKPNPCYFFFQG 
VTTPPFPLFPPPPKIFFFFF*DGVSLCR 
PGWSAVARSQLTATTHLPGSSDSP 


4091 


17992 


A 


4115 


220 


3 


FKKGEPGDPPPVWFGIiGWVWVWVGFFFF 
ERHS FTLVTQVG VQWCNFSSLQP PPRGL 
KSFSHLSLPSS*NYRH 


4092 


17993 


A 


4116 


342 


2 


PINFLYLSFNPNQV*RSPNFSKILHFCF 
KILSFFYTLAFLPFSSQLIRQSFFRVYL 
INTICFLQDLYYLYILNLYLILFLFFHL 
ASRLLDFCLFFVFETVLLCKPGWSTVAQ 
S 


4093 


17994 


A 


4117 


124 


352 


TSGFT*TRMKSKIPT*RSFI IDLLEAKM 
KEKFLKAARETQLITYRGTSIQMPVDFS 
WRKTEAKAQWNLMFEVLKEKNCETRSLH 
PATMSFRNEGKTKMFSDERKLRDSVTLP 
LKD 


4094 


17995 


A 


4118 


11 


387 


KTGKLLATSAPGDGIMVMVETFCLWSF 
FLSVSKKKKKNPLPQIXNPLPKPCWVFK 
RAPTPNH*KAFPGPPKTPP*KQPKKQIP 
CRPPKKGPFCI*PNPGINPSPLFFEKKN 
KGKAPPFKNSKQK 


4095 


17996 


A 


4119 


2 


407 


NT FQDQSGS S SNRE PLLRCRDARRDLEL 
AIGGVLRAEQQIKDNLREVKAQIHSCIS 
RHLECLRSREVWLYEQVDLIYQLKEETL 
QQQAQQLYS LLGQFN * LTHQLE CTQNKB 
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LANQVSVCLERLGSLTLKPEDST j 


4096 


17997 


A 


4120 


157 


2 


LTGSHSVT*AGVQWCKHGSLLPRPLGSK 
RSSSLNFLYSWDHRCGPPHSANF 


4097 


17998 


A 


4121 


3 


168 


IALKSGRKVD I RELGHWC PG WS * TPEL 
NQSTCLGLPRCWDYRREPPWIiALNMP 


4098 


17999 


A 


4122 


2 


376 


ETGFHC VSQDGLCLLTS * STRLGLPKCW 
DYKMREPPRPADGFINIRDFNPLLRTST 
CCCHVKKDI FASPFTMIVSFLRPPQPCR 
TDP I QNTLPLVI MS PVALLGCDS FSAF P 
VLMNLPFSSDCTK 


4099 


18000 


A 


4123 


11 


221 


GTQLLLRLRGEDLLNLGRGGCTEPRLHH 
CAPFWETEPDPV*EKKQAAKAKNSLLTA 
ALLKTQVKKDSVAW 


4100 


18001 


A 


4124 


2 


125 


AIIKKTKSNRWW*GCR*RGMLVHCFWKC 
KLVQPL* KAAWRL 


4101 


18002 


A 


4125 


425 


152 


TP VI LALWE AKAGRSLE PR S * RSAWATW 
QNP I STKKYKK* GGRITSAWGG*GCNGL 
*WCRSTPASATETPSQRKKKK* EKNLFT 
LILKVIK 


4102 


18003 


A 


4126 


3 


99 


CQAGLELLTL * SDS LRL PKCVJDYRCE P P 
RPA 


4103 


18004 


A 


4127 


396 


58 


ENCNPLS WSGGI ISGLLLVSGYSNKFPS 
FLVCLLPVIGPPENTAGPSLVPGTKWHD 
HSTL* TSQTPGPSLVPGTKWHDHSAL * T 
SQTPGPKLSSRLSLPSSWDYRHKTPYPA 
H 


4104 


18005 


A 


4128 


51 


549 


LGQS YLLLLRKCFS FNFQVGDLD I S Y IN 
IEGITATTSPESRGCTLWPQSSKHTLPT 
ETSPSVYPLSENVEGTAPP*AHQSFMSP 
PS WGGS PLNLFFGGGGFEKEQS PLKKKS 
FTLYPLGPPSEGEGHF*PSPLCFPPGFQ 
SPPKTGIPSGDELGFF*TTPGPDKKK 


4105 


18006 


A 


4129 


97 


362 


RK*SACLSLPKRWDYRHEPPRPACSC 


4106 


18007 


A 


4130 


376 


279 


DQDDLDLLTS * STHLGISKCWDYRHEPP 
HRAS 


4107 


18008 


A 


4131 


133 


344 


IIFFFF*NFVLQAGGQGGNLGSLKPPPP 
GLKQFLGITFGRSWDHGPTSFTRANFCI 
FRKKR VLi LCCPGW S 


4108 


18009 


A 


4132 


164 


3 


MNEMSYESSLLDHLLKRQ*NQCCLGSSE 
PSPPRLKQFSKLSLPSSWNYRKTSK 


4109 


18010 


A 


4133 


355 


2 


GKKQNRKTGNF KTH S AS P P P P P KERS S S 
RATEQSWMENDFDEMREEGFRRSNYPEL 
RED I QT KG KE VANF E KNLEE C I TRI PN T 
EKCLKELMELKTKARELREECRSLRS* C 
DQLEE 


4110 


18011 


A 


4134 


193 


357 


HHCNPS LCQNDLFWHLVLS P PTGVQWYN 
HGSLQP*PPGLE*SSCLSLPGS*EYRH 


4111 


18012 


A 


4135 


357 


104 


LRRL*YENHLNPEGGGCSEPNLLRCTPA 
WATEQDS ISKI III I *CNFLH*QNMQKI 
KQNPKTNSSKILTEIKISLKGNLRELRE 


4112 


18013 


A 


4136 


179 


381 


DTSLHIIQNRLSISKKKL+TLYFIY*DR 
VPLCCPGWSAWQS * FTAALTS LKRS S C 
LSLLSNWDCRC 


4113 


18014 


A 


4137 


89 


318 


CLE I S CT K I GQDL YT ENY KTLLG KVCKC 
PNK*NTTFMDQAEVSILKISFLPRLIYR 
LRI I PVR I PAG I FVE I LKFG 


4114 


18015 


A 


4138 


315 


60 

1 


KKLPAEIHAPLAEEATD*AELKPAGETH 
AEVQAPTEQTFAAE AATT IAEASV KVQP 
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PPAEEAS LAKFPAE IQPPADQERGSED I 


4115 


18016 


A 


4139 


377 


51 


GFTSPFPED*RIPGLtFSCIGWTLQSFHF 
RNIGSVPFSPPGFFFFPQEMRVLLCCPG 
WSQTLGLKPSSCLSLSNSWVYRCAPLVP 
GMPIF*LSCFLAIGFLEFVIYIGY 


4116 


18017 


A 


4140 


413 


175 


NFGIFCRDGVLLCCPGWS*TPRLKPSSC 
LPKCWDYRREPRTVPGLPFLFLNLRKDG 
NSVLE I YLEYKI TQ I I VQRLLVL 


4117 


18018 


A 


4141 


1 


394 


EI F VTKRWSGLG WAHASNPS TLGG * GG 
QITRSGV* EQPDQHGSPERRGLKERRGL 
MVMKATFLGLRHHLEHIVGHRELRDLHN 
NSKKKISSHPLSTPHVPGTVLKPCRLIL 
CFKHHQKPVRKP PNS PS YR 


4118 


18019 


A 


4142 


22 


376 


LVNKGKTI FFFLKRGWLPPKGKGRGIN 
TVNGSLNFRGQGNPLA*PPKKPGTKGGG 
HNPGEI * * FFGKKEA* QCGPGGSGI PG P 
RRPSGLTLQKGGTYGREPSPLPGPTKEN 
PRLTKT - 


4119 


18020 


A 


4143 


161 


1 


P I PAKYEHFCFF VS LF* DRVS LCHPG * S 
AW* SQLTTASTFQGSSDPLTSATP 


4120 


18021 


A 


4144 


18 


159 


KHLD PGGGGCRE P R S CHCT PAW VTTAKL 
HLKK*K*VELKSPSVIHT 


4121 


18022 


A 


4145 


383 


109 


LFFYKIQNYTGHGGPCLLTPFLQRVKQK 
NCFNPGGGGCR* PKLVFCPPTWGKKQGF 
VFQKRLKKPKPKPKPKPNWRGLFLPPFS 
KRPKPKI 


4122 


18023 


A 


4146 


374 


164 


QCQLLRSLRWEDCLSWGG*GCSEP*SCP 
CI PAWVTARSCLQ INKSLPAE I VI PSLY 
NQIARG 


4123 


18024 


A 


4147 


25 


375 


RKKKALFFCQGGSQPPPSNLMDPPPPGE 
KKISWLNIiFPKKEKKKF*KKKKFF*KVK 
KNPFLKPKGRPPPGEKKKGKKGKKPTPQ 
KGKGFLFAPPPKKPIKKSPHFKIPKKQK 
NFWQR 


4124 


18025 


A 


4148 


376 


202 


HYNSKVFPGGPKRSFLFFLKALSFFFAP 
APIPFWLHSKIFFFFF*DRVSLCHPGWK 
LR 


4125 


18026 


A 


4149 


3 


144 


IiFCPDWTPTPGLKQ S SHLGL PQ CWD YRH 
*VTVPGLPISFVFFFFPP 


4126 


18027 


A 


4150 


3 


188 


QLQSQLLSSLRPEDHMNLGVEGCGKLSL 
HHCTLA* MTE *DS IS* KKILKKKCF* KK 
HSFLY 


4127 


18028 


A 


4151 


83 


358 


GWARWLTPGIPALWEAKAIKSLEDLNA* 
A* QTYAPS * EESLHP I MNTL I RDP VWT 
SFRVTVDTSTIARHWLFCWDLLQAS IGH 
ILILGPG 


4128 


18029 


A 


4152 


162 


1 


TVWYWHKNKLSNQWNKI * FLELDPI TYK 
HLVYDKTGISNHWVKDALVKTQCRAG 


4129 


18030 


A 


4153 


369 


20 


FQLLARQRQENGVNPGRGAFG * PKSRP C 
P PAWGTK * DS VS KKKR KKKS CE F VHR I L 
KLEEAYHSLGSNTAQRLFVSCFHKKILC 
LSISIPDPYLQFKMWPVPPSKPLYQIDW 
VGLL 


4130 


18031 


A 


4154 


117 


2 


QIFFFFFFFKMESHSIAQAGVQWRHLGS 
L*PLPPRFKR 


4131 


18032 


A 


4155 


369 


217 


FFSPPPFFKPPPPPKKIFPP*KKKNPPP 
PKKKKI FFFFFFFFFFFFFFQTP 


4132 


18033 


A 


4156 


214 


283 


ASPSEGQGLSSGP*GLIN*SDKPSVTKS 
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PKPHCETA*GGTSFLSSSGSSPQAPKSP- 
HHGPDSPLSDSIPWVISPPEPPPLRAKG 
CPQGPPGASWWGTT 


4133 


18034 


A 


4157 


319 


712 


QDFVS KQHSY CLKNLKSS S L I QVAMNRA 
QVCLISSSKSGERHLYLIKVSRDKISDS 
NDQESANCDAKGNNFICIMLYKIWLN*K 
RLMK* HWTAI * LK I YFSNKMKFSNLNLS 
SVINSFDVKKERLNANTLN 


4134 


18035 


A 


4158 


183 


369 


LKRKQSQRGEVNFLKVTQPECGKAGI LF 
IYYFFEMESHSVGQTGVQWCNLG*LQPL 
PPESSD 


4135 


18036 


A 


4159 


190 


336 


DPISIKNFWPGTVYFLIETGFLHVG*AG 
LELPTSGDLSTLASQSAGITG 


4136 


18037 


A . 


4160 


10 


390 


QGILLPCFSMSSEERGRIYSNTFSFFFF 
FWKGSPINPQGGSQAKKFR*REPSPSGL 
TPQRPGNGGPPPPPGQNFFFKKKGG* PG 
GPGRA*TPGPRGTPPLGPPKAGNKRGGP 
RGRAKNFLKNQNGPT 


4137 


18038 


A 


4161 


53 


376 


YLFAFFFFSKREAWQGGQPGMEGAKKKF 
MAP* PSKGGGGEFSGLTPPNQWNPRAQP 
PPRGELILILEKKRPAPNAQPGPENLGP 
REPSPLAPQKGGNKSENPWPPPP 


4138 


18039 


A 


4162 


1 


356 


GVFILVSFNFWAFFLSFFFLGKQILVLM 
PQPESQGQNLAPQNPPPPG*RGFPALTP 
P RTGNKGMAPQAPQ F FGFL KKGG FSHGG 
QGGFKTPTLGNLPPLAPQGLGNNGLGPW 
PPPFFF 


4139 


18040 


A 


4163 


345 


26 


APGFKPALGPPGGPFFSKKKSKNL»PGLR 
GSTLQAVGPTFLGG * GKRS LKPPGG* AS 
RGPGWGPSPPPGGPN* I PVSE KKEREGK 
KKKGREKKKETFS SNLLI FLSF 


4140 


18041 


A 


4164 


18 


284 


TLQGTSGIFEGNFFFFLGTGISLYCPGW 
RAWGGDHGSLQL* PPGSSSPASTPLGRS 
WDYRHVPPGWAS FFFFKTLPNGVTQAGL 
ELWG 


4141 


18042 


A 


4165 


377 


30 


FSGKKKFFQLGWGCGPPPLIPPLWGV* P 
GQPPRVGGFNPPLPPWKNPVFFKKPKQP 
GVGGASPYSPFLGKFRPRIPFTLGPEGS 
G*PNFPPFPPPWAPKKNFSPKKKKKFYN 
CLW 


4142 


18043 


A 


4166 


305 


1 


ESRQVLTSRLTLLSIHEFNYLLNNSTYK 
H I TSLP INI RQ YGQDLKTLKKETKDLNK 
*GNIPCLGIGRINIVKMSLLSKLIYKFK 
AMPVKIPGELFLRNQQA 


4143 


.18044 


A 


4167 


394 


78 


EGKKQGRKKGRREGS * E3RREREXTR* K 
AEKEGMKEGGREGKKEGRKEGRRKGRKE 
GRKEEGQKGGRTDGGWWVLRSLDTSQLI 
SFASGPKWREASSLVWASLCL 


4144 


18045 


A 


4168 


1 


431 


CNTCVR* CAYWCV*MCGERLCWCVCSCN 
RVCCV*VERVCVLVCMSVCWCVCCCARQ 
CVSVCHSRIESSRPSSGPVAPSKCQTHL 
YLPKLSAPPAAGLWDLPSEATASLSGSL 
CQPSCPRTRSFATLVSSMYHRPVTGELQ 
SIT 


4145 


18046 


A 


4169 


3 


423 


YNARE I EAAAGRDHATALQ PG YR VR P CL 
KKKVLKLGQAWWLA* F*NFFFETGSHSV 
PRLECSGVISACCSLNLPGIV 


4146 


18047 


A 


t 4170 


J IS 


402 


OICTCTPKPKMKVKKITMLS^TWRYLYI 
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HLTQDI *DLDIDL*NTADMNLKDLNK*R 
EIHCLRI * RLNI I KTS I FLKLP YRVHAI 
PFKI SARSFVB 


4147 


J 8048 


A 


4171 


404 


285 


RGCGEPW * HHCTPAWVTE *DLVSKNKMK 
QNDNLSHLSLA 


4148 


18049 


A 


4172 


274 


431 


ESLVGQRNWVAKYFYRI TAPGRAQRLMP 
AI PAPWEAEVGGSQGLE I ETLL* NNS SR 
PGAEAHACNPSTLGGRGGRITRSRD*DP 
PGQH 


4149 


18050 


A 


4173 


24 


366 


FLCLFHTCVLAFPVQQLCSVGWGCKVEN 
ADREEKQPCDSQAREEPRLCRRGFNRAP 
NINTRHEKLFFVCLF*TGSHSVTQAGVQ 
GCGHSSPQPQLPGPKRSSHLSLRSSWDT 
GM 


4150 


18051 


A 


4174 


397 


1 


TEPVLANETWGPWDRGPLGLGGLGPENT 
DNSHAVGADQLLKEPPFPLSTRSIMBPM 
SLNAWLDSPHREMQAGTPLSLCGDTYET 
QVM* S WG I RG I LPQQPWKMGASLSLTAS 
LSLSFAHTLKHKHTHTHTLY 


4151 


18052 


A 


4175 


90 


387 


KGRWLFFPQGGERGHNG*TAPPQTGGKE 
NPPPPPPKDRGKKANAPPPGGIWYLKKN 
GFFPIGQGGPEPPPPRGPPPPAPPKGGE 
KRGGPPHRPGKKLLK 


4152 


18053 


A 


4176 


398 


280 


RRLKWEDPLNPRV*GCSKP*SHRCTPAW 
VTE*DPVSNFF 


4153 


18054 


A 


4177 


253 


395 


LFNFFFLVKMGSRYVAQAGLKPIiA*GNP 
PASAS PRAGITGGSHHTQP 


4154 


18055 


A 


4178 


366 


48 


P WAS PG I S LS F S F LRTKPT VKVR* YRVG 
PQRQPCPSRWAPPACPFSMFPPAWVSPP 
GSAASRPPRHAGSQVGATASPPPAQGLG 
APAWTAGLGEKQKLAARMRGT 


4155 


18056 


A 


4179 


44 


413 


GDGVNSAFFFFFFLEKKVWFIPPGGGPK 
PEFGFRAPPPPGVKKIFGFTPLRTGE*R 
PLPFPPGKGWFFKKNGGSPMWPGGV*PS 
DPKGAPPPGPPKGGE*RGEPPWVGWNFA 
LLrKERKPLFKE 


4156 


18057 


A 


4180 


396 


235 


HEVSLCCPDWS*TPGLKYSSCLGLPNCW 
DHRCRNHGPGKTQIiEQGHRQVAGLSR 


4157 


18058 


A 


4181 


85 


408 


YTCSLRITQDHFFFFFFFLGKKIFFFPP 
GWGAGGQI LTSPKTPPGVKGI PWLNPPR 
GGGPGGPPPPPTLFLGF*KKRGFPGGGQ 
KPRPKGDPPPLPPKGGGLKGGPTP 


4158 


18059 


A 


4182 


1 


149 


NHLNL * GRGCSEPRLHHCTPAWVTE * DS 
VLKKKSYIHLGQNPTVRKVLL 


4159 


18060 


A 


4183 


310 


3 


ATAPSQFI RIN * RNHCHTY IEVQS EAAS A 
NLETAVNNPENLGKIMDEGG*MKKQIWK 
EMTSRIFIVEKSMPGFKTSKAQALILGA 
NIAVKFRLKPMLIYHLEN 


4160 


18061 


A 


4184 


1 


359 


PTRPLVLDRDRPPFFFFLGSGKKKNPGV 
FLYFGGGWTPPKKKKNSLGWGGGGLKKG 
GGGVFFKPLPFLGGGPFLKNSGGVDKKK 
RAPPFFTKTPL* KKPPPPFPPKI FKARG 
FW*KMGEPFFFCPPPPSFLKKGPPPKKE 
GV*KKPPPPPFLTPPPPNQGNFFFFWGG 
STPPQNI KKPLGFFSSQTPKKKKRAADR 
DLELADAW 


4161 


18062 


A 


4185 


1 


362 


HFTKYLF I S C* K* * LLCWMQDCCKKG I P 
MNTNVIQEKATSLYGSLKKKEGEGPKAG 
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SEQH> 
NO: of 
nucleotide 
sequence 


SEQ IP 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A-Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=lsoIeucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknovvn, *=Stop codon, /=possible 
nucleotide deletion, \=possibIe 
nucleotide insertion 














EFNAS KGWLDNFRKSFGFKNVRITGEAA 
SANQKAADKFPDAIKKIIEEKGYLPEQI 
FNAGESAL 


4162 


18063 


A 


4186 


366 


47 


AALFLVDPPGRGGVKGPPPPPGPFFVFF 
QRRGVPPLSPGGFFFRPPNSPPPGPPKG 
GGSGGGPRGPA* THG VFLKKNP PNCGAG 
KKKGFFLKKKKKKPVFWFSVK 


4163 


18064 


A 


4187 


394 


112 


KNTENLKI S WAWWHVSMVSAPNEAETGG 
STEGTGCSKL * S CHCTAWVTVRETLS PR 
KFFKKRANYLTFRI *KVCSLYHRKTKNS 
FKKHTQGRKF 


4164 


18065 


A 


4188 


230 


372 


KPM FL P FNPAI LLLG I YLKE YKS F * HKD 
TCMRMFITALFTITKTWNQ 


4165 


18066 


A 


4189 


149 


2 


NRSLRFMRFKEGSHLPPI*VQGKAASAD 
VEAAASYPEGLGKTINKGGST 


4166 


18067 


A 


4190 


2 


159 


MHQPWLLRRPKQENCLNPGGGGCSEPRS 
HHC I PA * AIERDSLS KKKKKGGAF 


4167 


18068 


A 


4191 


180 


415 


LGFLQLLLFVGRSMFIFALEIiSHGWLGS 
SLSF* PVCVKQSSHhShSSS WDHKHMSP 
YPANFCI FCRDKTLPCCLGWAQ 


4168 


18069 


A 


4192 


424 


41 


PPLLRAPPKGLL* PKNSFPPGAPP* IPF 
PPKKI KI FKFTPGGGGAPLFLLPRQVKA 
EGSLPPRVFGPPPFFFPPLPSPLGAKPN 
PFFFFKKKKKKEWLHKSSLFYSAHFVLC 
LKYSLSFFYQSVKXK 


4169 


18070 


A 


4193 


105 


424 


ELKRLVIKLIRGIPEKGKAQCKEIQKIiA 
QEVKGE I FME I GSLKKKQ * K I QETLDTX* 
LKMQNALESLSNRIEQVEERNSELEDKV 
FKLTQSNKDPSQ I KXKILYNV 


4170 


18071 


A 


4194 


3 


240 


IiCLQSQLLGRLRQENPLNLRI *GCNEP* 
LHHCTPAQVTERDPVSKENKGSFIPMKI 
GKLVSLTLNRMCNCKNSEKENV 


4171 


18072 


A 


4195 


385 


1 


KMII LTKKMEI KHKDEKELQKTEVDLKI 
ENNTTRARLTNNIKKKPPESEKTNTKNH 
TKKS L* DQDTKKETRPQKKKNP FKSR * V 
RHWRNILNLESRNRGKKNPSGVQKRRDS 
VPTLITQTQETWWWCF 


4172 


18073 


A 


4196 


3 


272 


LFYL I MALKHKS SDVGNSNI YAKEKLES 
ASFKERI *KSYVEVAKIYGKNEYCIYEI 
VKKEK* IMHS I YRVQYYLQ FQTS LEVLE 
RIPHG 


4173 


18074 


A 


4197 


236 


379 


GGR FKG S N FTS PG LQGNS F FMG P P KLNS 
RAGV* QRREGKNPGVPQFNR 


4174 


18075 


A 


4198 


375 


1 


NFKIPAPPPQNKGPLRVGTPPFLiIFSPP 
GSPPKSFPKAKWARRDRPF*NPGETLQE 
G*VFFSGRFFFFFFFRDRVLLCHPGWSA 
VAQSRLTTSSDSRVHTILLVGRGCSEPR 
SHHCNPAWVTRVK 


4175 


18076 


A 


4199 


1 


270 


P SRTGPQ I PRRPTRSNCPNQFLGF * GCG 
EPRSRLCTPAWATEQDSISKKERHGLLV 
FLKGFIHYIRLIilNLFAARGRKRVLFAL 
GGREMW 


4176 


18077 


A 


4200 


388 


1 


P?GIINLFDPGL*PPPPWVKGPPPS+KL 
ISPQKKKNKVPPPVPPINIFSGPPPPTL 
VFWVLFFVSLKRQLIL* * KSGVSAASLV 
RHDGAPRLTS SAGKGHAEPTFQPR PTHH 
RNACSFVLIHIQSYKVF 


4177 


18078 


A 


4201 


298 


158 


FKI V INFQFLTP I FAPSEGGRGG * GRGE 
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SEQ IV 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ JO 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=GIycine, H=Histid ine, l=Iso leucine, 
K=Lysine, l^=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glu tarn ine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














LRSCHCTPAWVTE* EPI * KRKI KGKKRT 
EQESRRQNLA 


4178 


18079 


A 


4202 


128 


358 


KLMIPPSPGQPPRPGGRWGGHPPARPAA 
PSGR * GAPLPGRP Y * EKRI PSARTPPRL 
GGWPSMSLRTGHDDNGGVWE 


4179 


18080 


A 


4203 


406 


1 


FF FFFLRQS FTLVAQAGVQ* RNLGSQQP 
LLPGF 


4180 


18081 


A 


4204 


322 


2 


KPRPRKTPGGPFPI *GGGGAPPPRGGAP 
KGGSPPPPPFPPPGGKKGGKKKNNPPGG 
* KP WGGFFFFP PPF* TPPKKRGLFLKKK 
KTQNPKKKKKGGRSRYRTSPRV 


4181 


18082 


A 


4205 


3 


127 


GFHRVSQNGLYLLTL*STRLHLPKC*DY 
RSEPLCLACFLFL 


4182 


18083 


A 


4206 


1 


236 


GFRRVGRDGLDLLTS*SACLGLPKCWDC 
GR*PPRPAHLGGNSNAKEPGLPACPLLS 
GHMHRAVWVRWATATL 


4183 


18084 


A 


4207 


427 


0 


LFFTTNFFFWARVFFLPPFF*NPPPGFF 
FLPHKKKKNFPPPPGFFFFFFYAPPPFF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
PWLKTWSC 


4184 


18085 


A 


4208 


2 


404 


PRVRLGIWCKERLFLCYFFFFLERGFCF 
LACGGGPGGNFG*RAPPPPGEKKIPGPP 
PRGGGEK6PREPGRKIFGFLKKKGGGPF 
GPGGVGPPKKGPPPPKPPKGWGYGGKPP 
AQKRGGFFFGKKRAPFFNWVNG 


4185 


18086 


A 


4209 


40 


412 


PLFCLSEKPRYFE YAI YLMLSLKFLES C 
IMSVNSSAVSASVYY*W*KIEREPLTSW 
TLEKLDPSVFHQKFAFT*RAKAILRKKN 
KFEGLTLSGFKTHYKTTIITTVWY*HKD 
TQIDQWNRIESS 


4186 


18087 


A 


4210 


219 


392 


HFFFLNYNFLGKGVS FCPQAEGQGRNLG 
* LHP PPPGLKQFF CLTLLRN WNHRL VP P 
PP 


4187 


18088 


A 


4211 


409 


2 


LERKTVFFSPPFIFFFAPVFFLSPFFYT 
PPPLYIFCPPKKKKIFPPPPGKIFFFFK 
GPPPIFFFFFFFFFFFFFFWS*FFIMYQ 
IVFNISYNLIKVDSHLYLFKDKKILLCT 
ISSDAW 


4)88 


18089 


A 


4212 


24 


399 


ADAFSTTNLHGLGPDLFTPTTQLYINLG 
MAI LL * AGAVN I G YRS KI KNALAHFLPQ 
GTPTPLI PILVI IETINLLILATALWH 
LTAIITAGHLLMHLIGSASLAGSTISLP 
S TLT VFT I L I LLT 


4189 


18090 


A 


4213 


411 


197 


SWLTAPSFKRFSCLSLLSS*DYRLPVPH 
PDNFCIGSRDGARMVS IS* HHDPPAS AS 
QSVGFTAPKVLGLQT 


4190 


18091 


A 


4214 


218 


126 


KEPEPLLFRSSWSVGAIDLKNVMRRVLI 
KSCTPGWAHTCNPSTLGGRGGQITRSG 
DRDHPG*HKSALGSCASCFIYLWDLLLH 
IVFE 


4191 


18092 


A 


4215 


1 


332 


MAPSLCLSNKKRGFIGPGFAGAPKHPGR 
GQGHPREKLAFFKKGPPKPGLKSFLVLK 
TPPPKPFFFLSPKGGALKTEEEP* TKKK 
GGAEKKPPPGEKRPLGPFCPKKPNWG 


4192 


18093 


A 


4216 


1 


185 


KLYLSILISLQISLI ITFTATELI IFYI 
FFETTLI PTItAI I TR* GNQPANFLY3JLV 
SFHEN 


4193 


18094 


A 


4217 




388 


I.RFCWETLFPLPYNPFSFHFSFLFFGFS 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
etb 
od 


SEQ1D 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
OGlycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, V=possible 
nucleotide insertion 














FFFFLGGKFCFCPQPGGGGPNLNLRDPP 
PRG*KEF*APPPPRGGENQGSPPPPPNG 
GGF* KKGGLNHGGGGGFKPPPQKKPPPL 
ASQRGRNKGGAPPLLKF 


4194 


18095 


A 


4218 


3 


195 


VI YST I FAGTL I TALS SH* F FT * VGLE I 
NMLAFIPVLTKKINPRSTEAAIKYFLTQ 
ATASHPA 


4195 


18096 


A 


4219 


261 


1 


E PRRRERKSRKEKKKGATPSR I CMKRER 
SERFS *ERRERGEEGERGEKEERERGER 
ERERSEEERERERERPKN*MRLNGRTRR 
TRG 


4196 


18097 


A 


4220 


5 


375 


DEMLHLKFTYILN*TLKDTIIPKVNENL 
YN*DFLNSKVQGTPPPPRPSRPPSSPSP 
DPP PG PP ? AG AGRRARRPGRRG S ? PGG P 
PAPPPAPPPRAGRAAGGRGRPPARPGGR 
GPAARPAGGGG 


4197 


18098 


A 


4221 


258 


267 


GGGALKKKI YCGGGGGNFFF *SPPPSFF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFL 
IYFIFILIF 


4198 


18099 


A 


4222 


127 


330 


KKKKKKKKKKKKKKKGGGPFKKKKFLPR 
GGGKNFFF*GGQKKKLGG3VKKKGGGKK 
PGAKKKKKRGEK 


4199 


18100 


A 


4223 


1 


377 


RRFHLRRENLQEKQTWMGI FFFFGVAGP 
PGPGGKSFSPPSFFF*GKNTFFTPPPGG 
GGQNLVGENSNSNPPPPRDPPPSGPSEK 
I * KKTPP P FLVGPNTKNKKHFGGGGS P K 
KKTLLPPPPLDPF 


4200 


18101 


A 


4224 


223 


431 


IRKTGPLGFGGQNNPLPKV*PPPGLTFF 
QXGPPPPPFPPFQKPRFLKFYPKREGPP 
PPKVGLTPPPGLL 


4201 


18102 


A 


4225 


173 


2 


RCKLHCLFPTLAI ITR+GNQPERLNAGT 
YFLFYTLVGSLPLLIAI IYTHNTLGPTR 
P 


4202 


18103 


A 


4226 


11 


377 


FLTWFI CISLFGRLMGI *FFVNFFCEL 
FFHAFFS LFLKTD 1 1 FS I DLLE III YAH 
VCHI CCNL I FSL YFVF YFNGSLKY I KC* 
NLFAI ILDFADVS FLI FMFRTGKRQNDN 
QFSLKILIGF 


4203 


18104 


A 


4227 


1 


412 


KNSKVKNATDLLKNASESPHSRIDEAEE 
RI SELEDRLFENTKSEETK* KRI KK* SM 
PTDLENSLKRANLRVIGLKEKVGKEL*V 
GSFFKDI I * QNVPNLEKDNIQAQEAYTT 
LSRFNPKTTSRHLAIKLSKVKDKER 


4204 


18105 


A 


4228 


373 


163 


IGVFLGGAPFFFFFFFFFFPQGPKRGGG 
KGGASGKTG*GGGGLKLGGGKQGWPPFF 
FLRTWPTFTRAPGL 


4205 


18106 


A | 


4229 


392 


3 


PPC*NRAPGFNFGGPFKKIYSSPPPRGK 
FGSLKGPPPFFFSAPPGGPKFSKGGGPL 
FPGAPDF* GGFGGKNYFWGQGFP * PHSP 
PPPFSWGPKRRAPFSKKKKKKRRRGWGE 
GEGQGSNHLLKHGYFQI 


4206 


18107 


A 


4230 


211 


400 


YALIWLYCHjNNYALHSYFFWCCFCFI/I 
FKF I YVFLNLLFLFAF * LVLCLLKVQNH 
SGSLSIVSYFPLLEACRIFPLFPLF*NL 
* M I CFNMWLYFKQLCSALS VFLVLFLF 
FNF* IYLCIFKFIVFVCFLRRNPTLAQA 
GVQWHDLGA 


4207 


18108 


A j 


4231 


69 


386 


KRI FFFLGPRGGGKGKMWVNGNPPPKGK 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alnnine 
OCysteine, D=Aspartfc Acid, 
E=G!utamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S^Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














GNFSA* PPQRGGPKGPGPKNKVI FGFKK 
KRGFYGGGQPPNLGKELAGPPKGGEKRG 
KTH PGGKF YAFLGGGLFS KKRN 


4208 


18109 


A 


4232 


394 


287 


FKEKKKPPPPPLFSPPFF*KRGGGDPLF 
SPRGEKK 


4209 


18110 


A 


4233 


442 


112 


LLGRLRWEDGLS PQGRSCSEP * SCHCTP 
Ato'ATQ * DP VSNTNNKNPNPQNACSS LNP 
SLPWTAAVWRTLGEHCNGREALPARAKN 
ATKAP VLTTKHI CQTVRTI S CALLPH 


4210 


18111 


A 


4234 


415 


58 


SLQKGHFPPFQELDPQNPGRSGWSPVLP 
TTGLFFSLDFIRFT*FMAGVLVGYKHCW 
EWTAALVLLLFLRDRVSLCFPGLIHTLG 
LKRSSCLSLLSSWSYRHTIPHPAST*FF 
G 


421 J 


18112 


A 


4235 


406 


168 


HGETPSFLKIQKLAGRGGTLL*SQLLGR 
WRQGIvTHL^GGRSSVYIYTEKHTVHTQQ 
SVTTPFiMSMSQTLAYS PLSLKSS 


4212 


18113 


A 


4236 


247 


397 


TVCFSVYI YTELLPPQVK* FPCLTLPSS 
WDYRSVPPLPAYFCI FSRDGVS 


4213 


18114 


A 


4237 


36 


405 


RLYCFIKGLNVKNKTFRIVFFFFLETNF 
PFGPQGGGEGANSGFPBPLALGVKKGPR 
PPPPGGGE*GAGPPGPGNFGLLKKKGVP 
RGGGGGPKPPTQRFPPGRPPQKKKNFFG 
PALGKKKVFFF 


4214 


18115 


A 


4238 


407 


3 


KKKKKLTRPGGGGPFSPLPKRVRQKKGG 
NPGGGF S KKKKS P P P P P P RGKKKKP F S K 
KKKRFLSGNNTGKNRLGKDPQS * PAGNA 
RKKPGWERALARGRAPGMTDRKPQKSGL 
DGSQQSGTGQEPGNSGPVPKVH 


4215 


18116 


A 


4239 


144 


405 


PTVKLVLLYLVLLKVAVS KNLR* *GGLT 
V* QQRQ I LW KR I LSNTE FL YLRQS KMMH 
LVL * P WNTLKSLKSGQARWLTPVI PAL 
WEA 


4216 


18117 . 


A 


4240 


369 


40 


PLLQGTPASWPRWGRAGDRPRKEG* LHP 
RE TR V P PALKHQ PAP AG V S LTS PG P P FW 
LSLQRLRGHS S S DS LPAVCQ YSGS WREE 
KAAAEAPALT PA I CTC T I CSVLTWL 


4217 • 


18118 


A 


4241 


397 


1 


ISKSLFFPIPFWKKKKKGFSPPLNFLGG 
VFFFFFLKI FE I FFLKRFFFP I FLGAPG 
FKIFKKYFPFPRGFKTPPKKKIFLNPFF 
KKKGGGGGALFFFFFFETGSRSVTQAGV 
KWHDHGSLQP*PPRLK*SSDP 


4218 


18119 


A 


4242 


230 


393 


ADRPGS PLPS APETGSHS I AQTGVQ * HN 
YGS P* PLTPGLKLSS CLSLPNCWDYR 


4219 


18120 


A 


4243 


434 


104 


KQPTFQSTTGGSPSAGRPPPPRAKKKKP 
PFF*KKKKKSGNIGGPLFPPPPGGEEKK 
NLFPRKGEGSHKTDSSPPH*RGGKKKNL 
SPKKKKKKKAWCWDYRRKPLCPATYF 


4220 


18121 


A 


4244 


2 


425 


FVFKIILKNLSEIQENLQFSKIKKTIHD 
LNKKFNKETGNITKNQTEILELKNSMSE 
IKNTVKSFNNRLNQAEERISELEDRSFE 
MTQADKKRKKGEESLQDVLYTMK*TNIL 
WS FQ KE KRKE KGLQN I LNE I V AENF P S L 


4221 


18122 


A 


4245 


12 


424 


I YYKTLNCT* IF* FGGALACFHFSYPFY 
IHYVGFVPSPVYLFSHVIISVWTSGYLL 
YSLGCNP I LSLFI FLLKLFHFG YLTI QV 
GI YWILFVCLFLSNSFINI * FTYHAI P 
IPIYIPLKVYNSMFYSIFTTK 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
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SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=G1utamic Acid, F=Phenylaianine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W^Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possibie 
nucleotide deletion, \=possible 
nucleotide insertion 


4222 


18123 


A 


4246 


215 


425 


LSIQISQIRNLIYLFINIMGKILRAKRK 
*GTSDAWNQT I FLRPRTVAHS CNP STL 
GGRGGRITKSGDRE 


4223 


18124 


A 


4247 


413 


2 


WEVESFSLGKAFMWGCATENPGPPHPKF 
LLFIMKAPPPWRVGFQDCFNPKDGRFI* 
PRTP P CPRARGAKKNS VS KKKKI KI W KR 
WPPTPLPPPGNGGTRKFPSSPPPPPPPT 
TPKITPPGGERQL*PQRAEGAGDTK 


4224 


18125 


A 


4248 


61 


281 


ITWKDFDSFSVLRVSKAFHLALPAFIGN 
CIKCHIKMGIYPEEKFKNIHXYLYLEID 
VCGH*TWKKIFFLFFFLRQRFTLFAQAG 
GQWLDLSLLQPPPPGFKRFSGLSFLSK* 
DFDSFSVLRVSKAFHLALPAFIGNCIKC 
HI KMG I YPEE KFKNI HKYLYLE I DVCGH 
IRLGKKFFFCFFF 


4225 


18126 


A 


4249 


23 


13 


RVR P FAS PGGRE VTVCGGLGREGQERRL 
RWHRKPFLPLAATSCAWPRSVTPSVTS 
GSADGDGQSIAVGAGTLATVGGLELLNS 
NDPPASASQGAG IAGVSHRAWP*T 


4226 


18127 


A 


4250 


182 


342 


KGFFFLPPGGGGGGEF* FNEPPPPRVKG 
IFPPPPPGKGEKPPPPPFPGYIFVF 


4227 


18128 


A 


4251 


2 


219 


PSLRKMQKLARHGGTLLWSQLFGRLRRE 
DCLSSGGRGSQTHGSEL* SYHCIPAWAT 
DGDP VS KNTKFFFFFF 


4228 


18129 


A 


4252 


223 


324 


LGAVAHI CNPSTLGGRGRR I TR* GVRDQ 
PGQHGE 


4229 


18130 


A 


4253 


165 


336 


ES DAQGHQVAARVS HVLE KDALLVFR S L 
CKLAMKPLGEGPPDPK*ADSSSWPPSVQ 


4230 


18131 


A 


4254 


3 


233 


ETAFCHVGQ AGLE LLT £ S D P PASGLQHA 
. G I TGP ATKS AP * WS * VSG PHLGAFDS LA 
MLLAQDWRALLESAKFFPLL 


4231 


18132 


A 


4255 


394 


44 


AKIFPPGVKKFFASTPPGGGKKRGPPPP- 
PVNFFPFKKGGGFPPWPGGV*NPAP1NP 
RPWPPKRWGFKGEAPPFPPKLNLFFPQG 
GGG KN F FGKKKG AP TGG PPFFFFEMEFH 
CRPD 


4232 


18133 


A 


4256 


152 


2 


YWSPFYCRSCCIRQGTVAHTCNPSTLVG 
QGARFTRSGVRN * HGQRGES PR 


4233 


18134 


A 


4257 


404 


190 


AQL I RGWRQ * DHL S LGGGGCN E PS S CH C 
TPAW ITE PNQSLS QKTQKQKML I PN * YG 
*TVSPPKSHFELQFS 


4234 


18135 


A 


4258 


288 


407 


ENCRPRAVAHACN P S * ENCRPRAVAHAC 
NPSAFGGRGEQIPRSGEQEQPVQHG 


4235 


18136 


A 


4259 


406 


163 


GWGVQTHPGHHGETLFFLKKKK*VGGGG 
PTRYSPI FGGVGPQNGLNPQGGGSKNPK 
LNPLLPPWGQKGVFFSKKKKKKERD 


4236 


18137 


A 


4260 


101 


276 


LTSIVP S L WEAEG AKLRG LG R P ACRE S S 
LCHCTLPWVTG*NLVSKKKKKRGGALGF 
FF 


4237 


18138 


A 


4261 


2 


414 


WVAATPNLVFVGRGNCQDWFTGPFLTRG 
GWGGVLFPPVKRVIXGLVAPFSGPIPIR 
KNHQKVGGFLE PNKTNFKE * NF VENKNT 
RRGI KKNVPEKKKPQKGI PLPPQI FNEN 
QTWGDFDFFFFAKKKKI INS FLGWA 


4238 


18139 


A 


4262 


337 


382 


RDARFLHCSLV*LIT*LQKEFFFVGQAG 
VQWHYFGFLQPLPSGFNQFFCLSLLSRW 
HYRGPPPSLGKFLVGWLVGWLVFETKFW 
FCWPGWRAI TGFl^.rPATYTFRVQAI FL 
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SEQ ID 
NO: of 
nucleotide 
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SEQID 
NO: of 
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SEQID 
NO: in 
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Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=IsoIeucine, 
K=Lysine, LHLeucine, M=Methionine, 
N=Asparagine, IMProline, 
Q=G 1 utam in e, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, V=possibIe 
nucleotide insertion 


4239 


18140 


A 


4263 


406 


184 


QLLGRLRQENCHLGGEGCS *LRSPPCIS 
AWATKLDPVSEKPKTKNQKPTGLGGLNQ 
GPASLLKKIRKLGKKRPG 


4240 


18141 


A 


4264 


233 


440 


LPVHHGWGS LRKLT I LGKGEANPS FFTW 
GQQGKAKQKGQKQPGAGAHACNSSPLGG 
QGR* ITRSGVQDQ 


4241 


18142 


A 


4265 


2 


230 


GARL* SHLLVRLRKENHLNLKG KNCREL 
KWPHCTPAWGTKKDSAAKNKNQPLPPPK 
GVKSPPQREHGLIVTCMFVQ 


4242 


18143 


A 


4266 


1 


349 


HKTNIHFKLVMVSNVFTFFPFGKGFSFC 
APGGRAGPLFGLVETPPGREKAFFGPNP 
PRRGE*RAMAPCPGKILVF*KKTGFPRG 
LNFWPGGPPPPGLQRGGNNGGNPWPGPE 
FFWF 


4243 


18144 


A 


4267 


2 


375 


RSEQPAGSASSGNEGLSTRASGCGGCTG 
SPNSASPPALCSISRRALAAFPRGRARD 
LQPAMPEPPTHSVGSCAAKPPR+APPPA 
PRRPVPSTTQGLRSAGAQRGTGRHLHLQ 
PQCRIHWVKPAG 


4244 


18145 


A 


4268 


377 


3 


TP AWMTERDC I WRRRTS APGG S W P SG P V 
PSPGAQ*RPPSQGLGLWWAAAAAPRC*T 
APGPRPPPHGPGSPQGASPPTRPPRCRP 
HPRAGSAGPTGATPPGSTQGQRRRHSKQ 
LPGHPGHRVALG 


4245 


18146 


A 


4269 


1 


294 


LED WGGRG ARAH YDG FS LE PE S DH YRLR 
LGQYHGDAGDSLS WHNDKPFS TVDRDRD 
S YSGNWALYQRGGCWTHACAQ * I LDAVY 
ATGP KYQVHCEALH 


4246 


18147 


A 


4270 


2 


218 


TGRI I ILSQGLQTLLPLIAF* *LLASLA 
NLALPPTINLLGELSVLVTTFS *SNITL 
LQKKKKKKKKKKKI FF 


4247 


18148 


A 


4271 


32 


443 


LHSDVDQLAGLVF PGC P WPLAS PARRAP 
AGPWPRRAAAPPS * DAPAPRLAVS AGS P 
AWPPPST*GLPAPAAWASPASASPTSA 
RSRS*ASPTARRCRPGPGTAAPTSASRQ 
CQWRP * PCQRSPPSGTCSSAAAAPT 


4248 


18149 


A 


4272 


434 


57 


HLSFPPASAAPKPLPQA^SLWSSPPSIV 
LAQVPPMNTDPCPQPSA*PASELSPEMP 
PARPQAPPENWSHP *GCGAQTVAPLVPG 
ATPAHRPAS PSPVTLPVLTAGGHCF YLY 
T YQ I FLKFH YI KR 


4249 


18150 


A 


4273 


288 


424 


GLSLSVAQAWQWCDLV3L* PPPPRVKQ 
FSCLSPPRIWDY*HPPP t 


4250 


18151 


A 


4274 


1 


429 


NTRGAAGPPQMPHP PRASAFPENPCGRK 
N * GQVSGP * ASGSS PVKWAGPAGTWRKG 
GLGWGPTRVRGRGPRRRPGASSGYARQQ 
GPGHPGFPPSPRRRLSVPACALCPGQTS 
GTRAGLAVWPQIGLCLKAQGARSCPRDH 
SSD 


4251 


18152 


A 


4275 


253 


3 


PHQFNAVIYFPSPVSRGCPNNALYSPFV 
IWLFCFVKTGVSVTQAGV*WHDHSSLQ 
P*TAGLKRFSSLTILSSWDYKSTPPCI 


4252 


18153 


A 


4276 


406 


3 


PMV SQGG VCAK* QARAI ?REG I FRRSAD 
TQWREVQQKYNMGLPVDFDQYNELHLP 
AVILKTFLRELREPLLTFDLYPHWGFL 
NIDESQRVPATLQVLQTLPEESYQVLRF 
QTAFLVQI SAHSDQNKMTNTCI 


4253 


18154 


A 


4277 


373 


3 


DGVSLCPPGWSAVAOSWIiTAVL^ALGFI 
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Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
&=Glutamic Acid, F^Phenylalan ine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K-Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X-Unknown, *=Stop codon, /^possible 
nucleotide deletion, \-possible 
nucleotide insertion 














FLFPVGGLTGIVLANSSLDIVLHDTYYV 
VAPFPYVLS IGAVFAI IGGPIH* FPLFS 
GYTLGQTYAKIHFPIIFIGVNLSFFPQH 
NLGLSGMRRCI 


4254 


18155 


A 


4278 


71 


176 


AGGRGCNESRSHHCTPAWATE * DSVKEK 
KKGLFFW 


4255 


18156 


A 


4279 


2 


379 


SPFVPLQSSLGNKSETPSHGRKKKRRKR 
KRTDENQP * KHPLRARPGPGAPRAQGGA 
QGSGGGRRAGPCRKLVHRGHTGNPKREP 
GLIPQGEGRSLGIHPSNGCHKPCPSRGR 
PPPRPKQGRGRMQN 


4256 


18157 


A 


4280 


2 


338 


CDMPRRKLDPLSGRNTLGFVCWWAQTS 
GLK * S S CLCLPKC WD YRHE P LRPVWVYF 
XLWHPSPWKPGFYFSVFFPFYLPLCTTA 
SLFCSCLPPCCSRVTLVGSESSCLWPAG 


4257 


18158 


A 


4281 


407 


1 


E ERG VATGHTAERGRADNEE ERGEVANK 
RGGREVQPEARRMTTTATGTTAATRGAR 
T * TAATTATAP VTRTAAPATTAS STRTL 
RTPRRARTASVLWARCRSRWSATWRASR 
TCDCRTASHSRPTATRSRTARMA 


4258 


18159 


A 


4282 


367 


225 


PCDSACLGLPKCWDYRREPPRPAGVLFL 
* AS FS +QS KEVYVCKLTHA 


4259 


18160 


A 


4283 


452 


1 


NTCGLQSVCRGTQRGGPKRCPCHPRERW 
DLGDEQES SKEGRPGREGVQGQGLLGAT 
PGAQGSH * GL * AEAAMSSGKALG PGQVP 
LTPLSPSPPNPHRRPRAGRASRQSREST 
EAQRAVP SQGAAPG WETD WGS SHQWQPC 
QAQQEGRTGR 


4260 


18161 


A 


4284 


436 


22 


CHVTGTQPIKVSWAKDSREIRSGGKYQI 
S YLENSAHLTVLKVDKGDSGQ YTCYAVN 
EVGKDSCTAQLNI KERLIPPSFTKRLSE 
TVEETQGNSFKLEGRVPGSQPITVAWYK 
NNIEIQPTYN*EITFKNNCIAAARRI 


4261 


18162 


A 


4285 


301 


403 


LSIS* PCDLPALVS QS AG I TGMSHHARP 
RVKSLI 


4262 


18163 


A 


4286 


380 


2 


AQVLYSSREQERRQDLPEQVIQAEGE*V 
KASACQLTFEDEEAMESGPAALDKDFQC 
TRKHHFAEVQGS PRCMSSRYLVDGCPKT 
FAEAQNVCS * CCEANLVS IHAFTFILR I 
QWCTSTVWQAQVCI 


4263 


18164 


A 


4287 


453 


3 


Y I YEGS I MEEPRKP VPKGPLGLHCPGKF 
QG* RNS YNHHAVRVGTRCAPEGVKDLTS 
SLQSVITKPEQNIQELMKHFKKEKSEAE 
NHIRTLKAESLEEKNMAKIHRGQLEKLK 
SQCDRLTEELTQNENENKKLKLKYQCLK 
DQLEEREDV 


4264 


18165 


A 


4288 


52 


400 


LDLYFFYRQCLALSPRLQRSSAILAHCN 
LKLLGSGDPPTPASQSKEITSMRYHI*P 
NL*NSMC*NCTFDTTHRIGENIWESYI* 
* E IN I QNACRI PKTQQQKTKFQNKQGLH 
SNLN \ 


4265 


18166 


A 


4289 


2 


263 


I HGAHLW S QLPGR L KWSDRR S PGSRGCS 
EL* SHHCTPVWVTEGDPVSKKRINICKQ 
NLNEKTNLTVDTPRHS FTHKI KSLNHFQ 
MFL 


4266 


18167 


A 


4290 


3 


142 


YLGVHRLS IDGLDLLT* *SARLGLPKCW 
DYWRDPPRPANFYFILFK 


4267 


18168 


A 


4291 


324 


!9 


EMS KRHE Q KFHRRE LQMAN KHY KRTS LA 
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nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIalanine, 
G=Glycine, H=Histidine, l=lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R^Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Ty rosin e, 
X*=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














I REMKI ETTMKYYHTP I QMA* VRNNDTS 
QVQWHTLQVASPbVAEVGGWNEARNLRL 
QIVKMVPMNSHCTPAWVT 


4268 


18169 


A 


4292 


2 


204 


IHFCRDRDSLCCPGWPRTAELKQSALLG 
LPKCWDY RH * ATKPGLQSVFLLGLKI FL 
ILLVRGVDRYL 


4269 


18170 


A 


4293 


427 


3 


RRKTVTVPRTGERGI ISNEDNGLCKAVG 
N I PGS PLCNVS KLNIMKHAPGGHAARFC 
I WTE S AFRKLDELYGTWRRAASLKS S YTJ 
LPMHKMINPDLSRILKSPEIQRALRAPR 
KKI HNRVLKKN P VLRPL * R I KIiTSRHKR 
R 


4270 


18171 


A 


4294 


24 


372 


FICPLQDYVICSDVTSCWKKILATTGR 
RLRDDSVQFSFFFFGKGGSLLAPRVKEQ 
GGDLG * WNFRP PRLRE F PGLAL PRC WNW 
GLAPPPPLILVFLEKRGFPLAGKMGLNL 
LHSR 


4271 


18172 


A 


4295 


1 


332 


IHS S YFFPQS * FFGT INTSDKPLRI L I K 
EKKREGTN *H I MHE PWKFG IDPEDIIKV 
KKKYYKQLCTHKFDNLEEMVHFLKKKTI 
HL I * NR * FE * LYKYYRN 


4272 


18173 


A 


4296 


407 


30 


WSVIYGEN*ESLGWGAHPSHVANIRVTG 
LRYLFSYGRHALDMLESSQDNMRSWWVS 
QMS S E I DVDNLGH I SLCNAVQ * IRNLNP 
GLKTS KI ELKFKELHKS KDKAGSEVTKK 
EFIEVYHELYAVG 


4273 


18174 


A 


4297 


350 


1 


YKTWMNKEKW * YVGYN IEQEQKLALKT 
TVLDEWYTLPDGRI IKVGGERFEAPEAL 
FQPRL INVEGVG VDELL YSTIQAADVGT 
RSEFYKHIVLSGGSTMYPGLPSRMEREL 
KQHV 


4274 


18175 


A 


4298 


623 


1 


SRRGCAATCDGS ITAWFQRQAAQGKPS V 
HS KLEAEAKPTPGDHAAS ES TGFSCLGP 
GGVHKTAHAHARNMPGDSNTGSGQSPAG 
RRWEARGSAPRHHPTQPDWTHPDALRQA 
MARNPASSF*SF*RCCTAASPPVPTPPS 
PVLMRGPVPGGGGGQGK I RPLQEEAP PP 
SSPWLSR*PQAGTPSSPAVSSI,YHGGL 
SPTGRQDRWGR 


4275 


18176 


A 


4299 


469 


3 


PVRNCLGARFRVSGRAAHHAL*QSASTL 
QGDPRTKRQAISA* SSAFDVQDVSHVTL 
P FYP KRAQ S KDL I KE AI LDNDFMKNLEL 
SQIQE IVDCMYPVEYGKDSRIVKEVDVG 
SCVYVMEDGKVGVTKQGVKLCPVGPGKV 
YEEIiAILYICRRPCI 


4276 


18177 


A 


4300 


506 


321 


KPLSLLKI QKIiAGHGGSCJj * LQLLRR VR 
QENCLNLGGEGCSGLPKFWDYRCEPPCP 
DCWPS 


4277 


18178 


A 


4301 


454 


1 


NGEKPTYSGKKYVFLF*NTRRPTLFTWG 
KTN*GERPRSSVYSCSSWRLSSSSLGRS 
*NSVGSLNI ILATLTAQELSNS* ANNLI 
* LAYT IAFI VKI PLYGLHL* LHKAHVEV 
PIAGS IVIiAPVLLKLGGYG I IRLTLILN 
PLTKHIAYPLY 


4278 


18179 


A 


4302 


3 


463 


AVSLPLPLPRFGRLPPLRVGWCPLRCH 
I LRTQRLEGAPSR* QTGAKQAGVWEELR 
SRLSLGPELGRIRPYSGDPGQVTDSSGA 
LASSLERGCYPAALLVGEGEGEMYLGPY 
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ing to first 

amino ncid 

residue of 
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sequence 


Predict- 
ed end 
nucle- 
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location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenyla!anine, 
G=Glycine, H=Histidine, I=1soleucine T 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V= Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














HPT*DGLKRLSLAKPPGGETRLTRSGHT 
FWGEGIPPSAVCL 


4279 


18180 


A 


4303 


469 


2 


FPFFFSETESCSVTQAGV*WHDLGSLY 


4280 


18181 


A 


4304 


194 


3 


FFFFFSETESCSVTQAGV* WHDLGSLY 


4281 


18182 


A 


4306 


517 


254 


VVEETAFRLIGKTGIGLVTSGDLAASAS 
QRGGMTG VSHCNRKEFVLMN 1 1 VMSSGh 
TFLPFSPDEV* GS * GLS FLPS PLALNTG 
LRN 


4282 


18183 


A 


4307 


276 


3 


KI KMEGI PLH I PNPLVNLNLGLLF I LAT 
SSLAVYSIL*SG*ASNSNYALIGALRAV 
AQT I S YEVTLAI I LLSTL I ISGSFNLST 
LITTLV 


4283 


18184 


A 


4308 


234 


398 


KKFFFFPPLEGGGPITTIWSP?LPG*RE 
SPAPPPPRGGIKGLAPPPNLFFFLDKR 


4284 


18185 


A 


4309 


403 


100 


SGGQKAVGP P WAAPPGYKQNTNKLGPLP 
QPSTEGGAFWPTQGPKPTGLPPLPG*LN 
PRNPTPP*WPPPPPPFKTRSKGPHTPSP 
TPLCGPPPKNPRLFFFF 


4285 


18186 


A 


4310 


399 


85 


WKPIPLAPEWRGVIRFRALPPCRPLHCP 
TLKLTAPSVIHQRT*VHWGFLVGVSLSV 
FLERESCSFTQVECSGTKLALRSLELLG 
VSDPSISAFQRAGIAGVSHHA 


4286 


18187 


A 


4311 


230 


2 


WTEEDTRRCLVLFS FFLS FFLFCFVLRQ 
RPACCPGWSAVTQSQLTVASTSLAQAKR 
SSHLS PPQRS * DYRHTPQCI 


4287 


ism 


A 


4312 


1 


420 


NTS WG VGELSL I VI VINMLLP YVWLPKG 
KFFFPQNSRFPSAPPHSSPPGLRSDFSH 
SGGL F FHLE VLWGLPLP P P P P P ASTHVR 
RPLGTQGCP * LAW Y I HL I S AS YQKANAA 
PQLSCILQDCIRSKGDILIFTFFTLCLS 


4288 


18189 


A 


4313 


403 


207 


ESSEG * LNPLAHIiAMRYKGCPFKDVREK 
SEFILKS IQVRKS I LNTRMSLLQLDFTC 
RYSRCVCFY 


4289 


18190 


A 


4314 


469 


328 


TSEGGGFNELRLCHCTPVWLTK* DSVS I 
SQRIRKKNLKTQSLWRCP 


4290 


18191 


A 


4315 


1 


212 


NTLPGVEVLKRLRWEDSLNP*GQGCREP 
* S CHCTTAWATEKDP VSKKKKKKKNPGG 
FRPHSLSQPFGPPK 


4291 


18192 


A 


4316 


63 


559 


SNLTFWQCAVPAVETTPSSLCGPVPSPT 
SSATPKP I PS * AACPPPDCALAAEVRAL 
PAAGRPRFSEACLTPQPNPCVLWPCLWD 
HSPTSCPTGSS * APHSCAFLPVPRLGGG 
MTTVHPTVHLPIHSSLTFAWE* PSPPAC 
LTAPRPTGHCLLSVPPSAYSSSSPS 


4292 


1S193 


A 


4317 


135 


405 


PAPPSPHPKPDSVSCVIPPS*PAPPSPH 
PKPDSVSCVIPPSHVAGPSGLPEMTLLE 
PRGPCPPEVPPTSPFVG*AWVIPHPPARR 
ASGRMDGRTBGRGRHLDLRSTHS FLTP 


4293 


18194 


A 


4318 


413 


1 


LYLVGRA*SFHRSFPGQERIHPGGKPYD 
CKECGETFISLVS IGRHMLTHRGGVPYK 
CKVCGKAFDYRSLFRIHERSHPGEKPYE 
CKQCGKAFSCSSYIRIHERTHTGDEPYE 
CKQCGKAFSCCKYIRINERTHTGGV 


4294 


18195 


A 


4319 


479 


342 


GVGVDNDG ILVLGATNI PWVLDSAIRRR 
* DNTVN I LFLTADVTSR 


4295 


18196 


A 


4320 


400 


26 


HS EAVTTVR SHH S P VGH I H S LNA P P TVA 
LTP* YNKPLNIFV*ARAQKIQGQFLTHT 
I PH I LLLV I TRDTS PJICVC VCVCVCVC V 
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Predict- 
ed end 
nucle- 
otide 
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correspon 
ding to 
last amino 
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peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
5>=Glutamic Acid, F-Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, Lr=Leucine, MNMethionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V^Valine, 
W=Tryptophan, Y=Tyrosine, 
. X=Unknown, *=Stop cod on, /=possib!e 
nucleotide deletion, V=possibIe 
nucleotide insertion 














CVYAY I W VCMC VY I YVCVY I Y I CNY T F I 
PTLVHLP INKMY 


4296 


18197 


A 


4321 


370 


1 


SDRGQGKAGQATRLGSRQTGQDEDKGTE 
KSIPSWDKGPVNNEFGKSVWVSSNLVTQ 
E PS PE ETSTKRS I KQNSNPVKKEKSCKC 
NECGKAFS YCS AL I RHQRTHTGE KP Y * C 
NECEKAFSRNV 


4297 


18198 


A 


4322 


342 


82 


GLGLTTNWLLTGKSAGRGCAPSSGGLWIi 
AVGDS VYYHRETEGGKPSPGRPCC* PS * 
AGLAPPSPPPPGCPEAPGSCCLHYRCLL 
TAH 


4298 


18199 


A 


4323 


284 


3 


I TKKFPRVIGGGCFS PFFMGLS KKNGLF 
PEGALSISLIFGPAIPPGGAQVISSAKE 
KKKVLSRPGWAHACNPSTLGG *GGR I K 
RSRVPDQPV 


4299 


18200 


A 


4324 


365 • 


3 


QVEVYWENHPFRLEESIYQ* CRLEGAT 
SVAGEQISEYNISMRASDGGSPPLSTET 
HITLHVIDINDNPPTFPHIiSYSAYIFEN 
NPRGASI FSVTAQDPDSNNNAR ITYALT 
EDTLQGVY 


4300 


18201 


A 


4325 


253 


3 


PSFLSRD*SHIHKRIiECSGMISADYNLH 
LPGSSYPPTSPSQVAGTAGPSMSLQEGR 
ASHDITSSSRS IGAKELLRPADHPQCI 


4301 


18202 


A 


4326 


307 


95 


FLFLLKDKILLSPRLECSGTI IAHCSLK 
LLDSS YPPAVAS * VAGTSGMCH YTWLRW 
KNHLSPGIQGCSEI 


4302 


18203 


A 


4327 


371 


2 


WAPNHISPTPRSGSTTRIWAPTPPSNSP 
RPCQDPGHRTDPWYPEEEFLLTNPDPPR 
APASWSFPFQEKRLHFPALPCP*HLDSS 
LGPTTLFS FSFPPTS I RPSQTNHSKGTP 
PPLSYAQHDCI 


4303 


18204 


A 


4328 


409 


1 


RIiLEARQPELEMAALIFFTLKYKHVERE 
QKYHQLQDE YFTS AWLTL I LAALFGLV 
YLLI FPQSVWLLLLVLCICFLVACVLY 
LHITRVQCFPGCLTIQIRTVLCIFIWL 
IYSVAQGCWG*LPWAWSSKPNLY 


4304 


18205 


A 


4329 


262 


2 


LHGAYLVLDI IGAQNRKPRYS FKWRVGQ 
FFFLVFSPRDSLTLLPRLECSGTVMAHC 
HLDLLGSGDPPTSAS* I*GTTGAYHHTQ 
HV 


4305 


18206 


A 


4330 


1 


399 


NTF* FLKGMGLVNHVFTEDNLKKLYVSN 
LG IGHTR YATTGNV 


4306 


18207 


A 


4331 


408 


3 


SSVGIHVRTHTGEKPYECKHCGKAFSCK 
SSLREHVRTHSGEKPYECNQCGKAFSHA 
QYFQKHVRSHSGVKPYECTECGKAYSCS 
SSLRVHVRTHTGERPYECKQCGKTFRYL 
ASLQAHVRTHAGA* I YKYSGKV 


4307 


18208 


A 


4332 


110 


426 


GLSPGTWSDMTGGPAVTAFPCTLTGTHA 
GHI I IDGIDX AKLPLHTLRSRLS I ILQD 
PVLFSGTIR*APPPLRPTQPQAGSVPSD 
LEHKEEGVGGDQDPRGVSCSC 


4308 


18209 


A 


4333 


133 


1 


EPCCPSALAAPEVLGPEKYDKSCDMWSL 
GVFMYIL*VPSPPPLY 


4309 


18210 


A 


4334 


165 


1 


1 1 LTFFKKNERKWKAVQYW* KNR * VDQ 
WNRIASPE IDPHE * SQLI SDKGAKECI 


; 4310 


18211 


A 


4335 


42 


440 


S ARRAGDPARGAPS RIWAS L PD PRE LTG 
RPAGGLQ*GNCPAPPTPQLPSPSVSRPV 
S GRGPP P P PS FGD P RANR PQNP KG AKW 
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Amino acid sequence (A =AJa nine 
C=Cysteine, D=Aspartic Acid, 
E=G)utnmic Acid, F=Phenyla)anine, 
G^GIycine, H=Histidine, l=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /= possible 
nucleotide deletion, \= possible 
nucleotide insertion 














LTVQMG E PGE I RRAACPQLS WAMT F PG S 
LPEPKRTGPQIQKSPPSRRGG 


4311 


18212 


A 


4336 


1 


160 


NTCGGGAQL* SQLLRRLMWEDLNSGSQG 
CSELRSHHCTAA* ETKGDPLS INEY 


4312 


18213 


A 


4337 


236 




QRSLLCYIME I RTVAVRIVAI KGVESEV 
YLAMSEEGTVYAKKECNEYGIF*ELViiE 
SHYYTYAAAVLRPVCCIESKVS 


4333 


18214 


A 


4338 


408 


3 


CQSSVSKKERTNGAQNPGAAKQGNNELR 
DSTEQFQEYYRQRLRYHQHLEQKEQQRH 
I *QQMLLEGGVNQEDGRDQQQNRNEQFL 
NRSIQKLGELNIGMDGLGNEVSALIQQC 
NGSKGNGSNGL*VNSFDTPPHV 


4314 


18215 


A 


4339 


363 


1 


RSQSSLPKSFKRKI S WSATKG VPAGTS 
DT* GVQPGWQ* RWGASTATTQKKPS I S I 
ATE SLKSLI PD I KP LAGQEAWDLHADD 
SRISEDETERNGDDKTHDKGLKICRTW 
RARYSINEV 


4315 


18216 


A 


4340 


1 


353 


DVFLDTLARFLRHSCNFFLCYLL*DFFK 
FIPQFFYWMFYFDYIFISRRNFYFLLLT 
HFFTLPLFLVIRIQCLHLFLFFSFFFFF 
FGKGNPFLPPGWRARAQFWVNGSPPLRV 
NALLP 


4316 


18217 


A 


4341 


376 


3 


LELREGGFIiPK I ADS VREERS PALLDDR 
AGRCQGQPRIQVCLTPKSMLFLAFHM*T 
CEHCLEQCFSTSL*PIEIRTLH*D*GCV 
CMCAYVCI CVCVHMCMCAY I C I CVARMA 
KRPLESIHTSCT 


4317 


18218 


A 


4342 


158 


2 


LAFFFFCETQSCCQWHDLSSLQPLPQRF 
R*FSCLSLPSSWDKKHTSPHPTCI 


4318 


18219 


A 


4343 


145 


2 


IFGEQVVFDYMNKFFSGDF+DFGASLTQ 
AVHTVPNV*SVIPCHPPRV 


4319 


18220 


A 


4344 


1 


434 


RSLIFRATAYEYRYIDQADLKLPASKRS 
SCLGLPKGWDYRHEPPCLARFI FWQIFI 
AYKLR* KHYKSRQRSLPSRGSC* CACDT 
FVSDPSHWGTMAQ I LGGW ITLGQVQRS C 
LADQPLLPLLPPTLAPCLP *NDVLLFFF 
TQSQ 


4320 


18221 


A 


4345 


4 


475 


KHSCRSLSHLDRPLPPALQETCPVRAEP 
LLLVRINASGGLILRMGAINRCLKHPLA 
RDTPVCLliAVLGEQHSGKSFLLNHLiOG 
LPGLVRAGRGRREAGSKDGGSCLGEAGS 
GIPVCGDKEPTN* CSPFSPLQESGEGGR 
PRGGEASLQGCRWRANG 


4321 


18222 


A 


4346 


313 


442 


DSTLNFFFFF*TESYFVAQAGVQGHNLS 
SLKPPPPGFKRFSCL 


4322 


18223 


A 


4347 


437 


338 


AEVQEFYGDYIAVNPHLFSLNILGCCQG 
RNWDPAQLSRTTQGLTALLLSLKXCPM I 
RYQLSSEAAKRLAECVKQVI TKEYELFE 
FRRTEVP PLLL I LDRCDDAI TPLLNQWT 
YQAMVHELLGINNWRIDLSRVPGCMRPL 
*RIHPGTLERSIRLLFMPSSSWTMA 


4323 


18224 


A 


4348 


409 


118 


GLHDAS FVAA IPS TQ WYF F FE ETASE F 
DFFERLHTSRVARVCKNDVGGEKLLQKK 
WTTFLKAQLLCTQPGQLPFNVIRHAVLL 
PADSPTAPHIYAVFTSQW*AAGPSGRWS 
TSSAAAFRRPRHSCRL 


4324 


18225 


A 


4349 


2 


418 


YTLPDFPHHPHRLSHCRLLQSRTASTYQ 
SPGACTPISQGHPGQRSSPCSSSSNPAC 
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peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenylaIanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M^Methionine, 
N^Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tbreonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=poss*bJe 
nucleotide insertion 














*SSSSCGSTGFRCPGSAHPGCCSYVL*M 
GR* RSSQAPRAS PIAQGPRSAPAGRAGH 
PALPSPLP*GAPPPPSAASSPGL*SPA 


4325 


18226 


A 


4350 


401 


276 


DPLNVGGGG YS * PRLCHCS PTWVTEGDL 
VSKTNTHTNKKHL 


4326 


18227 


A 


4351 


446 


277 


AQDFVLI KDGEVIGGICFRMFPSQGFTE 
IVFCAVT*NEQVKVSAAPRPQPQSTAHS 


4327 


J 8228 


A 


4352 


408 


3 


PKSSLFTSRPNGKSWSTKRPAISLAYSN 
ENAQGSWNGDQDAGPLLVENPRDYSTEL 
S VT I AVGASL LFLNVLAFAAL Y S RKDTR 
RQ*PLCQLSPHRGIGASELGHAPQ*ELP 
ALLLGPIRHECEAGPPHDTLPV 


4328 


18229 


A 


4353 


62 


280 


VRIGCLTSVPEASSCGLPTTPGCC*PLG 
TLP CRSRSRTRS FRSRSRTQSRSRPKTY 
SPGRRRRSRSRSPTPP 


4329 


18230 


A 


4354 


364 


3 


PVGEEGKSLVPKS P VEEKGKS PVSQS P V 
EEKAQYRVPKS PVEEAKS KAEVGKGDQK 
EE * EKE VKQAPKEDKVEKKEE KPKHVPE 
KKKAES PVKEEAVAEVVTI AKSVKVQLE 
KETNEECI 


4330 


18231 


A 


4355 


3 


558 


IiGPTLLWRRGKVP KANS P I PTS bRRTLP 
RAS PGPSRGPLFMRQAGSTPMTIiPRSTP 
SMSPML*MVWPPTAVPVP*KPLMWAPPA 
PLVLLVTILTEIQEPATPAPLTQF*KPT 
SLMVS R PVC P WQG P RTTRS TLCATT I A 
PSHATIiRPGLSTTTSPLWQTLSLLLEGQ 
ASLPKG*NTSITLPSVS 


4331 


18232 


A 


4357 


258 


1 


LG FHDPGP AG W SQ PHGGRS AQ AQ* P P Q P 
PTVPRTS LADVSDAPLP PCGHSRPQHH P 
FRLPHTS FST P GL S AVCLI VH P AARV S R 
CI 


4332 


18233 


A 


4358 


97 


343 


DRDLGDEYGWKQVHGDVFRPSSHPLI FS 
SLIGSGCQIFAVSLIVIIVAMIEHLYTE 
*VPTLNCIMSLLFLNSFNHWNLLSQS 


4333 


18234 


A 


4359 


182 


2 


KDYFGSSVENGGEEELKQEDQLGGYSVI 
*ARDDEDEDDDDDDDDEDEEEDDEDEDD 
DDCI 


4334 


18235 


A 


4360 


88 


2 


RGSRTDTG*RGSDRGTG*RGSHTGTR*R 
GSRTDTG* RGSRTDTGGRGSRTGTGGRG 
SRTGTGGV 


4335 


18236 


A 


4361 


2 


250 


ELRDEGKASSAKQRLKCAGLQKFGERAF 
KAWAVARLSQRFPKAEFAEVS KLVTDLT 
KVHTECCHGDLFGCDEESAGIiAKYIM 


4336 


18237 


A 


4362 


169 


456 


EQLLCAGQGQIiSCEPLHLPSCPVGPQTP 
AHRPGRMAPGPPSFLSPGSPCGLGLAIP 
CSCQPWVAGVPDHHLFPSPMGRAGLSPS 
ERSAGEGVIPWP 


4337 


18238 


A 


4363 


1 


419 


PEFKLQKLKRSQNSAFLDIGDENEIQLS 
KSD WLS FTLE I VI MEVQGLKS VAPNR I 
VYCTMEVEGEKLQTDQAEASRPQWGTQG 
DFTTTHPRPWKVKLFTESTGVLALEDK 
ELGRVILYPTSNSS KSAELHRMWPKN 


4338 


18239 


A 


4364 


375 


448 


. PGAVAHACNPSTLGGRSGRIMRAG 


4339 


18240 


A 


4365 


1 


444 


DFLTNHYLHFLRIAGSQLTGLGTAVQLY 
SAYEENNRTFLLAAVKRNHNQYVNPSGV 
ATFFESI KEILLRQSGVKVESVDHDSCV 
HGPCQWGRSCLRKLAVSSVLKSRESLPV 
I IVANEPLQPFLCKCLPGYSCSWCEIDI 
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nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylatanine, 
G=Glycine, H=Histidine, I=lsofeucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutaroine, R^Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,A=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














DEWLPFPW 


4340 


18241 


A 


4366 


52 


402 


CPQEVWFHSFCHCTSAFMKDLYFLCLVR 
FGDLKEDKVTRHDGASSDGHLAHI FRHA 
AKELFNEDVEEVTYRALRCGAVSTACLC 
LWLAR VHGVPVW I CACGGSMDGVYTSQN 
QKDFF 


4341 


18242 


A 


4367 


119 


247 


WVLLS FQLHGVAVLGIALI AMGESIGAE 
MALRTFGHLLRYGEP 


4342 


18243 


A 


4368 


2 


430 


GGT WDGQPG FHGDS LSKAPGMNSLEQG 
MVGIiNIGDVSSSAVKTVGSWSSVALTG 
VLSGNGGTNVNMPVSKPTSWAAIASKPA 
KPQPIMKTKSGLPPSPIKHimDIGTWDN 
KGPVPKAPVPQQAPSTQACPQSQQVGQT 
LP 


4343 


18244 


A 


4369 


1 


406 


ETSTPEGEAGPIQRLDIPVENPVESKNI 
FLGAPLIICHVIDKRSPLYDISATDLAN 
QDLEVIVILEGLGETTGISTRARTFYIA 
EEIQWGHRFVSIETEEQRPDSVDYSKFG 
KTDLVTTSRCNARELDEKPSILI 


4344 


18245 


A 


4370 


2 


284 


GTGTLCDLTAQLSARYDGVRTCILPCWK 
TISTI PVALL I H YHQNQAS CGKRAI I LE 
TRQHRLFCAD PKEQ WVKDAKPHLDRQAA 
ALTRNGGTFE 


4345 


18246 


A 


4371 


3 


184 


EDYNI LLS IMKGTTRP INKE I EALNDTV 
DQLELTDLCRTLTPTIAEYSFFSSAPKV 
LIQP 


4346 


18247 


A 


4372 


44 


304 


GLPRS I GCQGL PRLPEWPTWVGGCHRGL 
IMPSVPTQVSERPLMFLLDTPGVLAPRI 
ESVETGLKIiALCGE PGLGLGLG PGPLPP 
HL 


4347 


18248 


A 


4373 


1 


422 


LHLFNPAASAPSRSLFSGPILDPLSRAL 
GVGPGICGLASSPGISEGWDQIRSWTHP 
PDPDHLSGFCRSQVYMHPSLSPPSTMIL 
SGGTALKPPYSAFPGMQPLEMVKPQSGS 
PYQ PMSGNQALVYEGQLSQAAGLSASQM 


4348 


18249 


A 


4374 


2 


423 


NSGTHTPGLLEDLS KNGRLPE IKLPVNG 
CSDLEDSFTILQSKDLKQEPLDDPTCID 
TSETSLSNQNKLFSDINLNDQEWQELID 
ELANTVPEDDIQDLFNEDFEE KKE PEFS 
QPATETPLSQESASVKSDPSHSPFAHVS 


4349 


18250 


A 


4375 


292 


429 


LKISVFPSSTEISLQQRLEYIARAILTA 
KS STG IAS I S ADGES LRE 


4350 


18251 


A 


4376 


544 


32 


ALHACACLGLS PEACFLHLLQLTGTSPG 
PVDGQGLDNQGFRLLKAI LGVS PAPEPV 
HGAS EHNADTGVSVARACVLPTS PTLAR 
TAAAKDESPPLAGliTMGPSREPGGSCLP 
LPSGLLS S PTRLALPRR PPS VS VWQDAG 
SSGSMPTEPLVQCGHRFYGTLVSWAVMV 
LK 


4351 


18252 


A 


4377 


378 


232 


PAKRAEEELLLHDTRCWLNGGAMPEARH 
PRTGAS ALHVAAAKG Y I EVKR 


4352 


18253 


A 


4378 


291 


1 


QMLSVDVASRYRAPSTYVLNSLKEGMDG 
LHGESCSSFLLGPSVAMNMQTAGLEMGI 
CDGH FRQNVGCG YVLKP DFLRD IQSSLH 
PEKP ISP FKGQSL 


4353 


38254 


A 


4379 


171 


458 


RGPLFLTLHIVNSLQGNNFRREYIVTQG 
PLPGTKDDFWKMVWEQNVHNIVMVTQCV 
EKGRVSKQLSWHQS FAFCHLDRI LPKKQ 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=G)utamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidihe, I=Isoleucine, 
K=Lysine, Ij=Leucine, M=Methionine, 
N=Asparagine, P^Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, ^possible 
nucleotide insertion 














SETPQNNVGIM 


4354 


18255 


A 


4380 


214 


410 


LDHLPTHPSPTGE I EVKWFEAVQTGLPM 
CILGAF FGP I RLGAQSLQ VLD S EL I PWA 
VQNGRIAPC 


4355 


18256 


A 


4381 


2 


74 


IGDSGVGKSNLLSRLTRNEFNLGL 


4356 


18257 


A 


4382 


290 


119 


IATVSLNIVKMPRLPKAIYRFNAIPIKI 
PMTFFTE I D KT I LKCM WNHK I IPGNSPA 
H 


4357 


18258 


A 


4383 


2 


423 


LERVCWI KDI WAVTGENME VMKS 1 1 QK 
YQHKRI SLVEAGVTRHRS IFNGLKALAE 
DQINSKLSKPEWI IHDAVRPFVEEGVL 
LKWTAAKEHGAAGAIRPLVSTWSPSA 
DGCLDYS LERARHRASEMPQAFLFDV I Y 


4358 


18259 


A 


4384 


356 


1 


WYPCCLLASLI C VFRGLF YLSLCLVLRV 
PVPLCLQFPGSLLGSLLLFSLLRQGLAL 
WPRLE CS S AI RAHCSLELGSNDSPASAS 
QSAGITGMSQCAWPLSLLSVSPRVPGLS 
SVPLIF 


4359 


18260 


A 


4385 


1 


125 


SGSKKTLRSSIiEEPTILDIEKFHRESFF 
YTHLINFSGKRYC 


4360 


18261 


A 


4386 


53 


215 


WQLYVJTTWCQKISRJSTQGLAHVPRMECSS 
MIIAHCWLKLLGSSDPPASASQIDPS 


4361 


18262 


A 


4387 


2 


390 


PSLAARVLARGYGNFSVFVLWLRKNDRM 
HGECAPNVSVAVSTSHTTI IGGGIRGGG 
GGGYGSGGSSYGSGGGSYGSGGGDGGGR 
GSYGSGGGNYGSGCAGSGHGSYCSGSSS 
GGYRRGSGGFGSCNSGG 


4362 


18263 


A 


4388 


2 


441 


I KTR I S V I HKEE FGLTP I EGAI EDMKNK 
TLQIiAFAINQEPSDAKMLQMVLKGSEGA 
TVNQGPLE VAQVF1AEI PADPKLYRHHN 
KLRLCFNEFIMRCGEAVEKNKRLITVDQ 
MEYQQELKKNYNKLKENLRPMIERKIPE 
LVKPIF 


4363 


18264 


A 


4390 


242 


24 


KLGNFLGGCVKESPFVPQGGMQWPDFGS 
LQHPPSRFKQFSCPTLPGNWNFRNAPLR 
SGKFFFS PVFLVETGY 


4364 


18265 


A 


4391 


1 


244 


VDQQMRQNLFTTGAHHLQQANIQFRTDI 
ARTEYLSNADERLRWQASSLPADDLCTE 
DAIML.KRFTRYELGCQGEAWEECGW 


4365 


18266 


A 


4392 


2 


146 


LDLLSQPCRAVYI FAKKNDI PFELRIVD 
L I KGRS S LGFGE PKSQEGDR 


4366 


18267 


A 


4393 


204 


489 


GAASEHPIPKGESHGTSKASSLCADAGP 
AGHFASVYNPLAWTVTTIVTIiTVGFPGV 
RVTDEAGHPVPSQVSGIQRCLQGMKAPP 
VTVTVSGEKGS 


4367 


18268 


A 


4394 


2 


171 


LSDGFQPSRSI I FASWSAGDFGSVGATE 
WLEVFFI I PSHIGHELVGLGCCPEVMFL 


4368 


18269 


A 


4395 


2 


197 


WLELLKFYTLDCALEEYVICVRIQDILT 
RENNNV7 P KRR IAI EGEMI CC I S FVEYF I 
CATHYLIQC 


4369 


18270 


A 


4396 


2 


285 


TVDFPFQPYKCQQEYMTKVLECLQQKVN 
G I LES PTGTGKTLCLLCTTLAWREHLRD 
G I S ARK I AERAQGELFPDRALS SWGNAA 
AAAGDPIGPS 


4370 


18271 


A 


4397 


366 


461 


TSNFS CSLVLEEEASDYLELDTI KNLVK 
KYSQ 


4371 


18272 


A 


4399 


357 


3 


DVLRNNFARSAVLYLF I DRVTLWNAPRL 
ECSGAI IAHCSLKLLGSNDPPTS 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Iso leucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P-Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /-possible 
nucleotide deletion, \=possible 
nucleotide insertion 


4372 


18273 


A 


4400 


2 


343 


DIDFKYHFDFFSVNFNEELVALYGGSLQ 
KQTKFVHE CI KT I LKLYKGQE FAPKSVA 
I IGKSMGGLVARALLTLKNFKHDLINLL 
ITQATPHVAPVMPLDRFITGSTGWLKDH 
G 


4373 


18274 


A 


4401 


214 


430 


ELGNILNYVVRWEAEAELSNRWFNGQAV 
HGELSPVTDFRESCCRQYEMGECTRGGF 
CNFMHLRP I SQNLQRQ 


4374 


18275 


A 


4402 


3 


227 


LTQVS PQMTGHAG LNTAQAGGf4AKV S EL 
KHFQYFLPNRGLSLRVMRKMGPRGPLLF 
FLLGSPRATHAWKRQISF 


4375 


18276 


A 


4403 


158 


313 


NRDEFCHVAQAGLELLGSSS PPAATSQS 
ACITGVSYHVRPTLSNKFGCSSL 


4376 


18277 


A 


4404 


3 


628 


HCIREGGQDVPSNKDVTSLDWWSEGTLL 
ATGS YDGFAR I WTKDGNLAS TLGQHKG P 
I FALKWN KXGN F I L S AG VDKTT I I WD AH 
TGEAKQQFPFHSAPALDVDWQSNNTFAS 
CSTDMCIHVCKLGQDRPIKTFQGHTNEV 
NAIKWDPTGNLLAS CSDDMTLKI WSMKQ 
DNCVHDLQAHNKEIYTIKWSPTGPGTNN 
PNANLMLASASL 


4377 


18278 


A 


4405 


35 


180 


MCVDYHYFFCLLGPNNNNPQTSAVRTPT 
QTNGSNVPFKPRGREFSFGK 


4378 


18279 


A 


4406 


1 


438 


DFQRPRDDHGDVDWEKLVELLTDCSNLQ 
DQTY I LY I !>YV I KG PS WBTNLSGQNGVT 
VQNLLGELYGKTGLNQEWGLIPYISSLF 
RKKVDVLAEACTDLLSDQKQLTVGLSDE 
PREKI IYAPL.PPKELTKLI YEASGQDI I 
IGVLTQ 


4379 


18280 


A 


4407 


1 


428 


TESVNAYFKGADPTKCIVKITGDMTMSF 
PSGII KVFTSNPTPAVLCFRVKN I SRI»E 
QILPNAQLVFSDPSQCDSNTKDFWMNMQ 
AVTVYLKKLSEQNPAASYYNVDVLKYQV 
SLNGIQSTPLNLATYWKCSASTTDLRVD 
YK 


4380 


18281 


A 


4408 


174 


383 


KN S W PGAVAHACN P S TLGGKGRQ I TRS L 
YRDHPVQHGEI PLYLKLTALAVTFLGLL 
TGLHDLYLTTNLT 


4381 


18282 


A 


4409 


1 


288 


LMAEKDS LDPS FTHAMQLLTAG KGASRV 
PLGRPAIPGMSGPGFVPLASRPLGTEAV 
GSSGLILI FGSLDSTFPPQPGVRGRISR 
LQWRPEGDFAIP 


4382 


18283 


A 


4410 


292 


422 


LVHSSCFQVSSKILEIARKQRMNTDIRR 
NI FCTIMTSEDFLDA 


4383 


18284 


A 


4411 


412 


238 


FFSRHRVSQWDQAGLELLTLSDLSALAS 
HSAEITGLNHHAQPLTTNFNKYCWISLQ 
VH 


4384 


18285 


A 


4412 


1 


405 


VTS SDKS LKVLLDAEDKVFNE I RNEHFS 
NVFGFLS QKARNLQAQYDRRRGMD I KQM 
KNFVSQELKGLKQEHRLLSLHIGACESI 
MKKKTKQD FQEL I KTEHALLEGFN I RES 
TSYIEEH I DRQVS P I ESLRLMCL 


4385 


18286 


A 


4414 


2 


149 | 


GFHHVGQAGLEVLTSDNLPVSASQSAGI 
TGVSHCAQLRSDGFKKRS SPA 


4386 


18287 


A 


4415 


1 


366 


DI CHRKLGVECPSHAS INRLWQWSS I 
TASLRFEGPLNVDLIEFQTNLVPYPRIH 
FPMTAFAP I VSADKAYHEQFSVSDITTA 
CFESSNQLVKCDPRLGKYMACCLLYRGE 
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NO: of 
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sequence 
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beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=G!ycine, HHHistidine, I=lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine t S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop eodon,/=possible 
nucleotide deletion, Y=possib!e 
nucleotide insertion 














WPKEVNAAI 


4387 


18288 


A 


4416 


1 


164 


RRRCEMSTMFADTLLIVFISVCTALLAE 
GITWVLVYRTDPFGTLAGSEGPGGRY 


4388 


18289 


A 


4417 


2 


141 


DI AI PCNNKGAHS VGLMWWMLAREVLRM 
RGTISREHPWDLTSSVMG 


4389 


18290 ' 


A 


4418 


136 


1 


EQTKAFGQLHKVLGiVDPLPSKMPKKPKN 
ENPVDYTVQI PPRDPMQ 


4390 


18291 


A 


4419 


I 


280 


GNYCKYYGYRGPSCEDGRLRVLKPEWFR 
GRDVLYLGCNVGHLTLS I ACKRGPSRMV 
GLDIDSRLIHSARQNIRHYLSSELRLLiP 
QTLEGDPGA 


4391 


18292 


A 


4420 


3 


446 


TVSTSQVRQNYHRDSQAAINRQISLELY 
ASYVYLSMPYYFDRDDVALKNFAKYFLH 
QSKEDRDHADKLMKLQNQRGGRSLLQDM 
RKPDCDDWES AliNAM E W ALHL D KNVNQ A 
UjELHKLTTDKNDPDLS D Y I ETHYLNEQ 
VTAINELG 


4392 


18293 


A 


4421 


6 


182 


DPSFCSEPRSCHCTPTWRTEQDSISEKK 
KKRN I LKNWAKNY ITLESEKERKSKQRS 
RR 


4393 


18294 


A 


4422 


1 


456 


GPISYVWMPKKRQALVEFEDVLGACNA 
VNYAADNQIYIAGHPAFVNYSTSQKISR 
LGDSDDSRSVNSVLLFTILNPIYSITTD 
VLYTI CNPCX5PVQRI VIFRKNGVQAMVE 
FDSVQSAQRAKASLNGADIYSGCCTLKI 
EYAKPTRLNGSK 


4394 


18295 


A 


4423 


2 


257 


FHHVGQAGLKLL I S VDPPALDSQSAE I T 
GVSHCAQPVLCILNQLCAKAVLSPDSHP 
YPPKSGRHEGQVREQKMSPCSYIYFKRT 
V 


4395 


18296 


A 


4424 


106 


259 


CCCFKYFFSNVKI CFYRDLKPENILLNE 
DMKIQITDFGTAKVLSPESKQGV 


4396 


18297 


A 


4425 


3 


364 


MDEIEKYQEVEEDQDPSCPRLSRELLDE 
KEPEVLQDSLDRCYSTPSGYLELPDLGQ 
PYSSAVYSLEEQYLGLALDVDSEYLTVK 
VISLHLVFQIGVIFLiFQVPLTDPRDVIA 
AGRTYGRI 


4397 


18298 


A 


4426 


2 


241 


HHVWLI CFVFFVCVETGFCHVAQVGLER 
LGSSDLPTSVSLSAG ITDVNHWAQSKPS 
PSKGTGMNRP I GLRVGKGVWSRL 


4398 


18299 


A 


4427 


2 


438 


TRLIGPNCPGVMNPGKCKMGIMPGHIHK 
KGR I G I VSRSGTLrT YEAVHQTTQVGIiGQ 
SLCVG IGGDPFNGTDFIDCLE I FLNDS A 
TEGI ILIAEIGGNAQENAAEFLKQHNSS 
PNS KPVES F IAALTS S TGRKMGHAGQLL 
AEEKV 


4399 


18300 


A 


4428 


34 


427 


AGMLPAVGSADEEEDPAEEDCPELVPME 
TrHSEEEEKSGLGAKIPVTIITGYLSAG 
KTTIiLNYILTEQHSKRVAVIIiNEFGEGS 
ALEKSLAVSPGGELYEERLELINACLCC 
S VKGNGL I AI ENLMRQKGE 


4400 | 


18301 


A 


4429 


330 


8 


YSDRISHWIDMCPSKYIFRGPVHPPHIS 
TFLKTAHNLVARGNYENMSPHDEEKGAY 
QRKEKCCDDPMAGAARIHFLCFFHMGFR 
SVSQAGVPWPHITSMQPQPPR I K 


4401 


18302 


A 


4430 


446 


222 


LTSSDPPASAPQSAGITGVSHRPQPIWG 
VSIQLYFSGCPDTPGQGRLTREHCDGQD 
GEQGG PCS E S VAQFCHDWS 
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SEQU> 
NO: of 
nucleotide 
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SEQ ID 
NO: of 
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sequence 
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NO: in 
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Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
£=Glutamic Acid, F=Phenyla)anine, 
G=Glycine, H=Histidine, l=Iso leucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


4402 


18303 


A 


4431 


3 


243 


AREDSADFVSFFPDFVWTLRDFSLDLEA 
DGQPLTPGTSQKDNNFILPRLCIRKFTS 
LIRTSRSYQSPDTVSVASSRGAR 


4403 


18304 


A 


4432 


433 


0 


PPPSPPPPPPPPPPPPPPPPPPPPPPPP 
PPPPPPPPPPPPPPPPPFPPPPPPPPPP 
PPPPPPPPPPPPPPPSPSSPSSSS 


4404 


18305 


A 


4433 


25 


185 


ARWVACSELRLRCCTPAWATEPEPVFKK 
KKKVQQQGGWPGGLTPVSYRFGRLT 


4405 


18306 


A 


4434 


2 


329 


VRD I KEKLCYVAVDFEQEMATVE FSSSL 
SKSYELPDGQVITIGNERFRCPEALIQP 
SFLGMESCGMHETTLNSIMKCDVDIRKD 
LYANTVLTGGTTMY PGI ADRMHNE I 


4406 


18307 


A 


4435 


2 


374 


WVTFISLLFLFSSADSRGGFRRDAHTSE 
VAHRLKDLPEENFKALLLIAFAHYLHQC 
PFEDHVELLNEVTEFAKTCVADESAENC 
DQSLHTLFGDTLCTVATLRETYGDMADC 
CAQQEPERNECF 


4407 


18308 


A 


4436 


2 


352 


KVRRIDS RLAELDRE I RNPQ I S CQLI YS 
PSLFLGAFFLFFFFGFFFPPPYLIAPKN 
QFNPPPLCTGGPVPRPKPPIPPWEPPRP 
APPLFSPQWDLGFQIAPFLWVINGGGPS 
GGSRA 


4408 


18309 


A 


4437 


2 


369 


QSCETLFHSWKDVEVCSSALSCLSQVSV 
HLQGLESLFILPGMEVEQRDSQMALVES 
LE YVRGE I S KAMADFTTWKTHLLTS DS Q 
GGNQMLDEGFREDFSEQME I AI RAI LCA 
IQNLEERKNE 


4409 


18310 


A 


4438 


2 


231 


SDTSRPQLPCPAARERFPDGPLSLRPLP 
FFLSQVYNEQIHDLLEPKGPLAIREDPD 
KGVWQGL S FHQVWDWARVG 


4410 


1S3U 


A 


4439 


1 


365 


MAHAMEEVKKCLGPDMMDDI CHEQFLEL 
S Y LKG VPE P SRGRGVP VRGRRAAP P P P P 
VPRGRGVGPPRWALVRGTPVRGAITRGA 
TVTRGVPPPPTVRGAPAPRALTAVMQR I 
PLSPPPATK 


4411 


18312 


A 


4440 


92 


510 


LAALPDGGQELHIPRSRAGPAGELSWAV 
VGTGALSSL>S ILFQMARCRP WPPLRPNS 
PLQGWIFNWPWLVAIPASLFSGFLSDH 
LINQG YRAI TVRKLMQGMGLGIiSS VFAL 
CLGHTSSFCESWFASASIGLQTFNHR 


4412 


18313 


A 


4441 


2 


266 


FPPHLPELGKWECPWHQCDECSSAAVSF 
CEFS PHS FCKDHEKGALVPSALEGRLCC 
SEHDPMAPVSPEYWSKIKCKWESQDHGE 
EVKE 


4413 


18314 


A 


4442 


2 


279 


KGAELVFLPSNTRSLMYPUDQGWRTFK 
AHYAGYSMERIVSAMEENfPGREKIMKKL 
LKLSSPKE I PAGEKLCPNDAKDFTELDD 
RVIHENHE 


4414 


18315 


A 


4443 


2 


371 


KWVTFISLLFLFSATDSRGVSRLDAHTS 
EVAHRFKDLGEENFKALEAIAFAQYLQP 
W PFEHHX^ELGNDVTES AKTCGADE SADN 
CDKSLHTLFGDTLCTVATLRETYGEMAD 
CCSQPEPCEKG 


4415 


18316 


A 


4444 


2 


372 


WVTFISLLYLFSSGYSRGVFRRDAHKSE 
VAHRFKDLGQENFKALVIj IAFAQ YLQQG 
SFEDHVKLMNEVTEFAETCVADESSDDC 
DKSLHTLFGDTLCTVATLRETYGEMADC 
CAOOEPERNEC 
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SEQ ID 
NO; of 
nucleotide 
sequence 


SEQ JD 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 

NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A-Alanine 
C=Cysreine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIalanine, 
G=Glycine, H=Histidine, I=Iso leucine, 
K=Lysine, L=Leucine, M-Methionine, 
N~Asparagine, P=Proline, 
Q^GJutamine, R-Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, V=possible 
nucleotide insertion 


4416 


18317 


A 


4445 


1 


283 


FRKNAEDILTMEGLRATMKHELEAAQKK 
HSLWELLRIPNICKRIWILSFVRSAGVC 
WRSTPDPVCLG I TSGGCRTAQ I P ACS FL 
WKLHSIGAPA 


4437 


18318 


A 


4446 


173 


2 


KKKKCVLYPEGHGVFHALVCVCVCVCVC 
VCVCIQVNFRNFCFISWINLSVQVSLLD 
Y 


4418 


18319 


A 


4447 


104 


221 


S CHLH YS LGDKSE TLS KNHH YHHHHHNH 
HHHPAGYFKMK 


4419 


18320 


A 


4448 


241 


1 


WVS WDKAPS WVRGLGSGSRGGVWGGGS P 
PHRVTDVSCSFLPDGRHVWEMEAKTDRD 
LCKPVSAFGAAGATVGPQGLGGRC 


4420 


18321 


A 


4449 


229 


349 


WKRSHS FGLDWAHACNPSTFGGQGGRI 
MRSGVQDQPGQY 


4421 


18322 


A 


4450 


76 


189 


TESKPAATTRSSGGGGGGGGKRGGKKDD 
SHWWSRFQKV 


4422 


18323 


A 


4451 


40 


359 


SGEPLFPFFFSLARLIPGVCFVEMHTRV 
RLLIGLKIWEKKVPQVSTPTLVEVSRNL 
RKVGSKCCKHPEAKRMPCAEHYLSWLN 
QLCVLHEKTPVSDRVTKCCTES 


4423 


18324 


A 


4452 


2 


371 


KFQNALLVRYTKKVPQGSTPT I VKVSRM 
LGKVG I KCCKHSEAKGMPCAEDYLS WL 
NQVCVLHEKTPASDRVTKCCTESLVNRR 
PCFSALEDYETYVPKECNAETFTFHADI 
CTGYENERLFQ 


4424 


18325 


A 


4453 


118 


369 


GRY I LLL KTKKQTANNN I KT PQ YL S NM S 
KKFRHSE F F FFYLKWS LALQRCNLG S L Q 
PTPPRFNQLSCLSLPNSWDYRCVPPNLV 


4425 


18326 


A 


4454 


364 


3 


GHLSLQRLLLPFVWLCPAPRGGAYRGRQ 
ASLSCGGLKPVRASRLLCLFNQACTMAG 
APPPASLLPCSLISDCCASNQRDSVGVG 
PS EP CAGYNLLVCRFLS PSEKRS PSAGV 
MRFSRCRJj 


4426 


18327 


A 


4455 


1 


379 


AAFTECCQAADRAACLLPKLDELRDEGK 
ASSARQRLKCASLQKFGERAFKAWAVAR 
LSQRFPKAEFAEVAKLVTDLTKVHTECC 
HGDLLECADDRSDLAKYICENQDS I SSK 
LKECCEKPLLEKSH 


4427 


18328 




4456 


1 


362 


LRPARSLVFPWFAPGGSGLRGLKLLEAK 
CQGDGVSYEETTIPRPSAYHNLFGLPLI 
SRRDAEWLTSRELDSLALNQSTGLPTL 
TLPRGTTCLPPALLP YLEQFRR IVFWLG 
DDLRSWEA 


4428 


18329 


A 


4457 


250 


330 


EWW THLWLN EGFAS W I E YVC VDH C F P E 


4429 


18330 


A 


4459 


96 


319 


YLT I PFS QAMENCKDARLTTS I GAYNFN 
HRLLEM I LNKPGLTYKP VCNQVS ALS LL 
S FLFFMPLFLS YCQVS IH 


4430 


38331 


A 


4460 


50 


339 


NSQTLKLSNVQCLKTRALFFFPKGI YPQ 
I KTLPPCPTSGPDP YVRVNLLPKGKGPC 
L KKI S GNRKTLKP L FD KTLKFLVPMS KV 
PKSPLNVPVKNI 


4431 


18332 


A 


4461 


2 


357 


G YDGVKRWTRDGDI FNKSLLLI PIHLEV 
HWSLISVDVRRRTITYFDSQRTLNRRCP 
KHIAKYLQAEAVKKDRLDFHQGWKGYFK 
M YCKHLALSQPFS FTQQDMPKLRRQ I YK 
ELCHCKF 


4432 


18333 


A 


4462 


182 


325 


IRLSSWLCDCASCQVYEFRVKESS 1 1 AP 
APAEDVDTPPRKKKRKHRL 
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SEQ ID 
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nucleotide 
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SEQ 10 
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peptide 
sequence 


M 
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Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cystei ne, 0=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=lsoleucine, 
K^JLysine, l>=JLeucine, M=Meth ion ine, . 
N=Asparagine> P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 


4433 


18334 


A 


4463 


1 


338 


SRHS PGPTPQPDCSLRTGQRS VQVSDTS 
SCSQLSSSSGSSSSSVAPAAGTWVLQAS 
QCSLTKACRQPPIVFLPKLVYDMVESTD 
SSGLPKAASLLPSPSVMWASSFRPLLSK 


4434 


18335 


A 


4464 


2 


343 


I IDFFLGAYLKDEVLKIMPVQKQTRAGQ 
RTRFKAFVAIGDYNGHVGLGVKCSKEVA 
TAI RGAI I LAKLS I VPVRRGYWGNKI GK 
PKTVPCKVTGRCGSVLVRLIPAPRGTGI 
VS 


4435 


18336 


A 


4465 


1 


371 


INSTLKMSFVGENSGVKMGSEDWEKDEP 
QCCLEEIFFAliASSLSSASAEKGSPILIi 
GVSKGEFCLYCDKDKGQSHPSLQLKKEK 
LMKLAAQKESARRPFI FYRAQVGSWNM1. 
ESAAHPGWFIC 


4436 


18337 


A 


4466 


246 


12 


LCFTPLPSYILFFFETKSHSASPKLECS 
GVITATSEVILLGGGGCSELRSCYCTPT 
WVTKLDSITKKRRKRKRKDLY 


4437 


18338 


A 


4467 


2 


355 


WVTFISLLFLFSSAYSRGVFRRDAHTSE 
VAHRFKDLGEENFKALALIAFAQYLQQC 
PFEDHVKLANEVTEFAKTCVADESAENC 
DKSLHTLFGDKLCTVATLRETYGEMADC 
CAKQEP 


4438 


18339 


A 


4468 


2 


361 


EDHVKLVNEVTEFGKTCVADESAENCDK 
SLHTLFGDKLCSVATIRETYGEMADCCA 
KHEPERNECFLQHKDDNPNLPRLVRPEV 
DEMCTAFHDTDETFLKKYLYEIARRHPY 
FYAPELLL 


4439 


18340 


A 


4469 


1 


373 


SSQPSSPSESVSGTVSVSPSSLPSSPCIi 
SLSVCLLVCVSLSMFFSLPVCLSVPLCV 
SPRPHPSVSHAHPHRASALVNCPWGPAP 
TSAPVS FYKPQS PALKTGQAS PHPHPHP 
FPTPSPMQSQKK 


4440 


18341 


A 


4470 


236 


371 


KVLGAVLKDKGGLFCFVFGDRVSLCHPG 
WSAWQSQLTAAVTSQA 


4441 


18342 


A 


4471 


345 


201 


AWMTAHLFMAWFTEYFKSTAQKKKFVFK 
TLLL I DNFDPLWDS KEGKL 


4442 


18343 


A 


4472 


1 


144 


AGFHHVGEAGLGLLTSRDPPASPCKRAG 
ITGLNHHTWPFSVILTYTYY 


4443 


18344 


A 


4473 


3 


333 


KNHLEASIGEHPFFPDHSPSCPHLLSSS 
NYKSLSCFLGGFFSETGSAPSPRLECSG 
VIAAHCSLKLLGSTSPLTSASWVAGITG 
MCWLYSPAVSPRPSPDLTCTSQLENI 


4444 


18345 


A 


4474 


288 


1 


TSAVNRISIKSQADQEATEQKLRDYSHQ 
GIQILAWQKFEYKPGKVTMDPEDCILQL 
AKKKEEEEEEEEEEEEEEEEEEEEEEED 
QEEEEEEGEEEE 


4445 


18346 


A 


4475 


3 


124 


PRLECSGMIMAHCSLDLLDSSDPPTLAS 
QVAGTTKKI SLH 


4446 


18347 


A 


4476 


196 


2 


SCYEHPGRIVYGFLFLRQDLAVTQAGVP 
WHDHGSLQPLPPELKRFSGTSHLLSSQG 
YRRAQPRR 


4447 


18348 


A 


4477 


3 


162 


LLERNKELEGSLQQMYSTNEEQVQEIEV 
RALHVLAHTPRGPLGLCVPLCNLE 


4448 


18349 


A 


4478 


167 


2 


ACGISPFSLSSSCFGHVRHASLPLAFHH 
DCGFPEAS PAMLPVQPSQLNLFI KYPV 


4449 


18350 


A 


4479 


195 


26 


WVSMLDIHDNGPHEVAVICFVCFLSQSL 
ALLPRLECSGVISAHCNLQPGRQSQTPS 


4450 


18351 


A 


4480 


175 


1 


NRSATKCDWGEKRTSQELNLFFWFFVCL 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histtdine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N^Asparagine, P=Pro!ine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan. Y=Tyrosine, 
X-Unknown, *=Stop codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














FEMESHSVAQAGVQWRNPGSLKPLPPRF 
KR 


4451 


18352 


A 


4481 


122 


357 


YHAFLYQWYNLKREHKRFRCS LLII FLG 
GERKLQSIQLWNISHYIQIRRYVLEIPS 
GLGAVAHACNPSTLGGQGEWIT 


4452 


18353 


A 


4482 


137 


2 


FYS YLKI LEEMGPHYVFQAGLELLGSRD 
PPDSAYQSTEI IGMSHK 


4453 


18354 


A 


4483 


347 


208 


GQAGLE LLT SSL PAS AS QS AG I TGAGHH 
TRPRIYFLKKYLAFVLGS 


4454 


18355 


A 


4484 


411 


268 


RLNRRGGGCSEPRSCRCTPAWATEQNSI 
SKKNQQKTTKLPLCCRLILL 


4455 


18356 


A 


4485 


2 . 


102 


PCLLKMQQLAGHGGMHLCSQLLRRLRCF 
LHSYP 


4456 


18357 


A 


4486 


1 


113 


HSAHSTHAGHAGHTSLPKCWDYRSEPPR 
LAKFLNTRY 


4457 


18358 


A 


4487 


163 


353 


QVAATLNNLAVLYGKRGKYKEAEPLCKR 
AL EIRE KVQ KKGAWFFE I F VLC V CG FF 
FFFKRDP 


4458 


18359 


A 


4488 


408 


307 


RGCSEPRSHDCSAAWLTERDSVSKKNKN 
KSPKKQ 


4459 


18360 


A 


4489 


337 


179 


YS INFALI L I FLFYLGVLVWAAI INYHR 
LHGLNNKHLLIMIWMVGSPRLSCG 


4460 


18361 


A 


4490 


229 


345 


LENETLGRARWLT? VI FALWEAEASRSR 
GQEIETILANM 


4461 


18362. 


A 


4491 


180 


365 


ENNLKTLVALDMSFYFILFLFYFILFYF 
ILFYFLRWSFALVAQAGVQWRYLGSLQP 
LPPRFK 


4462 


18363 


A 


4492 


269 


385 


GNLQNLEKVFYKIQHPFMIKTLNKLVID 
RTYLNIIKAIY 


4463 


18364 . 


A 


4493 


103 


336 


QLLVS L INLFLQAFKLQ I DCGHTCMHF W 
RQNLCLLPPTLPTPRRNLTLSPSLECSG 
T I WAHRKLRL PGS HHS PASATR 


4464 


18365 


A 


4494 


305 


379 


YIKIENFCLGWAHTCNPSTLGGRG 


4465 


18366 


A 


4495 


47 


224 


ETATKTQRRWETALVPQHPHRPLAPVTW 
VKTTGNSTIANFLLILVYLSFLFLVYSC 
IQQ 


4466 


18367 


A 


4496 


3 


129 


ITLNPGGRGCSELRSCHCTPAWATERDS 
VSKKKKKFPPPPF 


4467 


18368 


A 


4497 


396 


112 


LPELEFEAAVSHDCASELQLGHQSKTLF 
QKKINRIINNNmSHIAISLLGIYPKEF 
KAGTQRD I CTPM I TASLFT I VKRQKQPK 
CLATNELVYRM 


4468 


18369 


A 


4498 


1 


152 


LSLPGSQTWWRALIVPATWEAEVGGSPE 
PRRSRLQVAMITPLHSSCGRRG 


4469 


18370 


A 


4499 


92 


2 


KRPGWTHACKPSTLGGRGGP ITKS WR 
DQ 


4470 


18371 


A 


4500 


116 


1 


AGI AGMCLHAQLI FLYFLVETRFCHLGQ 
AGLELLTLWS 


4471 


18372 


A 


4501 


99 


1 


QGT PVHRWWKCKIjVQPLWRTVW S FLRKV 
KIDNS 


4472 


18373 


A 


4502 


140 


288 


RQFHYVAQAGLELLGFSNLPALTSQSAG 
ITGASHHNQPTSRFLKEKKIG 


4473 


18374 


A 


4503 


330 


181 


FFFFLFFFFFFFFFFFFXXFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFF 


4474 


18375 


A 


4504 


171 


2 


VETVSKESSLRICLQDIRYFFFETKPHS 
IAQAAVQWHYQGSLQPQPLRLKQSSRLS 


4475 


18376 


A 


4505 


345 


74 


S VERTCHS PKPLMLFLPLVFQARDDILN 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
JE=Glutamic Acid, F-Phenylalanine, 
G=Glycine, H=Histidine, I=Jsoleucine, 
K=Lysine, L=Leucine, M=Methionine T 
N=Asparagine, P-Proline, 
Q=Glutamine, R-Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 














GSHP VS FE KACE FGGFQAQ I QFGPHVEH 
KHKPGFLEACKPYQTAEPASALTLDFPA 
SRTVQK 


4476 


18377 


A 


4506 


268 


102 


KNIRELWPGAVAHACNPSTPGGRGRWIT 
RSGBWLNLRTCEHVILQQKGKFRFLIS 


4477 


18378 


A 


4507 


1 


67 


GLPKCWDYRREPPHPARNCKYF 


4478 


18379 


A 


4508 


339 


180 


ANFFVFFEEMGFHRVAQTGLEVLGSSDL 
PILASKSAGITGI SYSVRPQTNT2 


4479 


18380 


A 


4509 


174 


3 


NEPKWIYIIPGDPIiFFLLLLFFETESH 
YVTQAGVQWHDLGSLQPLPPRLQRFSCV 


4480 


18381 


A 


4510 


84 


3 


AHHIFTVGIDVDTRAYFTSATI 1 1 AI 


4481 


18382 


A 


4511 


127 


2 


hhl K I H C W PGWAHACNPS TLGVRGR W I 
TRSGVQDQPEQYG 


4482 


18383 


A 


4512 


109 


345 


SI FL I QGREGS CLCFLFGS FMCFVATGF 
PHVTWLILTLSNLPTS SYSRAGI TGVSH 
HARANNLFCFGSMPSHGITGLNF 


4483 


18384 


A 


4513 


290 


13 


RDLRPTPQKGPKNHTAPNPLQRAQTHVK 
KGRGPTPPTPTPSKDPGPHPPLRKPKPW 
GRVWGEKKKKNKNGPKDKKGKKKXKKKE 
RKIQHSRF 


4484 


mss 


A 


4514 


234 


362 


DGDVLSKFVNFFSFKWEGTPLIDGRRL 
KYRLNGDFLFLRLKL 


4485 


18386 


A 


4515 


190 


330 


CLALQSECFRRWS FALI AQAGVQWRDLS 
SPQPLPPVFKRFSSLHPFE 


4486 


18387 


A 


4516 


2 


199 


.CVCVCVCVCVCVCI FFFFLSKKWGPPPL 
CPPGGPKTLGPKES P PLAPKNRGNSGLT 
PPPPPTFFF 


4487 


18388 


A 


4517 


94 


339 


SAYLGLKFFLFFFFFFFFFFFGVFPPFF 
FPNKKKLKPPQI FFPPQKRPPI FFFPPP 
PGGKNFFFFGKFFFNPREKGPPKNLG 


4488 


18389 


A 


4518 


95 


3 


LYLGPGAVAHACNPS TLGGRGGQ I TRSG 
VQ 


4489 


18390 


A 


4519 


14 


121 


LIFFGRNRISLCCPWSQTPGLKPSSWL 
SLPKCSD 


4490 


18391 


A 


4520 


207 


2 


I SREKLCTVRI PDMHV IEDS ILFYYLFY 
SLALPPRLECNGTIMAHCSLHLPGSSNP 
PASAS QVAGTTS 


4491 


18392 


A 


4521 


169 


1 


ANLTQVGSYYSLSLHTLIQVTFFKNIRA 
VRPGTVAHACNPRTLGGPDGWITRSGDR 


4492 


18393 j 


A 


4522 


2 


82 


GFILCVCVCVCVCVCVCVCVLYFI FF 


4493 


18394 


A 


4523 


300 


3 


LFLPPSPLPSQI PGGQKPLPLI FPPRVP 
KKFIPPNFWAPPLISPRPFWVRFLGPPT 
QGFNYPLGPGPPPPPPF3LGFKKSPNPF 
FIKKKFSPPPKKQK 


4494 


18395 


A 


4524 


348 


94 


KPGEGGKGPQNPPPGGLKPGNPLNPGGG 
GSPEPNFPPWPPPRGAKKKTRPQKKKKK 
KKGRKESRLNYGMEKRMKEKEKVFLSAL 
N 


4495 


18396 


A 


4525 


313 


161 


RGHPKNLKSKEVNIRREAPGHPRRNKKK 
TPNLKKKKKKKKKENKKQKQKP 


4496 


18397 


A 


4526 


118 


3 


FFFFLRGSFALVAQAWQWRYLSSLQAL 
PPGFKRFSCP 


4497 


18398 


A 


4527 


153 


2 


KDAINKMKKWPGWAHACNPSTLGGQGG 
WITRSGAQDQPSHYGETPSLPK 


4498 


18399 


A 


4528 


258 


2 


GGQKRGLPGFIKKPPFRKIFLKNLGEGP 
GKNSLS KFFFFFFFETESHCLTQAEVQW 
LNLSSQQNKPPRFKQFLCLNHPSSWDYR 
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K=Lysine, L=Leucine, M=Methionine, 
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nucleotide deletion, ^possible 
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R 


4499 


18400 


A 


4529 


296 


39 


SIRIINIQILLGFFETGCHSVAQAHLKL 
LGSNDPPASASESTEITEKQHCTRLSIQ 
IFVAFPYSNIKLSTKKEPILIKLLLYNC 
K 


4500 


18401 


A 


4530 


292 


37 


SIRIINIQILLGFCETGCHSVAQAHLKL 
LGSNDPPASASESTE I TEMQQCTRLS I L 
I F VAF P YSNNKLS T KKES IL I KLLLYNR 
K 


4501 


18402 


A 


4531 


3 


138 


GLWNGSCLCVCVCIFMGMGACLVCICTC 
LYCCVPVNTCLCMDGR 


4502 


18403 


A 


4532 


380 


2 


GGPPKKRE S AKAPGGRNPQGGGRE KT PP 
PKKKRPPQKGGGKKTPPGGGGGPPRGGG 
KKKPPPKREKECHRGGR3GGGPPWTPK 
EGPPFFFFFFFFFFFFSLPFFFFFCFFI 
FILFFPNIVINGFY 


4503 


18404 


A 


4533 


3 


191 


IiSKITHAS I PVPVSKKKKKKKKEKKKKK 
XKKKKKKKKK2CKKKKGG3GFKKTLGGHN 
FSRGLE 


4504 


18405 


A 


4534 


370 


48 


GPAPEVSSNGGKGGGCPKGLGKPLGSQF 
NGVKGY PS PKSS PLFQAPLGGQGFPKPR 
GNGGTPPFFPKKKKKKCALGDGTPTSGC 
PRRSWNHPETQDGCGGWGSRVPR 


4505 


18406 


A 


4535 


147 


1 


KGPPLFFFFFFFFFFFFFFFFFFFFFFF 
FFFFS FYFY 1 1 YLF FLS FLRL 


4506 


18407 


A 


4536 


198 


1 


NPPYFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFLSFIFI FLYFNFI 
SSNIVFIGFY 


4507 


18408 


A 


4537 


2 


356 


QNRTTGNSKTQSAS PPPKECSAS PATEQ 
SWMENDFDELREEGFTRSNYSELREDIQ 
TKGKEVENFEKHLHECITTITNTEKCLQ 
ELTELKTTARELRDECRSLTSRRDQLAE 
RVSALE 


4508 


18409 


A 


4538 


347 


220 


KCSGTI IAQCNIKLLGS3NPPVSASQSA 
G I TGMS HHVQPKYC 


4509 


18410 


A 


4539 


308 


2 


FSISCPHDTPAAASQSAGITGVSHCARP 
IFSFICFFFICIMCIKYLLRTGDSVMYF 
HGLGCFGLCCYLES Y I RKLSKMKKLS I L 
ITSREKCKCILQINIGDR 


4510 


18411 


A 


4540 


2 


194 


IKQWSTALSPRLECSGTITAHCSLKLP 
GSASASWARTTGTCHHAQLIYFSETGS 
HYVDVDAA 


4511 


18412 


A 


454! 


101 


2 


MEENLGNTIQDIGRGKDFTSKTPKAMAT 
KAKID 


4512 


18413 


A 


4542 


1 


84 


AEEAEAEAEAEEEEEEE2EEEEEEEIKY 


4513 


18414 


A 


4543 


3 


183 


KIQILFFVSSKTRLRPSAWHPRFVFCFL 
RWSLAIAAQAGVQWHHLSSLQPLLPGFK 
VDAA 


4514 


18415 


A 


4544 


I 


335 


RKKKKKEKKKEKKKKKQGRKKRKRERKE 
EEAKKKTKNKDK 


4515 


18416 


A 


4545 


104 | 


2 


QKKKKKKKKKKKKKKKKKKI YDFFLQ IN 
CMCSHK 


4516 


18417 


A 


4546 


5 


108 


FKRLSCRSFPSSWDYRREPPRPANFCIL 
SRDGIS 


4517 


18418 


A 


4547 


131 


1 


RPRRPQVGLQDATSPI IESLITFHDHAL 
IIIFLICFLVLYALF 


4518 


18419 


A 


4548 


156 


3 


KGGQGMSCLSRSPSALLKFECRVI SSYW 
At '? V7RAF VVF ATRGAEAGES LEP 
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C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=Jsoleucine, 
K=Lysine, L=Leucine, MNMethionine, 
N=Asparagine, P=Proline, 
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T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possib!e 
nucleotide deletion, \=possible 
nucleotide insertion 


4519 


18420 


A 


4549 


2 


340 


LNPGSAGCRELRSHHCTPAWTTERGSVS 
KKKKKKKKKKKKKKKKKKKTFFGGGPLW 
EREFLLGNFWEKKGGGANFFPLENPNGP 
QLKTWGKKAFFPRKGGGPLNPPQGFYQG 
A 


4520 


18421 


A 


4550 


165 


335 


PHYLPRRSI IHRPFKSKSPEPGAVAHAC 
NPSTLGGRGGWITSGQEFETSLINMEKS 
L 


4521 


18422 


A 


4551 


123 


3 


GRVDRLSCLSLLNSWDYGHLPLHLANFC 
IFSRDGVLSSW 


4522 


18423 


A 


4552 


3 


131 


HPEVDIPILPGLGIISDIGAYYSGISEP 
FGYVGMDVSYDPVRL 


4523 


18424 


A 


4553 


87 


364 


ILASQSWFRYAASAASGGLLETESQPS 
AAGTASAAVPSLEAQTPAAFRPTPPGLA 
PSSPPAPGPPAPTGKVCGRHFLRSESSA 
SGPLRALL 


4524 


18425 


A 


4554 


114 


0 


LTWAPHSLIETIRNQIIQALLITILLGL 
YFTLLQRSE 


4525 


18426 


A 


4555 


91 


3 


TTTTYIAIATACVCVCVCACVCPCVCVC 
M 


4526 


18427 


A 


4556 


239 


364 


C I K I T F S RPGAWHACNP STLGGQGGR I 
AMSRDRDHPGQQGK 


4527 


18428 


A 


4557 


250 


1 


STEHVKQLRNE FMKI SVI L I FLE I LLY I 
KVHLKLCTLTTDHLLLGIKSLKILFRPG 
AVAHACNPSTLGGRGGRI PRSGDRDKS 


4528 


18429 


A 


4558 


2 


90 


QDGLDLbTWGSACIiSLPKCVJDYRSEPSL 
F 


4529 


18430 


A 


4559 


65 


394 


DPVS KKKKKKKKI KNEKEKKFFGKKKPL 
KKKLRADLILKPKKKNFLKKREWKKREK 
GGDPFWKNKPQVGKTGNNLNLGQNGLGK 
GPSDLKKNTPETKLNLVQK-KFQIFF 


4530 


18431 


A 


4560 


93 


2 


GIISAHCNLCLPGSEDSPASASQVAGIT 
GQ 


4531 


18432 


A 


4561 


125 


4 


GAY I LYLNKKSKS IRPGMVAHACNLSTL 
GGRGGRI TRSGD 


4532 


18433 


A 


4562 


3 


84 


RSCHCTPAWATERETLSPKKKKKKNF 


4533 


18434 


A 


4563 


145 


3 


GFYHWPGGVEQSPCLSLPKYWNYHCEP 
PPLASIGFFFFFFETQSRS 


4534 


18435 


A 


4564 


125 


388 


I RKPQ I LYS PQS SENMQHLH I S VLMG VT 
GKRGKFMDILDVTERCRFSLKKCTGIQE 
DLNKWEGIPCLWI IRLNTVKMAVFSKLI 
CRFS 


4535 


18436 


A 


4565 


59 


325 


VARKGNLVS FLV I FY PLFD I E E V I E PL»W 
VLISSFDEGEKVELEFFFFFFFKKKSPF 
VTQAEFQWPNLSSLKPQPPRLKQFFCLG 
LLIT 


4536 


18437 


A 


4566 


326 


3 


PE KE FRRLVI KLI REAPEKGE AQCKE I Q 
KS IQEVKGEI FKE I DRIKKKQLKHQETL 
DILLVMQNALESLSNRIEQVEERNSELE 
DKVFDLTQSNKDKAKR IRKYEQS 


4537 


18438 


A 


4567 


333 


209 


EQAFDPYGTLSQGI FSPKFHEPFHKLMA 
R PAVALP FQVNLK 


4533 


18439 


A 


4568 


125 


2 


HNQSMSRS IDCSDP ILGHCNLCLPGSSD 
SPASASQVAGTTG 


4539 


18440 


A 


4569 


212 


2 


PI CLSGKKC YGQNVRVI KCKMWKHNHRL 
GWVKACNPNSLVGQGGRFMRSGVLEQH 
EQHABTPSLLKTLK 


4540 j 18441 


A - 


4570 


142 


14 


GRVDRSNPGRFDSTSNSSLYERPREIRP 
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TSQAPSPVNHIIST 


4541 


18442 


A 


4571 


2 


419 


ARDS FLHDSQTS FCFSDS I PTPSNMEET 
QQKSNLELLRISLLLIESRLEPVRYLRS 
MFANNLVHDTSDSDDYHLLKDIiEEGIQT 
LMGRLEDGSRRTGQ I LKQT YS KCDTNSH 
NHDALLTNYGLLYCFRKDMDKVETFLR 


4542 


18443 


A 


4572 


3 


194 


HEGRIiQSQLLRRLRQKNCLKTGGGGCSE 
PRSHHSFPAWVTEHGFISQISLKPLFIS 
HPEPFLL 


4543 


18444 


A 


4573 


2 


286 


CRDGLTYNDFLILPGYIDFTADHVDLTS 
ALTKRITLKTPLVYSPMDTVTEAVMAIA 
MALTGG IGF I HHN CTP E FHANE VRKVNK 
YEQGFITDPW 


4544 


18445 


A 


4574 


44 


325 


RQWLQEEAYITKEQKYSFLHDSQTSLCF 
SDSIPTSSNMEETQQKSNLELIiHISLLL 
I E S RLEP VR FLR S T FTNNLVYDTSDSDD 
YHLLQDLEEG 


4545 


18446 


A 


4575 


374 


33 


LRLARRSGASLCGSRSGGPPTCANICPR 
YAPL I PP CRTS LLE AQGLVMWGPDFRV 
RLVTSSERRFRYQPPPPGFNRFPCLSLP 
SNWD YRHAPP RLAN FNSL F F FF FLVE TG 
Y 


4546 


18447 


A 


4576 


400 


225 


PIFFWFFLFVLFLVETRLCHWQAGLE 
LLGS SNLP I SASQS AKITGMSHPAWPRA 
EF 


4547 


18448 


A 


4577 


2 


372 


ARETI FCFSNS I PAPWNMDETRQKSNLQ 
LLRISLLILLYRTRLTLPCITQLHILQT 
YKVNARCSHFHSSSTCGPLYSCTLYVSL 
TGLDMKLCHSTLLKQNSALMSLLKIQVN 
LNQVS PVFNAH 


4548 


18449 


A 


4578 


349 


174 


GCGEPRSHHCTPAWATRAKRCLKKKLKL 
QLKNWKESKHQPLQNFSHLTTSTINTQ 
EN 


4549 


18450 


A 


4579 


3 


124 


H ELIbN PEGEG CS E PR S CH CTLAW VTRAR 
LRLKKKKXKKKN 


4550 


18451 


A 


4580 


181 


1 


MSSYYFSPCSRLLMATVYNLHLPSSSNS 
PASASQVAGNAGARHYAWLI FVLLVETG 
LCSC 


4551 


18452 


A 


4581 


227 


3 


HRKRNRCVSRTALLFSLGDGVSPCLKI I 
III KVLFWLGEVAHACSPS PLGSRGGR I 
TRSGVRDQPGQHGETPSC 


4552 


18453 


A 


4582 


257 


2 


RVRQLGAFSLSPGSGSFSKILGALKIMG 
PPRAGPVGSPRYSRPLGGFGRQI PRSGD 
PGPPGLPRGNPVPLKKNPKRSLLILPAR 
A 


4553 


18454 


A 


4583 


3 


94 


HE I AMAFL I LTE RK I LG Y I QLRKG PNW 
GP 


4554 


18455 


A 


4584 


345 


50 


IYIYICIFLNRNRVSLHCPGWSKTPGLK 
QSSHLGLPKCWDYSHEPLCPAKTSHFKA 
QDRI PKNDPSCTPS PAQMSTNLIGWASS 
YLQSYSPAPYNNKN 


4555 


18456 


A 


4585 


338 


19 


FQLSRRFRPKNGGNPGSGACSEPRLGPC 
PSAWVTKRDS VSKKKKE YHTLSGLS TTG 
IYFSQFWMLGVQGASRFGVKGGAVSWSI 
DGAVLLRPHMAEGARQLSGLGF 


4556 


18457 


A 


4586 


2 


224 


ARGVGRAGLELQTSGDP PASASRGAG IT 
GVSHRVQLLGDRAKFSLKKKKRG I ILPG 
FFYSPLKKRFTLNTPDNP 
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4557 


18458 


A 


4587 


344 


168 


PPGTWHTYNPSTLGGQAGRIGSLGVQD 
KPGQHRQGLSLGVEDKPGQHSETPSVQS 
NP 


4558 


18459 


A 


4588 


212 


328 


GLAVAQAGLELLSSSNPPASASQSAGIT 
GVSHHTQPYF 


4559 


18460 


A 


4589 


110 


323 


LSNQKNQLYNPNLLIYCWRKLKSSYDK 
QAEVQGSKLGSLQALPHGFTPLSGLNLP 
SSWEYTRPPPRPTNF 


4560 


18461 


A 


4590 


2 


127 


ARGCS EPRSRHCTPAWATEQVS KKKKKG 
REKTFGQFFFWVI 


4561 


18462 


A 


4591 


211 


2 


ARGSLEAGSSTPAWATINQSTWKKTDKT 
ALYWKKMPSRTYKARQKSMPGFKSSKDR 
LTLFLGAKVSCLV 


4562 


18463 


. A 


4592 


3 


120 


HEKTKTNNPIKKWAKDMNRQLSKEDNDN 
KINGWPGAVF 


4563 


18464 


A 


4593 


388 


227 


GHVGQDGLELLTSDAPPSLASQSAGITG 
VSHRAQPKWLLFQHRSPCLPEAAGK 


4564 


18465 


A 


4594 


1 


348 


PPPPPPPPPPPPPPPPPPPPPPPPPPPP 
PPPPPPPPPPPPPPPPPPPPPPPPPPPP 
PLPPPPPPPPPSSPSSSPPNPPATTSVA 
C 


4565 


18466 


A 


4595 


2 


138 


ARAARGFRHVGQAGLELLTSGD PPASAS 
RSAGITGVSHHTQPTNF \ 


4566 


18467 


A 


4596 


315 


2 


ALKPNYSSTSMNIKARLPQYNINNLNliA 
PLYSSTSNQINI I VI ILINKENNHTLFP 
CLVSNSRLKQSSHLGLPKCWDYRRELLC 
LACDKFFISYSAPAIQLLSC 


4567 


18468 


A 


4597 


1 


146 


GTSVDQGSLELLTSGDPPASASQSAE IT 
GVSHRTQPPLTI IQLYLGYC 


4568 


18469 


A 


4598 


293 


3 


GGTPWPRKKLGQRKFPHRPPGGSGNPDF 
SLLFFFFLRQSFALVAQAGVQWCDLGSL 
QPPPPVNPFLLRKQFGFSTSCFPILPAS 
GSFFGREASASC 


4569 


18470 


A 


4599 


174 


2 


RSVAPFLKSARPQTYRKEPTPDTGSPQP 
PPPRFKRFSCLTRLSSWDYRHAPPHPAS 
C 


4570 


18471 


A 


4600 


59 


252 


LAPFGHELGTSESAHQTDNRHRLEGYQK 
RLDASGLERASYPLAAEFKVGGRGCSSQ 
PLTLCGYP 


4571 


18472 


A 


4601 


233 


366 


I HYLPD AVAHACNPS TLRGCGGR I TRS R 
DRDYPGQHGETPFLLK 


4572 


18473 


A 


4602 


182 


325 


LFVFCLFLRQCXSVAQAGVQWHDHCSLY 
PQPPSLKQSSRLGLPSSWDY 


4573 


18474 


A 


4603 


202 


2 


GGSSSGLLRPCAVLGLKQSSYLSLVRRW 
DHNSWLKRSTHLGLPKGWDYRREPPHLA 
CILKFSINLV 


4574 


18475 


A 


4604 


317 


2 


FRLGPQVFFSPPGAGFFFCGPVWGPGPF 
SRSLQIiLTFGAKFFFCLSLPESFGAPLR 
GPPSGFMAHCFFFLVLLFFFFFFRDSVL 
LCCPGWSAVAIHRRDPTTARA 


4575 


18476 


A 


4605 


233 


71 


FFVXjLVEIEDQLNEKKQEGKIRKKKRMK 
RNKQSLQKIWDYVERPNLAGHSGSRL 


4576 


18477 


A 


4606 


114 


1 


HPRPT^RVSS CCPKWSAMVWSRIiAATSAS 
RVQAILQSRA 


4577 


18478 


A 


4607 


348 


3 


LKFQNFRGWHGARCPSFFGGLGRGIPF 
TPEVEVLVNLGPAPALPFWPPSQTLFSQ 
NQKKNQKKKQERKREKERKEERKKKKER 
KKKEKE RER KEGR KKGQE KKRKI TAKSIi 
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LG 


4578 


18479 


A 


4608 


174 


3 


TVVTWVQQCVFVCLLKQGLALSPRLECT 
Gil IAHCNLKLLGPSDPPTSVEKTKKLV 


4579 


18480 


A 


4609 


171 


334 


DPHSSLNIRFSRSVAQAGVQWCNLNSLQ 
PPPPGFQRFLCFSLPSSWDYRCAPP 


4580 


18481 


A 


4610 


2 


164 


ARGSRGHIWSAFCHVGQAGLELLTSRPA 
LASQSAGITGISHRAQPPLNTLYSMS 


4581 


18482 


A 


4611 


2 


164 


ARE PRGHIWS S FCHVGQAGLELLTSRPA 
LASQSAGITGISHRAQPPLSTLDSMS 


4582 


18483 


A 


4612 


180 


334 


CCLMGSSYLLPRKPNTLRTAGFLQQRKS 
LIRPGMWHACNPSTLGGQGGWI 


4583 


1S484 


A 


4613 


189 


1 


SGFSRWENGLKCKGQKGGQS LVLLP I R 
VFFFFFFEMKSHSFAQARVQWGNLGSLQ 
PLPLGFK 


4584 


18485 


A 


4614 


216 


358 


KSTPRLGTVAHTCNPSTLGGQGKRITRS 
GDRDHSGLHKGTPRLSKNI 


4585 


18486 


A 


4615 


179 


393 


EKQEEEGSWGWAKTEDGVALMLSEAIGS . 
CWI KAFLTDAETFTFHAD I CTLSEKERQ 
IKKQTALVELVKHKP 


4586 


18487 


A 


4616 


2 


126 


ARVQRYLFLFFTLDGPLPDPSMIRGSVP 
NQMMPRITPQSGK 


4587 


18488 


A 


4617 


338 


181 


RVGEAGLELLALSDLPASDSQSAGITDI 
SHHTRPGSPFLEHHLYPRYRISGK 


4588 


184S9 


A 


4618 


3 


197 


HEKRPEVQNSTVEFIASSDYMLRPPLPL 
PFCFFLFFFKIKMGVSPCCPGSCQTPSL 
KQSSCLNV 


4589 


18490 


A 


4619 


, 1 


339 


GTRTFGSGNGPTKPDLLQELRVATERGL 
VIVNCTHCLQGAVTTDYAAGMAMAGAGV 
I SGFDMTSEAALAKLSYVLGQPGLSLDV 
RKELLTKDLRGEMTPPSVEERRPSLQGN 
T 


4590 


18491 


A 


4620 


3 


306 


LEPGDGGCSELKSCHCTPAWVTERDS IS 
KKKKKKKKGGPLKGTLGGPKFNGGWQRK 
IFSQKGGQKKPNLGILGRNLNFGGEKNW 
NKFLTKIKVLREKKNF 


4591 


18492 


A 


4621 


397 


29 


NTKE KILS ARKNNQVFPPHTQ KKN FF P K 
KRKNFREGIPPNFPPPKKSPPQKNHPPQ 
KKKPPKEKKKTIARPPKKRAPPGAFKKK 
PPPFFFLSSLCISYSFIYFFSLINLFFF 
FFFFHDSTVY 


4592 


18493 


A 


4622 


2 


235 


VSLCHPGGTISTHYNFCLPGSRDPPTSA 
SAYALLI FVFFVETGFPLVAQAGPKLQG 
TNSI PASASQSAGITGMNHCT 


4593 


18494 


A 


4623 


209 


324 


ILRVLWSGTVAHACNPSTLGARGRQIMR 
SGVRDQPGQY 


4594 


18495 


A 


4624 


52 


287 


N P FS F LF FS FLS F FLFFF FLE KG PRRGA 
RAQKGKPHPKKKKKKKRGGGKKNKTQKP 
FSPLKGPRGPPRGKGQTTPKGP 


4595 


18496 


A 


4625 


102 


1 


THS F P VS I S LS LSHTHTHTHTHTHTE I Y 
RVREIR 


4596 


18497 


A 


4626 


125 


1 


DRHVPQHTANFF WL I EMGFLHVGQAGL 
KLLTSGNPLVPLV 


4597 


18498 


A 


4627 


3 


167 


ERLRQQNCTNPRSEGCSEPGSPHCTPAA 
WATEQDSVS KKEKEKRKKEKKKKI W 


4598 


18499 


A 


4628 


135 


3 


AASTLSRGTITAQGNLCLPVSSNSPASA 
SRVAG I TG ARHDTWL 


4599 


18500 


A 


4629 


320 


210 


WFLGI I FRRVENAESVNIDLTYDIQSFT 
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DTGKTPSS 


4600 


18501 


A 


4630 


98 


326 


SSPTLAFQFQYFFSCATKEAVKLjSFPKS 
LLGTGSRSVAQPGEQWHNHGSLQPQPAQ 
LRLSSCLTLVSGWDHRCAPP 


4601 


18502 


A 


4631 


275 


2 


RVKLLLFYMRQSCSVIQVESSSTVNGSL 
QPQPPGPRQSSHVSLLSRWDHKRMPPCL 
ANFFLFFWRAKGIiAILPVHSNIjVNTBSP 
LISIKSN 


4602 


18503 


A 


4632 


180 


1 


KGLTLYFYFFI fkmrrglamladwsqtp 
elnqfsclglpkcwdfrhepphpvkefk 
pfhr 


4603 


18504 


A 


4633 


234 


5 


kplrdkplrs fsnlpkknlnffwpffpq 
kiwgffffffeteshcvtqagvqwldls 
slqppspglrqflclslsss 


4604 


18505 


A 


4634 


171 


2 


GRVDILTPWQHFETLFSFFSFFWQGLSV 
TRLE CS GV I MAY CS LDLLGS GNP PT S AS 


4605 


18506 


A 


4635 


324 


211 


LRNII FMPVTMNPGMYTGGCMCVCVCVC 
VCVCLCVCVC 


4606 


18507 


A 


4636 


95 


1 


MRQS INLSPRLECSGTIPAHCNLCLPGS 
SNS 


4607 


18508 


A 


4637 


234 


1 


KKGHDTLTEKAM PVTLS S RMLS INVCKC 
IYKVWKNSPNYSSLTHHINKIKPSFTRE 
NTLMFIHLSPILLLSLNPDIIT 


4608 


18509 


A 


4638 


2 


243 


WGKVNVDEVGGEALGRLLWYPWTHMYF 
EFFGAFYISFCLFFLIFPHLFFVSSIHI 
VSS FYLLFYNFP 1 1 FFTLFCH FFL 


4609 


18510 


A 


4639 


118 


2 


GR VDLQCSGAI S AHCKLHLQRSRHSPAS 
ASQVGAHATT 


4610 


18511 


A 


4640 


50 


168 


NKINKT'IHIjITFLKKKKKKKKKKKKKKKK 
KKKKKKKKKKN 


4611 


18512 


A 


4641 


315 


87 


G I FLRDRNLALL PRLECS GT I IAQCNLE 
LLSPKDSPASPSPVARTTGVQPRPDTLT 
LFEXil YIRYKGM I FSQGDC 


4612 


18513 


A 


4642 


383 


2 


KFKKKKFAGGKAPARLSPPLWGPKGGGP 
PRAGVLSPPGPPGETPFFLKNPKFPGHG 
GGPRKSPFFGGLKGSNSLTREKGGPINP 
NIAPPAFPPKGQNPGPFPKKKKKKKNRS 
SPAWNWDWIGTNAGK 


4613 


18514 


A 


4643 


3 


205 


LKRFFCFSLGGQGCSEPRSCQCIPAWET 
QRDSISKKKKKKKTRPGGLTLEPPPFGG 
VKP KGPLGAG I 


4614 


18515 


A 


4644 


149 


1 


FFP IKADLVFFFFFLRRNLADSQGAVQW 
HDLGSLQAPPPGFMPLSCLSL 


4615 


18516 


A 


4645 


474 


344 


AIHEHYLPESSASGDCDEDLQAACTILV 
NALMDFHQRLAAHTV 


4616 


18517 


A 


4646 


165 


357 


LCNALSHLLPQCTPTPLI SILCIIDTIS 
LLIQP ITHAVRLS AN I TACHLLMHLILL 
STLSICTL, 


4617 


18518 


A 


4647 


2 


339 


ARGENDLSPGGGGCSELRLKHCTPTWAT 
RVNCVSQKKKKNQPKPKPNQTKPNQTRK 
GSPGFPLLNLKVISPEKSLNPNILKTAP 
KOTFRCWIiTPEIPPFWKMEEKGSFDPRI 


4618 


18519 


A 


4648 


313 


187 


LTSGDP PASAS QS AG I TGVSHHALPPLV 
QFPSHSLTPTPLFC 


4619 


18520 


A 


4649 


236 


324 


GQVMWPGMMAHACNPSTLGGRSRWIVRS 
R 


4620 


18521 


A 


4650 


225 


313 


GQVMWPGMMAHACN PS TLGGRGRW I VRS 
R 
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X=Un known, *=Stop codon, /^possible 
nucleotide deletion, V=possible 
nucleotide insertion 


462) 


18522 


A 


4651 


240 


76 


FCVHAHTHTHTPS FQASGSSEHMHTYTH 
PSGSPEHTHTYTHTTFLASRSAEPQLP 


4622 


18523 


A 


4652 


273 


379 


KQLAAQCVRPLSPSQAIASPVTCTYMPR 
WPEVTEE 


4623 


18524 


A 


4653 


369 


189 


GTVKQAGAQCRNLGSLQPTPPPGFKQFS 
CKSLYSIMICYSKDCTMSTTLGIRANNR 
LHL 


4624 


18525 


A 


4654 


168 


346 


DRICRASSYTIHLMLYFVFLIFLFYWYI 
YFIjF FVL FHFT I F 1 1 LFYS F 1 1 LLLLLY 
FYN 


4625 


18526 


A 


4655 


56 


354 


WKRTFPFFFFLETGSGFASRPCTLLLFL 
YSLGPNPRVLIRVFLFLSSSRCYYYYLF 
LGLSLLFLFFYIMYLLFIRLFFFVLSGL 
LLLLSSYYRFI IFFF 


4626 


18527 


A 


4656 


50 


186 


SACLGLPKCWDYRHEALRPAMYLLTGSS 
YVAQAGLNLLGSADPPE 


4627 


18528 


A 


4657 


131 


366 


DHFVASSESVKEEGLFFVFVFFFFIFIF 
FFILYICFFYFKFI FI ILLYI III FIFI 
FYFFYLYYLILI FFLIFFI FIG 


4628 


18529 


A 


4658 


290 


155 


LEYIVEKGFHQDDKASPELLTSRYLSDS 
ASQSAGI TGHRAWPTKG 


4629 


18530 


A 


4659 


163 


372 


LRL FAL FAFLFM F F F YFL FLL YVF CFV F 
FLFFFFTFFLYYFFLYFFLYFLCSLVFL 
FFICFIFFWFFVFF 


4630 


18531 


A 


4660 


227 


378 


FI FFSNCTSTALVHPLF YLSRLVLS KNR 
NKFIGRARWLTPVFPALWEAEA 


4631 


18532 


A 


4661 


3 


141 


RSRHCTPAWQQSE I LSQKERKKERKKKK 
ERKKEKERKKERKKKKKK 


4632 


18533 


A 


4662 


346 


3 


TFKFLGGVCPFFSPPKKRVFSKNSPGGF 
FLPPFLGENFFFAPPVNFGPPGGFFKGG 
PPFFFFFFFFFFFFFFFFFFFFFFFFSQ 
DSFDNLGI FVIPYKIQDDFSSSMSNATS 
ILIKIAL 


4633 


18534 


A 


4663 


302 


3 


GDINRRSHSPFWSKPFAPCFMSSLDVF 
YKSSDFFYFFFETEFHSCCPGGISAYCN 
LCLPGSSNSPASASQVAGITGVRHHTQL 
I FAS WETFKITYKN 


4634 


18535 


A 


4664 


2 


76 


RLAPTPQLGGHWPPTGITPLNPLEV 


4635 


18536 


A 


4665 


1 


210 


S PTRS PKP TPP YYQTTLAKPFTKKKKKK 
KXKKKKKKXKKKKRGGAPLKKS PGGPK I 
NRGKKKKFFFLKGG 


4636 


18537 


A 


4666 


2 


356 


GSRPGGGKRTARRWTRPWRLRGACLGTG 
MAFAFHHQVI YKI KS FLRKKKKKKKKKK 
KKKKKKKKKKKKKKKKKKKKKKKKKPRG 
GGFLKKLWGGKFFMGGGKIFFFFIRGGK 
KKMWGG 


4637 


18538 


A 


4667 


1 


323 


RTRGIFCYDVCVESGCADIQLLLLCLKK 
KKKKKKKKKKKKKKKGAPLKKNPGGAQN 
NPGKKKKKFSPKRGPKKTPPGNFEKXPF 
FGGGPNGAPPPQKNKTPKEKKKF 


4638 


18539 


A 


4668 


167 


446 


ELNKGLCKPKKKKKKGGGRSQFKPTRPK 
KKKKKKKKKKKKKKKKKKKKKKKKKKKK 
KS 


4639 


18540 


A 


4669 


321 


52 


GVFSPFSPKKFFFSPKGLNFWGGFAPIF 
SPPKKRFFSKNPQGGFKYPPLRGKILIW 
GPRENWGPPGGFLKGAPLFFFFFFFFFF 
FFFFWG 


4640 


18541 


A 


4670 | 82 


359 


11AYG 1 1 LI KGNEDTKTVKWKSS CQLANT 
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WEFKIRALLGFNCFFLIFKRRGLTLLPR 
LDCNGVI I IHNNLESLGLSNPPMLASQV 
ARTTGLCY 


4641 


18542 


A 


4671 


249 


60 


LLIFIYYFYFFLFFFLFFFFFFFFFFFF 
LFVQVHLP VHL PC YDLS PLYKC VGVLVK 
CPFKYT 


4642 


18543 


A 


4672 


2 


319 


AAASTLVFVFSFFWVLGFFFFFFFFFWG 
PPKKI LKXKGGGGGGGGGGPRGGPRGGG 
KRNPGGQKKGIGGANGGQKKERNFQGGG 
PPRGEKRGGGGGPGNPKGGGLK 


4643 


18544 


A 


4673 


3 


451 


HESFNPVI ISQLERGEDPWVLDRKGAKK 
SQGLWSDYSDNLKYDHTTACTQQDSLSC 
PWECETKGESQNTDLS PKPL I SEQTVI L 
GKTPLGRIDQENNETKQSFCLSPNSVDH 
RE VQVLS QSMP LT PHQAVP SGERPYMCV 
ECGKCFGRS 


4644 


18545 


A 


4674 


446 


189 


SISLSTLVNVCQLISI^DPASGFVDHVC 
CI FT FYS INFALILI FLFYLGVLVWAAI 
INYHRLHGLNTKHLLIMIWMVGSPRLSC 
G 


4645 


18546 


A 


4675 


26 


170 


HLWFLVI VAHWASSRQI ISHWKNCTRH 
DCPVCLPLKNASDKRNQQSK 


4646 


18547 


A 


4676 


3 


378 


HEDEFDKMGNQHQALLrEAMEQQS ISLAK 
AGWCILPARTS 1 1 AVAHS FAGL.YLI FN 
TVSWFFFFSNFIFLILSLFNYFCISIY 
ILFLIIFIFIFISSLIVFFCLISFRFLL 
FIFFSALSRIFIY 


4647 


18548 


A 


4677 


18 


242 


PIGLALSPRLECSGVI IAHLLGSNNPPT 
SGSRVAGTTGIHHHAQPHCFNCI FPMSS 
EVECLFI CIjIjLIH JIjSRA 


4648 


18549 


A 


4678 


288 


2 


PKYRWVPEKPTTASENAEIPSERQKEGI 
KLTIRISSRKKKPDSPPKVLEPETKQEK 
PEKEEEKTMVGRTLRRSPRISRPPAKVA 
QIRDPKAHKKK 


4649 


18550 


A 


4679 


340 


208 


SVGQAGLELLTSGDPPATASQSAGITGL 
SQCYPPDSVPETFLPL 


4650 


18551 


A 


4680 


4 


379 


EENTG INLHDLGFGNGFLDMAPKAQATT 
AKKKKKKKKKIKIKSFLFLGGLFKKGKK 
TPPKGEKNFGNFKFWGGDLTPKI FGDPG 
ASTKKKKPFLKGGRGLKKTFFKKKKKRG 
PHPLESPSPFSVF 


4651 


18552 


A 


4681 


1 


350 


SRFSGAKEAKMEKKITGYTTVDISQWHR 
KEHFEAFQS VAQCTYWQTVQLDI TAFLK 
TVKKNKHKFYPAFIHILARLMNAHPEFR 
MAMKDGELVIWDSVHPCYTVFHEQTETF 
SSLW 


4652 


18553 


A 


4682 


365 


45 


S PKWFVAS YDPYAGPGRPTRPDNSRDET 
GSHSVSESSSGSDSKRESTSSDREANEP 
SHRASPEPEPPPTNKWQLDNWLNKVN 


4653 


18554 


A 


4683 


238 


372 


SATPLISSFSFLFFFFFFFREGVWLCHP 
GWSAVSRDCTTAAVWPG 


4654 


18555 


A 


4684 


3 


462 


KTKES REAVE KEFEPLLNWMKDRALKDK 
IEXAWSQRLTESPCALVASQYGWSGNM 
ER I MKAQA YQTGKD I S TOY YAS Q KKTFE 
I NPRH P L I RDMLRR I KEDE DDKT VLDLA 
VGLFETATLRSGYLLPDTKAYGDRIERM 
LRLSLNIDPDARG 


4655 


18556 


A 


4686 


410 


0 


FFFFFFFFFFFLLGKGGFGI IKFL 
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nucleotide 
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ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
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correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E-GIutamic Acid, F=Pheny!alanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q-GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Va!ine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \-possibIe 
nucleotide insertion 


4656 


18557 


A 


4687 


151 


2 


IFFFFFDRVQWHKLGSLQPLPPRFQQFS 
CLSLPSSWEYRHAPPRVRPRV 


4657 


18558 


A 


4688 


342 


183 


FFLFLRWSFALVPQAGVQWCDLGSLQPL 
S PGFKRFS WAEVGGLHB PRS QRLQ 


4658 


18559 


A 


4689 


126 


404 


NCQVWPLRFTGHSVFSVHPQWFILFYCF 
ALYFI FLEMGSYSVTQAGVQWHSLGSLQ 
PPQWSYVLTFLWHPSSWGYFVQNHLTPC 
FDGKVPWV 


4659 


1S560 


A 


4690 


2 


128 


GESLFYIVSPRDWVAKERDQDDKIDWL 
LEKKKYEVPL I LLL 


4660 


18561 


A 


4691 


407 


2 


YFFFPLPRMKI FFFPHQKGFFFGGCP I I 
SPPPNEVFFSPKEGGFFFSPPKKRVFFH 
NPPEGFFPPPPFFMSPPPPFFFFFFSLF 
FFFFFFFFFFFFFFFFFFFFFFFFFSFR 
IILLNQKKNLSACVELRSCCTFM 


4661 


18562 


A 


4692 


1 


176 


SRRLNPGGRGCSEPRSHHCTSTWSTERD 
SVS KKKKKRFFALKFLGEANVKPT I GGQ 
KN 


4662 


18563 


A 


4693 


1 


403 


SKAEIGRRLGLLRQTVSQWNAKEKFLK 
EVKS ATPMNTRM I RKRKS L I ADMEKVLV 
WIEDQTSRNIPLSQSLIQNKAIjTIiFNS 
MKAERGVEAAEEKFEASRGWFMRFKERS 
HFHNI KAQGEAASADVEAAASY 


4663 


18564 


A 


4694. 


343 


425 


HCFLSLQLNSFEQLCINYTNEKLQQIjF 


4664 


18565 


A . 


4695 


395 


2 


KNKGLFFFFFLRWSFTLVTQASNVTTAV 
CNVT I LAHCNLHLPGS S NS PAS AS RVAG 
ITGSCHQG 


4665 


18566 


A 


4696 


429 


0 


VTKWCYITKKKKKKKKKKKKKKKXKJCKK 
KKARG 


4666 


18567 


A 


4697 


371 


217 


IPVFKQSSCLSFPRSWDYRHMPPCVTQK 
KKIEYQHISLYKKPSKRCALFLI 


4667 


18568 


A 


4698 


415 


59 


LQQQCECFISELTKKKKKKKKKKKKKKK 
KRGGGLEKKKKKGFFAPYPPH 


4668 


18569 


A 


4699 


431 


0 


LKKIFPPGGRVLSPSPPPPPPPPPPFPP 
P 


4669 


18570 


A 


4700 


371 


245 


TLGDPPASASQSAGITGVSHHAQPLFGI 
F I YKFT Y I CRKTRT 


4670 


18571 


A 


4701 


410 


0 


KQCWFTQKKKKKKKKKKKKKKKKKKKKK 
DRG 


4671 


18572 


A 


4702 


340 


402 


PFAGGGYRLGAAPEEESAYVA 


4672 


18573 


A 


4703 


188 


2 


TSLPKCWEYRHHPPCPAHTLLI FLFFI F 
VDMGSCHVAQVALELU3SSDPPTSASQS 
AGIIGM 


4673 


18574 


A 


4704 


423 


34 


LKKKKKKKKKKKKKKKKKKKKKKKQKDL 
KKMVGGGNYKNVGGAQILGYKKLCFSFF 
YRVF 


4674 


18575 


A 


4705 


2 


297 


PR VRERE KEMAS MKEEFTRLKE ALE KS E 
ARRKELEEKMVSLLQEKNDLQLQVQAEQ 
DNLADAEERCDQLIKNKIQLEAKVKEMN 
ERLEDEEEMNRSLP 


4675 


18576 


A 


4706 


2 


179 


QENGMNPGGRACCEPRLRYCTSS WATER 
DSVSKKNEKKRSASLRRFDILFGALTGR 
KAS 


4676 


18577 


A 


4707 


432 


578 


PRIVCFQELRKPSYAEICQRTSKEPPSS 
PLQPSSSSSSSS 


4677 


18578 


A 


4708 


419 


35 


GSFFFFFLKDQPKKGRKKEGCPPPPPKK 
IIYPPPATKWGGGGGKKPPPPKKRGPPK 
KNRGRXKKPPKKKKKKFPPPPKKRAPQK 
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nucleotide 
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ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F-Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L==Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=G!utamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
^W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














KKKKPPPPIFFYLYFFKNFIFFFFFFFY 
FFPFFFFKFI FNLKKF 


4678 


18579 


A 


4709 


147 


31 


NFFFFFFFFFXFXFFFFFXXXKFFLFFF 
*FFFLKLKIFF 


4679 


18580 


A 


4710 


3 


229 


HASAHAS DQPERLNAGTYFLFYTLVG SL 
PLL I AL I YTHNTLGS LNI LLLTLTAQEL 
SKKKKKGGAVLKNPWGAQS 


4680 


18581 


A 


4711 


1 


158 


PTRPPTRPRSCSELRSHHCTPAWVTERL 
CLKKKKKKFPNIiSVGFKILGILKG 


4681 


18582 


A 


4712 


163 


404 


KKKKKKKKKKKKKKKKKKKKKKKKKKKP 
KKKKNKKKKKKKKKKKKKHPSRGGAQKK 
KKGGGAQNRAGGKRRPSRGGGEKQN 


4682 


18583 


A 


4713 


3 


378 


FEEPVYI KSRQKRKESNPPKLVSSQPHG 
LHDFFKKKKKKKKKKKKKKKKDGGGPLK 
KK PGG AKN KPGG E KKNFFLKGGG K KNP W 
GFFKKKTFFGGGKIGAKPPKKKKSLKKK 
KKFLRG KGGKKT 


4683 


18584 


A 


4714 


2 


208 


VSNPAVSVPHLPFSVYKSSPMASMTFSK 
KKKKKTKKKKKKKKKKKKKKKKKKGGPQ 
KKKPWGAQNKPG 


4684 


18585 


A 


4715 


290 


56 


SAPPPIFFLFFFLFLFIFFIFFFFFKQI 
LEKKGGPPPFFLFFFFIFFFFFFYRMGC 
DRWWFILIQGTFHRKQKPVLKV 


4685 


18586 


A 


4716 


50 


424 


GGFKIKFLFTQFLFLLFSPSMFLISKSP 
AYLWQSS P KQVPMWFGKTNFLVS I E CK 
KEGIYFFCIPFVPFGFPHPKILCSLLPF 
LAFLRR S IiALL PRLE CSGAI LTHCNP CL 
PGSSNSHAWA 


4686 


18587 


A 


4717 


161 


2 


GRPGPADFRVRPQLLQRFLFIYLFTEME 
SCSVTQAGVQWCNLGSLQPLPPGLQ 


4687 


18588 


A 


4718 


2 


115 


VYTTAMAI EEYPEAWGSGVE VLVS VLVG 
LAMEVGSTRP 


4688 


18589 


A 


4719 


1 


416 


GNQGGG YGGGYDN YGGGN YGSGN PSS S S 

ssssssssssssssssssssssssssss 
sssssssssssssssssss 


4689 


18590 


A 


4720 


1 


454 


QPDSG I SS I RSFKT I LPAAAQDVYYRDE 
IGNVS TSHLL I LDD S VEME I R PR FPL FG 
GWKTHYIVGYNLPSYEYLYNLGDQYALK 
MRFVDHVFDEQVIDSLTVKI 1LPEGAKN 
IEIDSPYEISRAPDELHYTYLDTFGRPV 
IVAYKKNLVEQ 


4690 


18591 


A 


4721 


3 


173 


DAWAGFHYVGQAGLELLTS SHLPASASQ 
SAG I TDVSHRTQLDVPFSFS LS PLPHQC 


4691 


18592 


A 


4722 


2 


201 


LKPGGGGCSEPRSCHCSTPAWRQSETPS 
QKKKKGKWGLERIGWGGSTTNNGAKNQR 
GEKFGKKGGF 


4692 


18593 


A 


4723 


2 


183 


IiKPGGGGCSEPRS CHCSTPAWRQSETPS 
QKKKKGKWGLE 1 1 CWGGSPTNKRATNS P 
VEKF 


4693 


18594 


A 


4724 


1 


395 


QDATS P 1 1 EEL I TFHDHAL IIIFLI CFL 
DLHALSLTLPSALT 


4694 


18595 


A 


4725 


388 


3 


S FS PLEENLGNT I QD IGRGKDFTS KTPK 
AMATKAKID 


4695 


18596 


A 


4726 


223 


44 


WLFFFXPLFFFFFWGGIIFFLLKKLYIF 
FFFMSKIFFFFFFFFFFFFFFFFFFFFF 
FFFF 


4696 


18597 


A 


4727 


41 


374 


KKKKKGKKKKKGGGGGGNPKKKKGKMEK 
P PGGNKRGEGGKKKI^FLKKKGGGLFKGK 
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residue of 

peptide 
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Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=Phenylalanine, 
G=G lycine, H=Histidine, I=lsoleucine, 
K=Lysine, Lr=Leucine, M=Methionine, 
N=Asparagine, P=ProHne, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














KP PKKKNGGRKP W KKKNFKKKKKKAP F F 
LGPKI FKKKKKRGLKKKLGFFSQSVGP 


4697 


18598 


A 


4728 


323 


69 


FFFLRQSLGSASQAGAQ3HNLGSLQPLP 
AGFQLFLQPPPPGSKQFGRLNLLGNKDL 
WGLPVLRAHPQAPPLAFIL 


4698 


18599 


A 


4729 


3 


197 


CLALLHLSRQRQYALPLLGLPGCPHIHT 
HSHTEAHDHTHYTSPHAHRHAHTHTHMH 
TLPYTTHCI 


4699 


18600 


A 


4730 


309 


443 


VLTLSPRLECSSMNIAHYSLKLLGSSDH 
PTSASQSGGI IGMSYHA 


4700 


18601 


A 


4731 


3 


172 


DAWAFFNGPGVSLCCTGWSRTPQLKRST 
CLSLPKCWDYRPEPLSPAPKTLFSYVLL 


4701 


18602 


A 


4732 


98 


232 


QTVFLLIWLIiALCSPGVLQTVKRFLEFR 
CIFSLGIVTFICLYRN 


4702 


18603 


A 


4733 


238 


1 


LGPPPPSPLGPPPFLVIFRGFSFPKTPL 
FFSPFFFGPPPFFFFFFFFFFIRRSFAL 
VTQAGVQWRDLGSLQPLPPGFKR 


4703 


18604 


A 


4734 


121 


424 


LIQGHRWIYFSQLRKVRGRQACDSTLIj 
GHLPKWRRMFPFLS FTVAGLEPTSHYR 
MFVDWLVDQHHWRYQTGKWVQCGKAEG 
TMPGAR 


4704 


18605 


A 


4735 


319 


8 


NPPQKKKKKIFFPPKKKTPPKKI KKNPP 
PFTIFFFFFFFFFFFFFFFFFFFFFFFF 
FF YFQHVW FCFE 1 1 HVT FFLIE VLFLAF 
HVGDS IADAWSTKNSNRTK \ 


4705 


1E606 


A 


4736 


297 


442 


TQGFFGGGPRKIGFFFRGLKNGGGGNPK 
LRFLKPGQLVGRGKTKKNQG 


4706 


18607 


A 


4737 


430 


0 


CVLGSKDKKKKKKKKKKKKKKKKKKKKK 
KAKKKDS 


4707 


18608 


A 


4738 


137 


1 


INYYFFLKQGFPLLPRLECSGAI IAHCS 
LSFPGSSDPRPSASQVA 


4708 


18609 


A 


4739 


188 


3 


ISPKALWPRREVGPIFPPPKKKVPPQNP 
PGGFFTPPHKEKNFSLPPPVNLGPPKDP 
LKRPP 


4709 


18610 


A 


4740 


188 


2 


TSLPKCWDYRHEPPCPAHTLLI FLFFIF 
VDMGS CHVAQVALELLGSSDPPTSASQS 
AGIIGM 


4710 


18611 


A 


4741 


410 


1 


RRPGGGFCPPPPPKIFFSPPPPPFFVGV 
LPQTPPPPKKFFFLNPPPGFFFPPPKKK 
KI FFFSPPFFSPPP I FFFPPPPPFFFFF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
GKDRVSLCCSDWSRTPGPKPSSLL 


4711 


18612 


A 


4742 


225 


444 


ENLKLKSFPCSHFHWLRWFNLTIGALSR 
S FDEAALLFFQNDTYWPGTVAHACNPST 
LGGRGGRITKSRDRNHS 


4712 


18613 


A 


4743 


2 


452 


VSNPAVSVPHLPFSVYKSSPMASMTFSK 
KKKKKTKKKKKKKKKKKKKKKKKSGGPQ 
KKKNR 


4713 


18614 


A 


4744 


3 


144 


LAAVLLKLGGYGI IRLTLILNPLTKHIA 
YPILLLSLNPDIITGFSS 


4714 


18615 


A 


4745 


206 


2 


IQNPQTERVNSTSKLKISSKDTINRAKK 
QPMDWEVI CKKH I FGKELISRMYKELQL 
NNKKQIVPPTRP 


4715 


18616 


A 


4746 1 


523 


178 


RHRRFFMDIEREQVKEQQRQKEQKKDIE 
KIKDKREQECYAAEQRILRMNFHEDPYS 
GEKLSEILAQLQLQEIKGTREKQQREKE 
YLRYVEALRAQIQEKMQLYNITLPPLC 


4716 


18617 


A 


4747 


3 


518 


ENRLInLEGGGCGEPRSRHltTPTWAT 
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peptide 

sequence 


Predict- 
ed end 
nucle- 
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correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A~A\anine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Iso!eucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Pro!ine, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unkno\vn, *=Stop codon, /-possible 
nucleotide deletion, \=possible 
nucleotide insertion 


4717 


18618 


A 


4748 


491 


145 


LCHVGQAGLELLTSSDLPTLASRSAGIA 
GVSHRTRPHSHFYFYFLYSSSFFFFFFL 
AQRGNRAPRGKKTPFGLATLN I KSNGGV 
FWFLFLGFGFLKREPTFALGLKDRGGNL 
V 


4718 


18619 


A 


4749 


6 


362 


NKLPNGTEFLSSLVPNTFFFSFLFFPFF 
FFFFFWGKGGPPRGGGAKFYPLEFPPPG 
VPPPPKGGKKGGPPPPTNFLFFKKEGGS 
PFGPGGVKTPGPKGFPPPSPQRGGKKGQ 
WAQKPP 


4719 


18620 


A 


4750 


380 


3 


S LLLAFALLCLP WLQEAGAVQTVPLSRL 
FDHAMLQAHRAHHLAIDTYQEFEETYI P 
KDQKYS FVHDSQTS FCFSDS I PTPSNME 
EPQHKSNLELLRISL.LLIESWLEPVRFL 
RSMFANNLVYDPS 


4720 


18621 


A 


4751 




345 


LFYI FKKTWSLFLtCFLWMMCVF I FFFFL 
LFVFFLCLGGILVSFI PPGGEKREKKKR 
RGGEKKKKKKNWFKKKKKNPWGGGGGL 
KKQRARGGKKKKKKGGGGSGGGKKKQS P 
LRG 


4721 


18622 


A 


4752 


1 


407 


QEFEETYIPKDQKYSFLHDSQTSFCFSD 
S I PTPSNMEETQQKSNLELLRI SLLLI E 
SWLEPWFLRSMFANNLVYDTSDSDDYH 
LLiKDLEEGIQTLMGVRVAPGVTNPGTPL 
ASRAGGEKYCCPLFSNKARDQEN 


4722 


18623 


A 


4753 


357 


1 


PLFFFKRNLFFFFFFTKKKEKKAPVFFF 
FPPPSREWGGFFFRVGGVNTQFLHPFFF 
SQNTSVFFSTLFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFLFYCFIYFSR 
DRGLTLL 


4723 


18624 


A 


4754 




207 


LGLAGVEEEVASRLNAGGRGCSKPRSHD 
WTPAWAMEQDSMSKKTKNEYQRRQLIHL 
QSSTSGFLYVHEQ 


4724 


18625 


A 


4755 


2 


336 


HEERERERERERERERERERERERERES 
RAQFRARKNSQAPHPRVCETLWTEEGGS 
FYPLTEKDFAQCERAHFCVLGVSHPPGW 
GLCKKMNPECERATHSLHGDADISGVL 


4725 


18626 


A 


4756 


230 


446 


VGFSSLFSRYCTILISASSLTKELCEKK 
RSFLFCFAFWLLGFLRQSLALSPRLEYN 
GAI SAHCNLHLPGS SD 


4726 


18627 


A 


4757 


3 


390 


QTSFCFSDSGPTPFNMEETQQKSNLELL 
RISLLLIESWLEPVRFLRSMFANNLVYD 
TSDSDDYHLLKDLEEGIQTLMGRLEDGS 
RRTGQ I LKQT YS KFDTNSHNHDALLKNY 
GLLYCFRKDMDKVETFL 


4727 


18628 


A 


4758 


234 


387 


XiAK I FLS FS S ATGW I ENRPQS PATGRTP 
VFVSPIPPPLPPPFPFAFVT 


4728 


18629 


A 


4759 


396 


0 


I YTHTHTHTHTHTY I YMYVFMI I 


4729 


18630 


A 


4760 


176 


384 


MVIPLFNKDTQHIILFWGYVLIFFFLKR 
NFVFVAQAGGQGRYFGSLKPPPPRLKPF 
SCLSLLGTWDYRR 


4730 


18631 


A 


4761 


419 


208 


IYPPPPPWEPKGGSFLXKKKKKKKPGPG 
VPPFIPPPLGGQAGGFPNSKIQIPPAPQ 
GKPXjFFQKWQNYLA 


4731 


18632 


A 


4762 


158 


2 


GNQLNPPPPSGSAFCFLFFLRRSLALVT 
QARVQWHDLGSLQPSPSGFKRFSC 


4732 


18633 


A 


4763 


489 


398 


AGFELLTSSDPPASASQSVGITGMSHCT 
OP 
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Predict- 
ed end 
nucle- 
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last amino 
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peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=lso!eucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P^Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
\V=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop cod on, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 


4733 


18634 


A 


4764 


420 


1 


FFFFFLGE I FPPGGKGGVFPPPPLKKFF 
FSPRGLFFLGGGGPNFFPPKKRVFFKKP 
RAGFFPPFGVLKRGPGPGFLTPPQKEKN 
FFFPPPGVLGPPKGFFKGGPPFFFFFFF 
FFFLGKKKKKKALFFTKKKGGGGTVRKA 


4734 


18635 


A 


4765 


167 


3 


I YFFFFLKKI FFYKKKINKLKKKKKKKK 
KKKKKKKKKKKKKRAAARDLBLAKAS 


4735 


18636 


A 


4766 


424 


222 


FQNIHKKNFPSPPLLKTGPAPLFKKPPK 
KKKKYFFPPPEKLGPPREFLKGPPPFFF 
FFFFFFFFFFF 


4736 


18637 


A 


4767 


254 


423 


V I KS WARRN KNKWD W P GAVALS CN PS TL 
GGPGQANPLRSGVQDHPGQHGETLSLLK 


4737 


18638 


A 


4768 


363 


0 


KTTTSQARGPTSVIPAAQETEAGDSSWP 
SGYGSYSSSSSSSP 


4738 


18639 


A 


4769 


51 


169 


YYTHTHPRTGFVKKKKKKKKKKKKKKKK 
KKKKKKRGGGF 


4739 


18640 


A 


4770 


188 


3 


ISPKALNPRREVGPIFPPPKKKVPPQNP 
PGG F FT PPQKE KNFTLP P P VNLGP PRD P 
LKRPP 


4740 


18641 


A 


4771 


1 


131 


KLDKENALDRAEQAEADKKAAARGKQLE 
DELGS LQKKKKGGP F 


4741 


18642 


A 


4772 


363 


452 


RLECSGMI IXHCSLNLPG S SDS PTSASQ 
LA 


4742 


18643 


A 


4773 


121 


1 


RPRRPVAQAGVQWHNLSSLQPLPPGFKQ 
FFHLSLLSSWDY 


4743 


18644 


A 


4774 


31 


226 


MISSQLS I I PTPTQEQGLKKKKKKKKKK 
KKKKKKKKKKKKKKKKKKGGGLLKKILG 
GAKFLGGEKKNFFFF 


4744 


18645 


A 


4775 


161 


1 


KPPPPFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFLFFFFLKKSWRP 
WGLKSAPAL 


4745 


18646 


A 


4776 


394 


67 


SPPRGEGGVAPPPPPKNFFFPRAVFFLG 
GGGGPMPPPQKRGFFPKTPGGVFKPPPK 
GGKFFFPPPGEGGPPPEFFKGPPPPFFF 
FFFFFFFFFFFFFFFFCFFFFFFFF 


4746 


18647 


A 


4777 


2 


409 


GVCIFSGHKKNKAPLPFSVLYSIQYINS 
LEKKKKKKKKKKKKKKKKKKKKKKKKKK 
KKT 


4747 


18648 


A 


4778 


456 


0 


FFYKKKKKKKKKKKKKKKKKKKKSSS 


4748 


18649 


A 


4779 


438 


0 


LLKIDKKKKKKKKKKKKKKKKKKRTSKR 
GG 


4749 


18650 


A 


4780 


58 


330 


SPFYQLKKKKKKKKKKKKKKKKKKKKKK 
KKKKKKKKKKK KKKK KKKKKKKKKKK KK 
KKKKKKKKKKKKKPQQKGGE KKKNPGEL 
GKKNCGGGGVLVGKPHKEKKRGGGKKKN 
FSG 


4750 


18651 


A 


4781 


435 


73 


FFPPPPRGKFFFKKPPGGKLFSPPELGV 
FFPPFPLKIFFFPSTPFFFWGGFPPFSP 
PPKKVFFPKFPPGFFFPPPLKKKFFFSP 
PPFFLPPPGFFFSPPPPFFFFFFFFFFF 
FFFFFFFSP 


4751 


18652 


A 


4782 


420 


109 


LFKKFFS PTEREGVFPPQF PKNFFFS PN 
PLFFGGGGAQNFPPQKKNFFPKNPPGFF 
FSPPKKKKKFFSPPGKILAPPGYFFKGP 
PPFFFFFFFFFFFFFFFFFF 


4752 


18653 


A 


4783 


230 


2 


LGLWGALYSPFLGGLGGRKFLLPWRPR 
VPLTPISPPPSRLGGPPKKKKSQAWWHA 
PWPTTREAEAGGSjjEPRkL 
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SEQ D> 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I^Isoleucine, 
K=Lysine, L=Leucine, MHVIethionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


4753 


18654 


A 


4784 


271 


440 


VLYS KRDNNKS FKNNTYTG I GPGAVAHA 
CGPSTLGGRGGQIMRSGVRDKPYRHGET 


4754 


18655 


A 


4785 


2 


266 


KKYGII I FCSFFLSLFLKDQGLAPLPRL 
ECS WFAAHRGLGLLGWS DPP KS AFLVA 
GITGMHHHAWLI FKVQFSNLMGPWS YTW 
SAVD 


4755 


18656 


A 


4786 


216 


435 


HRS GDSS LKKKKKKKKKKKKKKKKKKKK 
KKKKKIWEFWKGPFLWGGKKRKSGGG 


4756 


18657 


A 


4787 


249 


437 


MNNEWPLKTBTSRPDWAHACNPSTLG 
GRGGQ I TRLGVQDQPCQKGEAPSIxPENR 
KYLAWSV 


4757 


18658 


A 


4788 


107 


368 


AHGSLDFIiPGGPKKSHQWFPKPKDGKPP 
PFSKKPLKKKTPGKKGGGHPTPGARSKK 
GGYPRGGDPGGPFFFKKFGQKKKTPFLL 
RRD 


4758 


18659 


A 


4789 


1 


225 


PTRPFITNNIFPASPFQTTIPLYLKLTA 
LAVTFLGLLTTLDLNYLTNKLKIKSPLC 
AKKKKKKKI KKKKKKKNEG ' 


4759 


18660 


A 


4790 


2 


216 


SGRNSRVDGRQEAACRDYQSSLEDLTFN 
SKPHINMLTILAEENLPFAKEIVSLIEA 
QTAKVFIHPAASYYF 


4760 


18661 


A 


4791 


389 


30 


EKKSPPQKKKKKFFPPPPQKKFFSPPNQ 
KTCGGGGPKKPPPKKKGSPQKNPKGVKK 
PPPKKKKKIFPPPQKKGPPPKIIjKKPPP 
PFFFFFFFFFFFFFPFFFFYFSLFFSGG 
FSYKIFL 


4761 


18662 


A 


4792 


164 


3 


LFPGFFNPGVKGEKGISFPPFFFFETES 
HSVIQARGQWRRLGSLHPLPPGFKR 


4762 


18663 


A 


4793 


550 


413 


AEMGFHHVGQAGLELLTSSDPPALASQS 
AGITGISHRAQPHRPIC 


4763 


18664 


A 


4794 


456 


352 


RSQDGLYLLTS S FACLGLP KCWDYRRE P 
PCLAVFF 


4764 


18665 


A 


4795 


242 


3 


KPFSGGGGAFQTAFFITFGKKFWSRLG 
LLKKKVFLKFLPQAFFFFFFETESHSVA 
QAGVQWRSLGSMQPSPPGFKRLS 


4765 


18666 


A 


4796 


218 


346 


GFSCFVFETVSLCRPGWSAWRSRLTAT 
SASRVQAIILPQPPE 


4766 


18667 


A 


4797 


383 


80 


KFFFNQGIGGFFPPFPPKNFFFSLKPFF 
FLGGFSPFFPPPKKIFFSKIPPGFFFSP 
PLKKKIFFFPPRLILAPPRFFFKAPPPF 
FFFFFFFFFFFFFFRPI 


4767 


18668 


A 


4798 


1 


126 


ILIKLSSTWEGIQAGKELEEQHGIHCNM 
TLLFS FAQAVACAE 


4768 


18669 


A 


4799 


81 


356 


ILFFFFFFPGGGTPKKSQVFFKNPQNPI 
GKPTRMGKGEKPPPPRGKKKKKGGPPTL 
GKNKNPVPGWPGAFLPPLEEKKKPFPGP 
PPLKKKK 


4769 


18670 


A 


4800 


1 


99 


GCSEPRSRHCTLS VJATRAKLHLI KKKKK 
KKKGA 


4770 


18671 


A 


4801 


238 


5 


NHQNSVQNYFKNLRIFQLIYSLSMNLPR 
SSFKLKSMTHSISFSFLFSFFEIQSHSV 
AQDRIKWHDLGSLQPQPPGLK 


4771 


18672 


A 


4802 


166 


62 


I FLFS FFLFLRDRVLLC YVGWS VMVQST 
QTPGLK 


4772 


18673 


A 


4803 


102 


6 


AASTTES P S VAHAGVQWCDLGS LKPGSR 
RFQ 


4773 


18674 


A 


4804 


17 


374 


FGSRAATLFFFFFFLGRGGFFLGPRGPK 
KKTPGGEGFTRFFDPGWGKGPEKKTPPP 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, LHLeucine, M=Methionine, 
N=Asparagine, P^Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /= possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 














WGGKGGPKKMNLWGAGKIKFLEIKGKSF 
PRGTTPNFFGPPLFFS PPKTQGPGNFLD 
FSFKLKN 


4774 


18675 


A 


4805 


59 


175 


SFYFFLMMKKKKKKKKKKKKKKKKRGGA 
LKKKTGGGPN 


4775 


18676 


A 


4806 


406 


300 


SSGVLYSRAPTYFCGQTLTFRQVDIKTS 
HLPQEAE 


4776 


18677 


A 


4807 


123 


1 


KS IAWLGTMAHS CNPS TUGGQDGQTTRS 
GVRDQPGKHGETP 


4777 


18678 


A 


4808 


319 


406 


LYKSYSSWPNWAHTCNPNTLGGQGGQI 
I 


4778 


18679 


A 


4809 


236 


351 


GFWPAPGVHACNPS I LGGQGGRVTGPKL 
GNLLGQHGEI 


4779 


18680 


A 


4810 


173 


2 


EVLFLGFRSILISLLQMNYLLYLIAKFA 
FVAQAGVQWHNLSSLLPPPPRFKRFSCL 
R 


4780 


1S681 


A 


4811 


2 


94 


TRILKVWQKRKCS VKNGFLTISHGTVS I 
LF 


4781 


18682 


A 


4812 


358 


2 


F FFFFSETE SCS VAQAGVQWRDLGSL 


4782 


J8683 


A 


4813 


362 


3 


YSETVCIISLQQAHDSLSVINFHLAECVI 
QLPVNLRELPGAVAHSPLGSELCFSDSG 
SVPSHTRSFPKAAIRLTANHRRGCTESG 
W P FS LVFLFER CMW PGTVAHT CN P S TLG 
AQGGQIT 


4783 


18684 


A 


4814 


341 


1 


wnqkrahitksilsqknkaggimlpdfk 
lyykatvtktawywyqnrdidqwnrtep 
seimphi ynyl i fdkpdknkkwg kdsl f 
nkwcwenwiatcrkleldpfltpytkin 

S 


4784 


18685 


A 


4815 


2 


283 


VYTGEE I LQKKE I T I RDQEAP YLLRNLS 
DHTVAISSSTTLDCHANGVPEPQITWFK 
NNHKIQQEPELYTSTSPSSSSSSPLSSS 
SSSSSSSSS 


4785 


18686 


A 


4816 


266 


3 


AASTLQAHRAHQLAIDTYQEFEETYIPK 
DQKYSFLHDSQSLRQPLHTHTGSVGTRT 
IiAASLSSGRTAHTRTHMHTQHTRKRTAH 
TNA 


4786 


18687 


A 


4817 


365 


1 


SFFFFFSETESRS IGRLEFSGTISAHCN 
LRLPDS SNS P ASA 


4787 


18688 


A 


4818 


87 


372 


ASYKKDKYCMIPLIRVIKFPQTERKKWL 
SGLGGEGNKELLFNGCNMAVFLVLGGGF 
ETEPPSVTQAGEQWCDLSSLKPLPHRLE 
RVSCLSLPSGW 


4788 


18689 


A 


4819 


5 


175 


FAFLT I PLSRLFDNAMLRARRLYQLAYD 
TYQEFEEAYILKEQQDSFLQNPQTSLCF 
S 


4789 


18690 


A 


4820 


235 


351 


FFXFFFXFFXFFFFFFFFFXFFFXFXFF 
XFFXFFFFFXF 


4790 . 


18691 


A 


4821 


2 


155 


IFLLLIESWLEPVRFLKSMFANWLVYDT 
SDSDDYHLLXDLEEGIQTLMGRL 


4791 


18692 


A 


4822 


90 


2 


LFFFFFETES PS VTQAGVQWHDLGSLQP 
P 


4792 


18693 


A 


4823 


247 


3 


KVKGSPSHRRAMVAKGNLSLHERMKSPG 
ISNYMGNSKILFSYFKWCWPGTVAQAYN 
PS TLGRGG W I TRSGVRDQ PGQYS ET 


4793 


18694 


A 


4824 


93 | 


407 


QHPKKKKKWGGRGPLKNKEKVFKHKTFN 
FFSRQKKKKKKKKKKKKKKKKKKKKGGG 
GLLKICFV7GGPKFPGGKKKFFFFFGGGKK 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

etb 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G~Glycine, H=Histidine, I=Isoleucine, 
K= Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /--possible 
nucleotide deletion, \-possible 
nucleotide insertion 














NPPGVFLKKNFFWGGEKKKKK 


4794 


18695 


A 


4826 


2 


129 


TTAHCSLNLPGSSDPPTSRSLVAGTTHV 
CHNAWVNFVFCRRG 


4795 


18696 


A 


4827 


342 


140 


GGFFFFPPPPFFFKFPLFLPPPPPPGPS 
CVGGVQKFTFPPPPQLSRSLFFLFFFAP 
PPETDNPFFFS 


4796 


18697 


A 


4828 


1 


259 


GGGCS E PRSCHCTPAWVTE YDKKKKKKG 
RGKG KKK I GKGG FQRKF KG PN PHGR FLR 
ESHNSPKKGFPKKFFGNPPPGTSKKVPL 
PH 


4797 


18698 


A 


4830 


125 


1 


KGTRTFETESRSVAQAGVQWHDLGSLQP 
LPPGFRRLTCLSL 


4798 


18699 


A 


4831 


1 


323 


ARGEREREREREREREREREREREPPPL 
SRGGSESYLKSAWRERLLLERDPLFFFG 
GKMGNLFPPPPPPVGRGYTGGGLITGGP 
FWWGECVSHTHHTHTHTHTYMSV 


4799 


18700 


A 


4832 


3 


297 


TRRERERERERERERERTL 


4800 


18701 


A 


4833 


1 


150 


ARERERERERERERERERERERE KKKKN 
HSRGGGPGPQFPLGGGVKKFGGGV 


4801 


18702 


A 


4834 


1 


98 


ARGERERERERERERERERERERERERE 
RERGAGGGPT 


4802 


18703 


A 


4835 


2 


66 


HEERERERERERERERERERAAR 


4803 


18704 


A 


4836 


2 


73 


PEERERERERERERERERERETAR 


4804 


18705 


A 


4837 


146 


2 


FGLVYS I SQFCLFFVRRSLALVTQVGVQ 
WRNLGSLQALPPGFTAFSCL 


4805 


18706 


A 


4838 


354 


0 


PPPPPPSPPPPPPPPPPSSPPPPSPPSP 
SSSPP 


4806 


18707 


A 


4839 


62 


358 


GLVHDKHSPHGGCGSHHVLPSLNPSFPT 
FSTKSPSVPPARALWPPDSRWALQVGHL 
GLT P TTRQES AQ APRRKLALLEG S FQGQ 
AQWLTPVT PALWEAE 


4807 


18708 


A 


4840 


121 


13 


RPGRPPPEFTPFSCLSLPGSWDCRRPPP 
RPANLLYF 


4808 


18709 


A 


4841 


3 


22J 


liVMGFVGFSSKPSPIYGGLALIVSGWG 
CVI I LNFGGGYMGL I VFLI YLGGMMVGF 
GCILDFSLNFNVGLNF 


4809 


18710 


A 


4842 


3 


84 


CHCTPAWTANLDS I KKKKKKRKKFKN 


4810 


187)1 


A 


4843 


33 


230 


IiTVT I SKAAE VLKHKKKKKKKKKKKKKP 
KKKTPKKNPNPPPQKGELGKKIFFFFWE 
RFLPQKKRP 


4811 


18712 


A 


4844 


3 


368 


HELNPRGRSCGEPRPCHCTPAWATRAKV 
HITKKKKKKKGFNLFKPGGSLSNFFPKK 
KETPWSPQNFEKKGPRRFSPRGKNPPLA 
LRGKKKGGKSLSGKI FPPFPRGKTPPWA 
FMGTQKGEYS 


4812 


18713 


A 


4845 


163 


2 


HHGHS IS KFLLTVS S PKSFNQLR PDAVA 
HACNASTLGGRGGW I TRSGVQDQLV 


4813 


18714 


A 


4846 


387 


234 


KGCGEPRS CHCPPAWATIAKLLFKKKKT 
KPKKNPKKQEKTFPYNLNRVCY 


4814 


18715 


A 


4847 


189 


349 


KFSRFSGQKALLLKDILRLGTAAHTCNP 
STLGGQGRWIMRCQVFKTSLAKMVK 


4815 


18716 


A 


4848 


3 


95 


HENF YCPGSALGRNEGNI FPNPEATFVK 
EM 


4816 


18717 


A 


4849 


2 


325 


DSHTI IMGDFNTPLAILDRSTRQKVNED 
IQDLNSALHQADLIDIYKILHPKSTEYT 
. FFSAPHRTYSKIDHKIYSQGNHETDCRY 
GKKGGGWKDERFPDVDCEKFKSQ 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E^GIutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=Iso leucine, 
K-Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, V=Tyrosine, 
X=Unknown, *=Stop codon, /-possible 
nucleotide deletion, Y=possibIe 
nucleotide insertion 


4817 


18718 


A 


4850 


241 


348 


SFFSQTQASEQLTKSKQGLALRTLPPVQ 
FPPPPPRP 


48)8 


18719 


A 


485] 


2 


176 


LVETGFCHVGQAGLELLTSGECWDYRCE 
PPSPAYCFLHTIVTIYPLLCLAFFSYLC 
DC 


4819 


18720 


.A 


4852 


2 


191 


VGRVGLELLAS SDLPTSGS PS AGITGMS 
HCPWPNFLLIFIGYFFTVLTPCEYYSGG 
MTVPLWL 


4820 


18721 


A 


4853 


67 


376 


IKCTTFNIKKKKKKKKKKKKKKKKKKGG 
ALKKKPWGGQKKTGEKKKKFFSKRGGKK 
KRG PGR VLKKG E RE KI F F VINFL KKNF F W 
GGGNLGQPPPKKLRALGKK 


4821 


18722 


A 


4854 


278 


363 


IWPDWAHTCNPSTLGGQGRWTMRSGVR 


4822 


18723 


A 


4855 


3 


282 


RGCSEPRLRYCTPAWATEQDSASKKKKK 
KKTQI FWEGPGGGVSPFKPHFFKGPGGX 
KPWGQKFKNPPPQWEKPLFFFKKKKTKF 
SPGGGGPP 


4823 


18724 


A 


4857 


382 


224 


PVIQLIGGVRQEKNFNLGVKGFNEPKSP 
PCPPTWVAKQNSLSKKKKKKIGKK 


4824 


18725 


A 


4858 


392 


102 


FFFFFFLRQGLSWQARVQWCFCSNLGL 
LGSSNPPTSASLSCFLNKKEYRWENVTN 
KV 


4825 


18726 


A , 


4859 


3 


111 


FHRISQEGLILLTLGSAHLGLPECWDYR 
REP PR PA 


4826 


18727 


A 


4860 


282 


83 


AQIYKVQNWGSSGNKCICVCVCVCVCVC 
VCIYIYMLCMSRFKYKCIVGCMNIDSYV 
VYIPNIQCYI 


4827 


18728 


A 


4861 


3 


417 


NHSNLGGRGCSDPRSRHCTPAWVTERDS 
ISKKKKKKKKKIFFGLREKKLTIiFPFLG 
KKNENSGFPLNPNFFFGGKGKKATLGPL 
GLYLRGVPFFRGGETQNGENQKPGTTLE 
KFEKHFWVGPPPLEKISPHKGKQTQK 


4828 


18729 


A 


4862 


1 


88 


MEEMYKKAHAAI LENPVYEKKPKKKKKK 
K 


4829 


18730 


A 


4863 


274 


369 


VSLCCAGWRAAVRSQITAALISRAQAIL 
PPQP 


4830 


18731 


A 


4864 


37 


444 


DPRVRSEKKI PFKI IiliLIDNAPSHPKAL 
MEIYEEINVIFMPANTTSILQPMDQGVI 
STFKSYYLRNTFHKALAAMDSDVSDGSG 
QSKLKTFWKGFTILDAIKNIRDSWEEVK 
LSTLTGVWKKLI PTLIDDYEGFKT 


4831 


18732 


A 


4865 


341 


210 


VYMCHVGLLYHQPVIYIRKRIEASEITP 
HIYNYLIFDKPSHQF 


4832 


18733 


A 


4866 


206 


412 


QLCLSQACTLARGNAKGFLKY I HRNNVS 
MPSVAGHTWGPDQLVKGQGETCPATIKF 
SDSPSLKYTMYPM 


4833 


18734 


A 


4867 


248 


78 


SHFEAGLAAQLTVSWFPQGTPVFVHAGP 
FANIAHGNSSIIADRIALKLVGPEGFVG 


4834 


18735 


A 


4868 


637 


2 


EFDGRWPRQIVSSIGLCRYGGRIDCCWG 
WARQSWGQCQPVCQPRCKKGECIGPNKC 
KCHPGYAGKTCNQDEHIPAPLDQGSEQP 
LFQPLDHQATSLPSRDLNECGLKPRPCK 
HRCMNTYGSYKCYCLNGYKLMPDGSCSS 
ALTCSMANCQYGCDWKGQIRCQCPSPG 
LQLAPDGRTCVDVDECATGRASCPRFRQ' 
CVNTFGS Y I CKCHKG 


4835 


18736 


A 


4869 


276 


426 


VSFFLFFFFWGANPPFVPQAGGQGGFLG 
S LNP L PPGLKH F FCLTPPS SGN 



844 



WO 01/64835 



PCT/US01/04927 



SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 
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ing to first 
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Predict- 
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sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=IsoIeucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R^Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop cod on, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


4836 


18737 


A 


4870 


100 


485 


EERNREMTERCSVTQAGVQWHDLGSLQP 
SPPGFKRFSCLSLPSSWDYRLECDGMNL 
AHGKLRLPGS S ESPASASRVAGI TGMCH 
HAPLARLVSNTLPQVILPPLGLQQAEEL 
PRRQEVEHNTPSLVPI 


4837 


18738 


A 


4871 


718 


408 


EVLTLQLAHFPGFLGAHWWNQQDAALGR 
ATDSKEPPEELCPDVLYRTARTLHGQET 
YTPRL I LMDLH YDVSLGALVLTAAGYRA 
SSCSQSVAVCTGSVLEARA 


4838 


18739 


A 


4872 


392 


246 


RGGCS ELRS CHCTPAWATRV ICbHLKKKE 
KRKCKIRPEGNEILVNNAKG 


4839 


18740 


A 


4873 


1 


156 


GGGGYSEPRSCHSTPAWTTRAKXiRIiLKK 
KKKKKGGVCFGVGFLGFRPKGGFF 


4840 


18741 


A 


4874 


246 


2 


AHG I S I YASMLAMS I FSRRJSFVS FRRAGQ 
VKAVFPVSVVFGPLHRHHLGEHFLFFLR 
QS FTL VAQAGVQWCHLGS PQPLPPA 


4841 


18742 


A 


4875 


72 


2 


CFPAAPDEDSTTNITKKQKWTVE 


4842 


18743 


A 


4876 


375 


221 


GRLRQENRLNPAGGGCGEPRSHYCTPAW 
VTEQDS ISNI CI Y INIFS WHGQ 


4843 


18744 


A 


4877 


151 


2 


RAPFFFFFFFFVAQAGVQWRVIiGSLQAP 
PPGFTPFSCLSLPSSWDYRRP 


4844 


18745 


A 


4878 


401 


70 


PHRREGGCVPPPPPKNFFFSPGGYFFWG 
GGGQNPPPPKGGFFPKTPPGFFFSPPQK 
KKIFFFPPPERGPPPGFFLRPPPPFFFF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFF 


4845 


18746 


A 


4879 


4 


152 


LPYSTPSTPFRAKKKKKKKKKKKKKKKK 
KKKKKKKKKKKKKKKKKKKKKKRGGNFK 
KR 


4846 


18747 


A 


4880 


267 


1 


TPPKKKKKKI I PPPKNGPPPHI FKKTPP 
PLFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFF FFRTRGRTRGRTRGRT 
RGR 


4847 


18748 


A 


4881 


2 


408 


LQDATSPIIEELITFHDHALIIIFLICF 
L VL Y AL F LTLT T KKKKKKKKK KKKKKKK 
KKKKKKKKKKKKKKKKGGGGPKKKNRGG 
PKKLGGEKKIFFFFKGGKKKTRGGVGKK 
KI FLGGGN FGPP P PKKKKPRGK I 


4848 


18749 


A 


4882 


413 


69 


LPPPPPGFKHFWAPPPPGRGVPGPPPPP 
RVNFVFLGKKGVSPFGPGGPRPLFPNPG 
GVGAGDPLDPRGQGFHGPKFMPCPPARG 
TKREPPFPKKKKKEKEEKKEPERKEKKK 
EI 


4849 


18750 


A 


4883 


46 


394 


KQYQVSVKLLFVTQNLGTKKKKKKKKKK 
KKKKKGGALKKKPGGGQKKPGGKKKNFF 
LKRGGKKKPPGIFEKKTLFGGGENWGTP 
PQKKKPPGKKKKILKGEGGKKTLYFGRG 
KNFS 


4850 


18751 


A 


4884 


399 


3 


FFFKKFFSPNEFWFFFPPFPLKIFFFPP 
RLFFFWGGLiAQFSPPPKKVFFSKFPRGF 
FFPPPLGKNFFFFPPGYFWPPPGFFLRA 
PPLFFFFFFFFFFFFFFFSVS IRNAIAI 
RMGTMRSRRLAMGMLLRRG 


4851 


18752 

i 


A 


4885 


391 


47 


PHFFWGGVFPPPPPKKKKFFFLPQKGGG 
GGAPKKKKKKKKFLPPNFSGGFFFFSPP 
GGPPRFFFF.TFFFFFFFFFFFFFFFFFF 
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FFFFFFFFFFFLS KMFTF I LKYTLKQNE 
FY 


4852 


18753 


A 


4886 


32 


379 


LLFVKKKKEKKGEKKKKKRKKKKKKKKK 
KKQKKKGGGPSIKI IGGGHLSSGGRKNN 
SFSHRGGLKKPLFFLGRTYFLGGGGLWP 
HPPPKKIAGGAKNI ILRGWGEQKPLLFP 
PGEK 


4853 


18754 


A 


4887 


178 


33 


I FKRKWFFFRFFFFFFLYYFFFKFFFF 
FFFFFFFFFFFNICHHPKLY 


4854 


18755 


A 


4888 


238 


3 


KPFNPFKKKKIGPQKKKKKFPPFLKINP 
PKYKTQNINIYKYIKIFFFFFFVRGSDS 
VAEAEVQWHHLGSLKPASPGFK 


4855 


18756 


A 


4889 


93 


1 


DGILLLSPRLKCSGTISAHCNLCPPGSS 
NSR 


4856 


18757 


A 


4890 


6 


219 


IEKAHKALGTVPGSTFVRYHHYHHHNHY 
HHHCHHHHPHQDNSEWEKAQSLEPDKLF 
LNPGFATSQLRGLR 


4857 


18758 


A 


4891 


1 


364 


TSSRSRAGRPMDFGFFFKKKGHREPPGG 
VEKKKKNWGSRKTPPSGVKNFPGFDPPK 
MWNKGGPPTPPVNFFGFKKKRGFHKGLK 
PPPPEIAPLNPQKGGEKKGGPPPPPTPF 
LGGKFQGKR 


4858 


18759 


A 


4892 


387 


81 


GVFSPPRGGPPPGGKTPPFFPRGGPPCG 
PGGRGPIFPPLSHKKFFFSPPPPLLGGG 
GGPKSAPPPKGVFKEFPENFFWGAFIKK 
SFFLKKGGFFGPPRGFF 


4859 


18760 


A 


4893 


213 


14 


KTPPPFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFMLE VMFLVNRRGKI CRVPFTFFNL 
SL 


4860 


18761 


A 


4894 


384 


10 


PLPPFIPPPPPPPPLFGGGVCPYFSPPP 
HLGSPSPPPGGRPSPPPPPPRSTPPPPP 
PPFFPPPPPFVSGAPPPPPRSCLAPPFF 
FFFPPPFFFFFFFFFFFFLPFFFLKKSD 
FIFSLQMKYFMLI 


4861 


18762 


A 


4895 


240 . 


355 


LI ITPALFFKIFVLWLGAVAHACNPSTL 
GGQGGQIMRS 


4862 


18763 


A 


4896 


339 


3 


GVGWHLFFPPQKKFFFFFQVTGASCAAK 
SSKESHKGIiHLNIYLPKIFFYRPPPPFF 
FFFFFFFFFFFLFFFFFFFFFFFFFFFF 
FFFLWREGYNQHFRGMGPIGRVGGRV 


4863 


18764 


A 


4897 


56 


335 


TCFLYKLKKKKKKKKKKKKKKKKKKKKK 
KKRGGPLKKNLGGPKKNRGGKKKIFFFK 
GGEKKNLWGFFEKKPFFGGGKICPTPPQ 
KKKPPGEKX • 


4864 


18765 


A 


4898 


7 


206 


LCLQPDNRWNDDQSKSFLIEKKKKKKKK 
KKKKKKKKKKKKKGGGPLKKNPGGAPNN 
PGGEKKNSPP 


4865 


18766 


A 


4899 


74 


250 


INI CS EKLP KKKKKKKKKKKKKKKKKKK 
KKKGKKI KRGLN S QTS LLKKRE KKKKRG 
KI 


4866 


18767 


A 


4900 


151 


312 


KYHFKKNYFFSLLYFSQOTSQLSPDGPL 
PQLPLPYINSSATRVFFGHDRRPADG 


4867 


18768 


A 


4901 


1 


364 


LNLGGRGCSEPRLRYCTPAWATEQDSAS 
QKKKKKKNPNF 


4868 


18769 


A 


4902 


1 


96 


GGGGCGELKSCHCTPAWVAEQDSVSKKK 
KKRG 


4869 


18770 


A 


4903 


3 


225 


G FKR VG QDG FDL P TS W TARLGLP RGWDW 
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RREPPRPAVWAGLQLLTSSDPPASDSQR 
AGITHLSHLTLFCHLRF 


4870 


18771 


A 


4904 


384 


194 


HWVGRPFFNLWGQKNPPPPPPKRLEYQG 
WGPPPPPPFFFFKKTPIFSQVFNPPPFF 
FFFFFFG 


4871 


18772 


A 


4905 


3 


387 


TIITPILLTLFLITQLKILNTNYHLPLP 
AQYPLPQKKKKKKKKKKKKK 


4872 


18773 


A 


4906 


1 


327 


PTRPIEPIPENPKFSVPPITPHPKKKKK 
KKKKKKKKKKKKKKKKKKKKKKGGALKK 
KTWGGPKKRGGGKKI FFFLRGGKKKPRG 
EFLKKTSFWGGENLATTPPKKKKARKKK 
KNF 


4873 


18774 


A 


4907 


134 


2 


ALYSLKVKWPGAVAHACNPNTLGGQGEW 
IRRSGVRDQPGQHSET 


4874 


18775 


A 


4908 


59 


449 


ATRHYTI MLAYT S VG I F I TAWLGKQAI 
KGARFQVRRTTENYDKQQLIHKGGGRPP 
GCHSTHTVLLPPVTWVKTTGNSTVANFV 
LILVCLSFLLLVYRCIQQLQRDSNQREG 
AMMMVWLSKRKGGYAGK 


4875 


18776 


A 


4909 


337 


3 


LKTAWATRGKP PLYKKNTQ I SGARGEGP 
PIPLIGGGLSQKNFFTFGGENSINPDPP 
PSPPPGAKKETPPPGKKKKKKPTTKELG 
KDWTLELWBSDSQRAGKLAEVPGESSP 


4876 


18777 


A 


4910 


14 


162 


AESGAKRETAFPFAPQQGATDKRLN 1 1 Y 
AAKKHMKTCLPS LAIREMQ I K 


4877 


18778 


A 


4911 


331 


2 


LPLLAPKEGQGKILGYKKPPPPGLPPFF 
GSTPPRNGEKGGPPPPRGNFWVFKKNGG 
FPRLGGRGPFPGPQKPPPRPPQKGGKQG 
GNPAAGPFFFFFETEFKSCCPGWSA 


4878 


18779 


A 


4912 


116 


316 


ACAHAHTLFFFFFLKKKPPFVPLLGGKG 
GNLG YLKLW LPGL KKVCLT P P RTGN YGG 
APPHRIIFCFL 


4879 


18780 


A 


4913 


93 


16 


SPPGWPGTVAHACHPSTLGGRGGRIT 


4880 


18781 


A 


4914 


316 


2 


KPRPGNPLRARVFNPPGPPGETPFFPKN 
PKFTRGGRGGP I YNPFPGR VRPKNFLYP 
RGSRFHWPNSIPCPFAWGTKPNFFSKKK 
KKKRKKKEKLCFSASVPDAQ 


4881 


18782 


A 


4915 


3 


289 


TSCNPSTLGGRGGR ITRVGVRGQPDQHG 
ETPSLLKNY I Y I Y I CVDVY LY VCMY VGA 
Y I YI C I CG Y I Y I WRYLS LTS EHTHTRAP 
GQYRVIYFFCG 


4882 


18783 


A 


4916 


1 


222 


ARGERERERERERERERESRHQGGGAGP 
PTHIiWGEEGWLS S PQTGVRE KNAP I WG 
CAAPVLAPRVGRTPFGVD 


4883 


18784 


A 


4918 


1 


281 


ARGERERERERERERERERERERERERE 
RERERERERGAPPPPPRGGVRSRGTPCP 
SARGARALSPTISCGDPHRGPKIYRGVG 
VHTQRYFSVGSSLCINTHT 


4884 


18785 


A 


4919 


1 


259 


ARGERERERERERERERERERVFVEKGD 
MVTLPAGIYHRFGGGEKKYTKAVRVCVR 
KFGWRAHNR PADH FGARG PHVKFLAQT V 
YE 


4885 


18786 


A 


4920 


797 


910 


NTMQPGWTHTCNPSTLGGQGGRITGSG 
VQDQPGQHGE 


4886 


18787 


A 


4921 


2 


67 


LEERERERERERERERERERERSS 


4887 


18788 


A 


4922 


1 


698 


TLLLAELGTI CDP YRSCS I SEDSGLSTA 
FTIAHELGHVFNMPHDDNNFJCKKRGGPF 
KG PNFS P P VRA S KN FFFG P PNLNS WARF 



847 



WO 01/64835 



PCT/US01/04927 



SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ 10 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/535,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
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LTPGEGKTPGVPPFNPFATKAEL 


4888 


18789 


A 


4923 


323 


2 


LVCRGT I HKFRCVPHLTGRRFEHGVTDC 
YTLFRDAYHLAG I EMPE FHREDDWWRNG 
QNLYLDNLEATGLYQVPLSAAQPGDVLL 
CCFGSSVPNEAAIYCGDGELLHH 


4889 


18790 . 


A 


4924 


3 


225 


HEEALHLFQTLMECMKRKELITVFHIGS 
DEHQDIDVAILTALLKGKPLRTFLFVRP 
FI LCMYMCVYVC ICVCI 


4890 


18791 


A 


4925 


371 


169 


HTQS IS VVLVERRFHHSGQENPLNPGGR 
GCGELRLCHCTPAWATRAKSRLKKTKQT 
KKDAI KCSLCN 


4891 


18792 


A 


4926 


2 


352 


ARAARAGRI IKELFFFFFFGNLKKKKGL 
FWWPGGVKNPDLGNPPPWPPKGGGITGL 
TPRPGPMGVIFKKFGKKGSPSPTGPKSL 
GPREFFGLALQRGGDLGLNHRGPPFFFG 
VLETM 


4892 


18793 


A 


4927 


2 


379 


ARANMEETQQKSNLELVRISLLLIKTGL 
EPVRILRSMFTNNLVYDTSDSDDYHLLK 
DLQEG I QTLMGRLENGRRRTGQ I LTQTY 
SKFDTNSDNHDALLKNYRLLYCFTKDMD 
KVE T FLRMEQCRS V 


4893 


18794 


A 


4928 


175 


31 


I LGD L F P PAWLH P FLFLLL P F S RPS LAV 
TEVAVQWRNLGLKQPLPPGF 


4894 


18795 


A 


4929 


368 


247 


VDRLFPCCPGWSPSLELNQSACLSFLKC 
WDYRCEPLCSVS 


4895 


18796 


A 


4930 


2 


110 


ARGEPRSHHCTPAWAMSETVSKKKKSGG 
LFFLPRLV 


4896 


18797 


A 


4931 


150 


1 


KYVAPCRPLFLVLSPCRRSCFPFPFCHD 
CKF PE S FSEATMF FLQPAE PRA 


4897 


18798 


A 


4932 


2 


155 


ARDDLNPGGGCSELRSRHCTPAWATERD 
SIS KKKKNPQN I WGNLKKKWNLK 


4898 


18799 


A 


4933 


144 


249 


ETLYPQPPQSLNASCVCVCVCVCVCVCV 
CLCVCV 


4899 


18800 


A 


4934 


2 


153 


ARESFVEKGFRHVGQVGLKLLASGNPPN 
LATQSAGI IS I SHCTQPWRQGL 


4900 


18801 


A 


4935 


1 


355 


GTSQE FEGRKDRLAS PLETGNAGAGRER 
LAEVSTCPSGAS KPLQTPRPGGQEGTPG 
LGGRRMRHSAGPSPNPAWESIRDPSLTS 
TALLVACS IFEIHEDPILKDVLISRCMS 
IKTSHR 


4901 


18802 


A 


4936 


112 


358 


NIIKFSKAFFFSFYIFILINSVGLASTF 
MWGGELKFSFTSG I LI I FQPFKI KGHPQ 
RG I LRTGTFKEDPDDPEGNKVSSLLQ 


4902 


18803 


A 


4937 


1 


258 


GTSPYVTQQFFSGWGRQGLTLSPKLEKS 
ELRLCHCTPS WVTERDS I S KKKKKKLF Y 
LAGPTFFPFGGGNFNLALGRLLKRGGGE 
KK 


4903 


18804 


A 


4938 


380 


1 


FF FFFS ETEP YSVAQAGVQWC 


4904 


18805 


A 


4940 


255 


1 


LGNNLAGYKIRGLYS XSLRNNRHPSPVI 
SLCGNLFYVIRFFDMESCSVTQAGVLWC 
NIGSSQALTPGIAPFSFISLPSTRQNSR 
P 


4905 


18806 


A 


4941 


10 


173 


KYIEYDTNKWKHNSCSQIGRINFVKMSL 
PPKAIYTFSVFPIKIPSVYLPVHLSI 


4906 


18807 


A 


4942 


144 


3 


DKC I RRPGTVTHTCNS STLGDQGGRVMR 
SGVRDQQGQHSETPSVRV 


4907 


18808 


A 


4944 


484 


274 


PSSQRSPRTARMIALSTRPTTRSMKFCH 
VAQAGLXLLGS SDLPASAS QNVG ISGVS 
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HHTWPRFYYLS 


4908 


18809 


A 


4945 


103 


412 


KKKKKKKKKKKKKKKKKKKKGGGPLKKN 
PGGAKKKRGRKKKIFFFKRGEKKNPGGI 
FEKKPFFGGGKNWAPPPKKNKPFGSKKK 
FLRGKGGKNPPIPPGKKKL 


4909 


18810 


A 


4946 


309 


81 


KFFFFLKGLIFLGGFCPIFPPPKKIFFF 
KIPPGVFFSPPFKKKNFFFFPPLIFGPP 
RVFFKGPPPFFFFFFFFFF 


4910 


18811 


A 


4947 


38 


264 


NYTQRFKVEYLNNIINQLDLTDIYRILY 
PTVEYTFFKNPHGTLCRTDHMLGHNTGL 
NKFKRTECTQSMSSDHNGIKYHSAP 


4911 


18812 


A 


4948 


418 


293 


QLLRRVRQENCFNPGGGGCSEPRSPPCP 
PTWVTKQVFFSKP 


4912 


18813 


A 


4949 


3 


252 

-* 


DSAKHLGLKHWYCGLENVKRLTDGKLE 
VPHFDS KGEVEE YFWS IGI PWTS VRVAA 
YFENPLAAWRPVKASDGDYYTLGKTK 


4913 


18814 


A 


4950 


434 


270 


KRGFPPKTPPGFKKPPPKKKKKI FPPPP 
KIGPPQKILKKPPPPFFFFFFFFFFF 


4914 


18815 


A 


4951 


399 


1 


VGFSLFPPPQKKGFSPKPPRGFLFPPPK 
GKKKIFPPPGKIGPPQGFFKRPPPLFFF 
FFFFFFFFFFFFFFGPP3VTLLFLVTIL 
LQIQTTALGRKSWGKGFRIPKPVCPVPA 
QQTKQGAKHLLGQWE INSCA 


4915 


18816 


A 


4952 


235 


56 


FTWPPPKKKKKNPPGGGGPVPPPWGGK 
NKKIFLPPKFKGFFSFGAPLPFPPGQKK 
KPL 


4916 


18817 


A 


4953 


13 


194 


S I LI SLQ I SLI I TFTATEU 1 FY I FFET 
TLI PTLAI ITKKKKKKKKKKKKKKKKKG 
GGAF 


4917 


18818 


A 


4954 


177 


1 


PPQKKKKNI FSPPGKIGPPQGFFKRGPP 
LFFFFFFFFFFFFFFFFRFEGECSDLDG 
RV 


4918 


18819 


A 


4955 


450 


0 


LPNKADKKKKKKKKKKKKKKKKKKAR 


4919 


18820 


A 


4956 


29 


158 


AM I VPL YS S LGD F FKKKE KKKKKKKKKK 
KKKKKKKKKKGGGGF 


4920 


18S21 


A 


4957 


397 


0 


PSSPPPPSPPPSPPPPPPPPPPPPPPPS 
SPPPAPPPPPPPSPPP 


4921 


18822 


A 


4958 


257 


397 


F FFLS FT F VAQAGXQGXX Y GS LD P P PPG 
FNLFSCPSLPRIWNYTAPP 


4922 


18823 


A 


4959 


419 


266 


RFHHDGQDGLELLTSSDLPASASQSAGI 
IGVSHRAQS I KSLAPQYW YFPS F 


4923 


18824 


A 


4960 


300 


190 


FFFFFFFFFIVAFFFFFLFCFLFFFFFF 
FFXIFNNIF | 


4924 


18S25 


A 


4961 


3 


179 


DAWVLPFI IAALAALHLLFLHETGSNNP 
LGITSHSDKITFHPYYTIKDALGLLLFL 
LS 


4925 


18826 


A 


4962 


2 


182 


RVNAKDS KWLTPLKRAVAS CSEEAVQVL 
liKHSADVNARDKNWQTPLHIAAANKAVK 
CAKL 


4926 


18827 


A 


4963 


273 


440 


KIHIQHCWWEYKLLQSLWKAAWHFLKEL 
KAELPFNPAIPLLDIYPEEYKSVYLKDT 


4927 


18828 


A 


4964 


179 


3 


SLPFCCNLTPKKKLFFKFCRKNGFFFFK 
MESHSVTQAGVQWCNLNSLQPCPHGLKQ 
FS 


4928 


18829 


A 


4965 


268 


364 


TVCVIELLTSGDRPASASQSARITGLSH 
HAQP 


4929 


18830 


A 


4966 


374 


2 


QNFFLKKKKKKRVGPGVPPFNPPPFWGQ 
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G=Glycine, H=Histidine, Wsoleucine, 
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GGGFPRSILGNPIKPLFFLNSPKISPPLf 
WPGPFFPLPWGVWAGKNPLPRGGGFPLT 
KFPPLPSSLGKKKKFRFKKKKKKKGQAR 
WLTPAI PTPREA 


4930 


18831 


A 


4967 


79 


346 


DKI FSNISCGI VI DLLAI YNWDAYATRL 
GIYKHWDFI I IDKDTSSCRLSFSSYPRF 
LES LDDF Y ILS SGL I LLQTTNS VFNKTL 
LKQVI 


4931 


18832 


A 


4968 


3 


93 


FLHVGQAGLGLPTSGDPPASASQSVGLQ 
A 


4932 


18833 


A 


4969 


158 


1 


SKSNFAYI FPI KDNGLTGKKT INRWTK 
NWANDLNRHFSKVNIQMAKRYMKK 


4933 


18834 


A 


4970 


2 


266 


VGQAGVELLSS SDPPASAYEGAGI PGVS 
HHTWPKHFFPALFVACISSLVNCLFQLF 
AC I S I GLS FF F F F FREGDLGNLLKI AGL 
TFLG 


4934 


18835 


A 


4971 


3 


96 


GVRHHAHLI FVFLVETGFHHVGQDGLNF 
L 


4935 


18836 


A 


4972 


253 


348 


NFIFLLFIYFEMESHSVAEAGLQWHCLS 
SLQP 


4936 


18837 


A 


4973 


326 


2 


PMEIPQHKLSLWPGFAISVSYFERKLLF 
SADVSYKVLRNETVLEFMTALCQRTGLS 
CFTQTCEKQLIGLI VLTRYNNRTYS IDD 
IDWSVKPTHTFLKRDGTE I TYVDY 


4937 


18838 


A 


4974 


3 


244 


DAS LVFKVAETANEEEVKKMCM YKYPGN 
KCGREGGREKKREEERREGRKEGREGLR 
EGETDEEREEGEEGLSYS PFKKfS I 


4938 


18839 


A 


4975 


6 


79 


THFSLTITSLQPEDIGTYYCQQYD 


4939 


18840 


A 


4976 


15 


339 


PGEAGNCLNPGGGS CSEPRSRHCTPAWA 
TERNSVSKKKKKKKKKKKI PRGRGLPPV 
SHPFWKAGGADWFDLGTLEPPWPTGETP 
VFKKKKKLTGGGGAPLVAP TWGGG 


4940 


18841 


A 


4977 


204 


275 


I IEVDPDTKEMLKLLDFGSLSNLQ 


4941 


18842 


A 


4978 


3 


352 


RRPS PHGLVGAVS VGGAGVMAVETLS PD 
WEFDRVDDGSQKIHAEVQLKNYGKFLEE 
YTSQLRR I EDALDDS IGDAWDSNLEPMA 
WRRL P YEQ SS VLE L I KTENKVLNR\A7TG 
YAGL 


4942 


18843 


A 


4979 


219 


3 


AASTMAMS FEW PV7QYRFPP FFTLQPNVD 
TRQKQLAAWCSLVLS FSRLHKQSSMTLM 
KSRLTSLS ITVMLDC 


4943 


18844 


A 


4980 


142 


3 


FQGFFFLRDKVLLCCPGWSRTPELKRSS 
CLGLPKCWDYRRELPRLA 


4944 


18845 


A 


4981 


351 


2 


FFFFFSETESRSVAQAGVQWRDLD 


4945 


18846 


A 


4982 


133 


3 


SFFFFLFSTECCSVPQAGVQVJCDLHPLQ 
PLPPGFRRFSCLSLP 


4946 


18847 


A 


4983 


3 


161 


LGSLHDTANTLWPRLERSRTIMAHYSLD 
LPGSSDPPTSASHVRGTTGMRSTRP 


4947 


18848 


A 


4984 


363 


3 


GPSNKKRGRSKKAHVLAASVEQATENFL 
EKGDKIAKESQFLKEELVAAVEDVRKQG 
DLMKAAAGSFADDPCSSVKRGNMVRAAR 
ALLSAVTRLLI LADMADVYKLIjVQLKVV 
EDGILKL 


4948 


18849 


A 


4985 


101 




LQVCYGRRWMNSREYGAWKQQVESKKM 
PFQDA 


4949 


18850 


A 


4986 


341 


1 


P P PMS P P D VLKTS P VADAAG WVDVDKE T 
LQHQRYPNVFGIGDCTNLHTSKTAAAVA 
AQSGILDRPISVIMKNQrPSKKYDGYTS 
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SEQ 10 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
Inst amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A-Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, f-Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
VV=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














CPLVTGYNRVILAEFDYKAEPLETFPFD 
Q 


4950 


18851 


A 


4987 


86 


1 


KCSGTI SAHCNVRLL.GSSNSGPSASRAI 


4951 


18852 


A 


4988 


124 


3 


LPWLCSKFETKTFQFQIYYKAKGVKTVW 
YSHKDRNIDQWN 


4952 


18853 


A 


4989 


305 


150 


WLLNYRYSHSAEVQVCFQFLTWVFSTLG 
S PDHFS KFFNQYFFKI CFDYWKN 


4953 


18854 


A 


4990 


64 


340 


KKKKKKNLLKFPGQREKPLSPNFLGGLG 
GGKNGPPGGKPSRDPDWPPGIKPGEKKK 
NPLKKKKKKKKKPLSKGGGGKKTPLKKK 
KKPPPKGF 


4954 


18855 


A 


4991 


3 


158 


PSLVQTRLRHAGQAGLKLLTSTDSPASA 
SQSAGTTGVSHHAMPKDCSLNSN 


4955 


18856 


A 


4992 


116 


66 


FFFFFFFFFFFFFFFXE 


4956 


18857 


A 


4993 


3 


107 


GQAQWLTP VI AALWEAKVGGS FEVRSLR 
PAWPTQ 


4957 


18858 


A 


4994 


1 


105 


KLDRLARHGLYEKKKTSRKQRKERKNRM 
KKVRGTW 


4958 


18859 


A 


4995 


110 


3 


RRIGRYKTVFLCTQLEILMARYPLPPSP 
KPIKIKN 


4959 


18860 


A 


4996 


109 


3 


DEVSILCPRLECNGTILAHCNRRLPGSS 
DSPALAP 


4960 


18861 


A 


4997 


1 


148 


ACCP F CT I YLL PM FM 1 I KAPLMGTSKKK 
KKKKKKKKKKKKKKKKKKKKKKKKKKKK 
K 


4961 


18862 


A 


4998 


3 


110 


VI LQGSNDVELVAEGNSRFT YTVLVDGC 
SKKTSTRP 


4962 


18863 


A 


4999 


3 


330 


P I ADRGAEYVSARE WMTICFEIiLQLLKA 
HKKAI RRATVNTFGY I AQAIG PHDVIjAT 
LLNNI>KGHERQNRVCTTVAI AI GAETCS 
PFTALPALMNEYRVPELNVQNGVLK 


4963 


18864 


A 


5000 


3 


110 


VI LQGSNDVELVAEGNSRFTYTVLVDGC 
SKKTSTRP 


4964 


18865 


A 


5001 


25 


329 


NS RRRRNDQGS PNLCQTFTFMTLP YLPE 
HRS LLLKI RS CAERE TKKKDD I P EEDKG 
NIKQCEINYVKKFQSFQDHKLKISKEES 
KILKKAQKDGFLHETLLN 


4965 


18S66 


A 


5002 


278 


345 


EDEEGYNDGEVDDEEDEEELGG 


4966 


18867 


A 


5003 


101 


3 


VRI I ISGTGKKKKKKGKLPKN YDPKVTP 
DPER 


4967 


18868 


A 


5004 


18 


350 


VSHECLIFHIQNVLTGLVISLSCPSVPS 
HTHKHTHIHTHTKTHTHTHTFS 


4968 


18869 


A 


5005 


26 


147 


KEEVRMALFANDMIVYLENPFVSAPYVL 
KLI SCFCKGRLG 


4969 


18870 


A 


5006 


85 


1 


SASRVAGI IGKHHHTRLIFVFLVETGQS 


4970 


18871 


A 


5007 


370 


3 


RAR KPLL W VLKKI WS P WI YKKKTR INFQ 
KEKKSPFFSRPFFLFKKGGFLPPGVFFS 
RGFLKKIPPFFFFFFETESRSVTQFGVW 
SVQWCNLGLLQPEPPGSKQLLCRGLPSS 
WDYRCLSTRP 


4971 


18872 


A 


5008 


1 


85 


I RI LSKI KNALTHFLPQGTPTPL IPILF 


4972 


18873 


A 


5009 


218 


93 


FFFFFFFFYFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFF 


4973 


18874 


A 


5010 


3 


250 


RPRRRRLRQENHLNLGGEGCSESRSRHC 
TLTWATEQDSVSKKKKKRVFPPPHPRGK 
FFWAGGPTFFPPLKTVPFPGGGKPNF 


4974 


18875 


A 


5011 


336 


0 


SSSSPPSSSSSPPPPPPPPPPPPPPP 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^^Phenylalanine, 
G=Glycine, H=Histidine, I=lsoIeucine, 
K=Lysine, L=Leucine, MHVIcthionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine. S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 


4975 


18876 


A 


5012 


2 


80 


SNQNGTGGES I YGEKFEDENFHYKVK 


4976 


18877 


A 


5013 


53 


324 


FLVFFVPENSEKISLQLHLALTSNSSWV 
QSPSHLELMNLCRHINIRVDPTGLREGM 
HYTEVCGYDIASPNARSLFRGPITAVIA 
AQVNES 


4977 


18878 


A 


5014 


329 


1 


FFFFFSETESRSVAQAGVQWRYLGSLQA 
PPPGFM 


4978 


18879 


A 


5015 


193 


32 


PLLILCSERLCHCTPAWAI ERDSVSKTN 
KTKQQQQQQQNQGTKCKFLALLGQ I 


4979 


18880 


A 


5016 


354 


235 


CGEIGMLLHCWWECKL.VRPL.WTNLFLRD 
GSTLTVGSKSSF 


4980 


18881 


A 


5017 


348 


2 


RLLVGKLMELHGEG S S S G KATGDE 


4981 


18882 


A 


5018 


76 


1 


IFFLSQVSPLSKEDAGEYECHASNS 


4982 


18883 


A 


5019 


68 


356 


YFGGVGGFFFFFFFFFWFGPSGGVFFVG 
VQAI FFFFFFFLGGFFFLVRD YFFVAVF 
WWGLLFFFVFFFWFCFVFFFFFGWKRK 
NIFYGWVFLFCW 


4983 


18884 


A 


5020 


1 


288 


FFLNL E KN I KMI>S S YTDNG I WATAE D F 
MQNFKNLVGYHNS ITEENLPQLGANENL 
ESQSGNFS WF I FFNADRKRGMVLLLPN 
NEMTILXPRTSV 


4984 


18885 


A 


5021 


133 


1 


WATETMWSTKPKMWPGTGAHTCNPSTL 
GGQGGQITSPQKFKTS 


4985 


18886 


A 


5022 


2 


337 


RRSDPNFKNRLRERRKKQKLAKERAGLS 
KLPDLKDAEAAQKFFLEKIQLGEKLLAQ 
GEYEKGVDHLTNAIAVCGQPQQILHVLQ 
QTLPPPVFQMLLTKLPTISQRIVSAHSL 


4986 


18887 


A 


5023 


1 


325 


VDG C PANIiLS SHRS L VLRAET I S LGEHP 
CDRGEQVTLFLFNDCLEIARKRRKVIGT 
FRSPHGQTRPPASLKHIHLMPL»SQIKKA 
LDIIETEDCHNAFALLVRPPTEQA 


4987 


18888 


A 


5024 


135 


2 


ATMFLNSKVSKYSGYLLGFHECREKGWM 
TWDGERDPSPGILQLQ 


4988 


18889 


A 


5025 


2 


94 


KTATKL IGGXH YDSXNIKAI RDGLLARR 
HAL 


4989 


18890 


A 


5026 


2 


362 


QELERSMAQRCVCVLALVAMLIjLVFPTV 
SRSMGPRSGEHQRASRIPSQFSKEERVA 
MKEALKVFPTWSTS FIQHE WSEYSHIi 
FTIQGSDPSLQPYLLMAHFDVVPAPEEG 
WEVPPFSG 


4990 


18891 


A 


5027 


222 


84 


AASTFCHVSQAGLELLTSDDPPASASQS 
AGITGMSHRTQPMVHLY 


4991 


18892 


A ■ 


5028 


366 


0 


SGRSKKKKKKKKKKKKKKKKKKKKKKKK 
KKKKKKRGGR 


4992 


18893 


A 


5029 


231 


68 


AGMGSRALPKPLLMSHSSLKAVELPDSF 
SPELRSLLEGLLQRDVNRRLGCLGRG 


4993 


18894 


A 


5030 


2 


366 


LNLGGRGCSELRSCHCTPAWVTETPFKK 
KKKVPHGGYFVAQLLPPRKMRTHRRPQA 
QGERTGPGWGAPVHTGLLPRRGQAFTQP 
P S AFKG F FHPTNKS S S EAKKKKKK PRVG 
RKKNVGYIL 


4994 


18S95 


A 


5031 


241 


36S 


QVERNFKSQSGAEAHICDLSTLGGCGGQ 
I TRSGVHDQPCQHG 


4995 


18896 


A 


5032 


279 


396 


DGGMWPGTVAHACNPGTLGGQGRWITRS 
GVRDQPGQHGE 


4996 


18897 


A 


5033 


173 


398 


SNESSLRVNPFFFFFFLKTNFSFCPPGG 
RQGPNFSLLDPPPPKVKEIFCLTPPKRW 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=G Jycine, H=Histidine, I-Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P^Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X-Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














EYRPAPPPPRNFGFLIKTG 


4997 


18898 


A 


5034 


2 


143 


SNF PALTSQSMG I TS MTRHALP I ACFLV 
FVFLFFEMESHSLAQAEVQ 


4998 


18899 


A 


5035 


350 


2 


QEISSVQTSTQLFNGMTVKARATTREVM 
ATYTIEDIVIELIIQLPSNYPLGSIIVE 
SGKRVG VAVRLFI F I TQKS FI FLFS FLT 
LCLCLQHFHNDFLLLIVPILIAMAFLMIi 
TERK 


4999 


18900 


A 


5036 


1 


100 


HECGSSSQRTLSVQEAAAYLKVSNEIRI 
LIAIF 


5000 


18901 


A 


5037 


247 


334 


LQIDI SAWAYT IAVKEDGELNLMKKAA 
S 


5001 


18902 


A 


5038 


2 


296 


DKAPMLKVIVNSLKNMINTFVPSGKVMQ 
WDEKLPGLLGNFPGPFEEEMKGIAAVT 
DIPLGKVHLEALKKKVIKFFYKFPLRCD 
I HTAKCTG LR YTAQ 


5002 


18903 


A 


5039 


152 


335 


RPIWSYYLLFFFLFWGLYLDVNKRIFTF 
ILSEKYFDMKKNQCKEGLDIYKKFLTRM 
TRISE 


5003 


18904 


A 


5040 


336 


1 


GGLTISSLLKEKEGSEVAKFTLEQLCLI 
CNIMSTAEYCVAATQQLEEKLKEKVDVS 
LFDRINLSGDMGTFSPVISSSIHLLVQD 
VDAACD PAMVAMS KMQ CQNVQHVG VKSS 


5004 


18905 


A 


5041 


3 


204 


LNLGGGGCSEPRS CHCTPSWVTARLHLK 
KKKKKKRGKNPLKNGGKENFKFLQILVW 
PKNSLENLAV 


5005 


18906 


A 


5042 


382 


148 


WCNHRGPRSRKKRSEGSTKSRRLGATIR 
MVTPHTTRTCARDPELTSKEKCVYIEEH 
THTHTH I Y I HTHTHTC I YVKTH 


5006 


18907 


A 


5043 


137 


1 


RPRRRYMKKFSTSLIIKKMKIKITMKYH 
L.SHL I P VKMAITKKTXD 


5007 


18908 


A 


5044 


198 


34 


KPAS R F CHVGQ ADLiE LLTS SDL PAS AS Q 
SAG I TGVSHCSQPNFI TLCLWTDHFF 


5008 


18909 


A 


5045 


11 


357 


LLTYFIIMFKILEIYEKEEQLIIKQISE 
GQEKVKELRQFKEHRKAKDSALQS I ES K 
MLELTNRLRESQEE I Q I iMRKKKRKNAAR 
GPYWKFKYSQRWKSPPGPFQWGWGPRKN 
FFV 


5009 


18910 - 


A 


5046 


2 


193 


GLQLLTSSDPPASASQSAGIAGMSHCTQ 
PQVHLMPSLYHFRFLQVDTKDLLRASAD 
LIHRGIT 


5010 


18911 


A 


5047 


1 


196 


KLCLVMNSPMKAAPRNFSCAPSLSLLPF 
SFRKTTTTLTSNTADDFAYFDLCKMEPC 
^MVLFFVFC 


5011 


18912 


A 


5048 


386 


276 


AQAVLELLDSSDLPPSASQSAGISGVSH 
CTQPDSIF 


5012 


18913 


A 


5049 


1 


324 


VDAAAE KLEASTGWLMRFKE RS CLHM I K 
MHGEATVADTEAAAG YP EDLAK I TDKGV 
YTQQQILNGDEIAFCWQKIPCRI FLARE 
QAVPGCNASKARiyTVLLWANAAGD 


5013 


18914 


A 


5050 


1 


129 


PRSCKCFPAWVTEQDSV3QKKKKKKKKE 
KEKKKNPVEKSAKVI 


5014 | 


18915 


A 


5051 


193 


356 


rsfipsasasastlclkcsqngqpgava 
hacnpstlggqsrqltrsgvqdqsgqy 


5015 


18916 


A 


5052 


192 


341 


akrvkrnnffffflkqfcsvpqaggrgp 
dpgslkplppglkgfscptpln 


5016 


18917 


A 


5053 


340 


1 


KR I PDKPQKELRRLATKLI REAPEKGQA 
QGKEIIJKS IQEAKGE I FKAIDRI KKSQF 
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SEQ ID 
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nucleotide 
sequence 


SEQ ID 
NO: of 
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sequence 


M 
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od 
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26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A-Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIalanine, 
G=Gtycine r H=Histidine, I=Iso leu cine, 
K=Lysine, L=Leucine, M=IYIethionine> 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /= possible 
nucleotide deletion, \=possible 
nucleotide insertion 














DRQEALDILLVMQNALESLSNRIEQVEE 
RNSELEDKVFDLTRSSKAKGKRIRNYDH 
S 


5017 


18918 


A 


5054 


2 


383 


WKVATQPADNPLDVLSRKLHLGPNVGR 
DVPRLSLPGKLVFPSSTGSHFFMLGIGD 
IVMPGLLLCFVLRYDNYKKQASGDSCGA 
PGPANISGRMQKVSYFHCTL1GYFVGLL 
TATVASRIHRAAQPC 


5018 


18919 


A 


5055 


2 


383 


GS VLS KKGD YLKYH YNASLLDGTIiLDS T 
WNLGKTYNIVLGSGQWLGMDMGLKEMC 
VGEKRTVI IPPHLGYGEAGVDGEVPGSA 
VLMFDIELLELVAGLPEGYMFITOGEVS 
PNLFEEIDKDGNGEV 


5019 


18920 


A 


5056 


2 


383 


AVIDEVRTGTYRQLFHPEQL ITGKEDAA 
NNYARGHYTIGKENIDLVLDRIRXLADQ 
CTGLQGFLVFHSFGGGTGSGFTSLLMER 
LSVDYGKKS KLEFS I YPAPQV FTAWE P 
YNF I LTTHTTLEHSD 


5020 


18921 


A 


5057 


26 


452 

• 


KLMSLRLRQAWHEAAI DE FRTGTYRQLF 
HPEQIilTGQEDAANNYARGHYTIGKENI 
DLGVDR I RKLADQSTGLQGFLVFHS FGG 
GTCSGFTSLLMERLSVDYGKKSKLEFFI 
YPAPHVFTAWEPYNFILTTHTTIiEHSD 
CA 


5021 


18922 


A 


5058 


2 


385 


AVIDEVRTGTYRQLFHPEQLITGKEDAA 
NOTARGHYTIGKE I IDLVLDRIPKLADQ 
CTGLQGFLVFHS FGGGTGSGFTS LLMER 
LSVDYGKKSKLEFYIYPEPHVCTAWEP 
YNF I LTTHTNLEH S D C 


5022 


18923 


A 


5059 


3 


390 


GDAANNYARGHYT I GKEI I DLVLDRI RK 
LADQCTG LQGFLVFHS FGGGTGSGFTSL 
LiMERLSVDYGKKSKLEFSIYPAPQVSTA 
WEPYtfSILTTHTTLEHSDCAFMVDNEA 
IYDICRRNLDIERPTYT 


5023 


18924 


A 


5060 


103 


3 


KIFFFLRWSFALITQAGVQWRGLGSLQP 
LP RAT 


5024 


18925 


A 


5061 


262 


412 


KAS P FRTGTAFGNG KT S D YLLLGNFG YT 
FGGITGCLKAGLETSYWTWFTH 


5025 


18926 


A 


5062 


293 


406 


VIIGSIFEVIWAVIKPGTSFGISVLRAL 
RLLRIFKVTK 


5026 


J 8927 


A 


5063 


419 


15 


WEEEGPLPKKKKGGFSNKRGNI IMGPFL 
KRPPEKKPPPPPRKINPFFFFFKRKGPP 
PPRVKGKGAIKGSRNPPLFGSRDFFSPA 
PQKSGAPGGPPPPPVNKNQFFFFFFLVE 
TGFHHVTQAGPELLSSSSPPTI 


5027 


18928 


A 


5064 


2 


216 


GLTNLFIFCRDRILLCCPGWSQTPSLKQ 
SSHLSLPKPWDYMYEPLCLYFLYPWFCL 
SLSLS FPLSHTYFFG 


5028 ( 


18929 


A 


5065 


259 


36 


SQPHWDYRASRAEEIATFLEVWLQAIFV 
CCLFLRQSPFIAQAGVQWHYLKSLQPPP 
PGLKQFSCFSLLLVPRLA 


5029 


18930 


A 


5066 


2 


404 


GKGAPTTSLI SVAVTKI IAKVLEDNKLP 
GAICSLTCGGANIGTARWDERVNLLSF 
TGSTQVGKQVGLMVQERFGRSLLELGGN 
NAI IAFEDADLSLI VPSALFAAVGTAGQ 
RCTTARRLFIHESIHDEWNRL 


5030 


18931 


A 


5067 


1 


400 


GENM I TGTSQADCAVLNVAAGVGEFEAG 
ISKNGRTREHALLAYTLGVKQLIPGVNK 
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Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutaraic Acid, F-Pbenylalanine, 
G=G!ycine, H=Histidine, I=lsoleuctne, 
K= Lysine, L=Leucine, M=Methionme, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V= Valine, 
\y=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 














MDSTEPPYSQKRYEEIVKEVSTYIKKIG 
YDPDTIAFVPI SGWNGDNMLEPRANI PL 
VTGWKATPIDCDASGTTLLDA 


5031 


18932 


A 


5068 


41 


392 


GSPHHPCAHIERKKKPYNSNIGFYTKRN 
ALRVAEVWMDD YKS HVY I AWNLPLEN PG 
IDIGDVSERRALRKRLKCKNFQWYLDHV 
YPEMRRYNNTVAYGELRNNKAKDVCLDQ 
GPLEN 


5032 


18933 


A 


5069 


3 


395 


GTPTRPHILLQALFIRAMLPCPDYDTDT 
KTGLDQALKI CQAMLDEAANQG WLVT VL 
NNTNLIQMAIQGRWVKDSSLLTLPYIEN 
HHUILFKKRKP I P4KGPHAKS RTS I ECLP 
ELIHACGGKDHVFRSLVQQ 


5033 


18934 


A 


5070 


3 


405 


PRASEVCGFSCHITCVNKAPTTCPVPPE 
QTKGPLGIDPQKGIGTAYEGHVRIPKPA 
GVKKGWQRALAIVCDFKLFLYDIAEGKA 
SQPSWISQVIDMRDEEFSVSSVLASDV 
IHASRKDIPCIFRVTASQLSAS 


5034 


18935 


A 


5071 


3 


393 


ITRQEFIDGILASNFPTTTIEMTVGADI 
FDREGDGYIDYYEFGAALHPNKDAYRPT 
SDAPKTFHQGTRQVAQC I CAKRFLVEH I 
GENKYRFFLCNHFGDSHQMRLVRILLST 
VMVLDGGGWKAliDQFLTT 


.5035 


18936 


A 


5072 


3 


394 


I TRHE FI DG I LAS KFPTTK I KMTVGAD I 
FDRDGDGYIDYYEFVAALHPNKDAYRPT 
SDAYKIEDEVTRQVAQCKCAKRLLAEHI 
GENKYRFFLGNQFGDSHQLRLVRILRST 
VMVPVGGGWMALDE FLVND 


5036 


18937 


A 


5073 


1 


393 


GEDAANNYSRGH YT I GKE 1 1 DLGLDR I C 
KLADQCTGLQGFLVFHRFGGGTGSGVTS 
LLMEHLSPDYGPKSKLEFSIYPAPQVFT 
AVHEPYNSMLTTHTTLEHSDCAFMGDNE 
AIYDI CRTNLDI ERTTYTN 


5037 


18938 


A 


5074 


39 


482 • 


LGLHSAWRDDKIGYNPDTVSFVPISGWN 
GDNMLEPSANMPWFKGWKVTRNDGMASG 
TTLLEALDCI LPPTRATDKPLRLPLQDV 
YKIGGIGTVPVGRVETGVLKPGMVATFA 
PVNDTTEVKSVEMHHEALSEALPGDNVG 
FNVKNVSV 


5038 


18939 


A 


5075 


1 


390 


GVSMAVSLVIIYQYWRNMPDPHNLPIV 
AGWKKYPLFFGTAVFAFEGI RWLPLEN 
QMKES KRFPQALN I GMGIGTTLYVTLAT 
LGYMCFHDEIKGS ITLNLPQDVWYQS V 
KI L YS FG I FVTYS I QFYV 


5039 


18940 


A 


5076 


330 


462 


VNFFFREGGTESPSIAQARVPWCSLNSL 
QPLPPGFKRFSCLSLP 


5040 


18941 


A 


5077 


1 


403 


GGWGVEAEDFEYAPDVEPLEPTLSNI IE 
QRSLKWI FLGGKGGVGKTTCSCSLAVQL 
S FCGRESVLI I FTDPAHNISHAFDQKVS K 
VPTKVKGYDNLFAME IDPSLGVAELPDK 
FFEEDNMLSMGKKMMQEAKSAF 


5041 


18942 


A 


5078 


348 


1 


RPQGNNIERWQGGEVLI IQLLRIAKSVK 
NVQLLWKTQFLI KLHI LLFYDPRI PFVS 
NYPKEKKTYVHKKTYAQIFI SLPRTFLF 
LRQSLNMLPRQDLS S 


5042 


18943 j 


A 


5079 


248 


3 


PGQVLEREHTCRRPWAWNVC I CLAGPA 
KPNREDVSSSQEESLQLNSIPPTPTLTS 
TAVKSRQPLWGLKEMEEEDGSELDF 
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nucleotide 
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ed end 
nucle- 
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correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E-Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=Isoteucine, 
K=Lysine, L= Leu cine. M=Methionine, 
N=Asparagine, PHProline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Vah"ne, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, Y=possib!e 
nucleotide insertion 


5043 


18944 


A 


5080 


3 


192 


GDGSELRSCHCTPAWLTERDTSSQKKKK 
KEKKTWNPAPELFFFLLLEKKGQNQGKP 
NMKKVL 


5044 


18945 


A 


5081 


10 


171 


CFCLGPVGVCRSLLKPQLCFLGFPWLQD 
ARQKFRSVLVEATVKLDELVTRSTRP 


5045 


18946 


A 


5082 


94 


1 


DFWPGKVAHACNPSTLGGRGGWITRSGV 
QDT 


5046 


18947 


A 


5083 


142 


3 


AASTGSHS VAQAG VQWEDLSSLQP LP PG 
FKRFSFLSLPSSWGRKIA 


5047 


18948 


A 


5084 


257 


177 


ISLGEVAHACNPSTLGGRGGQITRPG 


5048 


18949 


A 


5085 


143 


1 


GGESHSDTQAGVQWCYLGSTQPPLTLTF 
KQFSCLNLPSSWNYRCVPP 


5049 


18950 


A 


5086 


3 


87 


HVGQAGLVLLASCDPPSLASQSDGITV 


5050 


18951 


A 


5087 


375 


220 


LHSLSQHIATRS KKPAFNLFAFGTLSPV 
STNWVGFNP I S FEGLMLI I FFFFF 


5051 


18952 


A 


5088 


40 


203 


IFLPLLLVSFFFYILTWGFTFFWGLTLI 
NDLLNFFFGNSGI S FS WFGSRAGELG 


5052 


18953 


A 


5089 


1 


145 


MDMRVPAQLLGLLLLWLSGARCDIQMTQ 
SPSS LS AS VGDREDGD VDAA 


5053 


18954 


A 


5090 


146 


328 


FMPFPLLESLEIFSSKLFNPPNLFFFFE 
TESCSVAQDGVQWYDLGSLQPPSPGFKR 
LSCLS 


5054 


18955 


A 


5091 


338 


1 


PNLPSVQIiPPTTSCTPLIGDLIiVARAHF 
AGWKHQGINSPSLPMAPREGMEAPRNPF 
HPHIFLCLIHGFHLAVSKLYSLFYYYYY 
YFSRWSFTLVAQAGVPWRDLGSLQPLPP 


5055 


18956 


A 


5092 


2 


327 


PPLTPPIFPSSPKNKNPPPPHYNPPPPP 
FRTPPPPPYPPLSPPKSPPPPPRVDPPL 
PYS I FPRPNLIS PPPYSPFYLLPQLQAF 
P 


5056 


18957 


A 


5093 


133 


3 


AQTCTPSTQ INS KW I TDLNVKGKT I KLL 
KDNIREKLDDLGCGE 


5057 


18958 


A 


5094 


170 


350 


aggqggnfyslqpnppwiresshstlpk 
fwnyrhtptgppnfgffflkmgfglvaq 
tifn 


5058 


18959 


A 


5095 


133 


274 


RDI I IFLESGI KGYFL.FFSDKITSELVS 
KIGDKNWKIRKEGLDEVAG 


5059 


18960 


A 


5096 


2 


188 


REMQ I KTTVR YHLTTARMVL I KKSEKNR 
CWHGCSEKGTLLHCWWECKLLQPLWKIG 
WRVDAA 


5060 


18961 


A 


5097 


235 


81 


FPKKKKAQ I KKPQAAQSLNLSLSLY I YA 
FTYTHTHRHTHTQTHTHTYIHIS 


5061 


18962 


A 


5098 


355 


169 


KI LLL I TDSCAH I S CRFSECMKYFNFFF 
HPHSVYVTDDKIRLLEBQLPHVFSNKME 
PFKVCN 


5062 


18963 


A 


5099 


160 


1 


FSASYFNSKKKLIFLLLKLRWNFFVCFF 
GMESHSVAQAGVQWCHLGSLQPLPS 


5063 


18964 


A 


5100 


364 


151 


QMGFCHIGQADLELLTSSDPSASASQSA 
GITGVSHYASQEFLKKEFHSAHLI PLQG 
TCV QGKTAQP YSEAIi 


5064 


18965 


A 


5101 


205 


3 


AASPGVQWPDSRLLQPRPFGVKQFFPLS 
FPS S Vf GYRGAAP P PGHFFFFFPSRDE I S 
VCLPGWSPTPE 


5065 


18966 


A 


5102 


349 


207 


DGASLCCPDWGLELLSSSGLPALALQNP 
EITGVGHHTQPWTLSLLIY 


5066 


18967 


A 


5103 


425 


0 


CEFFKNKDFCFLSKKKKKKKKKKKKKKK 
KKKQSSSQTSYS 
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beginning 

nucleotide 
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ing to first 
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peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E~G\utamic Acid, F=Pheny)alanine, 
G=G!ycine, H=Histidine, I=Iso!eucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 


5067 


18968 


A 


5104 


437 


184 


LFLFFFPPPKKVFSQKTPRSVYSPPPKK 
KKKNS PPPLKLGPPQE I FKTPP PP I FFF 
FFFMFFFSFFFFFKGHRPRFISLWSPPL 


5068 


18969 


A 


5105 


401 


10 


MFS P P FGGPRGGVPLR PGFKNP PGPKGK 
TPFFLKNQNYLGPPLYS PFLERLNKKKA 
FTLEAKGPNNPNSSRSPPPGAQKKNPPF 
KKKKKKKRKI IFMPQAHPAVYWCSGNQ 
LPITFPYSIISGPLSSFT 


5069 


18970 


A 


5106 


413 


2 


PNKKPSPPPFSPPHKTPSSPSPPPPPWK 
PTPTPPPPNKKNFPPPQPPPKKKNPPQT 
PPPQKKTPPPPKKKKHPPPPHKKTPPPP 
HTKNPPPPPPLPPSLSIPPLFPLYINSP 
FFI PFFFFFFFKI FFYLGRVGGRV 


5070 


18971 


A 


5107 


269 


115 


GFFFFFFFSFFFFFFFFFFFFFFFFFFF 
FFFFFFIKKCHFIXFKYIGSTSN 


5071 


18972 


A 


5108 


238 


4 


QWLPLVDRIWVILLFFDSLNLKMFPQYQ 
IGLQKI KVKNNTTFYFLFS FFLRWS FTL 
VTQAGVQWCDLGPLQPPPPRFK 


5072 


18973 


A 


5109 


496 


0 


STPSRASPKKKKKKKKKKKKKKKKKKKK 
KKKKKKKKKSSK 


5073 


18974 


A 


5110 


1 


243 


DHLSLGGRGCSEQRSCHCTPAWVTERDS 
VSKKKKSFRALDI FFFNGKKI LGTLRGK 
NQKPPREVSSGNYFHFKGLFGPCYK 


5074 


18975 


A 


5111 


78 


4 


LGMVAHACNPSTLGGRGGRITRSGV 


5075 


18976 


A 


5112 


2 


96 


AGVQWRDLGSLQPPPPGFEQFSCLSTMP 
NFL 


5076 


18977 


A 


5113 


128 


6 


IYKDVLEPGVLRLLDVDNR WLPIEAP I 
RIIIPSQDVLHS 


5077 


18978 


A 


5114 


132 


2 


DDLSSLQPLPPRFKRFCCLSLLDSWDYR 
RPPLRPAI QEAEAAE 


5078 


18979 


A 


5115 


1 


146 


AVETGFCHVAKAALELLHSNDPLASASQ 
SAG I TGMS HRAW P S L FS I LS 


5079 


18980 


A 


5116 


213 


3 


GAVGFSSGCFLLYLYALLGPFPPLPLLQ 
LKF I CWPGAVAHVCNPSTLGGRGGR I TR 
SGVRDQPDQHDET 


5080 


18981 


A 


5117 


407 


0 


LGWAWWFTPVI PTLREVKAGGSPEVRS 


5081 


18982 


A 


5118 


215 


414 


KIRPKE IRNNGGSQQRKRAQLGKLEQTN 
PVLNFKAPFFGEQGDYYKSFFKTCLDNL 
PRRGKGVFFF 


5082 


18983 


A 


5119 


40 


326 


KKKKKKKKKKKKFKKTGGAKWAP FLNLK 
KKGIKPQKKKGFFQGKKKRGKKKKIPKI 
LIQGFFQKKKNFSKKKISQKKKKNGFWP 
LGKGFCPKKKT 


5083 


18984 


A 


5120 


99 


2 


GRSTRHSPAHTHTHTLRHTGTLLHTHSH 
RHSP 


5084 


18985 


A 


5121 


3 


219 


ELRLCHCTPTWATRVRLSQKKKREKKKK 
NFVGKKKGKKGKKPPNPKKKGSGLGFKG 
FLFS KFFKTRGTT 1 1 


5085 


18986 


A 


5122 


262 


351 


TRPGTLAHACNPSTLGGRGGWIMRSGVQ. 
DQ 


5086 


18987 


A 


5123 


340 


0 


KKKKI SQAWWWVP WPATWEAEVG 


5087 


18988 


A 


5124 


94 


2 


LMPI IPAIiWEAEVGGAPGQEFQTSLANM 
VK 


5088 


18989 


A 


5125 


310 


68 


LSPFYKLGFWGRVIYPPPPPKVSPPVFG 
APPPPEKKKKPPPPPPKSSIKKPTEKKI 
CLAASGXGLEVCFCKKPLIQSYFL 


5089 


18990 


A 


5126 


343 


131 


SRPRRPGLELLTSDDLPAPATQSAGITG 
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E=Glutamic Acid, F=Phenylalanine, I 
G=Glycine, HHHistidine, I=Iso leucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
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W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














VSNHTQIWPISVGDLMNTGWIPENRIV 
SELLREJ.Hr LRICL 


5090 


18991 


A 


5127 


2 


145 


FSLLGSSVGRLEGTGTISTHCTLPASRD 
S PASNY RLTP P CPADFL YF 


5091 


18992 


A 


5128 


1 


130 


SRSVTRLECSGTISAHCNLPGSSDSPAS 
DYRLTPPCPADFLYF 


5092 


18993 


A 


5129 


1 


338 


WRKEGIRFEKEKSKDFKNKVI KYLETL 
LYSQQQVCKLWVTFI 


5093 


18994 


A 


5130 


346 

• 


3 


DFFFFHP I LFFFFFKKTLFFS P PEGNGG 
HTLFLKNTPPRGRI FFFLPPLLLWGGWA 
P PPPPKKNFFFFFFFFFFFFFFFFFFFF 
FFFFFFFMPPTQSLFKDQEGPVQGGEEA 
PV 


5094 


18995 


A 


5131 


3 


114 


FLHIGQAGLELLTSDDPPASASQSAGIT 
GMSHEAQP 


5095 


18996 


A 


5132 


292 


180 


AASTDSIEGHGASLPSKKTPSEEDFETI 
XLISNGAYG 


5096 


18997 


A 


5133 


338 


216 


RGENRLPPGGRGFSGPKSHFCPPAWATE 
RDSLS KKKKNPV 


5097 


18998 


A 


5134 


337 


3 


IWLGNGKGIIRNNQIFSNKEAGIYILY 
HGNP WSGNH I FKGRAAG I AVNENGKGL 
I TENV I R ENQWGG VD I RRGG I P VLRSTL 
I CFGYSDGWVGDEGKGLIEGNP I SAN 


5098 


18999 


A 


5135 


227 


343 


MYVKEKLI PTWNWMVS IMDSTEAQLPYG 
SALTSWDPGQ 


5099 


19000 


A 


5136 


166 


375 


ATFVSLGVFCSAVI LLYFKNMMKLDLSL 
TPHTTINSKWIKRLHTRPEAIKFLEVNI 
GKKFFDIGHEIIL 


5100 


19001 


A 


5137 


90 


2 


KTTLWPGAVAHACNPS ILGGRGGRITRS 
G 


5101 


19002 


A 


5138 


213 


379 


PTFQSVGETGSLLKMHILGPGAVAHACN 
PSTLGGRGGRITRSGVQDQPGQYGETP 


5102 


19003 


A 


5139 


384 


2 


FFFFFFLSVMESCSATQAGVQWCDLSS 


5103 


19004 


A 


5140 


41 


317 


TMS RDRPS DKTWT YNRSNWMPDDGAP F 
RYSFSALKDRHNAVEVNWIDPNWGWETA 
TELVEDTQAIARYGRNVTKMDAFGCTSR 
GQAHRAGL 


5104 


19005 


A 


5141 


29 


523 


VKAEAAY KKADD I WNLRKDD YFVNDEAR 
AR YWDDRE KARLALE AARKKAEQQTQQD 
KNAQQQSDTEASRLKYTEEAQKAYERLQ 
TPLEKYTARQEELNKALKDGKI LQADYN 
TLMAAAXKD YEATLKKP KQS SVKVSAGD 
RQEDSAHAALLTLQAELRTLEKHAG 


5105 


19006 


A 


5142 


432 


144 


GGFLPKAFLPPKKKKGAFFPPPGKGAIP 
PLFWGGWTVFLI PEKNPPRGFPPPKRGE 
KPVGGPFFRIFAPPPQKKIGGGGFFLER 
FFFWDWGKTGP 


5106 


19007 


A 


5143 


403 


6 


FFFFKKQIGPRGGGPPQKSPPLGGGGRG 
GSPRPGVLTPLGPQGKNLFFLKKKKKLG 
GGGGPRNPNFLGGGPGKSLYPGGQRFQG 
PKILPFFPPRGKKKKIFFPKKKKKKSHR 
KL I QTLI LQTGKLMKMLWRT 


5107 


19008 


A 


5144 


115 


427 


KKKKKKKKKKKKKKKGGGPKKKKNFSPG 

GGKKKKKCFEKNPFFGGGKKKKKKCS SS 
YPCPKKQ 


5108 


19009 


A 


5145 


2 


79 


GCSEPRSCHCTPAWVTEGDS ISKKK 


5109 


19010 


A 


5146 


317 


444 


I HQ P VCVAHA YNF S TLG G R GGH I TRS GD 
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nucleotide insertion 














RDHPGQHGETPYPL 


5110 


19011 


A 


5147 


3 


413 


S FYRGF I P VLQTVTVLI IGD PS LHGDAW 
SWMEFFLTVIFIALWMLPLIGLTIWNA 
IWFQDIADLAFEVSWTKPHPFPSVSQI I 
ADMLFPLLLQALFLI QGIAVSLFPIHLV 
GQLDSLLHMTL.LYSLYCSEYRLVIN 


5111 


19012 


A 


5148 


409 


247 


QTKFRHVAQAGPHFLGSSDVPTSASQSV 
G I TGMS LHAQHQHFFTASMGRAFGEN 


5112 


I90I3 


A 


5149 


284 


1 


AEASRAHMEAHAR7QGTPGSKAEGVGPV 
EE YR I WRRPGPDRAHASDSEESGEPGGP 
P PH PAN FVFLVETGF FRVGQAGLE FPTS 
GDPAASASPV 


5113 


19014 


A 


5150 


439 


120 


j? GSPRLECSGVISAHCNLRLLGSSDSPA 
SASHVAG I TAR PNAR I I KAFGAP S TCTV 
VWEKHEQQTCIQRSDLPAGRQKKTGPKN 
R PGLVAFPFNAS PLiGGPGGR I A 


5114 


19015 


A 


5351 


415 


0 


FFFFFFFFFFFFLPFVQSFYIYIYMYI 


5115 


19016 


A 


5152 


67 


179 


I CLSLFLTVHVCVCVCVCVCVC VCVCEM 
VFGVSIPCN 


5116 


19017 


A 


5153 


280 


2 


KTPFFLNLKEKTRAPFFLKKTFFKKKPF 
KIFFSKKRNFFFFPPPGKRPPTLFiCKKF 
F FFFQ KKKN F FF S P KKKKKKKKKKGRR S 
RSRTSPRV 


5117 


19018 


A 


5154 


296 


406 


DR VLLCH S S CS E VAE S RLTAAS T S WAQ A 
ILLPQPPEW 


5118 


19019 


A 


5155 


114 


402 


VWR I KAS VE KKKKKKKKKKQ KKKKKKKR 


5H9 


19020 


A 


5156 


110 


1 


QRFLLTLKS L I LRLGAVAHACNPS TLGG 
QGGRIMRS 


5120 


19021 


A 


5157 


382 


95 


SSSSPHFLTPSQLRLFFPFFPLKIFFFP 
KAFNFCGGVFPIFSPPKKKFFFQNSQVG 
FXNFPQKEKNFSFPPPVKFGPPKGFFKR 
PPPFFFFFFFFF 


5121 


19022 


A 


5158 


2 


350 


TLQPGRQS STP FQKKKKKKKKKKKKKKK 
KKKKKKRGGGFLKNLGGGPKIWGVEKFF 
FFFFGGGYXKPLGFFLEKIFFLGGGIWA 
PPPPKKxS * wGKKKFi? WGGGGKTPLFFC 
GGFCK 


5122 


19023 


A 


5159 


1 


208 


PTRTLYLVKLGLMGPAEIFLSSFQEIRL 
I D YD KMVDH RGVR Vv AFC^ WAG V AG KY A 
RAY I TH FLNWLGE 


5123 


19024 


A 


5160 


1 


208 


PTRPLYLLKLSLMGPVEIFLSSFQEIRL, 
IDYEKMVDHRGVRWAFGQWAGVAGKYA 
RAY I TH FLN w LGE 


5124 


39025 


A 


5161 


419 


25 


AQKKKKKKXKKKKKKGGGPLKKNFGGGK 
NKRGKKKKNFFLKRGGKKKTGGIXiEKKN 
FFGGGKKGEKPPQKKKALKEKKKFLRGK 
GGKKPLNWGGKKKW 


5125 


19026 


A 


5162 


475 


150 


KPRTSGTARVPGDPPPRCLDDKRLPSRP 
VI FVFLAETECDRVGQDGLQLLTSGDHP 
HLTFPNCYNYRREPPRPTSSGFQHDIET 
LHYRRGKADRFKAHLPK 


5126 


19027 


A 


5163 


237 


409 


FHNQKTYLDSALCTLVYFWFVLRDGI F 
LIiPQLECSGT VMAHCNLKLLGSR S P PAL 


5127 


39028 


A 


5164 


373 


476 


LGFILLSFLETRSHSVAQAEVQWCDHGL 
LQPQTP 


5128 


19029 


A 


5165 


374 


132 


I FYS P FL YYHMLGKYHKI KNLK I KHMWL 
DAVAHACNPSTLGGRGGRITRLGDRDHP 
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G=Glycine, H=Histidine, ]=Iso leucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
l — I nreonine. V=Vahne y 
w— i ryptopnan, Y— lyrosine, 
X=Un known, *=Stop cod on, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














VjUHDb rPbLiIjFLWTr ASSLSGFPIj 


5129 


19030 


A 


5166 


332 


2 


RPQGLANCFPGALP PTKS EGF PGQS PGF 

QI WGGKAFFLRGFPF'FPQVES KGAVSPP 

PIAJDD T^r^T D r>T?c ?\ ri'T'Ti'nL'TvTr^'rvv'DO'n'n'n/^ 
\_r\.v flr'fVjLiKf t bArl t'PKJN^L/AKUt'lrPo 

RATGFFFFFVFLVETGFHHVSNDGLY 


j 130 


19031 


A 


5167 


238 


1 


tjlar l^ibvjl\ii2.rAGlVHRb£»r VrllvbAT 
SRIGVGNLYRLMRGYATGGYVGTPGSMA 


5131 


39032 


A 


5168 


I 


89 


FFFFKTESRSVAQAGVQWCTLGSLQPPL 
P 


5132 


19033 


A 


5169 


3 


127 


UtiUc oUr jJCiKAUUbUr VroDAoPP.K.1 Kl 
SPKQVSYLIWVLS 


5133 


19034 


A 


5170 


20 


254 


IAPPARIRHEERERERERERERERERER 
ERERERERDTRAEWRARLFFFFPFFLPA 
DQCFF I KPSGQRFLPPGGGGV? 


5134 


19035 


A 


5171 


36 


479 


TXGXDNKKDLL I S VGDLVDRGAENAE Cl> 
£. L I T F P W FRAVRGNH EQMM I DGLS ERGW 
VNHWLLNGGGWVFNLDYDKEILAKALAH 
TADELPLIIELVSGDKKYVICHADYPFD 
h.iJbrGKP VUHHUV X WNKh.KlbNbyD(jl V 
KEIKGADT 


5135 


19036 


A 


5172 


278 


448 


RFLCPLLHPFFSSETKSLTLLPRLGCSG 
VI XAHCSIjKL'ijGSSNPPTSASRVAGMTG 
S 


5136 


19037 


A 


5173 


205 


1 


HARLIFVFLVETGFHCVGQAGLELLTSG 
DPPSASQSETPSPLjKILKSAGCGGANST 
AtASGLLFSLGP 


5137 


19038 


A 


5174 


409 


194 


GFTPFS CLSLLS I S FQPSAT YMRFLKT I 
TIVEYSFAYFPISQP 


5138 


19039 


A 


5176 


339 


472 


AKFTC I S SN I KLSNTRPGTVAHACG PS T 
LiGGKGGQI xGSGDGDH 


5139 


19040 


A 


5177 


426 


148 


GGRGPPPLSHPFGGGGGGGPLRAGGQKN 
PGPKGETPFFGQTQKKNRGGATPPFSQK 
LGGEKHKNSFTPGKENSIKFDFPPAPPT 


5140 


19041 


A 


5178 


328 


474 


NNTEYNKNRLGAVAHACNPSTLGGRGRW 
ITRSRDRISP 


Ji4i 




7\ 

A 


5179 


A AO 

44o 


330 


GIIGVSPHAWQ 


5142 


19043 


A 


5180 


259 


14 


LRYKAILCSWIRTTNIVKMAELPKVIYI 
FNAIPIFKIPDMFCRIARFILKCMWNVK 
VSTIDKTI FKKENRVGIPPRVLMW 


5143 


19044 


A 


5181 


2 


423 


KF YATL VR YVG DR KNIj VCREMSMAliL SN 
LAQGDALAARAI AVQKGS IGNLI S FLED 
vjv iiYiAy iyyoyHNijMHiyiypppjj£.ppjbVD 
I4MCRAAKA1jLAMARVDENRSEFLLHEGR 


5144 


19045 


A 


5182 


182 


312 


FFIQVGFHHVAQAGLKLPSSSSPPHLAS 
QSAGI TGVSHCAQPG 


5145 


19046 


A 


5183 


301 


472 


t 'DTT'D'CTI'TrT DDOT73VT T 7 TV piT^/^T/MTkTC) l^iT /-•o 

vjJL Pr rt r r LiKKor AJjVAyjJtyVyvvKUJUGb 
PQPPPPGLKQFFRNSTAWTKGLQPFPRL 


5146 


19047 


A 


5184 


216 


413 


K.P AA KKKKKKKKKlUvKJ^KJvKltXGGAVK 
KKGPQKTPVKPPGGFFFAPPLGVS PPPG 
VFFAGGGAPP 


5147 


19048 


A 


5185 


357 


465 


I KPTKI LAPGPGAVAHACNPSTLGGQGG 
WITRSGD 


5148 


19049 


A 


5186 


467 


0 


FFFFFFFFFRFLFFFFFSEKK 
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C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=Iso!eucine, 
K=Lysine, L=Leucine, M=Methionine, 
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nucleotide insertion 


5149 


19050 


A 


5187 


346 


3 


RFLPLGQGGVE I L.TPRSTCLGPPKWWD Y 
KGEPLRPAQKCFFFKKKRR RKKVQNS VA 
VYMV YHSMCGENKLRWRWWGI FTHTHTH 
THTHTQKKQN I LSRTHKKLVI TTGSEEG 

rpY 


5150 


1 905 1 


A 


5188 


11 


109 


GETPSLLKVQNIXWAWWHMP VI PAMWEA 
EARES 


5151 


19052 


A 


5189 


3 


151 


QLLRRPRQENHLNPGSRGCSEPRLHHCT 
PAWATVQDS I SNTNNNNKKCP 


5152 


19053 


A 


5190 


2 


373 


AVAADKGVPL YRH I AKLAGNPDLILP VP 
AFSVINGGSHAGNKLAMQEFMILPVGAS 
SFKEAMRI GAE VYHHLRGVI KAKYGKDA 
TNVGDEGGFG P 1 1 LDNK DAL ELVKTA I Q 
AAGYPDKGAIGR 


5153 


19054 


A 


5191 


55 


421 


ARVADVCESMKEHLLVLVERAKYIPGFC 
ELPLDDQGALVTAHAGEHLLLGATKRSM 
VFKDVLLLGNDYIVPSHCPELAEMSRVS 
I R I LDE LVLP FQ ELH I DDNE YAYLKA 1 1 
FFDPDKETEA 


5154 


19055 


A 


5192 


163 


466 


TCFLGSQSAGITSVSHCPSREVFFLKLI 
HWRQGGQVALLVATPHS PCCPQYRLAPI 
PRARHD FACAS L I FAC I LLVHVLLMP RV 
bA(jKu V \j V H FAu ± yAVj K 


5155 


19056 


A 


5193 


470 


0 


ECLWPGTVAHASNPTTLGGRGGRIMK 


5156 


19057 


A 


5194 


491 


281 


RGLALIiPMGQCSGMMLAHGSLDLLDSSD 
PP AS AS QS VG I MGVSHHAW PSLVNI S FV 
CLIHRS PKTKPEGR 


5157 


19058 


A 


5195 


3 


222 


LS FHSLHRCLYMLGTTS ENVS P FSLLEL 
LS RLATLLGD Y CG S LS EGT I S RNVALVY 
ELLDEVLRLE S RCVAQA 


5158 


19059 


A 


5196 


20 


191 


STWWWS SRGGGCS E PRSHHCTPAWVTER 

DSVSKTKKKINNKPALALPAVHPTSHPG 
c 


5159 


19060 


A 


5197 


460 


350 


RRLRRENHLNPGGGGCSELRSSHCTPAW 
VTERGPSD 


5160 


19061 


A 


5198 


174 


41 


SLGELLTECLFETKSHSVTQAGVQWRHP 
GSLQPPPQGFKRI PPH 


5161 


19062 


A 


5199 


131 


454 


FEYFKNRKPFFFFFFFGKGVSFCAPGGK 
AGD P FKLiRE P P P PRVKGFFG P P P PSGGN 

TPGGGPPPPAPQRGGINGLDPPAR 


5162 ■ 


19063 


A 


5200 


168 


51 


TVKTP I R PGAVAHACNL ST LGGRGMW I T 


5163 


19064 


A 


5201 


414 


497 


LLFXXXXDNGSIYNPEVLDITEETLHSR 


5164 


19065 


A 


5202 


381 


614 


VQP GAP P E Wv ALS TCP S AAP EGS QQP Y I 
P PT FC FFP VKWLLS VTWL YLF I YFLS W I 
SVAQAGVQWCNHGSLQPRPSRL 


5165 


19066 


A 


5203 


377 


485 


PKEPGPQPCAPQPQPHAPQPQPRAPQPQ 
PCAPQPQP 


5166 


19067 


A 


5204 


54 


434 


MRTPENLSLTNPQE FGSSWAAVECPDTL 
DPRDMCVLNPLREPFAKKECS ILLSEVF 
EICHPWDVTWFYSNCLTDTCGCSQGGD 
CEC F CAS VS AYAHQ CCQHGVAVDWRTPR 


5167 


19068 


A 


5205 


201 


418 


GDAGAHAVLRLLPGHPGTSHLPFSSLTT 
SPLGLKRVTGGFNSKNRCDARTYCYLL.P 
TFAFAHKDRDVQDETY 


5168 


19069 


A 


5206 


27 


177 


NSFHHAGQSGLELLTSSDP PAS A SQ5AG 
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K=Lysine, L=Leucine, M=IVIethionine, 
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I TGVSHRAQPKAGTFLS PFHR 


5169 


19070 


A 


5207 


6 


1175 


DLGPHTPAWTRPKREDLVYQSTVRLPEV 
RISDNGPYECHVGIYDRATREKWIiASG 
N I FLNVMAP PTS I E WAADTP AP FSRYQ 
AQNFTLVCI VSGGKPAPMVYFKRDGEP I 
DAVPLSEPPAASSGPIaQDSRPFRSLLHR 
DLDDTKMQKSLSLLDAEMRGGRPYTERP 
SRGLTPD PN I LLQP TTENI PETW S RE F 
PRWVHSAEPTYFLRHSRTPSSDGTVEVR 
ALLTWTLNPQIDNEALFSCEVKHPALSM 
PMQAEVTLVAPKGPKIVMTPSRARVGDT 
VR ILYHGFQNEVFPEPMFTWTR VGSRLL 
DG S AE !FDG KELVLERVP AELNGS M Y RCT 
AQWPLGSTDTHTRLI VFENPNI PRGTED 
SNGS I GPTGARLTLVLALTV I LE LT 


5170 


19071 


A 


5208 . 


401 


3 


FFFFFS ETESHS VAQAGVQ WHHLGS LQG 
PPPGF 


5171 


19072 


A 


5209 


402 


229 


GFFKKGPPFFFFFFFFFFFFFFFKMWJL 
RLSGSKVEtmraSQKLYMNFGCCRGRSW 
FT 


5172 


19073 


A 


5210 


198 


2 


PPQAKILSSSSPPIRPPPKKGFFPKNPQ 
VGFYSAPHKEKTFTLPAPVKFGPPKDPL 
KRPPP 


5173 


19074 


A 


5211 


516 


222 


GSTLRLTQRSMPASSSTMAPSSKRLLLP 
RP ER S VP AAAGTAGHHEASRN CGRGGAG 
ADEGPATKGDSHPKPGYCRAAHPSAAPW 
PPGPEKNFMRVG 


5174 


19075 


A 


5212 


103 


203 


DKVSLRCPGWPQTPGLKQSSCLSLPKCW 
DYKCE 


5175 


19076 


A 


5213 


409 


249 


RRNQVGGPRQLNPGKRI KSPGIDPYI YG 
QLIFDKAVRAIWWRKGSSFLQKALE 


5176 


19077 


A 


5214 


2- 


170 


ERKILGY I QLRKGPNWGPYGLLQPFAD 
AIKLFTKEPLKPATSPYPLQSPAFPLKP 


5177 


19078 


A 


5215 


270 


464 


GQLNKNVWGWDPGKVTFRDGVSLYCLHW 
NAVAIHGHRHSVPQPQTPGFKQSCLRLP 
SSWDYMPLY 


5178 


19079 


A 


5216 


156 


248 


PDKSYI ICVCVCVCVCVCVCLCVWVCME 
CP 


5179 


19080 


A 


5217 


412 


2 


PPPQIFFFSPPHFFSPPPKGGFFPPPPP 
KIFFFPPPLFFFWGFFPPFSPPPKKFFF 
PKSPPFFFFPPPKKKKFFFFPPLFFFPP 
PGFFLTPPPPFFFFFFFFFFFFFFFFFF 
FRMGCDRWHGEFWILRDGFD 


5180 


19081 


A 


5218 


165 


1 


QVKYHNKKTHNPIKIGKRSEQTFHEKDT 
QMANKYMKRSPTSLRI KG IQIKT IMRY 


5181 


19082 


A 


5219 


408 


88 


PSSSPHFFSPPPLGGVFPPFPFKIFFFP 
PGLFFFWGVFPLFSPPPKRVFFPNSPPG 
FFFPPLFGKKFFFSPPPFFWPPPGFFLS 
PPPLFFFFFFFFFFFFFFFF 


5182 


19083 


A 


5220 


3 


259 


NLLLLIVPILIAMAFMLTERKILGYIQ 
LKKKKKKKKKKKKKKKKKKKRGGAFKKK 
PRGGQNFPGEKKKKFFFLREVKKKPRGN 
F 


5183 


19084 


A 


5221 


10 


479 


KLMPAESDGRHRERERERERERERERER 
ESSSARARCVSLSLDRYKNRVYMKVAEA 
LCECRLLAYISQAPTQMFFLLRLINIIH 
AHTLTQENDMCLHTTLE I LTLARQRERV 
PLYLRVLQRMEHKKKHPGFVLNNSHNLV 
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nucleotide deletion, \= possible 
nucleotide insertion 














RLWQQL YLHRDRDS I C 


5184 


19085 


A 


5222 


142 


56 


KFQPGAVAHTCNPSALGGWGGQIKRSGV 


5185 


19086 


A 


5223 


34 


471 


LEMRSEIDSQNYCKFCDAGRNEWTSDTIV 
LSQKEHLPVTQ I VMTDAGRPHSEAAYTL 
GPLLCRGDQSFWNSASFNTETSYLHFPA 
FHGELTADVCFFFKTTVSSGVFMBNLGI 
TDF I R I ELRGKLATL.DKSQGTHYLAI KA 
LTQKKKKKKTRG 


5186 


19087 


A 


5224 


480 


0 


FFFFFFFFFFFLIVAQFYFL 


5187 


19088 


A 


5225 


16 


422 


VRRTAR I RHEG KP Y E CNACG KA FNRS AH 
LTEHQRTHTGEKPYVCKECGKTFSRSTH 
LTEKLKIHSCVKPYQCNECQKLFCYRTS 
LIRHQRTHTGEKPYQCNECGKSFSLSSA 
LTKKKR I HTRERP YQCKKKKKKK 


5188 


19089 


A 


5226 


99 


436 


GHPSFE I PATMTAAPAGFPPQVPWEDVR 
YLFDE IMYGGHI TDDGDCKLCRVYLEE F 
MNPSLLAALSGTHACGEKPRIiPANSHVS 
EPSWKWILQPQSSLQMTAALANILTATS 


5189 


19090 


A 


5227 


473 


9 


FQITATPHLAVYDPTVQFEFWFSEKQIA 
DIRQVEASTRYLGTALYWIAAS INIKPG 
HD YYFYI RSVNTVGKSAFVEAVGRASDD 
AEGYLDFFKGKIAESHLGKELLEKVELT 
EDIASRLEEFSKEWKDASDKWNAMWAVK 
I EQTKVGERDVTSG 


5190 


19091 


A 


5228 


1 

- 


473 


PPPIDRLPNCTACRNSARVMTDAGQPHS 
EAD YTLG PLLCCGD KS FWNS AS FN TET S 
YLHFPAFHGELTADVCFFFKTTVSSGVF 
MENLGI TDFIRI ELRGRLATLDKSQGTH 
YLAI KALTQKKKKKTRGGAGPPFPliIGS 
R I T I HG P PFNNAAMREK 


5191 


19092 


A 


5229 


170 


25 


KTPLLGPGTVAHACNLSTLGDQSGCIMR 
SGVRDQLRQHDETPSLLKRI 


5192 


19093 


A 


5230 


465 


0 


WLYPPQKAQKKKKKKKKKKKKKKKKTK 
ARG 


5193 


19094 


A 


5231 


419 


56 


CVLLRSTKKKKKKKKKKKKKKK 


5194 


19095 


A 


5232 


79 


451 


AGSNLQEHRGLRPES PFNPGGGGCREPR 
RRHCPPAWAKRAKLRLKKKKHGRNQEVA 
QGRHGPFFPGLGPEGFLPRRPGSRKRRK 
GRNVPGPELKGPGAVLFSRHRRGGADLR 
PPRKGPRGGGGG 


5195 


19096 


A 


5233 


39 


467 


VQQQQRAQEQQQQHPVLHLQPQQI MQLQ 
QQQQRQI SQQPYPQQPPHPFSQQQQQQQ 
QAHPHQFSQQQLQFPQQQLHPPQQLHRP 
QQQLQPFQQQHALQQQFHQLQQHQLQQQ 
QLAQLQQQHSLLQQQQQQQIQQQQLQRM 
HQQ 


5196 


19097 


A 


5234 


194 


45 


MILKFFFNFKIHFFFFLRQSLALVAQAG 
VKWCNVGSQQPPPPGFKRNST 


5197 


19098 


A 


5235 


204 


475 


AS ITVHLWYVEKCYSQSTEKVTPMPSSS 
WFGPQPDAQLKHGLRPGAVAHACNPSTM 
GGRGGRI TS SGDRAHP 


5198 


19099 


A 


5236 


59 


357 


NHWRKIYCKVYNRKKIKHNTTNTTKNYQ 
NFQRNRTKSSFTIFRIQSHYNKHIKKHI 
QRKKQSI aTDLiEMTQMljliLVW I FKWIV 
I LTKI KTPKTLKING 


5199 


19100 


A 


5237 


447 


166 


ELLAFWQTQ YNNMNAVNLFFFW I KAPGI 
PTPSFASRRTHSPFFSFLFFLFFFLGKK 
GFPLCGPGGPKSLNLKTPPLGPPKGGK 
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5200 


19101 


A 


5238 


34 


296 


LS VAE FLAFWQTQYNNMNAVNLFFAW I K 
VFG I PTPSFASRRTHSP FFS FLFFLFFF 
LGKNGFSPCGPEGPKFLNLITPPLGPPK 
GWE 


5201 


19102 


A 


5239 


340 


472 


SLNETHVRS PAHHRPGATAHACNPITLG 
GRDGL I MRSGDQDH PG 


5202 


19103 


A 


5240 


460 


0 


PKKKI FTI YFFYFFFFFFFFFFFFFL 


5203 


19104 


A 


5241 


363 


46 


SFYSTHRKPCHWPGSVAHACNPSTLGGR 
GGRITRSRDR 


5204 


19105 


A 


5242 


175 


36 


LKYYLIFIYLLSFFFLRQSRSVGQAGVQ 
WRDLS S L.QP PNSTALQ YR 


5205 


19106 


A 


5243 

1 

i 


11 


442 


DTIRWGLPTLGSKSTTNBKKREKRRKKK 
EQQQSEANELRNLAFKKIPQKSSHAVCN 
AQHDLPIiSNPVQKDSREENWQEWRQRDE 
QLTSEMFEADLEKALLLSKLEYEEHKKE 
YEDAENTSTQSKVMNKKDKRKNHQGKDR 
PLTV 


5206 


19107 


A 


5244 


459 


333 


FLRVTQAGLKLLSSSDPPTLASPKCWDY 
RHEPLCPAQWSVS 


5207 


19108 


A 


5245 


229 


436 


FNSTWRKRGLGGFIHLNIiIQTSFFAHL 
WSLTYVLCVSFFVCLFLRQSHFVTQAG 
VQWHNLGSLQPLP 


5208 


19109 


A 


5246 


227 


30 


VFFARWVFLRQDLDLSSRLECSDVILVH 
CNVRLAGSRGTPSSVSKVAGTTGMLYHT 
WVFFYEFHR 


5209 


19110 


A 


5247 


173 


442 


FLGSSSKAAITLYYCQYMEFFFFFFFFE 
KNFFFSPPGGGEGPNFILLEFPPPGLKK 
ISCPTPPRGWDYGVAPPPLVNFWFFKKN 
GVPPPW 


5210 


19111 


A 


5248 


255 


358 


TGPGT VAHTCDPS TSGGRGGQ ITMSGDR 
DHPGQH 


5211 


19112 


A 


5249 


3 


130 


QPQLAAP S I S WAPTSASQVAGTTGVCHH 
AWL I FLFWVESRRG - 


5212 


19113 


A 


5250 


90 


1 


S TRLENNGTKSAQCNLRLTGS SNS PNSA 
SR 


5213 


19114 


A 


5251 


358 


447 


SFISYRRRSPSPYYSRYRSRSRSRSYTP 


5214 


19115 


A 


5252 


55 


202 


CWNTSMRGPIWEAEAGESLEPGGRRSCG 
E ART CHC I PAW WRAKLVDAA 


5215 


19116 


A 


5253 


330 


3 


EEMGFSHVGQGGFKILASGDTPAWAFQK 
GGI SGVSPRAWAGFI IFI WNFSGGPQTG 
FPFYPFRGIFFPRAKINYIRVLNPKGLC 
QKGVFFFFFSGFPHYRCRLCQISV 


5216 


19117 


A 


5255 


175 


3 


MYLRFPLGLSLSERPGMVAHACNPCTYW 
ESKAGGSLSLGVQDQPREHGGTLCLQKS 
K 


5217 


19118 


A 


5256 


263 


1 


RPRRRFNRDKTALLQEMPFRTFTALGKL 
I SDFiCDSXDGLTLYFVETASRS WLAGL 
ELLTSSDPPTSASQSTGITGVSHFSQPG 
LAF 


5218 


19119 


A 


5257 


1 


103 


CHVSQAGLELLASSDPPASASQSVGITG 
VSHRTW 


5219 


19120 


A 


5258 


128 


2 


IHKWRDCKMVQPLWKRAWQFLMNLNTHL 
SCDPTVLLLDILTQ 


5220 


19121 


A 


5259 


381 


2 


FFFFFSEMESPSVAQAGVQWCGLGSLQA 
PSPGFT 


5221 


19122 


A 


5260 


2 


173 


ENYLNMGGGGCS EPS LHHCS P AIiAI ERD 
SVSTTTTTTKQQQQNNQLGLFSYPFNPL 
I 
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Amino acid sequence (A=Alanine 
C=Cy stein e, D=Aspartic Acid, 
E=Glutamic Acid, F^Phenylalanine, 
G=Glycine, H=Histidine, J=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proh'ne, 
Q=Glutamine, R^Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


5222 


J 9 123 


A 


5261 


1 


175 


I KGN Q I GKKE VKL FLFTDDM I L YMEN P K 
DSAKNYYKQS KTFCLP WLGRS CKRTCP 
LA 


5223 


19124 


A 


5262 


2 


145 


Q VSLKLLGSS D P PALTSQGAG I MGVSHR 
TRPRNTDFKNNTQKS KNTQ 


5224 


19125 


A 


5263 


182 


3 


S TT S FLN FFLTEM VS HYV SQ AG L KVLGS 
SDPPS SASHSAGI TGMSHHTR PQQPLLN 
LKV 


5225 


19126 


A 


5264 


183 


369 


WNSLDFFFFFPSFFRGYSSMGKVQDAFI 
F YRRF I DKS KPRANTWGS I GGL YQQQ I H 
PirlL/AJjR 


5226 


19127 


A 


5265 


329 


207 


RGCSELRSRHCTPVWVTEQDSVSQKRKK 
KSPCHLQLGTSQ 


5227 


19128 


A 


5266 


177 


13 


TLTILLNFFFLRQSYSVAQAGVQWHNLN 
SLKPPPPGFKRFSCLSLPEFQQGHNIR 


5228 


19129 


A 


5267 


474 


122 


FICSPHPGGFLTLGPKLNILGPSQPLFP 
LRGGS KKQG I P FYS Y KI KG P P GAG P PG S 
ARYSNPFGGYGGRVPLPPEFPDPPGEHK 
PRFFLKKKKKGHQVPTSKYLPEYLFAYY 
NASLE 


5229 


19130 


A 


5268 


55 


311 


GIQTFGKNVWAGRSKNVGMP I AMLLHT 
DGEHERPGGDATVT IAHRYTPKEQLKIH 
TQLAD 1 1 1 VAAGS LSI SHAGVQWRNHG S 
I* 


5230 


19131 


A 


5269 


238 


133 


NKNIV/VQKKCVFY I YLPLICVCVCVCVC 

vcvcvcv 


5231 


19132 


A 


5270 


336 


517 


PVLYILEHSPYNASIYQLALKKYQSRPG 
AVAHACDPNTLGGRGGRITRSGDRDHPG 
QRGE 


5232 


}9133 


A 


5271 


262 


467 


VLSMRPRIHGSAAREEDEHPYELLLTAE 
TKKWLVDGKTKGTF PTTPGRTNS KG I F 
KVCDPEWKGKKS 


5233 


19134 


A 


5272 


213 


2 


MKIYINVYILLFSLKXRQGPTLSPRLEC 
SGMIIVHRSLKLVGSSDPPALTSRIAWL 
IGARHTPDCSVCKF 


5234 


19135 


A 


5273 


284 


105 


YSCSYSYFDEPVELRNSSFFRWNHSSDS 
YWKKKS S KDTEPVLKP P3YSAR YECKTV 
GSS 


5235 


19136 


A 


5274 


290 


449 


I FYFSGRVRAKLSAPLAGMGNAKADSRG 
RSRTKMVSQSQRMYCLSALLYLSLC 


5236 


19137 


A 


5275 


248 


423 


NKVNITLMTKLEKDTTKKENYRPISLLN 
IDMKILSKILQNQIQQYIQSIIHHDQVK 
LL 


5237 


19138 


A 


5276 


425 


30 


TRRGMPHRQGS PRKAP VS VRRQRVREEC 
GREPLLWFLQSQLLGRLRQETHLNWGGR 
GCSE PRS CHC I PAWATRANS I FCNFQAS 
SVEVRRSARKKLFSDILKRHNTITWRVS 
GLLLVDSYFGRLATPVRTQ 


5238 


19139 


A 


5277 


25 


185 


SLFSVAEFAFNAYKVQTEKEBKEEARSK 
YKEAKESFQRFLENHEKMTSTTRYK 


5239 


19140 


A 


5278 


263 


483 


VFTWFFQLFCVYKLI FSFLTECRS VAQ 
AAHAGAHWHNLGS LQPPLSGFKQFS CLS 
LPEFPRMYSVLSPSP 


Dz40 


mi a i 
19141 


A 




25 


TCI 


LTSGDLPAS ASQGAG I AGMSHHTRP I LL 
HFIYFILLYHPPCLAYFIIF 


5241 


19142 


A 


5280 


326 


203 


IETESCYVAQDGLELLGSSNPPTSASQS 
AGITGVSHHTWTT 
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G=Glycine, H=Histidine, Wsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q^GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possib!e 
nucleotide deletion, \=possible 
nucleotide insertion 


5242 


19143 


A 


5281 


210 


459 


NKNNKKFNEKKKKNYKKWARELYRHFSK 
DDIHVANRYMKRYPTSLIIRKIQIKTSM 
RYHLTP IRIATI PR 


5243 


19144 


A 


5282 


262 


454 


LLFTDYHEGMLHSVKIiLFCFLGWSFTLA 
IQAGGHWRDLGSLCPLLPRLKQSSCLNL 
LSSWDYKR 


5244 


19145 


A 


5283 


451 


303 


GTEFHHVGAGLELLGSSDLPVLASKSAG 
I IGMGHHAQPKQYS FVILTVQ 


5245 


19146 


A 


5284 


217 


489 


PVS IS I ILRCLSLPTPGSRVLCMLEGMD 
GISLGWVPLPSIiPVPLHPLDLSLPVCRQ 
VGGTKTGWR YVGETDFAKGEWCGVE LD 
KPLGKND 


5246 


19147 


A 


5285 


247 


99 


FFFFFFFFFLYGFFFCFFFFFFFFFFFF 
FFFLKNKTQPYF x YKRNI LSM 


5247 


19148 


A 


5286 


2 


117 


PRVRPRVRPRVRPRVRPRVRPRVRKKKK 
KKKKKKKKKKGGGF , 


5248 


19149 


A 


5287 


121 


3 


KFWPGAVAHACNPS PLGGRGGW I TRSGD 
RDHPGQHSETP 


5249 


19150 


A 


5288 


416 


48 


IGKPGSFFPPPPSPRGAPPPPPKIFFLP 
PPPFFFRAGFPLFPPPPQIFFFPKIPPP 
FFFTPPLKKKFFFPPPPIFES PLiGLFFF 
PPPPSLFSFFFFFFFFFFFFFFFFFFFF 
FFFFFFLGVR 


5250 


19151 


A 


5289 


503 


0 


FFFFFFFFSPFFFFFSSSFFFFFFFFFF 


5251. 


19152 


A 


5290 


70 


236 


I FCNSQLTSPHKHQKKKKKKKKKKKKKK 
KKKKKKKKKKKKKKGGGD FKKK I GGAK 


5252 


19153 


A 


5291 


443 


287 


QFTKKKKKKKKRKKKKNKKKKTKQKKT I 
IPKPPLF 


5253 


19154 


A 


5292 


334 


115 


KNPNALFFFFFFVDRISLLPRQWHNLGS 
LQPLPPGFTQFSCLSLPNSWDYRCEPLR 
PALLFFSSKSKIHIKPL 


5254 


mss 


A 


5293 


413 


3 


SSSPPIFFPPQKKKIFSFPPPKKFFFPP 
FCTFFFLRGFFQI FPPPKKNFFPKKPQNF 
FFYPPKKKKIFFFPPPKIFPPPKIFLKT 
PPPFFFFFFFFFFFFFFFFFFFFFFLVS 
LEKG F PDAWADAW ADAW 


5255 


19156 


A 


5294 


291 


11 


IQPYI KRLMQHDRVGLSSEKQSWFRI YI 
YINVPGFQNKGFI I IGSSQ KAQKMI FFF 
LDRVLLCHPGRSTWQSQPTAALKNWAQ 
KTLPPQPSE 


5256 


19157 


A 


5295 


2 


401 


NPRSTEAAI KYFLTQATAS I I LL I AILF 
NNILSGQKKKKKKKKKKKKKKKKGGGPK 
KKKIGGGKKKPGGKKKIFFFIRGEKKNP 
RGEFEKKTFFWGGEKRANPPPKKKTLGE 
KKKI LGGGGGKKFFFFVGKKK 


5257 


19158 


A 


5296 


49 


363 


RLQKKNKEKKKKKKKKKKKKKKKKNKRK 
KKKKKKKRRNGGGGKLLWKGGGQILTKE 
KI L I LVAVS YTCN I NNLGRQR KW I IGGG 
KSEPNYEFKRNIGAGRGGSLL 


5258 


19159 


A 


5297 


100 


323 


EYLNTCVRICHSHAHFLPPSYVSFALLP 
RFLFKWLLYMQVFGEGTEAVKKSLEGI 
FDDIVPDGKVKINVCFLF 


5259 


19160 


A 


5298 


595 


0 


VCVCVCMCAC I C I C 


5260 


19161 


A 


5299 


412 


141 


TVDTS YS E KKI LiFQ I L LL 1 DNG Fv^HLtRA 
LMEMYKEINWSMPANTTS IMQLMNQGV 
I S I FKS S YLRNAFCKAI AAIHSDS FDES 
GQSNLL 


5261 


19162 


A 


5300 


216 


2 


SODWLFXFCYHPRVFQRQPIQKLMGLFF 



866 



WO 01/64835 



PCT/US01/04927 



SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


JM 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
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W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possib)e 
nucleotide insertion 














FCLPNMVKNGVFFFFEMEGRSVVQAEVQ 
WHDNS PLQPKPPGLK 


5262 


19163 


A 


5301 


368 


3 


IFFFPQGKEGGGLFYIFFPPKKKGFFFK 
KKIFFFYPPPKKKKKIIRAABKRGPPLF 
FLKKTRPNFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFFLVREK 
VFNSTI STQ 


5263 


19164 


A 


5302 


389 


2 


AMAQASGSCLCSRLLVDGFI PWGVFRYF 
IVEAGSCYVDQGGlAILASSYPTThVSQ 
SAGMTGLSHRARPPLNGPPPGLTPSLSM 
VPKANVFSLLEKPTGLTQALRITLANVS 
NAPGQTRSGLEGWGCFF 


5264 


19165 


A 


5303 


84 


333 


FHDCCRPERSRILRNFLLSSAGIQSIAD 
AYRLALPQGLLYGPTNFAPI INHVARFA 
AHAAHQGTASVRWGGSGSGGNQGLP 


5265 


19166 


A 


5304 


485 


93 


PPLFRRFSPPPPPKFFFSPRAFFFGGGF 
FSFFSPPKKSFFFKNPPGFFFFPPLKKK 
KIFFLPPLFLAPPKDSFLAPPPFFFFFF 
FFFFFFFFFFFFFFFFFFFFSRENI FFI 
III FI YLFRDRFLFCCLG 


5266 


19167 


A 


5305 


157 


2 


PPPPFFFFFFFFFVWWVITLNVELEPSF 
SPNTESQIGPEEAMERLQENRVE 


5267 


19168 


A 


5306 


1 


133 


TQES IMDLENDKQQLDERLKKKKKRPAV 
LKNPWGGQSLPGMARE 


5268 


19169 


A 


5307 


239 


2 


PPRNWGFFSPLSPLKSSSPPKAFNFWGG 
VGPNFPPPKKRFFSKNPPGGFFYPPLKE 
KNIFSLPPVNLGPPRVFFKGPPP 


5269 


19170 


A 


5308 


454 


116 


FRLPSSSNSPAPASRVAGISGFRCVGRA 
GLQLLTASDPPASAS QGAG I ADGVS FTQ 
CSMVPRLECSGMISACYNLHLPAACLGP 
PKCRDCSLCLAATPSGK 


5270 


19171 


A 


5309 


123 


1 


CSVSKWYPIAFLKNI IMLWEAEAGGSRG 
QEIKTILANTVKP 


5271 


19172 


A 


5310 


153 


33 


WPGWAHTCNPPPLGGRSGLITRSGVQP 
QPGQHSESPSI 


5272 


19173 


A 


5311 


350 


3 


NSRNLILTQEHSPEEKNEFLFSLPLQSV 
SMNTTHSPLLSNSLPHFIISFLPTRFLV 
GPSPTRSFPPAQSPDRISFSSRLECSGT 
ITAHRSLNLFGSSDPLISVSQVAGTTCM 
CHH 


5273 


19174 


A 


5312 


1 


392 


RTRGRTRGI CKS I T I IHHINRTNEKKNH 
MIISVDAESAPLQIHSAKLKK 


5274 


19175 


A 


5313 


1 


239 


LKT S FHHVGQAVLE FLAS S DLPALAS Q S 
AGITCMSHCAQP Y I S INF I PAAETQQS S 
PELVPLPASQKTPSFSTPLFSLP 


5275 


19176 


A 


5314 


345 


237 


LYFYIYFLKSACVI ILSTLCVCVCVCVC 
VCVCLCM 


5276 


19177 


A 


5315 


268 


412 


GMISSNNSSDSFGNCLNPGGKGCCEPGS 
YHYPPAWATDTFSKKKKKKL 


5277 


19178 


A 


5316 


112 


224 


P WMI/ENELGLHAS YLAMSTPLS P VE I E C 
ASKKIFTFC 


5278 


19179 


A 


5317 


377 


139 


KPPPRIFYLGPQKKKKLFYPPPFKNCFF 
LGPPPPPLFMIFFFFFFFFFFFFFFFFF 


5279 


19180 


A 


5318 


216 


385 


RGVLLCCPAWSQTPGLKGSSCLSLPKCW 
DYKHELPRPATS CNS I LFFNLQGAFMYH 


5280 


19181 


A 


5319 


376 


3 


RIDFGGPKKKILLPPPPAVKIVSLKGPP 
LFFFFLNS FFAPAGGQWGVFQL I S SGD P 
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T=Threonine ? V=Valine, 
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nucleotide deletion, \=possib!e 
nucleotide insertion 














PPPAPESVRI PGVSFCAGVPQGFFFFFL 
NSSVAQAGVQWRDLGSLQPLPPRSKRVS 
CLSHPSSWDYSH 


5281 


19182 


A 


5320 


377 


287 


ELLTSSDLPALASQSAGITSMSHHTQPG 
SF 


5282 


19183 


A 


5321 


381 


65 


SSSPSSRTSGPFFFFSPPKKKKNFQKKK 
NFLYPPFFFPPPFYKNPPKKKLKGPKKK 
KKISPPPPKKNLSLKNPPPFFFFFFFFF 
FFFFF 


5283 


19184 


A 


5322 


132 


396 


EGGFVLKIIREGIGPHFLGLEEEGSKFH 
EHI FWEKHRESFPKQGP I PHFMELGTWG 
LSKNPYFRVKQRVEHIEGFKNFFNEKKD 
FLKE 


5284 


19185 


A 


5323 


366 


1 


GFLKKGVPFFPPGKRGGGNQTPPKPFVL 
GGKQPPPPPPLKKKNPKGGPPQQKKIFF 
FLTRGGVSLFGPGGGFPFFFQPPKKPPP 
GGFPFFFLKKPPPPPLFFFFFFFFETGF 
CSCRPGRTRG 


5285 


19186 


A 


5324 


131 


1 


DGALSPRLEWCSGTILAKCSLRLPGSSA 
SCVSASRVAGITGVC 


5286 


19187 


A 


5325 


112 


1 


DGI LLLLPRLESNGAISAHCNLHLPSTS 
ESTASAFLS 


5287 


19188 


A 


5326 


404 


18 


PVSPHGKNHEKHTVTCVGSTSGAFLHGE 
EHCHGNTHHPFEPSNPQTACQSQANRIR 
HGNS FKNLWRS WAWWRTP WSATQDAEM 
GRPLEPRGLS LAWATQQDPSLKQKQKQK 
KPTHLWSYILGHHQLP 


5288 


19189 


A 


5327 


109 


3 


KCVLTP VI RALVJEAETGGSRGQDIET I L 
SDTVKPR 


5289 


19190 


A 


5328 


1 


117 


ETGFCCVAQSGLQLLDSSNLLALVSQSA 
GVTGMSHRSRP 


5290 


19191 


A 


5329 


3 


120 


DAW VAAE AEGQNDT I EEPNKVQKRKRG I 
NDNVPAGQAH 


5291 


19192 


A 


5330 


105 


2 


RSGWAH I CNPSTLGGQGRW I TRSGVQD 
QSVQDG 


5292 


19193 


A 


5331 


204 


411 


SKKKQKKKKKKKKKKKKKKKKKQDKKG 


5293 


19194 


A 


5332 


185 


2 


FFFETESHS ITQAGVQWRDLSSLQPLPP 
GFKSLPSSWDYRCLPPCPANFCIFSRDR 
VSLCW 


5294 


19195 


A 


5333 


561 


325 


WSTLHSFPIWLLNTGYYFKELTGFFFFL 
TVALSPRLECSDTI IAHYSLRLLGSRDP 
PTLVFQSTKITSVSHCTQPSVRF 


5295 


19196 


A 


5334 


3 


96 


EGKAAFSQEKSRRVKEENPEIAVSASTI 
PE 


5296 


19197 


A 


5335 


375 


228 


LSP WLE CSGTI LAHCNIjRPLGSSNSPGV 
FGFYFFLTLILGSGVHVQLC 


5297 


19198 


A 


5336 


403 


0 


NISVPASKHISPFPKKKKKKKKKKKKKK 
XKSQKKKSRG 


5298 


19199 


A 


5337 


397 


80 


1 1 ITS KKKKKKKKKKKKKKKKK 


5299 


19200 


A 


5338 


406 


34 


KRTLKDSKKKRKEKKKKKRKKKKKKKNR 
EKKKKPKKKKKKKKRGGAQKKKNNHPAG 
G KAY FFFLGGTKKKRGGGDTKPLGEKNS 
PAQTKKLGGNNPPFVERGKKKKTPG I 


5300 


19201 


A 


5339 


410 


0 


KKQKKKKKKDAPGGGGGGRKKKGGPQKK 
KKGSHLKTPKKEKTGGA 


5301 


19202 


A 


5340 


306 


392 


IQYKILPQKENDWDKIQGQLAIPVSSLN 
F 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E-Glutamic Acid, F=Phenylafanine, 
G=Glycine, H=Histidine, I=IsoIeucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 


5302 


19203 


A 


5341 


3 


173 


TFTATELI I FYI FFETTLIPTLAI ITRG 
GNQPERLNAGTYFLFYTLVGEGSLLARS 
R 


5303 


19204 


A 


5342 


247 


426 


GRAWWFTPVITPLWEAEAGVQWRDLSSL 
QPLLPGFKRFSCLRLPNNWDYRHVPPRP 
ANFL 


5304 


19205 


A 


5343 


3 


149 


SWFLHFGEAGLELLTSGDPPALALPKCW 
DYRLVSNSWPEVILQPQPPK 


5305 


19206 


A 


5344 


464 


147 


LRRLRWENRLRPGGGGCSEPRSHRCTLA 
R VTES LS RVQCQRDPRRLAQTSRRPGAS 
S FPFAGPAFSAVNSTYQRALP PAQGAPW 
T PRL PLG PHGTTS S TS S YL P V 


5306 


19207 


A 


5345 


228 


1 


VIRLEETYFLLFFVLFLFLFFRQTRSGS 
FAQAGVQ S SHLSLPS SWDHRYTPHLANF 
CI FLQKQSIiAMLPRLVSNTK 


5307 


19208 


A 


5346 


2 


106 


FVLELLGSNNSPSSASESAGITGVSHCG 
QPLFNF 


5308 


19209 


A 


5347 


140 


436 


FLCDMRVLYVTFFFFGKGVS FFPPGWRA 
GGEFGLIENPPPGFNPFFLLNLPKKGEL 
RAPP PAP I FFFFFFEKRGFTRLTQEGLN 
FWTWEPPPLTLPRGG 


5309 


19210 


A 


5348 


1 


465 


LESRRFEDGTRVPGST I S WDP PEEAVCP 
FSDLQLRAGRTTTLFKAVRQGHSSLQRI 
LLPFVWQCPAPRGGVYRGRQASLSCSGL 
HPVRASWLLCFPSEAWAMSGTPPPASLP 
PCNLISDCCASNEQGSVGIGPSEPGVEY 
NLLLRHLLKPEEKRS 


5310 


19211 


A 


5349 


183 


58 


E RD YEEG PGAVACAC IPS TLGGRGG WI T 
RSGDRDHPDEIVDP 


5311 


19212 


A 


5350 


288 


404 


lEXNGMILAHCNFQLPGSSNSPASASPV 
AVITGMNHYAR 


5312 


19213. 


A 


5351 


413 


305 


AE AGLKPLS STYLRAS ASQS AG I TGVSH 
HTQPLLFF 


5313 


19214 


A 


5352 


91 


356 


TVPKRNTFSPKMVAITTAGTVSTSVLAL 
GNRESALPTRLGWALCCFIFWGGRGHRV 
P LCH P GW S TVAR S RLTATSTS QVQA I LL 
PQPP 


5314 


19215 


A 


5353 


2 


410 


FVQWQFLKTGSDRSHKAVESLGYEKVEK 
AMTVNYS FRS LNTKGKLELLGKI S VNLY 
AKWKHFREKGRRYTNTLGPTATAAFYST 
EKTEFLPWSQTCKPFIDSFNKYLLPGA 
ADHACNPSTLGGRGGKITRSGDRD 


5315 


19216 


A 


5354 


3 


288 


SLDDDLLKLLLPLMLQYSDEFVQSAYLS 
RRLAYFCARRLSLLLSDSPNLLAAHSPH 
MM IGPNNS S IGAPS PGPPGPGMS PVQLA 
FSDFLSCAQH 


5316 


I92I7 


A 


5355 


187 


3 


KKILWQTPGVNLFSFKGAPLFFFFLRRV 
LLCCPGWS AVARS PLTATSASRVQA I LL 
PQPPK 


5317 


19218 


A 


5356 


383 


261 


GYKINGQKSQAFLYTNNRQTESRLWYCY 
DSPQRLIHLSIK 


5318 


19219 


A 


5357 


367 


241 


FFFRHVAQAGLQLLGSNWPPAS PSQS VR 
VTGVSHCTQPLCFV 


5319 


19220 


A 


5358 


217 


1 


SCDRENTGLKLLSILLLIHCETDTRISL 
LCISLSIKREI WLGAVARACNP S TLGGR 
GGRITRSEDRDPGETP 


5320 


19221 


A 


5359 


255 


412 


TQLCRKRNAFFFLATNHSQKGQSRWGTV 
AHACNPS TLGGRGGR I TR S GDGPH 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=lsoIeucine, 
K-Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *==Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 


5321 


19222 


A 


5360 


391 


271 


R S CH I AQAGLJCL LAS SNP PAS AS QNAE I 
TGVSEHLTPICC 


5322 


19223 


A 


5361 


232 


397 


EGLKEPAGLFTMEYYTARKRNKIMPFW 
TWMQLEAI IliGESTQEQKTKFCMFSPV 


5323 


19224 


A 


5362 


213 


392 


LFI KVEGKFFLRS INI LNESLLFLSSNG 
PGLLKKLGVWPGWAHACNPSTLRGQGG 
RTA 


5324 


19225 


A 


5363 


306 


418 


SDGITGTLPLASQESAWEDLLYVLVGV 
DGRYVSAQP 


5325 


19226 


A 


5364 


347 


48 


XKRWLKPGGGKNPPKKGGKKGGTPPFFQ 
KKGFFFPPFFFFLEKRDLSPQAGGKGEK 
KKPFQ PWKAEGKKNPWGNP PQRGGYKGN 
PTRPPPQGFFFFFFL 


5326 


19227 


A 


5365 


412 


172 


PDSNLSFGAHYQASLLGS FS LMPLLLYS 
KCSGQWDVARSAECTSKSSQNKERKKER 
KKERKKERKKERKKTRKQKKNPLT 


5327 


19228 


A 


5366 


400 


296 


LVEGGFHHVGRAGLEbPTSSDPPTSASQ 
NLGLQA 


5328 


19229 


A 


5367 


162 


1 


APS SLDETPTLLKKTLt FGPGL VAGI CNP 
STLRGRGGRITRSGVRDQPDQHGETP 


5329 


19230 


A 


5368 


152 


403 


TTVLCKLVQPLWKTVWRFLKELEVDLPF 
NPAVPLLGIYPTEKKSYEKGTCTHVFIA 
AQFQ I AR I WNQS KCPPI NEQKKKKGAP V 


5330 


19231 


A 


5369 


170 


367 


IVFEILVISYFKSFPQKTTEWNTFFFHP 
FKVFNWPSTVAHACNPSTLGGRGRQITR 
SGDRDHPAR 


5331 


19232 


A 


5370 


396 


1 


QMCVRAHTFAGGRARTHTHTHTHTHTHS 
WATPRVSHNHCVCVCVCVCVC 


5332 


19233 


A 


5371 


139 


3 


PKLECGAS I IAHCSLELLGSKDPPPSAF 
QVAETTGCAPPCLAFFF 


5333 


19234 


A 


5372 


1 


341 


PLINILSHIiILLCRNYFSFVKFCIRHTS 
LLYWI S LFHVLLCW S FLTDRRAHNAS 
VT YG FVL P VTFKNS PL PGAVAHACNPS T 
LGGRGGQ IMRSGVQDQ PGQARS S RPAL P 
T 


5334 


19235 


A 


5373 


280 


.449 


ICHNLQLCPSKDTIEKVKRQPIEWEKIF 
ANHI SHKGLSS 


5335 


19236 


A 


5374 


170 


1 


QXRGFSMGTPMVS I FRPQDLTPLGFPKA 
WDPRVNPWGPWPCPLFFKGNPQIiFQAPG 


5336 


19237 


A 


5375 


462 


165 


PPTRFLHWQTGLELPTSGDLPASASQG 
AG I TGVSHHAQPRGHVFDVLLEYSQQGV 
FTLLSFLGFSVFFEPLFC 


5337 


19238 


A 


5376 


3 


451 


PRAKLGTRRGLRNLDTIGVAVDLILLFR 
ELRVEQESLLTAFPCGLQFLKNIASRHE 
DSQSIGWVHAFPPPVWPCLSLPPPHTGS 
SSS ITLPPLLPHRPHPPPHSHPPMATLS 
RPPYPPPPHPPPKPLPPSLSPPALSPPP 
PPPPPPPSL 


5338 


19239 


A 


5377 


460 


300 


KVLLC C PRLE CRGAI MAHHSLELLGS SN 
PPTSASQSAGVTGVSHCRSPWEFL 


5339 


19240 


A 


5378 


452 


191 


DSRFHHWQADLEHPTS S DLHASASQS A 
VITGVSHRTRPDIAFAPSISSYFSCVAE 
ISLLHLSESFPKQFL 


5340 


19241 


A 


5379 


240 


395 


JLAT S S VANIiRNT F Y L KLtKS S VLG VE AHS 
CNPSTLGGQGVRITKSGVRDQPGQ 


5341 


19242 


A 


5380 


210 


411 


TSVNSAPNSGQKASVEECFGQVTKVKIQ 
NVFLGWAQWLTPVI PALWEAGAGGSRDQ 
EIDTILANTVR 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=G!ycine, H=Histidine, I=lsoIeucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine } S^Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 


5342 


19243 


A 


5381 


406 


300 


FKYVGQPGLELLTLGSTCLGLSKCWDYR 
REPPHQA 


5343 


19244 


A 


5382 


364 


479 


INDGISKISLFLSHLKDQA6QVWWLTPV 
IPALWEAEAG 


5344 


19245 


A 


5383 


182 


395 


HHLRGSAASFLKSAQEPTDFRHTGFKRF 
SCLSLPSSWDYRCPPPRRGGGVLCLPGW 
RTWQSWLTTTS I SQ 


5345 


19246 


A 


5384 


37 


478 


GGIHSYPVICLFILMGSSVSLPFSSVAC 
LLAWLLPSFLPSLLPSFLPSFLPSLTGR 
AFL 


5346 


19247 


A 


5385 


201 


458 


WN P P FLS PR KWAARADTE WGVGKTAQE P 
GIiSSRLQAYEERQRHWQREREALREDCA 
AYAQCAQRAQQLLQLQVFQLQQEKRQLQ 
DD 


5347 


19248 


A 


5386 


206 


407 


FFS PGGRAGE KFG FME P P P PG FKKTLWL 
NPPGKWKQRAPPPIPVYFLEFKKKGGWP 
NMVQVGLNPRP 


5348 


19249 


A 


5387 


468 


0 


NFF P PRPAP P FFFFFF FVAQAGVQWRVL 
GSLQAPPPGFTPFSCLSLPSSWDYRRPA 
RVSPHC 


5349 


19250 


A 


5388 


136 


2 


LFFFFLFETGFCSVPRLECSGTI IAHCS 
LKLPGSCNSPTSVSQV 


5350 


19251 


A 


5389 


397 


0 


ILYPSQKKKKKKKKKKKKKKKKKKSQ 


5351 


19252 


A 


5390 


299 


3 


AQPLAPFPGSFILMYFFMLLLFGIPLLY 
IVtEVlMGQWLHVDNIRVWKQLVPWLCSMS 
YAHSLVNEGPSPRPSPISLCGLQVSAHR 
VHPQVCASVSLYNS 


5352 


19253 


A 


5391 


264 


202 


FFFFFFFFFFFFFFFFSENKI 


5353 


19254 


A 


5392 


2 


150 


PR VR FL I ETE FCHVGQAGLKLLTS VDPP 
TSASQSAGITGVCHLVRPEKS 


5354 


19255 


A 


5393 


382 


0 


NSRAQAWWLMPVISPHKEAEVGGSLEPT 
S 


5355 


19256 


A 


5394 


395 


239 


SLPSSWDYRHAPQCWANFIFLVETGFSS 
VILCSFRLGGFGLSLNILCSLKSL 


5356 


19257 


A 


5395 


24 


265 


LL PHRKSRNYRTRPTRSLRQKNGMNLGG 
GACSESRSHHCTPVWATERDSVSKKKLF 
TI ELVYVFLKKNSKGEFL PKVLGL 


5357 


19258 


A 


5396 


174 n 


118 


FFFFFFFFFFFFFFFFRSS 


5358 


19259 


A 


5397 


406 


134 


PSAPSSPPPPPLIWGPPGFFPPPLFKNP 
PPKFFFGAPKKKIYLPPPRALNFVLLKG 
PPPFFFFFFFFFSFF 


5359 


19260 


A 


539S 


294 


64 


IFFFFFFFFFFFFFFFFWELHILWGALF 
LIjPCSRIPPGKKKWTCKIFLCTPLTGYIP 
LLKCFQKQNKKSQSQNTELKS 


5360 


19261 


A 


5399 


1 


211 


LLLPTPYHH INQDSSNKYHETKKKKKKK 
KKKKKKKKKKKKKKKKKKKKKRGEKKKK 
KKKKGGGGGYKKKR 


5361 


19262 


A 


5400 


305 


1 


SPPRPPFLLPFLFFLPSFFFKAPHTINH 
FFSPPPLFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFLKLYLFYKMHRINYTSNI 
LKINI FVFSSPGSGGRR 


5362 


19263 


A 


5401 


416 


3 


ENTERDRGRQRREDRSRAETQETNRAGQ 
QESTPPPQKKCNTYSIPSSSSSSPTPQK 
GKKGGPPGGPPPPAGEKGGPKKKKRGPQ 
KKKIRPPPPGKKKPPPKTPPLFFFFFFF 
FFFFFHKPQTFIVGVNTYILLRSVS 


5363 


19264 


A 


5402 


154 


416 


KKKKKKKKKKKKKKKKKKKKKKKKKKKK " 
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SEQ1D 


SEQ IB 


M 


SEQ ID 


Predicted 


Predict- 


Amino acid sequence (A=Alanine 


NO: of 


NO: of 


eth 


NO: in 


beginning 


ed end 


C=Cysteine, D=Aspartic Acid, 


nucleotide 


peptide 


od 


XJSSN 


nucleotide 


nucle- 


E=Glutamic Acid, F=Phenylalanine, 


sequence 


sequence 




09/515,1 
26 


location 
correspond 
ing to first 
amino acid 
residue of 
peptide 
sequence 


otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


G^GIycine, H=Histidine, J=IsoJeucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine ? P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y^Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \ppossible 
nucleotide insertion 














KKKKKKKKDSGGGAF 


5364 


19265 


A 


5403 


327 


2 


PPPQKFFLGGVFFFPFFFYNPPPPFFFF 
SPKKKKYFSPPRGKIFFFFFPPPPPFFF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
LR WS LALVAQAG VWWLKHG S LQSL 


5365 


19266 


A 


5404 


372 


181 


YKGLAHTRCS IRNFLNFI LI IEMGSRFV 
AQAGLEPLGSSDPPTSASQSDGITGVSR 
RTRPINIC 


5366 


19267 


A 


5405 


110 


1 


PVPVSSTEVKKLVLWPGTWHACNPKTS 
GGQGGRIT 


5367 


19268 


A 


5406 


2 


401 


GFHLTTYGNRKQMQRHVREASARPLQGT 
PPPGHALEWTLENVLLAAPRPRKAQVLF 
AIVASETSSWDREKLWTLSLEAKCKGIT 
LFVLALGPGVGTHELAELAELVSAFSEQ 
HLLRLQG VS KP E VNYAQG FTR 


5368 


19269 


A 


5407 


3 


181 


QLILFHDKLSPKLLSWGNVDILINFFLS 
HCSPERARRLQAKMMTKLVMAKDRLQLL 
GIN 


5369 


19270 


A 


5408 


371 


222 


SVEMGFHHVAQPSLEPLGSSDPPASASQ 
SAE I TGVCHCARPNHDS YRRS 


5370 


19271 


A 


5409 


204 


416 


NQVGEWVLRKGTLGDLIMSMLFHFLWR 
SKDEKNTKIKPGAVAHTCNPSTLGGRGG 
QITRSGVRDQPGRHG 


5371 


19272 


A 


5410 


236 


308 


AGRPGTVTHTCNPSTLGGQGGRIT 


5372 


19273 


A 


5411 


375 


216 


GCSEPRWCHCTTPAWVTVQDSPPKKEKK 
RKE KKKKLLTT ENS KKAL KRNG YL 


5373 


19274 


A 


5412 


1 


381 


RFKHVAQAGLNLLG SNDL S AS AS QSAG I 
TGMSPRTQP 


5374 


19275 


A 


5413 


144 


3 


RPPPI FFFLAKKLRALFFFFHS VXQAGV 
QWRNLGSLQPLPPGFKQF 


5375 


19276 


A 


5414 


412 


32 


LFS KKKKKKKKKKKKKKKKKKKK 


5376 


19277 


A 


5415 


414 


2 


NNNSFVFFSKPIFLKTVGGAPPPSPIYN 
P I P I FLY I YP PH KWGG VY I P THM YGGG D 
IWGYMYTPPPPPLFIYFFPPPPFYIEKK 
KKTVFCFFAQGGVRLCLKKKKNKDRGWG 
DTMTMGHGGRLEMVANVCGLKRYW 


5377 


19278 


A 


5416 


1 


240 


GGEGCSEPRSCHCTPAWATERDSISKKK 
KKKGGPFKRTHFSSPGLPSFFFFFGAPK 
FNSRARFLTPREGKNPGLPPFYPA 


5378 


19279 


A 


5417 


2 


63 


LCLKI^NNl^WNNNNNND 


5379 


19280 


A 


5418 


322 


1 


WPLCFSRCSKKRGS PLGFPFS P PLGQS K 
KFVRFACKTPSAFKPFPSLS FFQRWAGP 
P PFLQGLGELPKGLFFQQCFFFFFFLRW 
SLTLVAQAGVQWHDPGSLQPPPL 


5380 


19281 


A 


5419 


1 


392 


FRMAGSYPEGAPAILADKRQQFGSRFLS 
DPARVFHHNAWDOTEWSEEQAAAAERKV 
QENS I QRVCQE KQVDYE INAHKYWNDFY 
KIHENGFFKDRHVJLFTEFPELAPSQNQN 
HLKDWFLENKSEAPSQKK 


5381 


19282 


A 


5420 


273 


91 


RWYIVLDNLIFILENICCETSLKIYLGP 
GWARACNLSTLGGQGGQITRSGVRDQQ 
GQHGE 


5382 


19283 


A 


5421 


1 


149 


GGGGCSEPRLRRCTPAWATERDSVSKKK 
NLKKFFCPGLGLYQKQNTWTL 


5383 


19284 


A " 


5422 


381 


179 


S S CLGLP KS WD YRCE PLCPARTALNLKT 












I KKSKDNRRTOGCGEKEMLMHCWWECDM 
VWLCVPTQISC 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

resid ue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A^AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F^Phenylalanine, 
G=Glycine, H=Histidine, T=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine 7 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Val in e, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 


. 5384 


19285 


A 


5423 


195 


369 


CSLSFIKECKLHISGLQDSSISLVNLGL 
AIRNIRNKPGWAHSCNPSTLGGQAGWI 
T 


5385 


19286 


A 


5424 


253 


414 


KKKKKKKKKKKKKKKKKXKKKKXKXKKK 
KKKKKXXRGGAAKKKPHSPAPAERDL 


5386 


19287 


A 


5425 


160 


2 


NLKGSAPFSKKTSPFFLVKENPLFFFFF 
2MESLSVAQAGVQWRDLGSLQPLP 


5387 


19288 


A 


5426 


53 


426 


ASSSFFFFFFLGKGGSIFPPKGGGEGKN 
FFL WKPGAPGGGG P P P PTPRGGGKKGP P 
PPGQKI FLGMFFEKKRFHHGGGGGFKLR 
GKKKPPPIAPKRVKNKGGNPGGGQPLKF 
KKTLPEKMGLKK 


5388 


19289 


A 


5427 


60 


1 


KWPGTVAHACNPSTLGGRGG 


5389 


19290 


A 


5428 


87 


247 


K KKN P Q KKKKKKKKKKKKKKKKKNKKKA 
PAGAGDLSLYYPRAGRNNLFFNYLP 


5390 


19291 


A 


5429 


209 


405 


KLNLVNFSSLSSLKAMISAILLKLSVPI 
FPMKIVILSTLNSGSWPGAVAHACDPSA 
LGGRGGRIT 


5391 - 


19292 


A 


5430 


324 


237 


VMNAFVC VCVCVCS C I CVCVCVCVYFLF 


5392 


19293 


A 


5431 


409 


3 


FFFFFSETESRSVTQAGVQWRDLGS 


5393 


19294 


A 


5432 


270 


402 


MKI I FHL I R I TGWAQWLTPVTPALWEAE 
TGGSRGQEFETTVKPC 


5394 


19295 


A 


5433 


107 


277 


YNFNNHRYREAIQKWDEALQLTPNDATL 
YEMKSQVI IMKNFTFLMKLCYVCSVSEQ 


5395 


19296 


A 


5434 


95 


1 


RNRKNNGQSGTVAHACNPSTLGGRDGWI 
TRS 


5396 


19297 


A 


5435 


412 


105 


GGEGCSE LRSCHCP PAWATRARLHLKI K 
KKERWDLAMLLRVGYSGHRQVHCTDHGT 
LQPQPF 


5397 


19298 


A 


5436 


276 


488 


PLPQPNKDSRLLPLLVCLGFLFAPLLML 
GHGFKRARLPIFFPQEAYFITFRLVFAE 
XKGYRGPLTMCLAPR 


5398 


19299 


A 


5437 


179 


2 


GISPLKGGTPSLKKKKKAKDFNRRVSKE 
DI QFTNKHMKRC ITLL I I KQMQ I KI TMR 
YLP 


5399 


19300 


A 


5438 


3 


156 


TPLYSQILI SLROEHYFKAEGRGCNELT 
S YHCTPAWAKDGDSVS KNKQNT 


5400 


19301 


A 


5439 


47 


173 


SLQLFLQTEERKDSDDEKSDRNRPWWRK 
RFVSAMPKGNFIKY 


5401 


19302 


A 


5440 


3 


122 


TTLTLTSLIPPILTTLVNPNKKNPPKLI 
RILVLFQTTEK 


5402 


19303 


A 


5441 


390 


489 


KYKSYSLQPPPPKFKHFSCVTLPKHWDY 
RREPP 


5403 


19304 


A 


5442 


86 


275 


RQEFETSQQGETPCLLKIQKLAGRAHLN 
PGGRGCSEPRLCDCTAAWVTRAGLSQIK 
KIKKKKK 


5404 


19305 


A 


5443 


424 


166 


CFFVSPKAKSQGWLGSPPPPPPGIiKHF 
SAPTPRGLFLFFFFPRGVPPLGKGGSKS 
PPPGVFPPWPPQKGGDSGGYPPPRAKIF 
LF 


5405 


19306 


A 


5444 


356 


3 


AFFWFTNPFP PGFKGFFCPHPLSG I SKG 
GPPPRGNFFFFGKAHFFSPFFKFLPPIF 
LPFLAPQKVGFPGFGPPPKPSQGVFLFF 
FFFFFSFLFFFFFETESSSVAQAGVQWR 
DPDAW 


5406 


19307 


A 


5445 


198 


86 


P W PLG AQA I LLPQS SCXNLP KC WD YRRE 
PPHTALLNS 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V^Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


5407 


19308 


A 


5446 


198 


86 


PWPLGAQAILLPQSSCLNLPKCWDYRRE 
PPHTALLNS 


5408 


19309 


A 


5447 


143 


3 


TMIYFYFYFFLSQSLALLPSLEGNGTIS 
AHCNLRLPGSSDSSADAW 


5409 


19310 


A 


5448 


324 


1 


KKNSATS FFSVEP FSTGGVLANKGALTQ 
EKILPCFKIKKIRKRVGAVAPARNHFLF 
FSPFFKEGLQGIVYFFFFFFELESRSVT 
QAGVQWRNLCSPQPLPPGFKRTRG 


5410 


19311 


A 


5449 


403 


0 


IjFFFFFFFFFFFFPKENH 


5411 


19312 


A 


5450 


114 


409 


LNIWTLKYLISKHLLIVCLPDTILSSLC 
LIKYIGLQGKVLI FFFKQFSAIDKLGQN 
IAWGKFGFAHYSLLTKKWKLFGNITQE 
QNMIVTGGLSWWND 


5412 


19313 


A 


5451 


192 


406 


RNFFLGHLKYVKRFEQIGWGRDFMFKTP 
KARATKNKMDKGDL I KLKS F CP AKETT F 
RGTGHPT KGEKI FAT 


5413 


19314 


A 


5452 


254 


48 


NAFG WL CVH YALQ F I D KS VKNS VCWLGA 
VAHACNPSTLGGQGGWITRSGVRHQPGQ 
YGETLRTRASTR 


5414 


19315 


A 


5453 


1 


288 


PLLFILIPSPFSFPHPFCGPIKFILPRE 
KKKKKKKKKKKKKKKKKKGGGPFKKKK I 
FPRGGKKKI FFFWAQKKKI RGGVLKTGG 
RKKPGGKKNKSG 


5415 


19316 


A 


5454 


285 


415 


TSGCCFVFLVLKLS I FCKGKVLNAI EDN 
GLKNSTFTYFTSDHG 


5416 


19317 


A 


5455 


372 


125 


ITPAFSFSPMGKKGVFPPPKFFGFPPGF 
PPPPFFKPPPGIIFLGPKKKKFFSPPPR 
VYFFFFKRPPPFFFFFFFFFFFFFFF 


5417 


19318 


A 


5456 


3 


159 


AHASAHAYDT I KDALGLLLF I LSLMTLT 
LFSPDLLGDQKKKKKKKKKRGGP 


5418 


19319 


A 


5457 


109 


392 


N I I I I KRSTS LFLRP I FPPHLFP FI L YL 
FISLRTLLFFFFFKREFCFLPQVGGQGP 
FLNSLKPLPRGLMQFSCLTLLRSGNYGP 
PPPPPINFFF 


5419 


19320 


A 


5458 


243 


402 


FLKIGFIHKYLKEFVWGILNIVFNRFNS 
HERAPHAIVFVNGTTIEGHVEKCYW 


5420 


19321 


A 


5459 


394 


105 


I PPPS PRPGGPGAAPP P3GGGSTTPPPK 
KKPPFFPKNKNQPGGGGGAPPSPPPRGI* 
GPKIPLTPGTKVFKETKPPPFPPPRGAP 
PKFLSKKKKKET 


5421 


19322 


A 


5460 


56 


379 


VCFSPSIYGAWLNKSPLRSHGSSSTSG 
WGGLPS PFPAQRLPLASQ VQEHNGHVFA 
S YQVS I PQS CEQCLSY I WLWDKALLCSG 
EWIiPHQAPKPSMS PTTPHCSRGLQ 


5422 


19323 ! 


A 


5461 


184 


420 


FFFFFLNNNLFFFFFFFFFFFFFFFFLG 
RN 


5423 


19324 


A 


5462 


295 


384 


LGAVAQARNP ITLGGRGG W I TR SG VQDQ 
PG 


5424 


19325 


A 


5463 


413 


257 


FLNPGGRGCSEPKSHKSTLAWVTERDLI 
SKKKKKEKCWIPQRMPSRGQWSPD 


5425 


19326 


A 


5464 


2 


181 


SIiYCSGWSQTPGLKQSSCLSFPKCWDYR 
CELPRPVYVLLYKTSFPKRYVFGIHLCF 
MSQ 


5426 


19327 


A 


5465 


1 


229 


TGS CYVAQANLELLS S S DASTS VS QNAG 
TTGVSHHVQPYTS SFSQVPPLRAQSAAT 
ETTSILWVQKLRIiQWRCSV 


5427 


19328 


A 


5466 


1 


368 


KERETKKEREEERRMRRRREGRKEKGRK 
ERKKEREKERKKREEKKRNNLGOSAGT T 
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C=Cysteine, D=Aspartic Acid, 
E-Glutamic Acid, ^Phenylalanine, 
G=Glycine, H-Histidine, l=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
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X=Unknown, *=Stop codon, /=possib!e 
nucleotide deletion, V=possibIe 
nucleotide insertion 














GMSHHTRPKSHNVLSKFTNLCWAAFKAI 
LGHMWGRLDKLNVEQCFSTELSALPS 1 1 
ATSHMRLWVI 


5428 


19329 


A 


5467 


183 


3 


VSLCYGEKFS PQMNESQLYFRPGAVAHA 
CNPSTLGGRGRQITRSGDRNHPGQHGET 
GRV 


5429 


19330 


A 


5468 


1 


128 


GGAG YS ELRLCHCTRAWATRAKLHLKKK 
KKKG PLRALKK I GG 


5430 


19331 


A 


5469 


1 


155 


NKFPVLISNVINI IGMCHCTPAWATERD 
SVSKRKKKEKKRSKLARHSVLCL 


5431 


19332 


A 


5470 


113 


1 


FFFFFFQKTSSRLGTVAEACNPSTLGGR 
GGRITRSGD 


5432 


19333 


A 


5471 


407 


281 


NGMIPGGRACSELRSRHCTPAWATEQDS 
ISKKQNKSKNPGLK 


5433 


19334 


A 


5472 


1 


177 


GGGCSEPRSRHCPPAWATERDSYLHKMK - 
QSKTKGERMEVACYCI I ILFMLTDALEV 
LMA 


5434 


19335 


A 


5473 


76 


267 


KERREEKEKEKRKRSERKGRKKEGRRDR 
EKEKEERKDERREGEEGEEGRKEKKEKD 
IKRRGWP 


5435 


19336 


A 


5474 


133 


2 


LCRMWLFCFEAGSHSVTQAGVQWCGLN 
SLQPLPPGFKQFSCL 


5436 


19337 


A 


5475 


5 


394 


YGVGTKGFRAGVXEFQALFLLITVHSQL 
CHIEGTRHHSRHI IVWDFINQIENGTF 
FPLVLSNGCIWLLIVYLFWNIFTRQEGQ 
FFYGKLSHYS VF VFCFLRRS FALAQAGV 
RWCDLGSLQPLPPGFKRF 


5437 


19338 


A 


5476 


163 


2 


SEKPWVVALTLMTFSHFYFFIYFYYIFI 
FQMES CS I TQAGVQWRBLNSLQPLP 


5438 


19339 


A 


5477 


111 


2 


FLFLKQDLPGAVAHACNPSTLGGRGRQI 
TRSGDRDH 


5439 


19340 


A 


5478 


232 


0 


HLVWRMESGSVAHAAVQWCDLSSLQPPP 
PRFKPFPSS 


5440 


19341 


A 


5479 


388 


225 


ALF S S KKKKKR K KE RKRK KE KKKKKES M 
QRKGGHLPRVCCGASMVSPGHKLSL 


5441 


19342 


A 


5480 


150 


2 


HRVAVIYEVFFVCLFFYFLFLRQSLTLV 
TQPGVQWRNLS KLQ PPHPGFK 


5442 


19343 


A 


5481 


156 


395 


DSVCTCVLHTCHSSGLSQGRSCLHCSLG 
LPNSNCPCVCVC VCVCVCVCVCVCVS FC 
PLKQGVCVCVCVCVCVCVFLSAHFTKGC 
VCVCVCVCVCVCFSPPI 


5443 


19344 


A 


5482 


391 


115 


RACPDCRVRFPHTSPPCLPCGPEAEPGP 
GPALRELVQPLPGQLQPPFGMPPSGLGA 
APSPAPPACAWTRPPHLHPSSFSSSVPQ 
ISSLFLCF 


5444 


19345 


A 


5483 


392 


2 


FFFFFSETQFRS CCPRLECNGAI S VHCN 
LRLPGSNNSP 


5445 


19346 


A 


5484 


381 


282 


PVPWGQAI FLPPPPP YliGIPGPPPPPGL 
FLFP 


5446 


19347 


A 


5485 


407 


3 


FFFFFFLRRSLTVAQAGVRWCDLGSLQA 
PPPGFT 


5447 


19348 


A 


5486 


2 


417 


QENHLNPGGVGCSEPRSCHCTPAWVTEY 
DKKKKK 


5448 


19349 


A 


5487 


3 


413 


VYLHPSLQPSLSSLVPDQALSHDIWLKK 
KKKKKKKKKKKRGGISPPSPVGGVGPRE 
TWGVWGPGANLFQRGGGVRELFSQKKKG 
EKGQRNPEKKPPHKRGPGGPKEKFIFFE 
KKKKKGGKPFGKKTKPPPPRVNPGG 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
R=G)utamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, Wsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
1 = 1 nreonine, V=Vahne, 
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X=Unknown, *=Stop codon, /-possible 
nucleotide deletion, \=possible 
nucleotide insertion 


5449 


19350 


A 


5488 


229 


411 


IKGPGFKNGGRGKTLGLPNLMALKKIPF 
SPGGYKPKNP 


5450 


19351 


A 


5489 


190 


50 


Y HH H HH LHHjH HHH Q (JHiiKHHHiJHH YHGD 
NDYS SHHSRSTCHVAVTV 


5451 


19352 


A 


5490 


615 


402 


RWCLTLSPRLECSGMI SAHYNLHLPGS K 
QLSCLSLPSSWDYRCPPPGKTTFSRGIR 
DALKPRKPKRINIjI 


5452 


19353 


A 


5491 


338 


53 


KGPPFFFFCKKGGLFCSNWFLTPGGKWF 
SPP 1 PPRSGGKRVPPEGGGKGr SLFFFV 
EGGS CHI AQAGLKLIiGSSDLVTSASQSA 
o 1 Mvj ivo tr rift. W Jr 


5453 


19354 


A 


5492 


266 


412 


NFNFFERKSCSVSQAGVHRRELNSLKAA 
PSGYTPASCVSLPSSWDYRRP 


5454 


19355 


A 


5493 


285 


412 


RDGVSALSPGLECGGMI IAHCNLELLGS 
SDPSALASQSTWNY 


5455 


19356 


A 


5494 


1 


1350 


MGFMTGICIEDNWGPSPESGSVLTVGG 
VRIQMLDRCHTAHCPVI PGLLRGQLWTE 
NLAERNSHDRRPFPTWCSHFAQDLWPE 
QS I KDS YQKVI LRKFEKCGHGNLHFKKG 
CESVDECKLHKRGYNGLNQCLTTTQSKI 
FQCGKYVKVFHQFSNSKRHKRRHTEKKP 
LKYIEGDKAFNQSSTHTTHKKIDTGEKP 
YKCE E CG KAFNRS S HLTTH KI THTRE KP 
YKCE E CG KVFKYFS S FTTH KKI HSGE KP 
YICEECGKAFMYPYTLTTHKI IHTGEQP 
YKCKECDKAFNHPATLS SHKKIHTGEKP 
YTCDKCGKAFISSSILSKHEKIHTGEKP 
YKCEE CGKAFTRS SHLTMHKI IHTGEKP 
YKCEECGKAFTWSAGLHKHRRTHTGEKP 
YKCEECG KAYTTSSNLTEHKTTHTGEKP 
YKCKECGKAFNWSSDLNKHKRIHIGQKP 
RT 


5456 


19357 


A 


5495 


4 


146 


PPTRPTCYYLSFYvSYPTGYKEDTRi!WN 


5457 


19358 


A 


5496 


5 


388 


CGCFFFS PPPKKKVFPPPFFFFAPGFFP 

DDT T?T?TT)DT3nPl?T?tJaDt?V'YTrMGDDD'DPV' 

trxrLtk r I f fPy r r r W/Lt'i\JvKJU^c>PPPPv7l\. 
KIFFLKGPPPP 


5458 


19359 


A 


5497 


231 


163 


CCT? T? TT T?t? CtP I? I? TPCTTYYTWirKI V 

rrrrrrrcrrrrrr ^x , AJiiJMAWx , 


5459 


19360 


A 


5498 


163 


2 


SVKFTLGKVSQFLKKHTPAGWAHTVI PA 

T MJ?Zi'FAr*^!C , Df2f"YK , TTr r PTT TTJTVtTDU 
JbV¥ll/iJC»/iOvjoKoyil»X c* J. ±1jI AN J. V J\.frl 


5460 


19361 


A 


5499 


229 


388 


RTHLQAS SDLFCLYSLKTGS CCVAYAGV 
yKrlI/l4ool-»yPPPPKr )\Kbo\-JjKPPc» 


5461 


19362 


A 


5500 


405 


289 


HVAQAGLKLLAS SDP PTPASQSAGVTS V 
SHCTHPRCLFV 


5462 


19363 


A 


5501 


286 


375 


GDFVRFS FWPGGGAHSCNPSTLGGRGGQ 
VT 


5463 


19364 


A 


5502 


20 


374 


I LQFQWFWI VLFKWFYHFGKFLiI YILNW 
FSGLFWYYFSELSCFSLSSKINILNLLS 
GILNVFFSISGVRKKIFLIKRYCWLGTV 
THACE PS T LGGQGRQ I TR SGDRDH PGQR 
WKIQN 


5464 


19365 


A 


5503 


1 


185 


FHH IGQAGLI LLTS SNLPASASQKAG I T 
GMSHCTQP III FLYSLFLQLS FS WQDFV 

1 V J i\. IV X\ J7 


5465 


19366 


A 


5504 


202 


402 


NSFSSTAKLREKYRQFLYTP I PTGTQPL 
PLLTFCTRPDTVAHACNHSTLGGQGGQI 
TRSGACNQPGQ 


5466 


19367 


A 


5505 


118 


7 


I KKENNWPGAVAHACNPS I LGGRDGR I T 
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E=Glutamic Acid, ^Phenylalanine, 


sequence 


sequence 




09/515,1 
26 


location 
correspond 
ing to first 
amino acid 
residue of 
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peptide 
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G=GIycine, H=Histidine, I=Isoleucine, 
K=Lysine, Lr=Leucine, M=Methk>nine, 
N=Asparagine, P^ProJ ine, 
Q=Glutaraine, R=Arginine, S=Serine, 
T=Threonine, V=Vnline, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possibie 
nucleotide deletion, \=possible 
nucleotide insertion 














RSGDRDHPG 


5467 


19368 


A 


5506 


205 


422 


ANHFTS IRMAI INNNKQTENNKCWQGQG 
BVGTLVHC WWECKMVWMLRQ I VWSFLKK 
LNIQLSRDPATLLLAI 


5468 


19369 


A 


5507 


3 


425 


RSCECTPAWATERDSVSQKKKKKKKKK 


5469 


19370 


A 


5508 


3 


86 


SKKLQVLLNCMTE I YYQFKKDKAERS K 


5470 


19371 


A 


5509 


395 


108 


GVLECNDTITAHCSLDLVGSSDPPTSAS 
QVTGTTGTCHHAQLGLPKCWDDRHEPRL 
PAWNKTLRRKNWHSLLPWPAIQSLEVG 
GSLRPRSLRLE 


5471 


19372 


A 


5510 


421 


59 


FFFFFFLRRSLAVS QAGVQW WDLGSLQA 
LQA 


5472 


19373 


A 


5511 


168 


3 


EICKMCFPVPFGVFFLWIVFVFVFVFVL 
RWRMESLSVSQAGVQWCNLSSLQPPP 


5473 


19374 


A 


5512 


446 


149 


FFFFFFFCFFETESCSVARLESSGMISA 
HSSHNLCI 


5474 


19375 


A 


5513 


74 


398 


DRI IDAGPKGNYARFMNHCCQPNCETQK 
WSVNGDTRVGLFALSDIKAGKNHFRILQ 
LTSEFQGFCFLQTASSDYNFKHFYMYRS 
LELSSFTFLCNHHHHPSPELFSSS 


5475 


19376 


A 


5514 


73 


335 


EGGSLFCPGWSAGALFWLTAPSASRGWG 
GAPGVPWGEVGQKNCWNPGGGGCNELG 
LHHCI PPWGTGKTLSPKKKKKKGATDMG 
GAS 


5476 


19377 


A 


5515 


416 


0 


WLG PGAVAHTCNPTTLGGRGRW I TRSRD 


5477 


19378 


A 


5516 


242 


423 


AGSPLIHQGGOAI ILTQNPTPGWGTMVF 
QP VLR P V KGLYNANHVTSS P VAS Q P 1 1 F 
.TTAG 


5478 


19379 


A 


5517 


2 


151 


GRVGFVGMGSHCVAQAGLELLGSGGPPT 
SASQSAI IAGVEHRAWPYFPS 


5479 


19380 


A 


5518 


312 


405 


GQVRPGAVAHACS PNTLGGRGR I TRSGV 
PDQ 


5480 


19381 


A 


5519 


3 


387 


ACSGPRSCHCTPAWATHQDSVSKKKRKK 
XREKP FLL3 KKRG P TLTWGE P KKG I PLG 
ERNGPPSHFKKGPPLAPKPPGGNLHPEI 
AQGFGSPTDGNKKGDLNPTPVFYKFKTP 
EQAINWAAPPFPSEK 


5481 


19382 


A 


5520 


292 


140 


KKGPPRARPGGGGSPPLFFFFFFFFFFF 
FFFFFWESTLKKRELHRSDLCV 


5482 


19383 


A 


5521 


1 


399 


KKWLFLSSAQIPTKLSRRRRNRETWPKE 
RSKINLHKPILKKQRHELPDKEFKIALI 
KMLYELKVTMHEQNENIMKETENTKKKK 
KKKENPPLFGTGKNQSPPEKFQMGGQPP 
PPPPQKKKPPPPNRFFVFSPP <■ 


5483 


19384 


A 


5523 


316 


399 


MLLY W PGTVAYACN PSTLGGRGGR I TRL 


5484 


19385 


A 


5524 


400 


0 


FFFFFFFCFFFFFLFFFS 


5485 


19386 


A 


5525 


3 


429 


TTNIGRAMGATTS I VGSDTSQAERPGGT 
TWSPGASSTSQSSRPGTSVTPDSSAStf 
SETVTTKEFSGTTAI SRTSHTGTPAASG 
GQATGSLTATTGVAPGTTVAPGS SNTEA 
TTS VGE KKKDQKKKKAS S P 


5486 


19387 


A 


5526 


135 


3 


GGKICTL I KQS VI QES WPGTVAHACNTS 
Tj_iGGRGGW IMKSGDR 


5487 


19388 


A 


5527 


105 


5 


KKKKKKKSGRWPGAVADACNPNI LGGQG 
GQITI 


5488 


19389 


A 


5528 


411 


31 


GQR VPLS RG VAKKKKFL WNPGGGAGKNG 
GKNFLKKKPPGAPPPSGGPKKGPGPPPP 
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Predict- 
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peptide 

sequence 


Amino acid sequence (A=AIan ine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=GJycine, H=Histidine, I=Jso leucine, 
K=Lysine, L=Leucine, M=M eth ion ine, 
jN=Asparagine, P=Proline, 
Q=G!utamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, V= possible 
nucleotide insertion 














KKPLILVSPPPRKGGAHNMHFRQKKSGN 
KGGEGSTKGGKEKI FLGKFVQKNFFFFF 
DPEFHSCCLGWSAMP 


5489 


19390 


A 


5529 


411 


142 


PRFSFLFSPPRKKEVFPPPKILGGPRFF 
PPPRFLKPPPHFFFWGPKKKKKFPPPPG 
EKFFFFKGAPPPFFFFFFFFFFFFFFF 


5490 


19391 


A 


5530 


174 


401 


QVQPQQKKKKKKKKKKKKKKKKKKKKKK 
KKKKKKKKKKKKKKKKASPGGGGKKKKK 
ERGGGGERKNI I WVPLLFPP 


5491 


19392 


A 


5531 


410 


0 


QPQS KKKKKKKKKKKKKKKKKKKKKKAS 
QKKKKKKKKKKKKKKSSGGGFFKEV 


5492 


19393 


A 


5532 


383 


145 


GGFFFNPPAKKGSFPQPHNLGGPPVFSP 
PPVFKTPPRNIFFGAPKKKKFFPPPGAK 
KYFFKKAPPPFFFFFFFFFFFFF 


5493 


19394 


A 


5533 


418 


328 


KTGFRHVGQAGLKLLASSDLPALASQSA 
GL 


5494 


19395 


A 


5534 


169 


3 


I I LI IHPCSHVPRRETLGLKACTFKNLF 
WPGMVAHACNPSTSGGRERRITRSGDR 


5495 


19396 


A 


5535 


1 


254 


PTRP CKP PQTS PN SG KLS P S SNG CMNTL 
HISSTNTVGEVIEALIiKKFLVTESPAKF 
ALiYKRCHREDQVYACKLSDREHPLYLHD 


5496 


19397 


A 


5536 


43 


404 


QGGLTVLPRLVLNSWPDKRSSRLGLPEC 
WDYRYEPPHQAN 


5497 


19398 


A 


5538 


2 


212 


KENHLNPGGGGCSELRSCRCTPAWATER 
NSVSKKKKKNFRGGGLKGKGPGFYPLEK 
KFPGVGFFPKTNLV 


5498 


19399 


A 


5539 


2 


387 


IFQEKKNCGEMSEIEAKVKYVKLARSLR 
TYGVSFFLVKEKMKGKNKLVPRLIiG ITK 
DSVMRVDEKTKE VIjQEW PLTTVKRWAAS 
PKSFTLDFGEYQESYYSVQTTEGEQISQ 
LIAGYIDI ILKKKKKKD 


5499 


19400 


A 


5540 


317 


407 


GLSmlVFFEEGKLKKPRIKDKDKKVPEP 
DN 


5500 


19401 


A 


5541 


120 


3 


NF FF FFLRRS FALS PRLECSGT I SAHCN 
LHLPGSSDSP 


5501 


19402 


A 


5542 


430 


0 


QKAQSNvj PEKyE KRGVIQNFKR i JjSKKK 


5502 


19403 


A 


5543 


172 


2 


FIFNFLHIFSFWFITKFGCFIVLFLRQS 
LALLPRPGVQWRDLGSLQPPPPRVRPRV 


5503 


19404 


A 


5544 


149 


2 


EKDI QELNSALHQADL I DI YRTLHRKST 
F FS A PHRT Y S K I DH I VGS KAL 


5504 


19405 


A 


5545 


84 


276 


TLKKRPKE I RNQGGS PQRKRAQLGKLQQ 
KYPVLNFKAPFFGERGDYYKSFFKTCLD 
NLPRRGKG 


5505 


19406 


A 


5546 


2 


914 


RPAWHEEQKS KNE PEDTEDKEG CQMDKE 
PSAVKKKPKPTNPVEIKEELKSTSPASE 
KADPGAVKDKASPEPEKDFSEKAKPSPH 
PIKDKLKGKDETDSPTVHLGLDSDSESE 
LVIDLGEDHSGREGRKNKKEPKEPSPKQ 
DWGKTP P STTVGSHS PPETP VT.TRS S A 
QTSAAGATATTSTSS TVTVTAPAPAATG 
S PVKKQRPLLPKETAPAVQRVVWNS S SK 
FQTSSQKVJYMQDSSSSSSSSSPQNQQQQ 
PQSSQGTRYHTRQAVKAVQQKE ITQSPS 


5506 


19407 


A 


5547 


2 


244 


TRLIFVFLVETGFRRVGQAGLKLLTSGY 
VGTSASQRMTAHLCLPRCWDYRCEQIjHP 
AQRFFKRMLSVAVDAALLHTSGGM 


5507 


1940S 


A 


5548 


406 


1 1 


FFFFFSETESCS ISQARVQWCNJ..SSIiKP 
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Predicted 


Predict- 


Amino acid sequence (A-Alanine 


NO: of 


NO: of 


eth 


NO: in 


beginning 


ed end 


C=Cysteine, D=*Aspartic Acid, 


nucleotide 


peptide 


od 


USSN 


nucleotide 


nucle- 


E=Glutamic Acid, F=Phenylalanine, 


sequence 


sequence 




09/515,1 
26 


location 
correspond 
ing to first 
amino acid 
residue of 
peptide 
sequence 


otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


G=Glycine, H=Histidine, ]=]soleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V^Valine, 
W=aryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


5508 

- 


19409 


A 


5549 


417 


43 


I RPARKGCS SGL FGAPLAGGGAP P P P P P 
RQTS P P PPHTKGGGGGFHKP P PKKKR P P 
PKKKGGGKKNPPKKKKKNTPPPPKSAPP 
PAAKTDPPPPLTVPPFLPPYIFTKKIPF 
LKSYFFFFY 


5509 


19410 


A 


5550 


3 


136 


QPRFPTLTNINENNPTLLNP I KB1MGS 
LFAGFLITNNI SYHSP 


5510 


19411 


A 


555) 


75 


194 


NKELSRNKKKKKKKKKKKKKKKKKRGGG 
LKKKNGGGAKT 


5511 


19412 


A 


5552 


343 


493 


1 1 FS FFFFFFLKGS FVFAPQAGGHGGNF 
GPLKPPPLGLKEFS CLTLLS S W 


5512 


19413 


A 


5553 


148 


3 


DAVSLFCPGSLEHSDTI IAHYS PDLLGS 
SSLSASTSRVAGITGICHHA 


5513 


19414 


A 


5554 


407 


252 


KKKFLKKLAGQGGMRLQSLLLRRLRQED 
HSSSGVQSCSELWGGHLIYSKNR 


5514 


19415 


A 


5555 


3 


231 


HEFVFLVETRFHHVGRAGLELLI SSYPL 
ASAFREAKARREDLLSPGVKDKPGQYRD 
TLTLKKKKKSFFIYLGECW 


5515 


19416 


A 


5556 


3 


131 


SVVVS WEESKDAKELVGYYI EASVAGSG 
KWE P CNNNPVKGS R 


5516 


19417 


A 


5557 


3 


131 


SVWSWEESKDAKELVGYYIEASVAGSG 
KWE P CNNNP VKGSR 


5517 


19418 


A 


5558 


462 


255 


RSHHS AQAGLELQAS SDPPALTS QSAGV 














TNVSLRAQPILACSSVEVSSRCQVCLHV 
RRGKCPRAESYR 


5518 


19419 


A 


5559 


445 


29 


KS SN I L I LQ KVENGDLSN K I LK I TD FGL 
AREWHRTTKMSAAGTYAWMAPEVIRASM 
FS KGS DVWS YGVLLWELLTGEVP FRG I D 
GIAVAYGVAMNKLALPIPSTCPEPFAKL 
MEDCWNPDPHSRPS FTNI LAS CQAWR 


5519 


19420 


A 


5560 


225 


475 


GAVES S CTEEHMW I PEKAVEGS LTPTPE 
SRSVAHAGVQWHNVSSLQPPPPGFRRFS 
CLSLPSSWDYTSAGEGCAASYRSKVLK 


5520 


19421 


A 


5561 


248 


142 


NKNIWVQKKCVFYIYLPFICVCVCVCVC 
VCVCVCV 


5521 


19422 


A 


5562 


262 


456 


NFPLKTTYFLKKKKKKKKKKKKKKKKKK 
KKKKKKKKKKKKKKKKKKKKK KKKKKKK 
KKKKKASSPPPLIKRGPKN 


5522 


19423 


A 


5563 


314 


444 


LKSMFCFVLFCFFE I ESRSVAQAGVQWC 
HLSPLQPPPPGFKRS 


5523 


19424 


A 


5564 


220 


486 


LNSSHPESPLPLFGLVLQFFYPSNPKSF 
NKKNLEIRMLLTVDRGPNHPDSLFFVHS 
NVDTVFLPQEGPSFLQPLGQGITEAFEH 
NQAYV 


5524 


19425 


A 


5565 


176 


492 


CTKTAVAS KAR YLQALLYALAS VTI AAT 
ATRIiASISGASPI FRSFPYLQPTAPGLV 
LPS I L ILVDVGLRKEKGPDTVAHACGPG 
TLGGRGGRITRSGVRDRPDRH 


5525 


19426 


A 


5567 


24 


470 


PMDXKXPXXXETPLNGGAGENQDDMFAK 
LKEKLFNEINKIPLPPWALIAIAGVAGL 
LLLTCCFCICKKCCCKKKKNKKEKGKGM 
KNAMNMKDMKGGQLPQDDDDAKTGLTEG 
EGEGEEEKEPENLGKLQFSLDYDFQANQ 
LTVGVLQAA 


5526 


19427 


A 


5568 


45 | 


485 


GXDGRLLRGHNQYAYDGLDYIALNEDLS 
CWTAADTVAQITQRKWEAAREAEQLTAY 
LEGLCVEWLRTHLENGKETLQRADPPKT 
HVTHHPVSDHEATLRCWALGFYPAEITL 
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TWQRDGEDQTHDTEIjVETRPAGDRTFQK 
WAAVh, VP 


5527 


19428 


A 


5569 


221 


422 


FFFXjTGRVRQSIjADIjEQKI EVHbMPPr E 
SLALSPTLYCRGAI LGATSASQVAGI TN 
ICHHAQPSYLF 


5528 


19429 


A 


5570 


401 


477 


IFSRDKDVAMLPRIVLNSWAQVIUj 


5529 


19430 


A 


5571 


102 


36 


E VEKNDD I SHHHHHHHHHHST 


5530 


19431 


A 


5572 


106 


407 


AGRQGWLEGCRPAAWPTLLTPPATSSR 
GASRPPTATCPTPDPAPATTTWQPWPPR 
PLWAGVQWHDLGSSQPPPPGIKQFSCLS 
CPS S WDYRGRRAVCHH 


5531 


19432 


A 


5574 


253 


363 


YHSNQCVGLRKWPGAWHACNPSTLGGR 
GGWIMRSGA 


5532 


19433 


A 


5575 


1 


132 


MECHYVAQAGLELPGSSDLPASASQSTG 
ITGVSHCAWPGFFFFF 


5533 


19434 


A 


5576 


18 


378 


TPGGGGCI EPR WHHCTPAWATRAKLS YK 
KKKKKKK 


5534 


19435 


A 


5577 


21 


232 


ILRCTTIETLLKVQDNHLNPGGGSCSEP 
RSHHCTPTWARERDSVSKKKKKGGAALW 
DPGGGQPSAGSQTR 


5535 


19436 


A 


5578 


377 


2 


LSLI QT I TLCLGAI TTLFAAVCAVTQND 
IKKIEAFSTSSQIGLIIVTIGINQPHLA 
FLH I CTHAFFKA I LFMCSGS 1 1 HNLNNE 
QDIRKIGGLLKTI PLTSTSLTIGSLALA 
G I P FLTGFYRTRG 


5536 


19437 


A 


5579 


2 


227 


I ES WLE P VRFLRS M FANNL VYD TS DS DD 
YHLLKDLEEGIQTLMGRLEDGSRRTGQI 
LKQTYS KFDTNSHNHVDAA 


5537 


19438 


A 


5580 


113 


3 


N FP F FFFLRQS LAVAQAGVQ WCDLGSLQ 
PPPPWFKQ 


5538 


19439 


A 


5581 


184 


3 


IMPPHSRLSPTARFCLQKKTNNXJNTPIE 
MASIQCPGWSQTPGLKPPSCFVLPKCWD 
YRCE 


5539 


19440 


A 


5582 


381 


3 


FFFFIiGDRAPPGGEKNPLGGGLKTPGKP 
PGGVfGVS P P PQKKKMPPPNFFPPGKKKS 
FGGGGPI FFPGKNFPPPTQKKKRGGGGF 
FFFFKKPGGFFFFFr PFFFFFSSFFLGK 
S F I KI S FKKRRRG 


5540 


19441 


A 


5583 


314 


389 


FSCLSLLSSWDYRCVLPHLGNFLYF 


5541 


19442 


A 


5584 


131 


1 


FPLWTSEQGVGRNKQTYVTWQADCKENA 
GGDYYWTFFPQPTFV 


5542 


19443 


A 


5585 


484 


3 


MTEILLKQAMVGI VGS IGSAIGGAVGGG 
ASASGGTAIQAAAAKFHFATGGFTGTGG 
KYEPAGI VHRGEFVFTKEATSR IGVGNL 
YRLMRGYATGGYVGTPGSMADSRSQASG 
TFEQNNHWI NNDGTNGQ I G P AAL KAV Y 
DMARKGARDSNSAGDQI FHK 


5543 


19444 


A 


5586 


136 


48 


NTMQPG WTHTCNPSTLGGQGGR I TGSG 
R 


5544 


19445 


A 


5587 


306 


479 


DSFFPGVGSAFLYQVAAWTTKYFKKRV 
ALSTDIARSEMRLTFLLAPFTKFL IDVC 
DW 


5545 


19446 


A 


5588 


201 


2 


ILETGFHHVAQAGLNLLASSNLPASTSQ 
S AG I TG VNHHAW PS J? L I QGT. KHS F KNS K 
PVFHIKLSKN 


5546 


19447 


A 


5589 


30 


477 


TPSLARGVYPCWL YTGFLHDFLFS PKIE 
ERFMDVGVLRATE P SNSLFAI I YSTPIE 
LAIKSWHI IGDGLS I EIiVAVERGNRTTI 



880 



WO 01/64835 



PCT/US01/04927 



SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, HHHistidine, 1=1 so leu cine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
j=Inreonine, V=Vaiine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














X SKLPEFEKSS LSDQSLVTXiTSGYFAVY 
RVKLTAKKLEGIHEGAFQITTDYEILTI 

DWTfE'PTJiiT P 

IT V tVMK WM ] il . 


5547 


19448 




5590 


208 


91 


SLCYPGWSAW 


5548 


19449 


A 


5591 


325 


467 


ADLCSPCDILQLDFRHIRKTVDTLLALG 
EKAPPPTSALRSRDLISFCF 


5549 


19450 


A 


5592 


332 


403 


XjbL» L. V jLHi Hi HTH xH 1 H XRk, X FX Y 


5550 


19451 


A 


5593 


292 


26 


FLLDMEGAEAGAELGLYHCLLCAWBTPS 
RLAVLQHLRTPAHRDAQAQRRLQLLQNG 
PTTEEGLAALQS I LSFSHGQLRTPGEEE 
GTGE 


5551 


19452 


A 


5594 


387 


51 


LKKI FLGRGQGVSLFFSPPLGGPGGRFL 
GLEIGALFGLPGKTPPFPKIKKNYPGLW 
GGALFPLPLEGLGGKMAFPRKGGAPFSQ 
KNP PPPPLGGKTKP P PPLKKKKKKKVL 


5552 


19453 


A 


5595 


88 


362 


AETSPGQGSYQDGLRRPGFAEGPPAGRN 
PRRKKTQTDRRGGSAGSQPQACADGVRL 
CTRLQCSGAISAHCNLCLPGSSDCPASA 
SRVAG-lT 


5553 


\9454 


A 


5596 


92 


366 


DQPWPGQLPGRSPEAGIRGGSTSRKKPQ 
EEEIv SDRS PGRQRG I PASGVRGRCAALY 
QAAVQVJCHLGSLQPIiPPGFKRLSCLSLP 
SSWDYSH 


5554 


19455 


A 


5597 


3 


138 


TRPRTRGVGQAGLELLTSSDPPTLASQS 

7\ 1-1 "frozen rmm7i t*> n/"\fi r» f» 

AGITGVSHRARPQSSS 


5555 


19456 


A 


5598 


150 


3 


SCVI YS WMKFFS FLRWSFTLVTQAGVQW 
CDLSSLQPPPPRFKLFSCPG 


5556 


19457 


A 


5599 


148 


3 


TQHLN LAKKS Q I T PGMVAKS VAHAGRQ W 
CDLGSPKPLPPGLNRASCLP 


5557 


19458 


A 


5600 


195 


1 


MVY FKKFCGFFLFVRS RIACSVAQVGMQ 
WCNHSSLQPRTPGLKQSSHFSFPSRWNN 
RHMQPCPGA 


5558 


19459 


A 


5601 


228 


415 


VNVS I PKDVFLLII I ILFFFGPAPQAGG 
RGCNLNSLLPWPPGLKQFSCLTLPRSWN 
YGLPPP 


5559 


19460 


A 


5602 


405 


2 


AATG VDRHAQRTQAGL P LKT PQTPCPN F 
TSKARGTPSAQHPSCTRQRHQASNPTQR 
RRRRPQAEHS IRTDAHGPGLPASGHMAG 
ERPTRKAQSRDVSVLRQRXJUjLPRLECT 
SAISAHCSQNLTGSGNPPVSCL 


5560 


19461 


A 


5603 . 


3 


236 


RQENGMNPGGGACSEPRSRHCTPAWATE 
RW S V b KKKG X K X KGHL b b AMHHQ Y XjS P r 
GWRQVCSHCVCPKPWIaLVRGH 


5561 


19462 


A 


5604 


184 


2 


RPFLRXQGKIRMFSGSRFIFNFLILETW 
SHSVPQARVQWCHRSSLQPKPIiVFKSSS 
YLIjP 


5562 


19463 


A 


5605 


261 


2 


GWFLKGPPLFLGPKKPGGLGFFQKKLVL 
QKKNPPFGIGPPKNLKKTPXjFFFFFLRR 
SLALSPRLECSGTISAHCNLRIPRTSNS 
PA 


5563 


19464 


A 


5606 


134 


2 


gspfknlvfflgqsltlsprlecsgtis 

AHCKLRLPGPHQS PAS 


0->04 


19465 


A. 




3 


zt>y 


PnT^QPQMPP.PP^PP.FPPQPPPTPIlUTT 2\T 

ERDSVSKKKKRKRNFLNFIGKKKKPCLP 
KP KLTKKKKWGG I TALVLKL YKRAI APK 
TGGY 


5565 


19466 


A 


5608 


36 


380 


PPFXXPPGGETFGGTPRGPPPKKKKSPR 
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E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, l=Iso!eucine, 
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KRGGGGKKKPPRGAPPFKHLLGGKNRGG 
GTPPPPRKTPPLNPKKKRGGGKRGVFFP 
PKGGI FQKRGFFFSRGGGGGAPPPHPPP 
GGG 


5566 


19467 


A 


5609 


3 


411 


SSRSRAAALFYFFFFSRRGQKEKEPFLG 
GKTPPVFVFFPGGVGKKNRGFPPPPPAG 
KGPFSGGGGREQTPIWSPPPPPRGKWGK 
PPKPVT PNFNFLGRVGFLGKNPS PGGVC 
PPPKTP I FFFGRPPPGGFFWGGGG 


5567 


19468 


A 


5610 


364 


483 


ISWVLFFLFLKWSFVLVAQAGEQWRGLG 
SLQPLPPGLKGF 


5568 


19469 


A 


5611 


399 


285 


GGCSEPKSRPCTPAWGTERDFVSKKKNK 
NNYLVKKSN 


5569 


19470 


A 


5612 


408 


3 


KEVFSFPPGLEHGGVFYSLPPLPPRVKG 
FSPPPPPGGGGPRGPPPPPGYFLLFPKK 
GGFPPLPGGFSSPQIWPPRPPQKVGVPG 
GAPRPGPNFFFFFFRESFPFLARGFPKK 
KKKTCWNVTISHSASVQSSVLL 


5570 


19471 


A 


5613 


1 


112 


LAQHVFFLLNTS IASPADSSELLMEVHG 
NGKRPSPER 


5571 


19472 


A 


5614 


2 


389 


F VREG ER KKE KRKKEKKR KEGTKE KGKE 
EGRKERKKEKVRKKTVDDRRRPQVTFQA 
VNLES EGAPGNCGVTAP PGGH I WLKQVE 
VQTHLENRCHRLKATVTTTTRPGAEPRS 
PPFI ITCGEIMNVFSDM 


5572 


19473 


A 


5615 


385 


3 


KPPPGDPGKTPFFLKSQKITGGGGGAPL 
FPPPQKVKAKKRGYPRRQSFKRARLGPC 
PPSRGGKKKAGFKTNPPTKKKNQTKKPM 
KAKTS KLLQENRSG PGT VAHACNPS TLG 
GQGGVflMRSGDRDTK 


5573 


19474 


A 


5616 


218 


402 


R S YS FS QL VG T VI KARK PE S S CWLCAPN 
RKNAF WPGAVAHTCNPSTLGGRGGR I MR 
SGDGD 


5574 


19475 


A 


5617 


2 


235 


KSFGCASRbQMHGRTHTGEKPYKCKQCG 
KAFGCPSNLRRHGRTHTGEKPYKCNQCG 
KVFRCSSQLQVHGRAHCIDTP 


5575 


19476 


A 


5618 


347 


421 


I YLXXKKKKKKKKKKKKKKQKKKKK 


5576 


19477 


A 


5619 


92 


1 


ENLALLPRLECNGTILAHCNLRLPGSSD 
SP 


5577 


19478 


A 


5620 


3 


402 


APPPSFFFKKGLIMAISPTLGAIKLINF 
LFFFKKGKKKKKPPLEKKKKKKKKKGGA 
PLKKKKFS PPGGGRKI FFKGAPKKISRA 
RVKKQRRGEKTRGNPKKKRG KKKRPFPN 
TVSR 


5578 


19479 


A 


5621 


420 


3 


HPPPQGSANFFPPPRKIGINGPPPPAGI 
I FVFLECRGFPQ VGQGGFKI RPPNKPPP 
PPPKNLGLQKKSPPPPPIFFFFFLWRWG 
LMTLPRLVSNSWLKQS SHLGLLKH WDYM 
SEAPHLAKNFLKEYEWSLSFHVEGS 


5579 


19480 


A 


5622 


246 


132 


E VAPHLLG CISP TMC VCLC VC VCVCVCV 
CVCVCLCSW 


5580 


19481 


A 


5623 


219 


424 


I LNVMVLG IGTFGRLLGLEGRALTNG I K 
ALI RRDMGPGAVAHTCNPNI LGGREGW I 
MRSGDGDHPGQH 


5581 


19482 


A 


5624 


32 


454 


GLEKVHSMTXDRNSRLSVQLRDWDGGGE 
LLQFS VHLDGEDTAYSLQLTAPVAGQLG 
ATTVPPSGLSVPFSTWDQDHDLRRDKNC 
AKSLSGGWWFGTCSHSNLNGQYFRSIPO 
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QRQKLKKGI FWKTWRGRYYPLQATTMLI 
Q 


5582 


19483 


A 


5625 


166 


2 


LFGLVKLTALKGLKMILPSQVWWLTPVI 
PTLWEAKAGGSQGQEFETSLANTVKP 


5583 


19484 


A 


5626 


120 


406 


GEAATQENLAELRPEPELLSPSTVLSRE 
PELPSPSTVLSREPELPSPSTVLSRKPD 
LLSPSTVLSRKPDLLSPSTVLSRKPDLL 
SPSTVLSRKPDLLSPSTVLSRKPDLLSP 
STVLSRKPD 


5584 


19485 


A 


5627 


324 


16 


LSPPPPLFKTAPRPQKKNPPPKKKKPAP 
PKKKNLFFFFFFFFLVFFFIWCSWRLAE 
WNETLGECDMRWGQAVGTGTSRELRPGG 
SRSQPRRLAGKELGPHCAP 


5585 


19486 


A 


5628 


378 - 


72 


GRGGERPPPPQKSGAKKKPGVLNFFPQR 
GKRSGPPGPFGFFKIFSQEAPGFFSSFF 
WGRS LALFPSWGARGGFLVPANLGP PAG 
APKGNPPSKKKKEKIED 


5586 


19487 


A 


5629 


3 


204 


QESCLNMGGRGCSEPRSCHCTPAWATEK 
DSVS KKKKKWKFGGFYEVIGNFLHLFPK 
WSYPPSFRIF 


5587 


19488 


A 


5630 


177 


1 


SFHEMRFWKLIFI IRIYVHFERRSCPVP 
QSRVQWHDLGSLQPLPLGFKRFSCLGLL 
SNW 


5588 


19489 


A 


5631 


154 


3 


KW FQMGS R S VTQ AG ARQCDH SSLRPPTP 
QLKRSSCLSFPGGWDYRHVPPC 


5589 


19490 


A 


5632 


239 


17 


EGVRNYLI LQPRSLCRLNCS FFPNREKE 
LCHHSSFSSPLAAAKESQGEKGRLLSQD 
EGLLLW3VFVEDVEINS 


5590 


19491 


A 


5633 


2 


110 


GCS E PRS HHC I P AW VTEGDS VS KKQ P P P 
QKKKCIAG 


5591 


19492 


A 


5634 


433 


0 


FFFFFFFFFFLKKI PPPEGGGF 


5592 


19493 


A 


5635 


306 


433 


FSISI ERS SSGRAQLLTP VM PAL WEAEA 
. GRSRSQEIKTILAN 


5593 


19494 


A 


5636 


2 


285 


QSRTSVI FNSEKAERGEETTEEKFEASR 
GWLVKFKERSCLHN I KVQSEAI SADVEA 
AASDPEDLAKI IDEDGYTKQHI FGQALC 
LMSNPTTLGG 


5594 


19495 


A 


5637 


160 


1 


L YCGK I YWKFI RLNFFLSQGLTLS P KL 
ECTGT ITAHCS LKL I SPSNPPARAA 


5595 


19496 


A 


5638 


3 


261 


KIQLNPGSQGCSGLSLCHCSPAWATETL 
SQKKKKKNPPFGFKIKGEIEPLREPGTK 
KKKGGFLKKFPPHPLGFFLEPFTWGGFW 
N 


5596 


19497 


A 


5639 


2 


121 


S ELRS CHCTPAWRQS KTTSHKKKKN I KK 
KNKFFLPFREN 


5597 


19498 


A 


5640 


370 


ISO 


VEMEFCHVDQSGLELLISGDTPTSASHN 
AGITGVSHRTWQPSFFSFFKERRNLKSV 
TSLNVSP 


5598 


19499 


A 


5641 


254 


1 


FYILRKVFVFLFPAVSYGKGTYFAVDAS- 
YSAKDTYS KPDSNGRKHMYWRVLTGVF 
TKGRAGLVTPFPKNPHNPTDLFDSVTNN 


5599 


19500 


A 


5642 


292 


106 


NFVILFYLFKQGSHYVAQAGLEPLSSSD 
PLATASQRAGITSVSHRAQTKQAETYW 

DUCT FT 


5600 


19501 


A 


5643 


3 


100 


DTFFE VE VEMEKE VCRDLMCTS PKDEEG 
FLLI ' 


5601 


19502 


A 


5644 


1 


282 


EKVPEAKRLYGKRGDPFYEAQENHNLIG 
VANVFLECLFCDVKI.QYAVP.T ISQQGEY 
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STGQPWRTMVVQVFCFGLFWFSFYLSYV 
LEVDQETKAN 


5602 


19503 


A 


5645 


142 


67 


NKNRPGTVAHACYPNTLGGQGGQIT 


5603 


19504 


A 


5646 


350 


213 


GLELLTSSDPPASASQSAGITSMSHRPW 
PPLASRLDLCCGTLYLI 


5604 


19505 


A 


5647 


3 


180 


EHNAERAFLAQAHLGPGAPCTPGFPSFI 
PTQFPPFRSSGLPNIPVQTISRAAAEKL 
FG 


5605 


19506 


A 


5648 


366 


1 


RSWFTTYKERSLLHIIKVQREAASADIE 
TAASYSEDIAKITHESGYPKQQI FNVDE 
TAFYWKMIPLSTCIAREEKSISGFKASK 
DRLTLWLG ANA I GD I KLKPM I VYHFKTP 
RALKNYASSL 


5606 


19507 


A 


5649 


1 


355 


QIQSSQSGKGLLLTSFIDEETKLFFFGN 
KVWFCI PPGPGGKLCFKKKKI FGLYPGG 
GGFWGPQTPPLAPTFPFGFPPPPGPNFS 
PRGPFPKMGPALRGPGKKKKPPEKTKPA 
KGGTGP 


5607 


19508 


A 


5650 


348 


124 


RAPPVFFWVFLGQFPKKKSTNSPRGGGP 
PRSFQPFSLPPFFRNVFKKKPPPFFFEG 
GRGPPAPPFKNPFFFFSRL 


5608 


19509 


A 


5651 


194 


43 


KKKFFFr FYEMESHPVTQAAVQWHHLGS 
LQPPTPTFERFSCLTLSLI AT P 


5609 


19510 


A 


5652 


27 


244 


KKQQSTLFLFYFLSFILFFFLVKTGSHY 
AAQAGL3LRASSNPNTSVSLPKSWDYGH 
EHTVPS PTWFLAAKTP 


5610 


19511 


A 


5653 


3S 


429 


GLFLSCLFFFFLEKGVPLTPPGATGENP 
APKKKGTGQKLLKNRDFFLKKAMFGKQK 
QKKFFFAPTI TGDQRAPRVFKG PGAQKE 
PFAQKKXERGGGKKTRAAPNKKKKPGAF 
FSLKKTPLWAGGGPPPFF 


5611 


19512 


A 


5654 


391 


68 


NLYPHGGVPGVS PGKGSLPKAPNPLGGI 
PPSGEPPPLFPQGFFPCLKPKKRLGKAP 
GPFFFFFLAILLGWFQISNTDSCVQVFI 
IQTIWII FTT 1 1 KGCHLHFTFLF 


5612 


19513 


A 


5655 


159 


34 


GARFLFFSPVLS PLIEHCVCVCVCVCVC 
VCVCVCVCVLCACMCT 


5613 


19514 


A 


5656 


88 


388 


S I F CFI YVFFLRQS FTLVAQAGVQWRDL 
GPLQSPPQKKKGKETFSFLGENRYLQPK 
NFPTFLTQVCFFEKIPPNGLINGKRPKW 
IPGCEFLKTLWGPWV 


5614 


19515 


A 


5657 


176 


371 


HFYFCFSDINLAAE P KVNRGKAGVKRS A 
AEM YGS VTEHPS PS PLLRSGTLL FITAL 
CPSVGXFSF 


5615 


19516 


A 


5658 


3 


113 


CSE PRSRHCTPAWATERDS VS KKQQTKN 
LSYKKIHW 


5616 


19517 


A 


5659 


409 


67 


SSSNKPPTRPVFILPFFGHGVPRLPPGK 
KPPGPPPGPPPPQVEQMYGRKGGFALNL 
PPRRQNKNMLYSPELGEK 


5617 


19518 


A 


5660 


2 


377 


GR I S CSRNTPLRHR SLMLI VCGLRMPLS 
GFLPVJVGQVFLALIPVNPQPSSLGVRTI 
MDMIiRCCRDFIYGQFWGQFMARLWGACS 
QQYTEFFRFKI I FISSLLFNI VLEVLAR 
Al Ry c. K. b laiNHaM Li**) 


5618 


19519 


A 


5661 


256 


381 


NFNTGQAWWLMPVILAIWEAEAGESRGQ 
EIETILANTVKPHL 


5619 


19520 


A 


5662 


19 


313 


LYSSIiAILGFLPLLINFII 1 1 1 FFFFER 
DPGQQNONFVSLKKRETPPPPFSLGVPG 
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KRSLLPPPPGPKHAGPPVLPAGUWGGPE 
PPTWGRGAKQKREI 


5620 


19521 


A 


5663 


78 


365 


ETGRKRKETRWNLKKVCLLKMSLQGEKE 
ERKREKYRRTEMKRQKNSEKQRGREKGK 
KSKYQRRRKKEKKGENQTKPKPXVEESS 
SRQ 


5621 


19522 


A 


5664 


159 


28 


DTSTLPTPTGFKRFFCLSLPSKWDYRRP 
P I CLPNFFFFLWTG Y 


5622 


19523 


A 


5665 


350 


J51 


I R CLC PDK FLAF CRDG VLL CCS G WSR TP 
GLKRSPCPSLPKSWDHRLCLADIYIYIF 
LKDTSTIMSF 


5623 


19524 


A 


5666 


394 


248 


LTPGGGGCGKLRLRHCTPAWATRANLPL 
KTKKKRKRFLFTLAETSKVY 


5624 


19525 


A 


5667 


1 


194 


RLNPGGRGCSELRSRHCI PAVJATRTKLH 
LKKKKKGDQG FRGDHTKN I WLF PRVFLP 
LHSLNTLK 


5625 


19526 


A 


5668 


2 


312 


RGQNS ARG III I AI LKKVCVYI YRCGLT 
LSPKLKCRGTI IAPCSLELMGSCDPSTS 
ASQ I ARTADMHHHTRLI IASLTFCLLKK 
XKNVFCGTYNHCPPKGFYE 


5626 


19527 


A 


5669 


1 


245 


RLNPGGGGCGE PRSSHCT P WIATRVKLS 
liKKKKKKRPFFFFFGIRQPPQGGPFQTL 
TPLNLLKNLTTLLGAKKFFFFTGGK 


5627 


19528 


A 


5670 


1 


135 


ARTPSPSL3ESSEDEKPTKKHKKGKALR 
LKRRFWWLMSALPCIH 


5628 


19529 


A 


5671 


97 


2 


XKF THVW p G WAHACS PSTLG GR GGR I T 
RLN 


5629 


19530 


A 


5672 


362 


173 


VFSPPPPFKTPPPPPKKIPPPKKKKTPP 
PKKKKLFFFFFFFFFFFFFFGTRIVIDN 
ILSGNRV I 


5630 


19531 


A 


5673 


237 


13 


SPGGGGGFAPYWMFFLDFERQSCSVAQA 
GVQWHNLS SLQP S P PRFYF FSRDGVKNR 
PVPRYAPPSPKGISLRPYK 


5631 


19532 


A 


5674 


245 


14 


ANFLKAHIVSTLGLiQIIWSLLQLLNFAI 
VFYFLRQS FALVAQAG VQWRD I G S LQPL 
PPRFKRFACQRFSWEAEVAVS 


5632 


19533 


A 


5675 


2 


249 


SHPTHSHPQHLPLTPTWNWSSTPVDFIF 
RKAPPVFPHWQHHRAGPGTSLI PTDAPQ 
EASPPPPYPSSPSSTPTIPRETFLRQ 


5633 


19534 


A 


5676 


115 


241 


SSTLGGQGS ITLAQELRSCHCTPAWATG 
QDS IS KKKKTLLGP 


5634 


19535 


A 


5677 


460 

- 


2 


RLGVTGRWQKLKPQERMRAWRENGKGPK 
ALPAS WEGPTVNLLWKHEHRAWVS CHCD 
SGFCAWCQGLGSEQSGLPSLIPSWDGLV 
LTSPPNPTGPPPPLKTRSVLRGDDVLLP 
CDQPSNLARALWLLNGSMGLSDGQGGYR 
VGVDGLLVTDAQ ; 


5635 


19536 


A 


5678 


2 


346 


EN CLNSGG EGCS KLRLCHCTPA WVTERD 
S I TKEKKKKNFKP VLEKFGE I FFLKGWK 
FFQLNPLKSSKEMEFFNTTLPLEPCFNG 
NKPFWAAKIWTLNFFQTQKGSI FKWGGR 
KK 


5636 


19537 


A 


5679 


181 


4 


PKEVRTLFGRFFFFFHMESHSWQAGVQ 
RHM 


5637 


19538 


A 


5680 


549 


410 


FFVQTDFCHVAQAGLQLLGSSN P PAS AS 
PKYWDYRREPVCWPSSGF 


5638 


19539 


A 


5681 


in 


403 


SRQINGFFWSRTKVAIGQFIiVGPSVKI 
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FENNWE I CMI KS IQSY I VLNHKHLKVYS 
SNIWQLKNILLRGGAVAHACNPSILGGR 
GRR I TR SRDRDH P . 


5639 


19540 


A 


5682 


189 


3 


NNNRKASWKKKDRVLLYCPGCNWCDLGS 
LQPL PPGFKRFS CLKLP KCWDYKHE PLH 
PAPHV 


5640 


19541 


A 


5683 


2 


227 


I KVCVCVRVCVC I LREDSAQ PARCVCVC 
VCVCVCVCVCLDPRRELSPTSQLESSCH 
PLYVPSGGSRINRPYPGGC 


5641 


19542 


A 


5684 


318 


421 


DVPCFPVAFCCDCKFPEASPAMLNCESM 
KPLSFI 


5642 


19543 


A 


5685 


213 


398 


SLCCLLFCCCYFETESRSWQAGVQWHD 
HSSPQPGPPRLKSSSHLSLWRHDHSSPQ 
PGPPRL 


5643 


19544 


A 


5686 


398 


1 


S FPTS S TYS SSVQGGGQGAHS YKSCTAP 
TAQPHDRPLTAS SSLAPGQRVQNLHAYQ 
SGRLSYDQQQQQQQQQQQQQQALQSRHH 
AQETLHYQNIiAKYQHYGQQGQGYCQPDA 
AVRTPEQYYQTFSPSSSHRE 


5644 


19545 


A 


5687 


2 


288 


SDRPRSPSFSFPSSPAGVGRASSIFPIL 
SAI IiLLLGGVCVAASRVYKS KRN 1 1 LGA 
GILFVAAGERQREGATGAAHLGAHVCVC 
VCVCVRARARA 


5645 


19546 


A 


5688 


314 


2 


CRPGSQLQELGDGGGGESFVGGHGQQHP 
GAPQRGRYLQPLLMCGASPHASPRQPLA 
ILAAGVKFRVLRHEAGRPLRLLMQINPL 
PYSRWHTYRLPSCGCLTCI 


5646 


19547 


A 


5689 


122 


3 


MGGGE F S KKMN I NGQG WAHACNP S TLG 
GRGRQITRSGV 


5647 


19548 


A 


5690 


144 


1 


EAVKKY TCP T W AGAVAKACN P S TLGGRG 
GWI TRSRGRDNNGKHGE I PR 


5648 


19549 


A 


5691 


377 


65 


LG VFP PFFFLRG EGLSFP PPGGLVKNQF 
FFPPGQKGFPGTPERGFQGFLGPKFFPP 
LPPQGPKGIFFLKKKKKKKI PICALAVL 
TYQHFCYRTGQERVDGLREL 


5649 


19550 


A 


5692 


12 


147 


YERRFCHIGQAGLELLTSGNPPASASRG 
AEITGVSYHALMRNIS 


5650 


19551 


A 


5693 


183 


395 


GHRFGFLHP P P PG FGG F FRP PP PKTEGK 
RAPPPPPRFFFFLGGGGFSGGGPRGVLF 
LEHENPPPGAPKGVG 


5651 


19552 


A 


5694 


177 


283 


THFLNRTDGRRWSIiASLPSSGYGTNTPS 
STVSVSA 


5652 


19553 


A 


5695 


266 


460 


LSQRKYAFFFFFFFFGKKIFFSSPGGEI 
RAQFFLTEPSPFWVKGFSPLPLPKTWDY 
IGRDPPPLG 


5653 


19554 


A 


5696 


.1 


412 


MQPLKVNLLEQKNIjGLEFVAPFTWIKCP 
SYWSIAYNVYFSQYSFIWVFCTMFITVN 
I PS KKKKKKKKKKKKKKKKKKS S S PGS P 
GYAPKKAPKNKKKK 


5654 


19555 


A 


5697 


379 


0 


FFFPSQKKKKKKKKKKKK 


5655 


19556 


A 


5698 


1 


187 


PTRPPTRFRFENGVNLGGGGCGEPRLCH 
CTPAV7VTERDS VS KKKKKKKKAVGGPWF 
FQKKJCV 


5656 


19557 


A 


5699 


374 


0 




5657 


19558 


A 


5700 


219 


413 


PFCPQGGGGGPNFVFLGPPPPGLKEIPP 
PPPRGGGKKKPPPPPRGGFFFFKKKGVS 
PGGPGGSPP 


^5658 


19559 


A 


5701 


381 


249 


FHPAGQDGLDLLAPGSAPLGLPKCHDYR 
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REPPRPAFFFFFFF 


5659 


19560 


A 


5702 


382 


197 


LANFCIFSRDGILPCCPGWSGTPGLKQS 
AKLCLPKCWDYTHKPPCPAPQVIKPSIN 
ILIVN 


5660 


19561 


A 


5703 


1 


381 


ERGSKHPPRGLKHIRGRTCFVLKGCMIQ 
LCVVPPGQGKKKKEGGGEKKKDKKREEE 
GREKKKNGNQGERVKKKKSKRKKKKRSS 
SS 


5661 


19562 


A 


5704 


291 


167 


NGPPFFFFFLVKMGFCHVGQAGLEFLTP 
GDPPALASRLLGL 


5662 


19563 


A 


5705 


367 


23 


KNPCPPPPPEFFLAPKKKGPPPRKKTRF 
VPPPLFLIKGPPFKRGKGPPPGGFPPLF 
CPLGGKRGGVFLSPRVGKPRGKGGNPLF 
PKKKKKKKRRRRKKEGKKGRKERQGYIH 
TV 


5663 


19564 


A 


5706 


411 


116 


FLKRWGSHYAAQAGLELLDSSDLPVSAS 
QSAGTTGVTHCSRLSYSALACQLCQPVW 
RI PWRGRGVGRETSFRGWGRRYGRAWSS 
SGGGAESLPGLSPG 


5664 


19565 


A 


5707 


162 


2 


SYCFPNSDDFIPFQPLFHTWFLWPRPFF 
IFKTAHGWARWLTPVIPALWEAEAG 


5665 


19566 


A 


5708 


1 


416 


LPLLYKI PPPPAILFLFI KDSP PPPPPP 
PFFFFFFPPP 


5666 


19567 


A 


5709 


403 


142 


LSKEKKWGWPRGFSPRIFEKPPPPKIFF 
APPKKKIFPPPPGEKKFFFKGAPPPFFF 
FFFSFFFFFFDCYNSSTWNSAWHIRCSI 
SIC 


5667 


19568 


A 


5710 


339 


461 


PS T F I LF I LRRS FALVAQGG VRWRH LG S 
PQPPPPGFKRVPA 


5668 


19569 


A 


5711 


408 


42 


TGARI I FSGPPKKTYPSPPRAFI ILS FK 
RAAPQL I YF YLNLFAYKRLVKLTTV I MR 
RRPTFFLLLFKTFLFVEPGFCHVAQAGL 
QFLGSSTCGSLPQCWDYRCEPQHLAKRS 
TFIKSESAV 


5669 


19570 


A 


5712 


277 


9 


gvcvcvggwvppmglqtpsapwglsla 
pplgtlcsvqwlaesihlcicqalve.pl 
rrqlyqvpvrklllassivsgfgdsiwn 

GSYLR 


5670 


19571 


A 


5713 


411 


0 


FLQ I LLFL ID I AFFLVFFFF FWKK 


5671 


19572 


A 


5714 


205 


400 


HYLI QS S S SMAS E S VQLD WMK I PLT W P 
GAVAHTCSPSTLGGRGGRITRSGDGDHP 
GQHDEAPSL 


5672 


19573 


A 


5715 


1 


89 


FFFFKTESRSVAQAGVQWCTLGSLQPPL 
P 


5673 


19574 


A 


5716 


176 


392 


GVHLVLEWKLSMVCAEDDELFPLCSWKK 
FFFFLFLRQSRSVTQAGVQGRNFGSLQP 
LPPGLKRISCLTRPSS 


5674 


19575 


A 


5717 


1 


89 


FFFFKTESRSVAQAGVQWCTLGSLQPPL 
P 


5675 


19576 


A 


5718 


306 


42 


GSGLRRCALNSVKCLNTLECILLGIYLR 
QIKICVYTKTCTCKFIVTLFI IAKKYEY 
PSVEQLINKMWHIQI IFLVIKRNEGCGG 
SCL 


5676 


19577 


A 


5719 


1 


263 


PTRPGRFFLILGDRGCSELKSCVCTPAW 
VTEGDS I S KQNKKQVS LRS PAAHLLGG L 
VPNRPQASTRIGDPCTKKLLEL ISKFSK 
AAA 


5677 


19578 


A 


5720 


3 


278 


NQETT.TWTOSSRTVQPHFLEFLLSLGWS 
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VDVGRHPGWTGHVSTSWS INCCDDGEGS 
QQEEVISSEDIGASIFNGQKKSMHIIKQ 
IKYFLSH 


5678 


19579 


A 


5721 


127 


3 


KKKKKKEEEEQGEGRRRRRKKKKEKEKK 
EKEEDGSGKEKGE 


5679 


19580 


A 


5722 


204 


96 


VTSRSSLFSTPGI KRASCLGLPKCWDYR 
PQPPHLA 


5680 


19581 


A 


5723 


3 


261 


QIVRQWKYNDDDDDQLFYTRVYLDPGLR 
VGKAEEWAHPAALWEWRPQAEWDLGSKG 
GCKEKNVCLGWAHACHPRLLGGLGRRI 
A 


5681 


19582 


A 


5724 


134 


358 


PPPPPPPXLLFFVSAVPPSPSPSIiSFPP 
HRPCYSPPSPTLSSLPPPLLFPPPPSSP 
PPPPRPSAPSPPPSSPFFS 


5682 


19583 


A 


5726 


248 


470 


LiLNVQNGIRNVCSCPHYTPFHKVVKYLS 
VFLTEVDSSQLRRQLCGGSQAAIERMIH 
FGREKVQILHLALIYFLV 


5683 


19584 


A 


5727 


1 


281 


NQHNRGFFDSCDGFFTNYNWREEHLERM 
LGQAGERRADVYLGVDVFARWNWGGRF 
DTDKGGGGFLSKGQRPSAS PGTSFPHGP 
SIPVCTSKE 


5684 


19585 


A 


5728 


460 


3 


SQTRTLTLDRE I TLPSSGTALI ShVDGS 
GNPVSVEVQSVTDGVKVKVSRVPDGVAE 
YSWELKLPTLRQRLFRCVS IRENDDGT 
YAI TAVQHVPEKEAI VDNGAHFDGEQS G 
TVNGVTPPAVQHLTAEVTADSGEYQVLA 
RWDTPKGGPVTV 


5685 


19586 


A 


5729 


70 


465 


SQCS PHHS VLCQKVQDLESGTLQSNWKK 
ITPWDENVCVCVCVCVCPCVSMCVYVCV 
YVCLCVCVCVTVCMCVCLY I KGLLFLLA 
KKETAGSFFHSQLTCLGVPCAFTIHPVP 
FLPQCRSHCVSGGRFWPGVL 


5686 


19587 


A 


5730 


281 


427 


SWLTAAPNFQLQMESRSVAQAGAQWHDL 
GSLQAP P PGPQS AETQKQM IV 


5687 


19588 


A 


5731 


418 


194 


TKFHHVGHAGLELVTSGDLLSSASQSAR 
I TDMHHCS WPASVLSMNRPSGSTRCLRN 
GIFGPLKDFKPNDLTVQL 


5688 


19589 


A 


5732 


229 


2 


PGGAPPLVRSGFPGWGGAPPFVFVPNGE 
PPKNFFFFFDGVSLCRPGWSAVAGSQLT 
ASSTSRVHAILLPQPGIVA 


5689 


19590 


A 


5733 


435 


138 


PPRYPPPLWGAP.PHLSRGGGSFTPLTPQ 
G KPR F FIjKI KIN PGGG P P P YS PP FGGGG 
PRNFFSPGAKGPLVLISPLFFPPGDQKK 
NSFFKKKKKERERE 


5690 


19591 


A 


5734 


113 


409 


KDVCGGVLTGLSGVLTSPEYPNWYPNSM 
ECHWVIRAAGPAHVKLVFVDFQVEGNEE 
CTYDYVAVLGGSS PTPVHQYCGS I SAPY 
LFFLAKNIMSYSLPN j 


5691 


19592 


A 


5735 


3 


122 


GGWQVQFVSEPCSLSINGVIFGLTSTDL 
LFHLGAEEISR 


5692 


19593 


A 


5736 


14 


428 


FLALRWGHIRLPASGPRDTATLFSTLDT 
QLLMTLYVGKDETGFYVSKALVHTGVAIi 
VPRGLTLAPADGPTTDEVTLQVSGEREG 
SPSTAVRYPSGSVALPSQWLLIGHHELP 
PVLHTTMLRVHPTLGSGTAETRPPEN 


5693 


19594 


A 


5737 


415 


1 


QHINTLGQFWTTKAFLPRMLELQNGHIV 
CLNSVLALSAIPGAIDYCTSKASAFAFM 
ESLTLGLLDCPGVSATTVLPFHTSTEMF 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, FHPhenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
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T=Tbreonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possibJe 
nucleotide deletion, \=possible 
nucleotide insertion 














QGMRVRFPNLFPPLKPETVARRTVEAVQ 
LTQALLLLPWTMHALVILKS I LPQAA 


5694 


19595 


A 


5738 


335 


152 


GVEMGFHHWQAGLE FLSSSDP PASASQ 
SAGITGVSHQASPFFLPLSGITLPLSIY 
SSGSD 


5695 


19596 


A 


5739 


35 


462 


IDASDFGECSQAAINDNWIVYELLEEM 
VDNGFPLATESNILKELIKPPTILRSW 
NSITGSSNAGDTLPTGQLSNIPWRRAGV 
K YTNNEA Y FDAAEE I DAL I DKS RS TVFA 
EIQGVTDACLKLSGMPDLSLSFMNPTLIi 
DD 


5696 


19597 


A 


5740 


429 


325 


NPGGRGCS3LRSHPCTPAWVTEQDAILG 
GRSGRIT 


5697 


19598 


A 


5741 


344 


423 


F WPGAVAHACN P S TLGGRGGR I TRSR 


5698 


19599 


A 


5742 


431 


103 


RGPFFPRPRQKKGGSPGGGPPQNKKNGP 
P LHTGGAKGGS F F KKKKKKKVP KKGPGL 
FFPGPQGPR KG PVGAPPVL I PNRLGPPR 
I PPGWEPGRQI FFLAPVFFFPS KPK 


5699 


19600 


A 


5743 


31 


224 


EEMGFYHAEQADFELLSSNDPAALAFQG 
AGIASMSHCSWPGKNFNMFCCIWKMKRIj 
AIHWECIA 


5700 


19601 


A 


5744 


'217 


400 


HSVMGGSPSFFFFFFFFFGEKGLFFFCP 
GGPNLGKGNPPPPGKXGFPPPPPPKGGN 
NGAPP 


5701 


19602 


A 


5745 


4 


455 


DGLEFPGRRFRGQRCSSPPRRGRGQVEA 
LLTSQTGRRGRGAPHVSDDGRRGRDAPH 
FLDGLVAGKRRSSLPIWDGGRAETLLTF 
QTGQPGRGAPHVPDDGRPGREAPHFPDG 
VAAGQRLQSRHFGGQGRRLGGGGCIEPR 
SRHCTPAWAP 


5702 


19603 


A 


5746 


381 


212 


PRGGGGGR Y S P P PGGGGRGNS FY PGGGG 
LNKPHFGPCPPPGGPKMKPPPQKKKKSL 


5703 


19604 


A 


5747 


334 


432 


MFCENYKE I AWPGAVAHSCNPNTLGGRG 
RHITC 


5704 


19605 


A 


5748 


190 


I 


I PPKKGGLNFFSPPTPKKGVKSKPPQGK 
VFFFLRRS FTLVTQHGVQWRDLGS PQPP 
APGSKHE 


5705 


19606 


A 


5749 


425 


297 


ESAFHHVAQAGLELLGS CDLP VS ASQSV 
GVTGMSHHTRLYCS 


5706 


19607 


A 


5750 


425 


0 


KKKQRKKERKKERKKERKKERKKEKKRT 
KSDE 


5707 


19608 


A 


575! 


136 


3 


AQSSRPAWLI FLFLVETGFHNVGQAGLE 
LLASGDLPTSQVPRRP 


5708 


19609 


A 


5752 


3 


237 


GCSEPRSRHCTPAWVTEGDSVSKTNKQK 
TQRGLKS IALFPAIHSLLPWIKLAMPI 
LAMDI FPHPI VEFSYHWEQM 


5709 


19610 


A 


5753 


1 


358 


GDRGCS ELKS CHC I PAWVTEQDLPS KKK 
KKKKGGKKTRNGPQTCGGDLPPPPLGGG 
KNFPKGETFWFHSGKKKRFFSLGKGEKC 
FF I PRAPKG PGKTLGGPNRGGGAKKKLK 
RPPGPVY 


5710 


19611 


A 


5754 


2 


191 


QGCSELCSYHCTPAWETRARPVSKKKKP 
KTKNQTKKNRLGKKPPNLIFPPFQKKKK 
WVQIPGK 


5711 


19612 


a : 


5755 


42 


408 


KHAAP PAS LS LSLLLHHGQKRAC FPFAF 
CRDCQFLEGSPAMLPVQPAKLTPRSTPP 
HPCSLENDSAHFFN 


5712 - 


19613 


A 


5756 


179 


410 


LQAFLR PS PP VLFLL YADT.^SNQNLWPGM 
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Amino acid sequence (A=A!anine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, k=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
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X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 














VARACNPTTLGGRGGRIT 


5713 


19614 


A 


5757 


166 


2 


GRPG PADFRVRPQLLQR FLF I YLFTEME 
SCSVTQAGVQWCNLGSLQPLPPGLPD 


5714 


19615 


A 


5758 


334 


3 


EEKVFGLPS P I LLS KGQGGAASGP PCP I 
o PRAPIG PQKKG W E TMV F I TKAP P S FGF V 
LVKQSPGTGPVPRNFFFFLRRSLALSPR 
AGVQWRDLGSMQAPPPGFTPFSCLNL 


5715 


19616 


A 


5759 


373 


470 


S S Cw S LRNS S PGAVAHACNPNTIjGG RG G 
RITR 


5716 


19617 


A 


5760 


210 


384 


EALDTTI FFFFFFETKFCFVPQAGGQGQ 
NLGSLGAPPTGLTPFSCLTLRXTWDCGP 

PP 


5717 


19618 


A 


5761 


257 


400 


SLGSEYTWVCFCVFETESRSVGQVGVQW 
CNLSSIiQPPPPGFKRFSCLR 


5718 


19619 


A 


5762 


400 


2 


ARAVGS VGEDPGLGTGPS PARETWEWAV 
VCRACRGSQLLWGKLFILRHWDLRKVLA 
RGNGTEDCLKQLWSQCGAGMGRGPFQGL 
KGKPVGCLCSSREVFYLFLYLFRDRVSL 
CHPGWNAVKQTRLTAASASQ 


5719 


19620 


A 


-5763 


34 


440 


RVRAPLGWEEGNYFVLRAKPRSGKRSCL 
PFLRCGHQE PLLWG I STAS PLRHLVGAS 
QICGVLFLCFQVSLKPAGSWKWIGNNG 
ERKAQRLCPLRATRVEGI FCFRFVFLRW 
SFTLVAQAGVQV7RDLGSLQPPPL 


5720 


19621 


A 


5764 


297 


468 


LVGAGVPTGCGQRNGWNSHKSVQAQWL 
MPVI PALWEAEAGGSRSQEIETILANMV 
T 


5721 


19622 


A 


5765 


1 


205 


GFKLTDSFASGAKAILRSQLLGRLRQEN 
RWNLGGGGCMGLKIVHNCPPAWAIEGNLL 
FKKKKKXGGGTF 


5722 


19623 


A 


5766 


307 


1 


KERDPPPTPFGKTFFFFFFKRNFCFFPP 
KEGQGGILGPRHPFFSRLSFLGRGVPRG 
FPPGPRNFFFFWILVKMGVPHVGQADFK 
LLT S GDLNKTASQN AGI K 


5723 


19624 


A 


5767 


2 


319 


LLGNGEELSRFPFQRSSQAWVFESSAGI 
FGGKAGATGLEKLTGSFQQLTGHPDPRT 
PEELGDPEKSSASEKTRGLQEANGVPEI 
LKAVT YTQAW PKEAKVDP I TP 


5724 


19625 


A 


5768 


787 


1118 


EAAWRELEAERAQLQSQLQREQEELLAR 
RKAEKEQLSEE IAALQQEHDEGIiLLAES 
EKQQVREPWRGLCCSLSCSSSGAGPCSA 
TWQLGALGQATALLGASVSLPAGGED 


5725 


19626 


A 


5769 


2 


452 


NGAGTMSVSLEADENPFAQGALKSEDCF 
ILDHGKIX3KIFWKGKQANTEERKAALK 
TASDFITKMDYPKQTQVSVLPEGGETPL 
FKQFFKNWRDPDQTIXjL/jZjTYLSSHIAN 
VERVP FDGGNLDTFTAMAAQHGMDDDGT 
GQKQIWRIEG 


5726 


19627 


A 


5770 


2 


130 


GTQGKIVDLVKELDRDTVFALVNYIFFK 
GKWERP FEVKD WKS 


5727 


19628 


A 


5771 


422 


8 


H I ARWGARAPQGGWALLKAELCLNP AP P 
PIGARTAPLWPGCCFSLRPVFLLLFPFF 
SLFIQSFRILYPQPWPLQGRRSTTLSRM 

f~" 1 "ITlT-r\ 7TT 7T 7\ CTTNTD AT»TT Flf"* 7\ T WD TJI^T?! 

D RHV F I DN PTLQER R KCMRRJLG AS RN 


5728 


19629 


A 


5772 


432 


3 


YSLKLKKRWFVLTHNSVHYYKSSEKSAL 
KLGT WINSLCS WPPDEKI FKETEYWN 
VAVYGRKHC YRL YTKLLSEAS RWS S AI Q 
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C=Cysteine, D^Aspartic Acid, 
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K=Lysine, LHLeucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X^Un known, *=Stop codon, /^possible 

_ ■ _ _ A * 1 * ft - • \ •» ft 

nucleotide deletion, ^possible 
nucleotide insertion 














NVTDTKAPIDTPTQQLIQDLKENCMTSH 
VVEQIYKRKKILRYTHHPCIAAAVEDHD 
ND 


5729 


19630 


A . 


5773 


20 


432 


SRAAALLEAVTETLFYYEVAEKIWSNRA 
NRQCADCGSSRPDWAAWLGWICKQCA 
GQHRALGSGI SKVQSLKLDTS VWSNE I V 
QLF I VLGNDRANRFWAGTLP PGEGLHPD 
ATPGPRGEFISRKYRLGLFRKPHPQ 


5730 


19631 


A 


5774 


433 


2 


RGAFQTLQKKDLQAFVLSSVQLQVLTGS 
CFKLRTVHN I PVP SNKVDDECPGLYGFL 
HVIVHFAKGFKQSATLSCALEVDSFGYF 
VSKAKTRVFRDTAE P KWDE E FE I ELEGS 
PSLRILWYEEYNNKSKVKNLYSAAVGFS 
AWQ 


5731 


19632 


A 


5775 


133 


436 


MLI S LPH PSTHLTS LPF CADFGTRQTTG 
AASHVLTLSAHSSVCSCPPLMGEATTSR 
AQDLPADEHTAFAHELEAPALS KGQGLS 
ILRALTGHLPLGQEVNL 


5732 


19633 


A 


5776 


397 


58 


KGT I S IGKLEE KLRGAARQALADAI I EF 
QLLPASLCTEDTPTGMQVKRSLGSRECH 
RS KQDS VAGERA3CVGFRGS T S EN YY I RT 
TYSQLSVSLSGHHPGATLAAVNGCSCSP 
T 


5733 


19634 


A 


5777 


1 


324 


MRRCVSAGGVELEEYYPAFLDMVRSMLE 
GS IDPTQYEDTLREMFT IKAYVG FTMDK 
LVQNI ARQVSRAGVGPALRTAEMALP S F 
HSPLGRGPWSTANLSCFFLIETK 


5734 


19635 


A 


5778 


1 


398 


RLGSRPSLHDQSPLELRSEIQELKGDVK 
KTVKLFQTEPLCAIQDAEGAIHEVKAAC 
REE I QSNAVRS AR WL FETRSLDVTNQD P 
SQVRLIRGISLEEGARPDVSATRWIFD7 
QSLDAIREILVDEKDFPPSP 


5735 


19636 


A 


5779 


281 


12 


VACNLGWRVRTPTPASYPPCVPQLiSVRG 
KNRKEL LGGFLRN I VKS ADE AL I TGMSG 
LKEVDDFFEHERTNLLEYKTLYCGRSED 
HATSR 


5736 


19637 


A 


"5780 


406 


23 


DSQATGPGISRGRRI SSPS SLCTRFSTS 
IVKCSPLSHVLKYLDPPPPPTRCSRPAS 
LRSSPKHCVQQQFPIACRPGAAAPPPPP 
AQCPNPGPAAASARHEPKWGSRTEGGPA 
RARRREGTGRRKKRR 


5737 


19638 


A 


5781 


415 


18 


TSQAGDDLLHG S LLRRGPEMG YL PG YD P 
D PTLATPP AGQTIiAVP SLPRATE PGTGP 
LTTAVTPNGVRGAGPSAPELLTPPPGTT 
APPPPSPASPGPPLGPEGGEEEPPPTII 
TTTTVNTTVTSPMYCGRSEG 


5738 


19639 


A 


5782 


168 


432 


GMRRQTGLDQQSRGVGKNWRVDFFFPSS 
LSFFFFFFGKESYFVLKFGIHGLKFGYL 
EPLSPGLKEFSRLSPPKGGHLKCRPPSP 
LFFF 


5739 


19640 


A 


5783 


1 


423 


TDDDLNWLDHSRTFREQSVDENETLLLR 
RKFFYSDQNVDSRDPVQLNLLYVQARDD 
ILNGSHPVSFEKACEFGGFQAQIQFGPH 

VFPnfWKHDftlFT nTWFPT DKT7VT IfOPp AT?W 
v Cinj\TL]\.c\sr J-iUI-txVCi r J_i.tr ivd X -L IxyKtiMr, J\ 

RIFQEHKNCEKISERSAKV 


5740 


19641 


A 


5784 


273 


400 


DSWLLMLSPGVCPHVQVWPISCTLGAT 
FGYVAGLVISPLWI 


5741 


19642 


A 


5785 


445 


28 


DTNRAPNHMEIKGSGGTEAKVNGADSID 
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T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














KE AAGDETMEARTME AEATGAKP TGVE A 
TGAKVTETKPTGAEVREMESTEEEANME 
SKPTGAQATDTETTGVEAMGVEATKTKA 
EEAEMQAYGVGAGQAEPPVTGTVLRPL 


5742 


19643 


A 


5786 


380 


2 


QRQQESPEASSLHILSRQVQELQQLLVE 
SQEEKESLGREVESLQSRLSLLENERGN 
TSYDVTTLQDEEGELPDLPGAEVLLSRQ 
LSPSAQEHLASLQEQVAVLTRQNQELME 
KVQIMENFEKDECI 


5743 


19644 


A 


5787 


663 


2 


FGVERRATRTIRLTRTVLDLYSFLAGVS 
ENLRHATQDDASRTRAPGLS SQH P KPDT 
TVSGDTE TGQS PG VFNTKES GMK D I KS E 
LAEVKDTLKNKS DKLEELDGKVKG YEGQ 
LRQLQEAAQGPTVTMTTNELYQAYVDSK 
IDALREELMEGMDRKLADLKNSCSYKLT 
GLQQQCDD YGS S YLGV I EL I GE KETSLR 
KEINNLRARLQEPSAQANCCDSVY 


5744 


19645 


A 


5788 


398 


1 


TRCYHSALRYGGSFAELGPRIGRLQQAQ 
LWNFHTGSCQHRAKVLPPLEQVWNLLEL 
EYKRNYGAKRGGPPVKRAAEPPWQPVP 
PAALSGPSGEEGLSPGGKRRRGCNSEQT 
GLPPGLPLPPPPLPPPPPPV 


5745 


19646 


A 


5789 


116 


371 


RHPKIWGLRGPPGLTLEKRGKGEEPLDP 
RSWGFKEA2TPPRPFTRGKKGTPRLKKK 
PKERKTRERGGKKEGKKKGKKNEGKTNP 

L 


5746 


19647 


A 


5790 


406 


284 


RRGFFHVDQAGLELPTSGDPPTLASQSK 
PGPLCPALLHTF 


5747 


19648 


A 


5791 


200 


3 


GDALIYMEPEKQVMSRSSDECWALCDQ 
WLVS YSKKI PLVNLLTFDLKKNIS ICPL 
PGTWIMEMY 


5748 


19649 


A 


5792 


372 


278 


LQIQSPAGQSRCGGFLVREDFVLTAAHC 
WGR 


5749 


19650 


A 


5793 


1 


167 


VAPPKLDPHSGRVRAQA3LELLTLCDPP 
AS S S Q T AA I TG AS HHTRQVLGSY KALS 


5750 


19651 


A 


5795 


112 


3 


SSWGKVGAHAGEYGAEALERMFLSFPTT 
KTYFPHCI 


5751 


19652 


A 


5797 


356 


238 


FLFCNNGVYSLEHFRCLAENAGDDAFVK 
DVTVLQNTDGR 


5752 


19653 


A 


5798 


379 


60 


VKIRHCHTMVSMS LLKNT RLI LEEEQVE 
GNYS FCVLDNQNLQQLWDWDHRS LTI KA 
GKMYFAFNPRLCVSEIYRMEBVTGTKGR 
QSKGDINTRSNGERASCE 


5753 


19654 


A 


5799 


377 


3 


KVAFPGEGRQGES WSAFQGKTR WS DLK 
WAVYEPDLMVTSS VGTYI YIWDI KDTRK 
LTVALFWAGASQVKWNKKNANCLAXSH 
DGDVRI VJDKRKPSTAVE YIiAAHLS K I HG 
LDWHPDSEHNLY 


5754 


19655 


A 


5800 


143 


316 


CPTSPPRLPYTEFFGGV3GLTVEQFRKI 
NGFPNAFWGWGGEDDDLWNRYLCRSFGV 
F 


5755 


19656 


A 


5801 


390 


2 


LFFFFFFFIiRQSLTLVPQAGVRWRDLSS 
LQPPPPGFKRFSCLCV 


5756 


19657 


A 


5802 


342 


465 


VIFFFKLKQGLTLSPRLECSGTI ITHCG 
LDLLGS SES PTFT 


5757 


19658 


A 


5803 


1 


378 


VYSSEFENIKEEYRGRGFPTICYFEKGR 
FLIQYDNYGSTAEDIVEWLKNPQPPQPQ 
VPETPWADEGGSVYHLTDEDFDQFVKEH 
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C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=Phenylalanine, 
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SSVLVMFHAPWCGHCKKMNPEFEKASEA 
LHGEADS S G VLAAV 


5758 


19659 


A 


5804 


152 


1 


KPEIFFFESRSVTQTGVQWCNFGSLQPP 
PSGFKLFSCLSLPSSWDYTAHV 


5759 


19660 


A 


5805 


207 


31 


LYLQLLQQTASSGNLNTLSSLHPMGGKC 
FTAAEEQQH PGQQTQLTTVLGVWHRFCE 
NIF 


5760 


19661 


A 


5806 


199 


2 


GSECSCLLSRGGEMHVSLSSFKVTFSCD 
AI IHFSFLFFWRQEFPSVQAGVQWRDLG 
SLQPPPPGV 


5761 


19662 


A 


5807 


385 


239 


QDQPGLQKMC VCV CVCVCV CV CVCVI I C 
KSCIRLS YMKMP INFGLKNL 


5762 


19663 


A 


5808 


39 


184 


GR 1 1 1 KFFGEMRSHHAAQTGLELLGSSD 
PPTWASQSAAITGVSLRIRP 


5763 


19664 


A 


5809 


376 


2 


WAEPTSFHVCITSYTQFFRGLTAFTRVR 
WKCLVIDEMQRVKGMTERHWEAVFTLQS 
QQRLLLIDSPLHNTFLELWTMVHFLVPG 
ISRPYLSSPLRAPSEESQDYYHKWIRL 
HRVTQPFILRSV 


5764 


19665 


A 


5810 


271 


120 


FLCFIGKTSERGSFSLYSRDTGLPGCQV 
S IiM I KKSNEM VFKH INKGS I NR 


5765 


19666 


A 


5811 


367 


3 


DKSFKEWHKMLEDANAPTEERDERIAGL 
YEE I ERDLMLLGAS AVEDKLQEGVT ETV 
TSLSLANIKIWVLTGDKQETAINIGYAC 
NMLTDDMND VFV I AGNNAVE VR E ELRKA 
KQNLFGRCI 


5766 


19667 


A 


5812 


294 


37 


FRGENPDDGVRGS P PED YRLRQVAS SLF 
RGEHHSRGGTGRIiASLFSSLEPQIQPVY 
VP VPKVSHWAFF PNS S LEPLT YKA I FLP 
TA 


5767 


19668 


A 


5813 


184 


3 


1 1 S YLKNYLLFS PLQRQNFVLLPRLECS 
GTI TAHCNIjKLVGSGDP PAPASQVARTT 
GMRH 


5768 


19669 


A 


5814 


468 


1 


DDMEYEGSVSVFTQVPRKSVWKGLDYTL 
MVAGEFGLGKSTLVWSIiFVSDLYRDRKL 
LGAEERI MQTVE I S KHAVDIEEKGVRLR 
LTI VDTPGFGDAVI STECGKRRAMY 


5769 


19670 


A 


5815 


24 


314 


QAPFGPKCFNVNLCFSL3LSKGEPEFHY 
IAGAHGNEVLGREIiLLLLVQFVCQEYLA 
RNARIVHLVEETRIHVLPSLNPDGYEKA 
YEGVMALAHLTQ 


5770 


19671 


A 


5816 


474 


3 


TTLRKG YSRHKGLLHS I RQEVQATPLEG 
I IVSGQMSTMNLSSTLWPSPKRVRRLCI 
GRTLRSLRIRPCKRTPLQTRRMPQKMRA 
LYGDPGDSPPLRSRAIPASSPGSTGSFR 
ARPPHPLHPPPHLTREGEPPLLPVLQPR 
RGPPLPLHSQASRPIV 


5771 


19672 


A 


5817 


511 


389 


GMYCTEGREWPTFRWEI EIEEDHCGRV 
SASIILTHCSNR 


5772 


19673 


A 


5818 


510 


2 


PLSCEVPNPLFPVPGVRMIQHHIEACAQ 
VRGMQEIVLIGFYQPDEPIjTQFIEAAQQ 
EFI LPVRYLQEFAPLGTGGGLYHFRDQI 
LAGSPEAFFVLNADVCSGFPLSAMLEAH 
RRQRHPFLLLGTTANRTQSLNYGC I VEN 
PQTHEVLHYVEKPSCIAAAIEDQASSRQ 
R 


5773 


19674 


A 


5819 


282 


152 


FSLFLFLLQSERHFIMQVVCEATQCPDT 
RVSVRLYEKSVLHLS 
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C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Pheny (alanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
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X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 


5774 


19675 


A 


5820 


347 


1 


E PHPVTP PAVGTSRYNPLIMMDRNRSAV 
TPPSHPPQQPSSMQTGMNPSAMQGPSPP 
PPPPS YMHI PRYSTNP I TVTVSQNLPS G 
QTVPRALQ ILPQI PSNLYGS PGS IYIRQ 
TKV 


5775 


19676 


A 


5821 


482 


13 


I YEKL I TGCYN I LANHADPNSGLDES I L 
EECLQYLEKQLESSQARKAMEEFFSDSG 
ELVQIMMATANENLSAKFCNRVLKFFTK 
LFQLTEKSPNPSLLHLCGSLAQLACVEP 
VRLQAWLTRMTTS 


5776 


19677 


A 


5822 


608 


7 


FRFAEKVVEGMFIIVNSITIKIHSKAFH 
AS FELWQLQGYS VNPNWQQSDLRLTRI T 
DPCRGEVLTFKE I TWQTLR I EADATDNG 
DQDPVTT PLRL I TN QGRIQ I ALKRRTKD 
CNVIS SKLMFLLDDLLWVLTDSQLKAMM 
KYAES LS EAMEKS AHQRKSLAPE PVQ I T 
PPAPSAQQSWAQAFGGSQGNSNSSSSPV 
LRPL 


5777 


19678 


A 


5823 


328 


3 


AEVASEDCSLPCFLAVWNRI IEPVAAMR 
KEADMLRLFPEYLKGEELFGLTVHAVLR 
IAESLPGVESCQNYLFRYGRHPLMELPL 
M I NPS GCAR S EH KI LTHYKRPSLY 


5778 


19679 


A 


5824 


69 


308 


TSVPSCVRCRYI I LRTS S ALTN I LGTHS 
NSSFFHASSSALHSCCFPFFSWLQTLDI 
NVKAPALMTKAWPEMEKRGYRE 


5779 


19680 


A 


5825 


424 


2 


LQRAFSESIRKLRGYQGADRKQIYHRRC 
SFANHSVRPSADEKCNSNFFEQRHGGSH 
QSSKWTPVGPAPSTSQSQKRSSGLQSGH 
SSQRTSAGSSSGTNSSGQRHDRESYNNS 
GSSSRKKGQHGSEHSKSRSSSPGKPQTV 


5780 


19681 


A 


5826 


107 


2 


SSLTAGVRMG VPAQS TQGTVNGS S PQMS 
GTAALTS 


5781 


19682 


A 


5827 


2 


437 


FPTEDSRTSKESMSEADRAQKMDGESEE 
EQESVDTGEEEEGGDESDLSSESSIKKK 
FLKRKGMTDSPWIKPARKRRRRSRKKPS 
GALGSE SDKS SAG SAEHIGP CDS TGDME 
VSSGLPGSRRPENPVFLCIWVAVRAICP 
GFLWS 


5782 


19683 


A 


5828 


487 


22 


MPEPVFPLSHFRQFIAAIKLQFQARLSR 
CVRDLVRSLAAPNHDTLRMLFQHLCRVI 
EHGEQNRMSVQSVAIVFGPTLLRPEVEE 
TSMPMTMVFQNRWELILQQCADIFPPH 


5783 


19684 


A 


5829 


2 


871 


RGICSQRWRREGSQSRGPGLVITSPSGS 
LVTTASSAQTFPISAPMIVSALPPGSQA 
LQWPDLSKKVASTLTEEGGGGGGGGGS 
VAPKPPRGRKKKRMLESGLPEMNDPYVL 
S PE DDDDHQ KDGKT YRCRMCSLTFYS KS 
EMQIHSKSHTETKPHKCPHCSKTFANSS 
YLAQHIR IHSGAKP YS CNFCE KS FRQLS 
HLQQHTRI HSKMHTETI KPHKCPHCSKT 
FANTSYLAQHLRIHSGAKPYNCSYCQKA 
FRQLSHLQQHTRIHTGDRPYKCAHPGCE 
KAFTQLSNLQ 


5784 


19685 


A 


5830 


463 


1 


LPESRIjPSPHKKEEGSRARVIMTSYPIE 
PHERRKGSLADWATLKQKKLEEMTRTE 
QEDSSCMEKLLSKDWKEKMERLNTSELL 
GEIKGTPESLAEKERQLSTMITQLISLR 
EQLLAAHDEQKKLAASQIEKQRQQMDLA 
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K=Lysine, k=Leucine, M=Methionine, 
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RQQQEQIARHQQLY 


5785 


19686 


A 


5831 


462 


2 


MILFDSSLYFLCGWYLSNLILGTYGVRK 
PWYFPFTASYWKSVGFLVEKRQYFLSSS 
LFFFNENFDNKGSSLQNREGELEGSAPG 
VTLVS VTKE YEGHKAWQDLS LTF YRDQ 
I T ALLGTN GAGKTT I ISMLTGLHPPTSG 
T 1 1 INGKNLQTVY 


5786 


19687 


A 


5832 


396 


2 


VPSTPASKRKGIVPRCDIFIYDRGGALP 
KD I HVAG I L FASAWMNVWPLLS AI E FHS 
AWAMGMGVNFFVAS THHVSLNMSGSGRY 
APNGRKVYH YDMKTEMGELLLS EVDSHP 
LSSLAYPTADNWNAYATLY 


5787 


19688 


A 


5833 


384 


137 


EEICbNPGAGGCSEPRLWHCTPAWATKR 
DS S KKKE KNCLS KNKKE KE KEMFACV C V 
CVLFS I YAFPS KAQE LSNPCKHYGNF 


5788 


19689 


A 


5834 


171 


2 


FVEGINISGNFYRNKLKYLAFLRKRMNT 
NPSRGPYHFRAPSRIFWRTVRGMLPHMY 


5789 


19690 


A 


5835 


278 


404 


SVLI KFEKKTLKYKAI WPGAVAHACNPS 
TLGGRSRGITRSGE 


5790 


19691 


A 


5836 


417 


3 


SKGKSGRCEWQYDLPSGTHAVFQFFLKQ 
QGRVPGLTQAVQVAKMSLI DLAGS ERAS 
STHAKGERLREGANINRSLIALINVLNA 
LADAKGRKTHVPYRDSKLTRLLKDSLGG 
NCRTVM IAAI S PSSMTNEDTYSTHV 


5791 


19692 


A 


5837 


409 


230 


AGRAYCYNGMCLTYQEQCQQLWGPGKAL 
LGSTALPSPPLPSQPPLALRVW3NLGFK 
WHL 


5792 


19693 


A 


5838 


3 


409 


LRS VP CKDYLTQNHYITS PLS SEEAAFP 
LAYVMVIHKDFDTFERLFRAIYMPQNVY 
CVHVDEKAPAEYKESVRQLLSCFQNAFI 
ASKTESWYAGISRLQADLNCLKDLVAS 
EVPWKYVINTCGQDFPLTTTRPV 


5793 


19694 


A 


5839 


385 


174 


GLAVEIGSRRIAEDGLELLASSDPPTSA 
SQSAGSTGVSHHAWPDTVSSVYWCFLCC 
TKQVWGALS IVSL 


5794 


19695 


A 


5840 


260 


2 


PliPRYTAACSRLLVQ YKAAFRQ VQGSE I 
SSIDEFCRKFRLDCPLAMERIKEDRPIT 
I KD D KGNLNRC IAD WS FY I TVMD KLRQ 
CI 


5795 


19696 


A 


5842 


299 


3 


FSGIKYI CI I FLFVCFFL YLLG IGS P YV 
AQAGLTLLiGSSDPPPSASQSPGIIGVSH 
YVQ P IHT VAQLS P S S I FRTFS S FQTETV 
PIKHTSHSPSPSLY 


5796 


19697 


A 


5843 


391 


2 


APHAPAFLRARGEPQDPLSHPRVPAVSA 
NCRMWKKLPVHSSPTPRLTPLWKLQARW 
LIjPQDVYLQGWGSYSLLRPAALI smvll 
ARE FbYP AKMSVSEVCS SGLSS PLLEQH 
KTNLI FYASGDICSANV 


5797 


19698 


A 


5844 


479 


59 


F VGME PGVGHAS LAMHGLH I RSLGK I KL 
RKRKCTLFFNTQEKSARRRGHLLGENIY 
IiLL FA I ALR I IjNCIjLVQTS FVPDE YWQS 
LEVSHHMVFNYGYLPWEWTERLRSYAYP 
LICASIYKVLHLLGKDSVQMLVSGNKSS 


5798 


19699 


A 


5845 


406 


548 


SVQTKPFFETESHPVTQAGVQWHNLCSL 
QPLPCRSKQFLCLRLPSSV7 


5799 


19700 


A 


5846 


2 


394 


TLCHRAKDPTVHHESGR IMYLNGLCFLM 
GPAQLTQRLSVSRQGEC 


5800 


19701 


A 


5847 


472 


201 


LSILSDTSLCLGRFFTYENGCAYFHEEE 
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REGLARICRLS IHSR YBDFWDGFNVLY 
NKKPVIYRSAPARPGLGQSLCNQVMWYQ 
VAILNK 


5801 


19702 


A 


5848 


127 


3 


S SNVKAAWGKVGAHAGEYGAEALERMFL 
SFPTTKTYFPRCI 


5802 


19703 


A 


5849 


238 


1 


MAETHPNPTGLPLAPALPCLAGSATHPC 
LFSPEISLDVDADRDGWEKNNPKKVPS 
FQGRHHPEPRASEDTAPDPAGMY 


5803 


19704 


A 


5850 


423 


2 


LSPLGNWSSANOKWFDSVKPSRKDLFA 
VDTQVGTVP SLTAGEQGFGDGG KGRAGE 
QATQAAGDWSSNMWGSDGI LSHSAPLTV 
PQPLLTGPQGPCLCRPLPNQQTSFPALL 
PFS TQLCSMCPG PAS ARPP PLLLKPTM Y 


5804 


19705 


A 


5851 


326 


682 


PSYSLLSLLQVKNEVEKLPRQQRKESMK 
QKMEEHTQKKQLLVSPKPQLGSAGAGAR 
PGMVTRPPLCLCPI PAPQI PRS EYLNLA 
S S FMAPLS P P P CS LHRLS CLHLEVRNS I 
SGYPNCI 


5805 


19706 


A 


5852 


263 


30 


HEKTDDERGPGQSARSGAI TKPPGPPLP 
IEPHETTPEHPAPSGTIPEPPLPVELHE 
TTPQHPVPSGTIPEPPYLLSQ 


5806 


19707 


A 


JOJ J 


^R9 


7 


GLPTQREKFGASMRTRMTIQS IAVVTSF 
VCLVGEGNNVQG FRAES RCWR YDPRHNR 
WFQIQSLQQEHADLSVCWGRYIYAVAG 
RGYHNDLNAVERYD PATNS WAYVAPLKR 
RCMPTKAKRWRGRCISP 


5807 


1970S 


A 




^79 


2 


GRSLRYSGS CSGEENSTNNS AGQSRAVT 
AAAARRRGNSHNEYYYEEAEHERRVRKR 
RARLWAVEEAFTH I KR LQEEEQKNPRE 
VMDPREAAQAI FASMARAMQKYLRAAKQ 
QNYNTMESILQHC 


5808 


19709 


A 


5855 


374 


2 


SDAGAPVN I YE FRHRPQCLEDTKPAFVK 
ADHADEVRFVFGGAFLKGD I VMFEGATE 
EEKLLSRKMMKYWATFARAGNPNGNDLS 
LWPAYNIiTEQYLQLDLNMSLGQRLKEPR 
VEIWNSTIPPCI 


5809 


19710 


A 


5856 


516 


18 


PYECKECGKAFNCGSSLVQHERIHTGEK 
PYBCKECGKAFSRGCHRTQHQKIHRGET 
PIKCKECGKAFSWGSSLVKHERVHTNEK 
SYECKDCGKAFGSGYQLSVHQRFHTGEK 
LYQRKEFGKTFTHGSKLVKERTHSNDKP 
YKNKECGEAFLWTTYSNEKCIASAK 


5810 


19711 


A 


5857 


392 


3 


CERGMGGS PRALGRHWTS FLKLRLNCS V 
PGDSTFYFDVLQALTGPVNLHGRSALFG 
VFTTQTNSIPGSAVCAFYLDEIERGFEG 
KFKEQRSIiDGAWTPVSEDRVPS PRPGSC 
AGVGGAAKFTSSRDLLY 


5811 


19712 


A 


5858 


391 


3 


ARRTTGMVPKAGGGKGRRGAVFRSYIRE 
IEELRSKLVESEAMNESLRRSLSRASDR 
S PYS LGAS PAAPAFGGSPAS S MEDAS EV 
IRRAKQDLERLKKKEVRQRRKSPEKEAF 
KKRAKLQQENSEETNGE 


5R19 


1Q713 

J7/JJ 


A 


JOJ7 


409 


j 


SLPGEGDPWWAGQEELLFVQEGKLSLPV 
AERVLLRIACRYDPRSNSWAEIAPMKNC 
REHFVLGAMEEYLYAVGGRNELRQVLPT 
VERYCPKKNKWTFVQSFDRSLSCHAGYV 
ADGLLW I SGGVTNTAQ YQNRQC I 


5813 


19714 


A 


5860 


405 


135 


NLS RLS LCRQPMTLVTWVLLF YDS EEKA 
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FWL LVAL WKRML P D Y YNTR WGE CPR VL 
PRLGLQVPAGSMRVAALPRPSSRLHVGS 
LGVRG 


5814 


19715 


TV 

A 


5861 


ion 

389 


2 


VVCT?T T?C/^T TTTD TTTvl A TTT Kirit7T>T t/"DT T^T T 

FVTFQDSRMAKRVRKDYKYVQCGVQPQQ 
SSVTTIVKSYYWRVTMAPKPKDI IWKHL 
SVRRFFWWARFIAINTFLFFLFFFLTTP 
AI IMNTIDMYNVTRPCI 


5815 


19716 


A 


5862 


525 


0 


SHLPESERIHHTVGQKEQVLMDTSKTRP 

NJNL) V PbPPMfc'iAUy VbNUDKPJiGoV,fc«.L)k 

EKKESSLPKSFKRKISWSTKGVPAGNS 
DTEGGQPGRKRRWGAS TATTQKKPS I S I 
TTESLKSLI PD I K PLAGQE AWDLHADD 
SRISEDETESKGDDGTHDKGLTIC 


5816 


19717 


A 


5863 


399 


2 


ATRRNRNR VPSGM i RTNVREMI AA vG PG 
PSPYPLPPPPEGTSS IEYSNQGNTCQGH 
GNFDFPHGNPGGTSMNDFMHGPPQLSHP 
PDMPNNMAALEKP I SHPMQETMPHAGSS 
DQPHPSMQQDKARNTPQPMY 


5817 


I97I8 


A 


5864 


3 


714 


RRPFIALCLSNVAFMLPWQFAQFILFTQ 
IASLFPMYWGYIEPSKFQKI IYMNMIS 
VTLSFILMFGNSMYLSSYYSSSLLMTWA 
1 1 LKRNE IQKLGVS KLNCWLI QGSAWWC 
GTIILKFLTSKILGVSDHICLSDLIAAG 
ILRYTDFDTLKYTCS PEFDFMEKATLL I 
YTKTLLLPWMVITCFIFKKTVGDISRV 
LATNVYLRKQLLEHSELAFHTLQLLAFT 
ALAILILRLKLVL 


5818 


19719 


A 


5865 


423 


1 


APPVSTAVAQSNSSEEEAREVGSPAQEF 
AYQKSiiPPRFURQQQQQ^EQijYAMQhW 
QPVYPPPSHPQRTFYPHHPQMLGFDPRW 
MMMPSYMDPRITPTRTPVDFYPSALHPS 
GLMKPMMPQESLNGTGCRSEDQNCVPPC 
I 


5819 


19720 


A 


5S66 


497 


? 


AVGAGQKGGRGGGGRELMPPQKIWGGGA 
FTNNAHVG P LK I M P R1j I RTGMLE E I HLE 
PGWQGLFPCVDELSDIHTRFLSQLLER 
RRQALCPGSTRNFVIHRLGDLLISQFSG 
HSAEQMCKTYSEFCSRH3KALKLYKELY 


5820 


19721 


A 


5867 


382 


74 


LALSPRLKCSGQI IAHCSFDLLGSSDPP 
PS ASR VAGTTGAR RHAR L P YS IjR H YH FV 
LRLFNPSKHSKLKHFHPILIRNPGKVGS 
YS FQMSDLLQSRRARQQS 


5821 


19722 


A 


5869 


378 


3 


or bKbAN Jj J. bHyKJn 1 OliKir r yLALUGK 
SFSRSPNLIAHQRSHTGEKPYSCPECGK 
S FGNRSS LNTHQG I RSGSKPYECKECGE 
SFSYNSNLIRHQRI LTGEKPYKCTDCGQ 
RFSQSSALITPV 


5822 


19723 


A 


5870 


375 


3 


IHVJ. VFrJyAI GHViyiAis-R /FDTYSPHEDE 
AMVLFLNMVAPGRVL I CTVKDEGSFHLK 
DTAKALLRSLGSQAGPALGWRDTWAFVG 
RKGEAECHWADTELNRRRRRFCSKVEGY 
GSVCSCKDPTV 


5823 


19724 


A 


5871 


373 


1 


QPEEVSGALSPPSASAYVKLVLLGLIMC 
VSLAGNAILSLMVLKERALHKAPYYFLL 
DLCLADGIRSAVCFPFVLASVRHGSSWT 
FSALSCKIVAFMAVLFCFHAAFMLFCIS 
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E=Glutamic Acid, F=Phenylalanine, 
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N=Asparagine, P=Proline, 
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i i in cum lie, v — vjiiiic, 

X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














VTRNMAIAHHLY 






7\ 


DO 11 


ISO 

JO J. 




i\jr*j\4X\j\jxj-x x. ii±Ji\. x n x r r d\^jr xi>v,ij 
KAFRE KS TV 1 1 H YR TH TGEKP YE CNE CG 
KAFTQKSNLIVKQKTHTGEKTYECAKCG 
5SFIQKLDLIIKHSTHTGKKPHECNECK 
KTFSDKS TL 1 1 PHV 


JGZJ 


1 mo/: 


7\ 

y-i 


JO ID 


3 1\) 


i 

3 


MnPWflDW T ^f^TM <ZC1 POKMT .MnQHFPTn 
J JOix vi\j\\^x\ ± our v rio vjzr ^/i\i ¥ JJLji T jyo^J7 r X\> 

GQQGFCEGKEPYQAMSQNMGNTQDMFSP 

DQSSMTMSNVGATRLSHMPLPPASNPPG 

TVHSAPNRGLGRRRSDLTIS INQMGSPG 

IGHLNSTTCI 


jozo 


\y F/.I 




JO/4 




L 


SLVRGQISEYNISLRASDGGSPPLSTET 
HITLHVIGINBNPPTFPHLSYSACIPEN 
NPRGAS I FSVTAQDPDSNNNAR I TYAL7 
EDTLQGVY 


con 

5827 


19728 




CC7C 

5875 


369 


I 
I 


PGDAVEKWLiNDLIjCIjDCIiNIARIVSVCP 
LPEACDLSYVNRDTLFWCHKASEVFLQR 
LMALYVASRFKNSPNDLQMLSDAPAHRL 
LNKCLLCPPPV 


5828 


19729 


A 


5876 


119 


39 


VIQDYTTPPNEELSRDLVNKLKPYMR 


5829 


19730 


A 


5877 


461 


2 


RRGWTSRRKPKEDPSGAAVPEMPKKSSK 

PGMKSMPGKSPSAPAPSKEGERSRSGKL 
SSGLPQQKPQLDGRHSSSSSSLASSEGK 
GPGGTTLNHSISSQTVS3SVGTTQTTGS 
NANSVQLPQPLCI 


5830 


19731 


A 


5878 


503 


3 


PSPQVPGECPSPKKLGARAAFTTPDPAP 

GSGMKDVPSWLKSLRLHKYAALFSQMSY 
E3MMTLTEQHLESQNVTKGARHKI ALS I 
QKLRERQSVLKSLEKDVLEGGNLRNALQ 
ELQQI I ITPIKAYSVLQATVAAATLY 


5831 


19732 


A 
/A 


jo/9 


421 


34 


T . \7T4T07T , Q A C\n/T \TV\ W FfJT\7TC T TT> Y QTQK 

i-» v nrv. V JLio>\OiN V J_i v %J±\Sj\j X v XV _L IJJloXiJlv 

RLADI CKEDVFEQTRVRFSVNAMHYRTG 
NKGDWRLGLLMLSLSQGQECAEYPVAI 
PSGLPGGFQDFLKKCVCLDDKKSMESPA 
WETKQN KS PANVLRPL 


jo3z 


iy/33 


7\ 


coon 

jooU 


-Ian 

3 /y 


3 


QNWKSPGLSLKTRAPGQGSVLSSSDLPR 
AVLTGTFAVMSVMVGSVTESLAPQALKG 
SMINETARDAARVQVASTLSVLVGLFQV 
WNSQEYVPPTTCI 


5833 


19734 


A 


5881 


429 


1 


S LVS YMDTES TAE KLGKELGLQAKELS A 

^PTTF T^VKTTI^MT.F^ r.FMT?RPf»F<5 <?M 
v noor.xiu ivj v I'juoi'ijr oudi luur jjdooif 

TKSFHSAVEI KNRSQHELPCFQHHGIDT 

PTSLQKRSACCPPSLLRLEETASSSQDG 

DPQI WDDLPFSESLNKFLTVLESE IAVT 

QCI 


5834 


19735 


A 


5882 


436 


3 


WSLVSYNDKKS TAE KLGKELGLQAKELS 

A^n-TCI^WWFTf^TKrnQTJT .F^T.FMPFPT.FC; c 

NAKSFHSAVEIKNRSQHELPCFQHHGID 
TPTSLQKRSACCPPSLLRLEETASSSQD 
GDPQMWDDLPFSESLI KFKTVLESEIAV 
TQCI 


5835 


19736 


A 


5883 


195 


353 


DS YS YVR S TAPA VA YDS KQY YQQ P TATA 
AAVAAAAQPQPSVAETYYQTGGFS 
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peptide 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparngine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W^Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop cod on, /-possible 
nucleotide deletion, \~possible 
nucleotide insertion 


5836 


19737 


A 


5884 


432 


1 


LSAKVPVASVSDQAAAMHLSQCPKNLAT 
SLAELRTAS QKAHKLCG PME I DS ALNTV 
QTLKNELQDAKMAAVESQLKPLPGETLE 
KCAQDLGSTSKAVGSSMAQLLTCAAQGN 
EHYTGVAARETAQALKTLAQAARGVAAS 
TTDP 


5837 


19738 


A 


5885 


313 


1 


GKS PHLLILQTS PLSKAPQPLILLLLYW 
SWVGWSVTAASRETEAGIQVSSEEPGP 
RNVS PHSRLQDETNLRLEAENNLAAYRQ 
VREVQGRDREGRKGLMATLY 


5838 


19739 


A 


5886 


432 


3 


GSGFCSPILCLWWVEEAVGI IRLHLWLR 
ELGCWHSFSLPGTYSLEFLSTCTQVLPY 
SWS GLLFAS GN YLS LYL V PSRAETD S RG 
SQPLPVARHGVGACKCKPRQTRPGLS PQ 
VCVEKLMPLSSFCSAFQQNTYNKQPMSR 
PV 


5839 


19740 


A 


5887 


417 


2 


AS LMVAPDYAE I S PLAMPAPS SGWCTP 
IMSTSSSEAWSTPLMLAPDSGELSPILM 
QDMNPGVMSTQPVPAPSFEAMSPLQITD 
EDTEAMS KVLMTALASGE I S S LLM S GTD 
SEAIS SLIMS AVASGGTSPQPTSTLY 


5840 


19741 


A 


588S 


420 


3 


KFDL. I KKLDLKTMS SYDLD I E AS DGGGL 
SGKCSVSVKVLDVNDNFPELSISSLTSP 
I PENS PETEVALFRI RDRDSGENGKM I C 
SIQDDVLFKLKPSVENFYRLVTEGALDR 
ETRAEYNITITITDMGTPRLKTEQSV 


5841 


19742 


A 


5889 


321 


414 


FNMRI WPGAGAHACNPSTLGGRGGRI TR 
SAD 


5842 


19743 


A 


5890 


415 


1 


PGLPGTSVKGI PAS KQSPHESPRTLKLK 
TSPI IQQLGLYLSHTAIRYHPQETXKEF 
VQLVCPDAGQQAGQVGFLNPNGS SQDKV 
HNPFLPTPMLPPPPPPPMARPVPLPVPD 
TKPPTTSTEGGADYPTSPTYSTPSLY 


5843 


19744 


A 


5891 


193 


3 


KPSSKVWSRRDPFLLSSVEQPIKDAVIT 
VPVFFNQAERRAVLQAARMAGLKEMQL I 
NDNTACI 


5844 


19745 


A 


5892 


412 


2 


KHQVSDNKDSFYVSLYPDFSSLSRAILD 
WQ FFKWKTVTWYDDSAGL I R LQEL I K 
APSRYNLRLKIRQLPVDTKDAKPLLKEM 
KRG KEFHVI FDCSHEMAAG I LKQALAMG 
MMTEYYHYIFTTLDLFALDVEPSV 


5845 


19746 


A 


5893 


413 


3 


ELLLCSNTSCCRCFCVECLEVliVGTGTA 
AEAKIjQE p ws C YMCLPQRCHGVLRRRKD 
WNVRLQAFFTSDTGLEYEAPKLYPAI PA 
ARRRPIRVLSLFDGIATGYLVLKELGIK 
VGKYVASEVCEES IAVGTVKHEWR 


5846 


19747 


A 


5894 


424 


3 


CSGRRE PS VRGVQCKGDQGHHSARMAP S 
EAPGTRS CTPSHGQNTAAEATPAQKTPA 
KWYVFSTEMANKAAKAVLKGQVETTWS 
FH I QN I S S SKTE RS TAPLNTQ I S ALRND 
PKPLPQQPPAPASQDQNYSQNTRLQPCI 


5847 


19748 


A 


5895 


400 


1 


ASVQNPALRLVTREEFAIMQTPAGELYD 
KSI IQSAQQDS IKKANMKRENKAYSFKE 
QI IELELKEVSTRRHLRKVAAELGGLKL 
VAFPTAHML.FTWWILRPMAHSSDAPDQC 
LVHWHTFPSPLLHS FFHPQLY 


5848 


19749 


A 


5896 


421 


3 


VSS IQMGAVNLLGGGLDSLLGSDLGGG I 
AGS PAVGQSFI PS S VQATCAPS PTPAW 
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C=Cysteine, D=Aspartic Acid, 
E=G)utamic Acid, F-Phenylalanine, 
G^Glycine, H=Histidine, l=Isoleucine, 
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N=Asparagine, P=Pro)ine, 
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X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














SSGLNDLS ELSTW IGMAHGG YVAPKAVW 
LPAVKAKGLESSGTFTRRQGHIYMAMDF 
TNKALQHMTDSAI QFTKNS FGVI PS C I 


5849 


19750 


A 


5897 


384 


1 


PRASRFIHEAGIVPLPMRTHTPKKPFTC 
EVAGVQPSRNVKVKIHMRKHTGERPYSR 
QSCSVRFVHSYDLKNHMKLHTGDRPYEC 
HLCHKAFAKEDHLQRHLKGQI CLEVRTR 
RRRKDDAPPHYPPPCI 


5850 


19751 


A 


5898 


429 


2 


QDIAAWQSLTQVLTPESWRKANIMTEPQ 
KSQERYKGIYVKEKLYRRARHDESLNWT 
SCDHHESQECKGEDPGRHPNCGKNLGMK 
STVEQHHWHVLPQPFTCNNCGVAFADD 
TDPRAHPSTHLGEKSYKCDQYGKILSQS 
LY 


5851 


19752 


A 


5899 


424 


I 


GTMLQALAAHDAGSRAHVLLSLSQQDGI 
EQHMDFDSRYTLLELFAETTS SEEHCMA 
FEGIHLPQIPGKLLFSLVKRYLCVTSLL 
DQLNSSPELGAGDQSSPCATREKSRGQR 
ELEFSMAVGNLISELVRSMGWARNLSEH 
V 


5852 


19753 


A 


5900 


135 


459 


YFMLKAEHVISYVCSENSLMVWNMRFC 
LIGLIHIQEFYFFEMEPHRAIQAGVQWC 
DLCSLQPPPPGLKRCSCIPPSSSRDYRC 
APPYP 


5853 


19754 


A 


5901 


418 


10 


GRSPKPGDRLWESVHLSFPGPADSLSGG 
SAPAS SYEPSQRS S FSSNRSQRGSTSTR 
NSSQKGSS VLS I KQKGKRELYMEKLQEH 
LI KAKAFT I KKKFSQNLVAQLW YFVKCV 
YFGLSAYQIPRGYPTRVLGNFLTK 


5854 


19755 


A 


5902 


402 


2 


GYRHPELEIKSVDGFQGPEKQPVTLSLV 
RSNRKGEVGFPVEDRRINVDVTRARRLV 
TVICDYRTVSSHAFLKPLVEYFTQHGEV 
RTAFEYLDDIVPDNYSHEDSQGSSQAAT 
RPQGPGTSTRTKKQRQEGGMY 


5855 


19756 


A 


5903 


346 


1 


GICPLSHI KMTDARCQIYEMKGGEKSPK 
DTGKB PGHSEAKTGP PQVLAGVPAQPEA 
PQPGPNTTAAPVDSGPKAGLAPETTETP 
AGASETAQATDLSLS PGGESKANCS PED 
PMY 


5856 


19757 


A 


5904 


139 


212 


EVENNDD I SHHHHHHHHHHSNS KS 


5857 


19758 


A 


5905 


2 


345 


PCGCWGRCALLLI SAAAKAKS KCGPTFL 
PCASGIHCI IGRFRSKGFEDCPDGSDEE 
NCTANPMLCSTARYHCKNGLC IDKSFI C 
DGQNNCQDNSDEESCESSQGVGYVCGFK 
KA 


5858 


19759 


A 


5906 


432 


17 


PQTTPHRTFGGGKAAWLLAVGGQFLLC 
WSPYFSFHLYVALSAQPISTGQVESWT 
WIGYFCFTSNPFFYGCLNRQIRGELSKQ 
FVCFFKPAPEEELRLPSREGS IEENFLQ 
FLQGTGCPYKSWVSRPLPCIAAALRV 


5859 


19760 


A 


5907 


407 


1 


PYACGECGKSYRQSSSLVSHRRIHSGVR 
PHHCDECGEFFSRKYDLLIHQRVHSGER 
PYKCS ECGKS FSHS CS L IAHQR I HTGMR 
PYECJbECGISr XH&L.SLi± lyyRVHLGTR 
PYMCS ECGKS FSQSCRLIKHRSV 


5860 


I976I 


A 


5908 


419 


3 


VLAKKTI ITKSARDCHEFGNILHLSTNL 
VASIQRPDKHESFGNWMVDNLDLFSRSS 
AEMKYDNGCAKLFFHTEYEKTNPGMKPY 
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C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenyl alanine, 
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K^Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
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W=Tryptophan, Y=Tyrosine, 
A— Unknown, ==otop cooon, /=possiDle 
nucleotide deletion, \=possib!e 
nucleotide insertion 














GYKECGKGLRRKKGLSLHQRIKNGEKPF 
ECTACRKTFSKKSHLIVHWRTHTGVY 


5861 


19762 


A 


5909 


441 


57 


KVMCsI FKSSLF QAIjVDIQEFYEVTLINS 
QKSCEQKIEEANQVLQKWEKTSLLAPCH 
DRLQKSSBLQSLIGQHSSLETTDGPQPW 
LFPRGSEGHRGKTYDHCHS IGLQIAVDQ 
RKMioH V lib KI KKI R V 


5862 


19763 


A 


5910 


3 


317 


CLLMLWRRWGSRGSRQRRLQATPSWEST 
KTADGQGDRRESGSQRHCSSPHPQSKST 
1 1 bo VFAAKEGAyEAAyCaCaFKKvay JLibKt r 
QVSTRQCAMPPARHRKLENDA 


5863 


19764 


A 


5911 


183 


1 


QNSDSKDSLNRVASRDHAKPNLTCHVSP 
AIQGTGS ISESSI PSVSDTSTPRRSRRQ 
LPPCI 


5864 


19765 


A 


5912 


395 


I 


KxIQFNICvKvFHRj? SNSNKDKIRYAGD 
KTFKCKECGKS FHVIiSRLTQHKR IHTGE 
NPYTCEECGKAFNWSSILTKHKRIHARE 
KFYKCEECGKGFTRSSHLTKHKR IHTGE 
KPY I CEKCGKAFNQS STPV 


5865 


19766 


A 


5913 


449 


2 


FGSHLEKEDEKKQELVDKAI KPS IEATL 
E S I QRKLQ YKRAE S S RPED I KDMTKAQ I 
ANEKVALQKALLYYESIHGRPVTKNERQ 
VMKPIiYDRYRLVKQILSRANTIPIIEEE 
EGSEADSOTKPDFMVNKKTDFSARMVKR 
PLGGSARPD 


5866 


19767 


A 


5914 


464 


23 


SAHKiTETRSKS FDYGSLSLTGPSAPAPV 
AP PARVAP PERRKCFLVRQAS LSRPPET 
ELEVAPKGRQESEEPQPSSSKPSAKSSL 
SQISSAATSHGGPPGGKGPGQDRPPLGP 
TVPYTEALPVFHHPVAQTHKHEKPYLPP 
LYCGRCG 


5867 


19768 


A 


5915 


366 


1 


LVS F INFFTS VLATLVWAVLGFKANI M 
NEKCWENAEKILGYLNTNVLSRDLIPP 

1TT TXTtf PITT rnrrir7"T"V\,rikjI'n»RVt'VrXTT T "T" fci! rm T\7" T* 7-\/"\ XT* c> 

HVNFSHIiTTKiJYMEMYNVXMTVKSIX3^ S 
ALGLDPCLLEDELDKSVQGTGLAFIAFN 
EAMTHSPACI 


5868 


19769 


A 


5916 


246 


366 


TSVKEQQMPGAVAHACNPSTLEGQGGRI 
TRSGVRDQPCQH 


5869 


19770 


A 


5917 


400 


3 


NDNAPEFYQS VYKVTVLENAFNGTLVI K 
LNATDPDDGTNGD I VYSFRRPVWPAWY 
AFTINPNNGEIRTKGKLDFEBKKLYEIS 
V E A V UISajJM X. tf ClftVjM l~ I LiLj V 11 V JjU V N UNA 
PEVTITSLSLPIREDTQPCI 


5870 


19771 ! 


A 


5918 


410 


200 


CTPPQPVKCRFLNRDRVSSCCPGWSQIP 
ISSKQPYFLPPSYR 


5871 


19772 


A 


5919 


214 


407 


MYFYIDRASLLSPRLECIWTIAHCSLE 
LLCSRDLPAS ASQSAG I TG IRHHTWLKT 
HFYSSFKT 


5872 


19773 


A 


5920 | 


476 


39 


VPYGLFVGSTGRLGLPYLRGTSHPLRGC 
LHAAALNGRR LVQPIjTRNKHEG CAEEFS 
ANDDVALGFCGSHS LAALPAWGTQDEGT 
LEFTLTTQSWQAPLAFQAAGWHGDFIHV 

U XI. EiUnii W 0 11 V X!j XvJ\£\3 X v 1 li-T 1 i IN o V tr V JL u 

AQPHM 


5873 


19774 


A 


5921 


430 


2 


SLRPSSTGPSPSGGLSEEPAAKDLDNRM 
PGLVGQEVGSGEGPRTSSPLFNKAVFLR 
PSSSTMILFETTKSGPALGKAVSEGAEE 
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nucleotide deletion, \=possible 
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AKLGVSGSRPEVAAKPALPTQKPAGTLP 
RSAPLSQDTKPPVPQEEAGQDHPPSKAS 
RC 


5874 


J 9775 


A 


5923 


411 


2 


iMSLKVD VEALENSAGAT YI RKKGG KVTG 
DSQPKEQGQGVliKKKKKKKKGMVPKNYD 
PKVTPDPERWLPMRERSYYRGRKKGKKK 
DQ I GKGTQGATAGAS SELDAS KTVS S P P 
TS PRPGSAATVSASTSN IIP PRLY 


5875 


19776 


A 


5924 


402 


2 


LQSGA I MNKF YQ PHE AH I P YLLQLF IDY 
NLYGMNLINIAAVKFRKARRKSNTLHAT 
GSCKNHLSGNSLADTLFRWEQDEIPSSL 
I LEGVE PQSTCELE VDAVAAD I LNRLD I 
EAQIGGNPGLQAIWEDEKQRR 


5876 


19777 


A 


5925 


215 


403 


AETSTYFFFFLERQFPFAPQLGGHGRNL 
NSLNLLPRGLREFSRLSLPGGWIYGGAP 
PSPTFFC 


5877 


19778 


A 


5926 


320 


1 


PGRRDRWI WGQGECLEYLLASMTQWCER 
EISSIAPGELCCLLLSFLPQEECQNYVR 
VL I VAGR KVFMCGTNAFS PMCTSRQVGN 
LSRTTEKINGVARCPYDPRHRV 


5878 


19779 


A 


5927 


400 


1 


WATDGGLPLLAS S ATVS VALQDVNDNE 
PQFQRTFYNASLPEGTQPGTCFLQVTAT 
DADSGPFGLLSYSLGAGLGSSGSPPFRI 
DAHSGDVCTTRTLDRDQG PS S FDFTVTA 
VDGGGLXSMVYVKVFLSDEMY 


5879 


19780 


A 


5928 


398 


62 


HNTEVLVGESVTLECSATGHPPPRISWT 
RGDR7PLPVDPRVNITPSGGLYIQNWQ 
GDSGEYACSATNNI DSVKATAFI I VQGG 
YPS S VPE PGLSP I LTPFLARAAI PTDAP 


5880 


19781 


A 


5929 


403 




LAS DGGD P VLS GTS R I C VKVLDANDN AP 
VFTQPE YRIS I PENTLVGTR I LTVTATD 
ADEG Y YAQ WYFLE KS PGETS EVFELKS 
TSGELTI IKDLDYEDATFHEIDIEAQDG 
PGLLTRAKVIVTVLDVNDNANV 


5881 


19782 


A 


5930 


408 


3 


VASDGGDPVLSGTSRICVKVLDANDNAP 
VFTQPEYR I S I PENTLVGSR I LTVTATD 
ADEGYYAQWYFLEKSPGETSEVFELKS 
SSGELTI IKDLDYEDATFHEIDIEAQDG 
PGLLTRAKVIVTVLDVNDNANV 


5882 


19783 


A 


5931 


126 


391 


PVLT CAH FHFS AGAGGSYFM I SRSLGPE 
FGGAEGLCLYLATTFATAM YILGAIQ I L 
LVSQAEP WSTLAM I KQFLWTPNRRLMVL 
I PTC 


5883 


19784 


A 


5932 


436 


359 


NKEWLPGSKVPEKSINDVKNTSGLL 


5884 


19785 


A 


5933 


365 


3 


LIIGMLTAI IGDLASHFGCTIGLKDSVT 
AWFVAFGTSVPDTFASKAAALQDVYAD 
ASIGNVTGSNAVNVFLGIGLAWSVAAIY 
WALQGQEFHVSAGTLAFSVTLFTIFAFV 
CISVLLYR 


5885 


19786 


A 


5934 


394 


3 


FQYFSRTDRVLKHERMCHENHDKKLNRC 
AIKGGLLTSEEDSVFSTSPKDNSLPKKK 
RQITEKRSSGMDKESALGKCDLKKVKND 
YSPLYSSSTKVKDEYMVAEYAVEMPHSS 
VGGSHLE YASGE I H PP CI 


5886 


19787 


A 


5935 


393 


1 


REDLIAGI DEFLDEVI VLPPGEWDPNIR 
IEPPKKVPSADKRKSVFSLAELGQMNGS 
VGGGGGAPGGGNGGGGGGGSGGGAGSGG 
AGGTSSGDDGEMPAMHEIGEELIWTGRF 
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FGGLCLDIKRKLPWFPSCI 


5887 


19788 


A 


5936 


410 


126 


PI CY YKKAGRARWLTP V I PS S WDYRHQA 
QLIFVFLVETGFHRVAQAGPELMTSGDP 
LAYNFLCSYPNEVFRSQASASDLQFVLR 
NLEPPDAGNV 


5888 


19789 


A 


5937 


426 


1 


DATFHE I D I EAQDGPGLLTRAKVIVTVL 
DVNDNAPEFYMTSATSSVSEDSLPGTI I 
GLFNVHDRDSGQNAFTTCSLPEDLPFKL 
EKSVDNYYRLVTTRALDREQFSFYNITL 
TAKDGGNPSLSTDAHILLQVADINDNAP 
CI 


5889 


19790 


A 


5938 


438 


3 


ADPTKGLLRNGSVCVRAPGGVSQGNSVN 
LKNSKQACLPSGGIPSLNNGTFSPPKQW 
SKESKAEQAESKRVPLPEGCPSGAASDL 
QSKHLPKTAKPASQEHVRCSAIGTGESP 
KES AQLSGAS PKES PSRGPAPPQENKW 
SPW 


5890 


19791 


A 


5939 


3 


191 


YS VCLWSQLLRRLRQEDHLS PGGGGCTE 
PWNTERLSKNKKTKKREREKQCGFHSM 
WYCNLiK 


5891 


19792 


A 


5940 


399 


J 


QYSDDNDMS WKVTRAAAKCLDAWS TRH 
EMLPE F YKTVS PAI.I YRFKE REENVKAD 
VCHAHHSLFKQTSPVQSWLCGLDAMDQG 
GT P LAMLQSQVAN I VKAVH KQM KEKS V K 
TRQCCFTLLTELVNVLLGAVY 


5892 


19793 


A 


5941 


411 


3 


SPCEGPRRFQCKSGKRVDGGKVCDVQRD 
CRDWSDELLKVWCGACLRPLAGLSLLPS 
PSWYLGSRFSSAPCPDTFCSGPLPGFMC 
RPMASHGAFRPQASGLHbYKVLRACPSQ 
VLKNYVFSHKLGLSSFLPRSDHV 


5893 


19794 


A 


5943 


431 


26 


KAWG I PGDMG P PG I TVR PG YNGLFGNP 
GVQGQKGEPGVGLPGLKGLPGLPGIPGS 
PEEKGSIGVPGVPGEHGAIGPPGHQGIR 
GEPGPPGLPGSVGSPGVPGIGPPGARGP 
PVGQG P PGLSG PLV I KGE VSRVR 


5894 


19795 


A 


5944 


396 


3 


PLPVELIRVPAFLDLFMQSLFKPGARIS 
QDLKHK Y I H I LAYAAS WETCKKNKR VS 
INKDELKSTSKAVETVHNLCCNENKGAS 
ELVAELSTLYQC IRFPWAMGVLKWVDW 
TVSEPRYFQLQTDHTPTV 


5895 


19796 


A 


5945 


465 


32 


ERVTLADITWCTLLWLYKQVLSPSFRQ 
AFPNTNRWFLTC INQPQFRAVLGEVKLC 
EKMAQFDAKKFAETQPKKDTPRKEKGSR 
EEKQKPQAERKEEKKAAAPAPEEEMDEC 
EQAMAAEPKAKDPFAHMNKSTFVLDDCI 
AAAL 


5896 


19797 


A 


5946 


414 


2 


ATP PVRCIRKE I RNW YVDI Q P VQE PKAQ 
ACGNGHGIII IAETSTGCLFAGSSLGKR 
GVNADKVAI EAAEMLLANLRHGGTVYE Y 
KQDQL I VFMALANGVS 1 1 KTGP VTLHTQ 
TAILYDEQIVKVNCKENHSVDEQVY 


5897 


19798 


A 


5947 


411 


1 


EPCWRRIACSSVQTDDEDGESRYLLSR 
RRRARRSADCSVQTDDEDSAEWEQPVRR 
RRSRIiPRHSDSGSDSKHDATASSSSAAA 
TVRAMSSVGIQTISDCSVQTEPDQLPRV 
S PA I H 2 TAATDPKVE I VR YI SAPCI 


5898 


19799 


A 


5948 


153 


409 


LPFFPILLPFIS FFRYRVGLLS PKLKCS 
G VI IAHCS LELLGS SNPPTS ASRVI FLY 
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C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
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X=Un known, *-Stop codon, /=possible 
nucleotide deletion, ^possible 
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LSLCLFVFEMESCSVAQAGEQWRDLGSL 

Q 


5899 


19800 


A 


5949 


21 


447 


RAAAMSSRNFRFLSSHRELYFGVDLPSG 
NLVVREPADREQLCRAKAACVLTYDLVL 
EDPLELHKIRIHVLDTNENSPLFPAGDV 
QLHI PEFLTPGARFTLPNAQDDD2GSNG 
ILSYSLSPSQHFRLDMGSRVDGSEYPEL 
VL 


5900 


19801 


A 


5950 


268 


19 


QDNWAYYCTVPEYCKTKLNILAWWLVIN 
YLS GD F YF F F I F FLDGVSLCH PGW S AVA 
RSQLTATSSSQVQGRFSCLSHCIAAAL 


5901 


19802 


A 


5951 


389 


3 


QQMDLARQQQEQIARQQQQLLQQQHKIN 
LLQQQIQVQGHMPPLMIPIFPHDQRTLA 
AAAAAQQGFLFPPGI TYKPGDNYPVQFI 
PSTMAAAAASGLSPLQLQQLYAAQLASM 
QVSPGAKMPSTPQPMY 


5902 


19803 


A 


5952 


2308 


307 


RRRPARKKTWVRDGGPHQGL/ YPQLPSA 
VLQPTQPGHGPRALGKGRELESWQGKRP 
GK/GQTHRRKTRGTAS PAVYFS VEWGD \ 
SGGCPMTGKGAKQQRAGLGAAPNPELSG 
EQGTGRGRAQDSQGHGGGKGRRGQHPPG 
ND/PRGVSGMGQESVHSPEARPTGGGTG 
/EMLGRTWRHQQGLVGGTGNLLETGSWS 
GSGS WGRGWLRPQEGQVTTGMGLAGR / 
YQPARQRAVELS PGAQGLRQRRGG WSG P 
PQETEQSGVPGG*GSWPLESHQGEQ\PA 
AGQKQSLQMLPGGFPCWS/SGMGGGPQQ 
LLE S EGAG P S PGGGRHHGKGRVAVTTT P 
REGD\RGQSPGGHTLQLFPCPLWS * ETQ 
GQGSRTVQHQE WGGEGREGEGAG SDQSR 
ALGIA* ICPHTLRPTSMEPQSTAPAPGQ 
PPGPPS WGHRGKQGMG YGGRCPGQ/ IQG 
RGRAQLGSTVG*RDG/QRG*MQGRDQQG 
PRSNGAGVGSCSHTSQK* IPPSSLCT*N 
SSHGPASG\QLWWSSPFIHSPGETNIPH 
TLTEPHSVPGWCWDTLRRHGAGQGHPGM 
ARSGTGEGQ/ QRGRY* ERGRRGRRQQKQ 
KKQGLKEPG* RAAP TLKGATRPLCRCLR 
KVQKPKQDGDVGS * LLKVFRAPGALGTK 
PQRTCRGPADFFHALSGLSNVHRSYCST 
NSGFGACM 


5903 


19804 


A 


5953 


461 


117 


HPRDI IGPVHQGLPPSPQPPPPCKQAPT 

TPSS\M* p *tmrhdtglsgpvqvpgadw 

KPLNAP*LPELAFGEEPCRPGVPDPPYV 
WLPDS APEHCVTWATS PGLSDLLSYPRR 
KQA ! 


5904 


19805 


A 


5954 


845 


610 


F FETGS Y / S VAQ VG AQWCNFGS LQSQPP 
RLKRSCHLSPPSSWDYRGAPPRLARFFF 
FCI FYRNGVS PCCPGWS * TPELQ 


5905 


19806 


A 


5955 


1 


378 


KTPVSDR \ ATKCCSESLGNRRPCFS ALE 
AYETYVPKEFNAETFTLHADICTLSDKE 
RQINKHTALAELVKHKPKARQEQLKAVM 
DDFADFVEKRCMADDKETCFVEEG*TLV 
AASQAALGLLHRIK * 


5906 


19807 


A 


5956 


723 


343 


GCQEG I CPSPCPVI PATREAEAGESLEL 
RRRRLH*AEIVPLHS\KAQRSLCSSDFI 
RILVIFSGMFLVFTIAGALFLHQRRKYR 
SNKGESPVEPAEPCRYSCPREEEGSTIP 
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IQEDYRKPEPACSP 


5907 


19808 


A 


5957 


55 


491 


AQHFNPHNWP VT* LLPHFTNEGGQVTEA 
RCLKWQAR\CLVPMQKVPNAMPGT** PS 
AEVTPPVCNQHHWPQVEGIPSS/APAFQ 
WPLTMGHPC\WESAPRQDASTSPRPGTA 
SHCPSAVAFQERGSSPCPPHHSDSGLCL, 
LPRLLPAP 


5908 


19809. 


A 


5958 


2 


413 


KKPDQ\AE\REHLCTSL\WSGRNTDKNG 
EELHGGKRVMERLKKAVKIANQCMDPSL 
QMQLYIEILNRYIYFYEKEYDAGTIQVIi 
NQLIQKIREDLPNLESSEEPEQIT*HFH 
NTLEHLRLRRESPESEGP I YEGLILYE 


5909 


19810 


A 


5959 


871 


564 


WMCHATFGSHLN*AWRLRPVNSG/LLM/ 
PSEGKAKKESTLMVGVTNPD \MXGKI RL 
LLHS DGKDEY VWNAGHPD I LG \HPLTLQ 
CPIIKI KKKL* * PKSGRTADGPDHSRI K 
V*VTPLGKEPTTDEVIiAEGKENME*VTE 
TC* IDMN* VLQGQTWNLQR 


5910 


19811 


A 


5960 


390 


271 


GINFFSRN/RSLTT*SRLV*NSWAQVIV 
LPWPSRVLGFQA 


5911 


19812 


A 


5961 


333 


922 


GLSCRVFGRTLSCPPCILRHLSEVETLQ 
TLQKKEIEDLYSRLGKQPPPG IGAPAAM 
LSSRQRRLSKGSFPTSRRNSLQRSEPPG 
PGIMRRNSLSGSSTGSQEQRASKGVTFA 
GDVGRM* IQNRSHVSPPHQGPPWSLCSQ 
NLMLSDQQN*ARKI PTLKG * KA\ GGHGE 
CSS I I VKSQTYVNCLLCGGVS SAAYHLH 

h 


5912 


19813 


A 


5962 


705 


387 


CVAQTGVQ*HDLGSLQPLPPGFKQSSHL 
SLPSSWDYRRVTPRIiPNF\ * FFVETDFC 
HI AQAVLQLL\ S SNNLPAS ASQS AGVTG 
VSHRARPTLTLDIQAEIPDLKPSG 


5913 


19814 


A 


5963 


425 


182 


ASNQ * DSVGVGPSE PGAGYNLLVRFLKC 
\LEKHSSWVGVTQFSRCHLSPLSLTRKG 
NSLTPCTSRVRQCIiALLWLTHGSRTH 


5914 


19815 


A 


5964 


388 


14 


PPMYTQLCSIRSTQAFCFVFFSRDE/SL 
ALLPRliVLNSWVQAILLPWPPKVQG*QA 
QWLTLVI PALWEAMAGGLLEPRS SRPAW 
ATTQDL I S TKEN KT KS LS AS DAAE LC VH 
WRLLKI 


5915 


19816 


A 


5965 




373 


DDQRVKSVINLLLAAYTGDVSALRRFAL 
SAMDMEQRDYDSRTALHVSAAQGHVEW 
KFL\ LEACKVNPFPXDRWKNTPMDETLH 
FGHHDVFKXLQEYQVQYTP * GDSDNGKE 
HHTVHKNLDGLS 


5916 


19817 


A 


5966 


3 


329 


HEETSRYEIYELIHVHPYAVKQSFDLEE 
YSLSQSTLEQVGHF*GFVSSMVYVYKTF 
PGSCRGSTALFSSLYFLPPFQVFLSLSK 
EQELGDFE/EDFDPSVKWKLLPQEEP 


5917 


19S18 


A 


5967 


2 


47 


MEDRRIiVRGI P CPQHNARQCPAVP PGIQ 
AYGAAPFEDLQVDFTSMSKCRGNKYLLV 
LGRTYSG * VEAYPTRTEKAREVTR VLLR 
DLLPRFEL PLRI GSDNR PAFVADLVQKA 

7WTT Ti T rptiTYT Uli 71 "VT»Tri/"\C"» O/"* TTtrnmm/^rn 

AKIIjRI i WiUjHAA.YWPQSSGKVERMNQT 
IKNSLGKVCQETGLKWIQALPMVLFKIR 
CTTSKRAG YS P YE I LYHRPP P ILRGLPG 
TP*ELGEIELQRQLQALGKITQTI / YSP 
SKWPVSLFSPVHPLSPGDRWIKKTWNVA 
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C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F-Phenyla!anine, 
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X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














SLCPLWKGPQTVVLSPPTAVKVE*IPSW 
IHHSHVKP*TA 


5918 


19819 


A 


5968 


2 


150 


KSQSI *QSSLLSYFEKLPQPPQPSA\ TS 
TSKQDPPLATRLWLAEGSDDH 


5919 


19820 


A 


5969 


3 


371 


HEGKSGPMTGWLVAVGEVAMKILLLCL 
CLILLRVRSCRRKAARAALGMEAADAVT 
D*SPDSRLLPD\PPHPVPPQSERPCFLC 
RLCMVCPLLDSPDPSFPCHPTSPQTLRA 
KDSAALDTLKV 


5920 


19821 


A 


5970 


396 


58 


YWVLSKGPVTCMLLNSVFSFQASSCLSF 
LINS*AP*PGIFLGMSVFILPFFVFVGQ 
AGLELLTSGDLPDSASQ\ARITGVSHCA 
QPHTALSQYLEQCLAPSRQLISVSLILR 


5921 


19822 


A 


5971 


442 


120 


ICMKQGLALSPRLDSSSAFSNTVQPMPP 
RFKSSYGLSLP/RSWDYRCTPPHMARSG 
NGV* P \ FMVETGS W * AAQGGLELLSS SN 
LPP / S AFQS AG ITS KGHCTR P AVFLVC 


5922 


19823 


A 


5972 


459 


434 


MWPLPWFSISSPSSPPPT*VSSNNHPCA 
VPRAYPGADI PGPGWQRLPPPHRGLSGP 
PESCLLTTILCSKHPNGCPLTQNPRSLP 
GPRPCLIYSTPSPPP*AQL/YAAPSP*L 
YTWREALIR 


5923 


19824 


A 


5973 


681 


1758 

i 


VANKCQ I PD I KAKT YC I CNTKEKRS YLK 
T*NLHRGF\ I KKQIEEFNIGKRHLANMM 
GEDPETFTQED I DRAIAYLFPSGLFEKR 
ARPVMKHPEQIFPRQRAIQWGEDGRPFH 
YLFYTGKQSYYSLMHDVYGMLLNLEKHQ 
SHLQAKSLLPEKTVTRDVIGSRWLIKEE 
LEEMLVEKLSDLDYMQFIRLLEKLLTSQ 
CGAAEEEFVQRFRRSVTLESKKQLIEPV 
QYDEQGMAFS KS EGKRKTAKAEAI VYKH 
GSGRIKVNGIDYQLYFPITQDREQLMFP 
FHFVDRLGKHDVTCTVSGGGRSAQAGAI 
RLAMAKALCS FVTEDEVEWMRQAGLLTT 
DPRVRERKKPGQEGARRKFTWKKR 


5924 


19825 


A 


5974 


1232 


980 


SLSLSPRLECSGVTAHCSLRLPASSNSC 
FSASQVAGTIGACHHIWLVFVFLVETGF 
HHVGQTGLELLT\ * VICPPWPPKVLGLQ 
A 


5925 


19826 


A 


5975 


1 


259 


LTSYDYRRALPCLVN\FCVYFLKREELA 
LLPKLFWNSWA*ATLLP*PPKIFGLQAQ 
VAMLRLKFPACLPACLPAYLPPFLFS FI 
PP 


5926 


19827 


A 


5976 


24 


223 


PYLTQISTNESSVCSHARAEFLDHQNQL 
/WLGTVTYTCNPS/TLGG*GGRIT*GQE 
FKTS LGNTAR PCl> 


5927 


19828 


A 


5977 


395 


136 


GS WD YTHVPPRS ANF / LVETGFRHVA* A 
GLELLGNPPTSASQSAKITAMSHCTWPS 
SYY*CEYNARFGSLHKRSGMTIYIKSYK 
RQN 


5928 


19829 


A 


5978 


1 


785 


GTRLKSGTNTKKALQAVYSMMSWPDDVP 
PEG WNRTRHVI I LMTD/GSEGTS LLSQP 
PHLLRPACGP* VHL*HYTKGWGPECDS * 
LAVHLS CDPS *GILPKPCDQLSLTLPQL 
AHPAMCI PAFSQFI FLIS YPHGPVSSAG 
LHNMGGDP I TV I DE I RDLL Y IGKDRKNP 
REDYLGE* PA*DPAPHFLRAWTIiILPFY 
PSDVYVFGVGPLYNQVNINALASKKDNE 
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X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














QHVFKVKDMENLEDVFYQM IGREIQGNK 
EPNSPQVPLK 


5929 


19830 


A 


5979 


2 


419 


KAIAPIDSDSSDGSGQNKLKT*KEFIIL 
NAI KS I YDSWEE VKX STLTGVW KKL I P I 
LIEDFEVFK\ADLVEIARERELEVEGEL 
/LQSQDKT 


5930 


19831 


A 


5981 


1 


154 


KKC * MMLTWWCMPVG/ PTPRAE AGGS P E 
TGNLRL* CTMIMSVNSHCTPAWAT 


5931 


19832 


A 


5982 


12 


311 


RKSLNFVEGCNIRLNKCRHTNFVPQTI F 
FFLLFL*LGQC* *VWFSFFFFFFFCFV 
LFFVFVCWFVFFCFCFF/GCLFWFLL 
/YFYELFCIVIIMCYYLR* YV* IVYMLF 
FVCWI FVLYLCFHY* LFLFFFVFVFLLF 
I CGVF WFYELFC I VI IMCYYLR 


5932 


19833 


A 


5983 


3 


203 


IGQKRASEDTT*GSAD/PKKSSAGPKRD 
ARQIYNPPSGKYSSNLGNFNYERSLQGK 
YRWPRLGHTRKS 


5933 


19834 


A 


5984 


368 


138 


STVGKMLSGQHPCYRE/Ih/REQKSQSM 
*QISLLSYFKKLPQPFQPSATTALISQQ 
PSAS RQN P P PAKRL * LAEG S YDH 


5934 


19835 


A 


5985 


2 


155 


LANF + IYFC/RDKCLPVLPRLVSNSWPQ 
VILPPWPPKVLGLQGMSPCAPRHS 


5935 


19836 


A 


5986 


325 


403 


FIYYLFFLSSLYLI ITYALFLSSF*R* V 
VYHMVMVLLKWICHHFLTLFFSFRSLI 
LYLFFILFM/YLLIFVLFFIQFSFYLFF 
FYLFYYLFFNDYCFDLIFFLCLIYYYHF 
QSYYFIS* FIYYLFFLSSLYLI ITYALF 
LSSFSI 


5936 


19837 


A 


5987 


81 


651 


KLQVAGCRTLS Q IDVKFKPS VHS SLAVC 
LRANYFTSQPNVIMQVTLLTRCVAHSKH 
LGTVSYSSWYIKLLFPKSHFIYLFILRR 
S LAPVAPGVQWLDLGS LQP PP PGP KRFS 
CLS/LPR*LDYRHPPPRLANF\ * FLVEK 
G FCHVGQAGLELLASSDLPTS ASQNAG I 
TGVTPAPSPQKATLNLILGGQFHI 


5937 


19838 


A 


5988 


301 


341 


FCFCPCKNRVSLW*WHAPPCPANF\*LF 
I YLFLVETGFSHVGQASLKLLTLDDLPA 
AASQSAGI PGMS HHDGLFL 


5938 


19839 


A 


5989 


71 


1252 


TKEQRSLRSQQLGVRQQRSHRLKFGGGG 
APSARGVGGSWSLATRLGPVSTANMSRP 
VRNRKWDYSQFQESDDADEDYGRDSGP 
PTKKIRSSPREAKNKRRSGKNSQEDSED 
SEDKDVKTKKDDSHSAEDSEDEKEDHKN 
VRQQRQAASKAASKQREMLMEDVGSEEE 
QEEEDEAPFQEKDSGSDEDFLMEDDDDS 
DYGSSK/NEKQKDG*EVQT*KKRKENAQ 
TQTKGYSDAKSSERQRESGSP\QASKAS 
KEKTPSPKEEDEEPESPPEKKTSTSPPP 
EKSGDEGSEDEAPSGED*K* * WSGERFY 
* KKKEKKRKKREEK2CEPT*DRTWFWLWL 
DSWGFQCFFPFVESNISLSLSFFFFFFK 
ANHCMCKCLSYLFVYWSLCQPSPFPMKA 
MSN 


5939 


19840 


A 


5990 


630 


316 


RWNVNSVA \QAG \ I QWCDLGS I GNPRP P 
GF\K* FSCLSLLSS WDYPTHATMSSKFL 
YF* WRRGFTMLARLVSNWLVLNS * PSDP 
PASAAQSAGITGMSHRAQPIYLS 


5940 


19841 


A 


5991 


2 


182 


WQEQKAE / CLRDWTAMCTAALLTI AKG 
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R\NIPCPMMKEWVRKMGPIYTFEYYSAV 
IMQVI LKF* * HATTGMNPDDVI SEVCQS 
QKDK * R I LLS CNNAGNS 


5941 


19842 


A 


5992 


79 


278 


EAK*RWGCSKPTWHMYTYVTOLHTYV/ 
IKPAKGAHVP*KLKYKIIITIIK* *RWG 
LTILPRLVSNSWAQVILLPWPPKILGLQ 
A 


5942 


19843 


A 


5993 


22 


274 


AGDEHRAWGYQPAM*ARLEMNTGPGDT/ 
M PAMAS QRAG I TGVSH * GCPFFS LLNK I 
WRSH*VAQAGEQRLFRE * TPGLS CYSCS 
LPSN*DYTSAPLCPPYDT 


5943 


19844 


A 


5994 


347 


195 


YMHTKHTHTHTHTHTHTHTHP\EKHLKA 
PNEV* KSKNNV* I KPKQSQSCFS 


5944 


19845 


A 


5995 


1 


122 


RYELEDELHLQMTTQLHALKENNYQLR* 
S SHVS I PST * G YRHVPS CPANF /S S LTM 
FPSLVSSCWAQVIHLPWPYKVLGL*ALK 
ENNYQLRT 1 1 IDCTLNN^JI F 


5945 


19846 


A 


5996 


365 


229 


RPPPCPANF \ *FLVETGFHHVGKAGLEL 
LTSSDLPAPALHQVFYA 


5946 


19847 


A 


5997 


357 


17 


I LVDKINNVWNLFQNNLWC * RG WGYK* 
DRSFHLLKTVVAE*WVYEGSLY*SLSSY 
YLF/C*RQGLTMLPRLVLNSWHQAILLP 
WPFSLLLCILEVFQNRELKKKNQGKPMT 
GS 


5947 


19848 


A 


5998 


1 


207 


P I LYNLFQKI EAEG I LPSFS F YEAS VTLL 
PKPDKDITR/ITTY*FILLT*NILNKVL 
SNQIQLY I KRITY 


5948 


19849 


A 


5999 


603 


269 


EDR VL VLLP KAKSAVAWPRLTATSAS W \ 
VQRS FHPSLPSRWDHRHVPPHSANFCCV 
CL*RQGLSLLPGLVLNSWAQVILP?WP\ 
PKVLG I TSMSHHAR PLS F YYGHFKY I QK 


5949 


19850 


A 


6000 


3 


150 


DYRHAPLHLATFKK/ FFCRD/RGLS VLP 
GLVSNS WPQTVLPP * PPKALGLQA 


5950 


19851 


A 


6001 


365 


62 


AI TAHCS LNLLGS SNPTASATR I AGTIG 
EAAMSYF * I S STLTHFS YT/C* LHHVWL 
I FKFFFVDEVYVAQASLELLSSSDPPAL 
VSQN AR FTGVSHCTE P V 


5951 


19852 


A 


6002 


537 


236 


DR VSLCCPGWS AVARFQLTATSAFR \ VQ 
*SSCLSLPSSCDYRSAPPCPANFLSFCR 
DEGGGLPGLPRLVSNSWAQAIHPPQPPS 
VWITGVINGTWPKIPLS 


5952 


19853 


A 


6003 


22 


368 


NFFLQNKENIKRKIF*ERKHKLMQRTSL 
L S Y FKQL P RP PTTS Q QP S TWRQDP P PAK 
R/LRTYKGLDDR 


5953 


19854 


A 


6004 


327 


335 


NQS I FTFNN YPRPF/ CCKFKFKN/ TLHL 
WADKVAHTCNLSTLEGQGGRIA*GQEFE 
ATLANMVKP * LLCKVK 


5954 


19855 


A 


6005 


2 


367 


WQFL.KKLNTKLLYGPVF/PYLGIYKQEK 
WKHVHAETCTQMF I TVLF 1 1 VPK* KQPK 
CPSMEE* ID*M*FVHMLEYRAIKENKLI 
HATWMNLRNTMLSE I SQTQITTCMNLLH 
KMFRKVTETES 


5955 


19856 


A 


6006 


1 


176 


PGF KQFFCLS FWTRGI FIEMG FHHVG * A 
WTY 


5956 


19857 


A 


6007 


1 


289 


E VLTK I P I R / FFVE I GKL I QDLKDTG PR 
IA* TMLTKKTEMGR I VHS DLVAY Y I VAV 
I KMVWY YW * RDRH I NQ/ WERLE I SE I DS 
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K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GJutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W^Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop cod on, /= possible 
nucleotide deletion, \=possib!e 
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QNMFNWVFTKMQKLF 


5957 


19858 


A 


6008 


439 


223 


FFSPGSH/SIT*ARVQW/PHHSSLQPBP 
PSLKQSSHLSLPSSSDYRCAPPHPADLC 
RDWVSPCCPGWSQTLGLK 


5958 


19859 


A 


6009 


1 


252 


MVS IS* PRDPPALASQSAGITGMSHHAQ 
L FFLN / CIS/ELISS TVTPGLKES ACLG 
LPKCWHYRREPQHPACTVLYLQFQSDFQ 
LK 


5959 


19860 


A 


6010 


131 


514 


PWPYSLSSSGLLSSFEN/VEKWVPEIIH 
\ HSLFLGTT I DLRS \ EPCTTEKLN\ KDK 
YNPLTPQMADMMVCDLKWE*LECSAFM 
WKRLK\VFDEALWAALEED*RSSCRKET 
STLLQSTPLLVSYALPCTP | 


5960 


19861 


A 


6011 


625 


141 


ETGSRFVTHAGVKWHNHDSLHPQPPRLK 

* FS YFSLLLSS WDYRHVPPRPANLVYPL 

* RGGP PSMLPQAS LEF\LDS K* I PPCLQ 
PS/AKVLGLQVSHRPSPKVTFHQRAKEG 
DWSHVVSRGKSIPDRGNSQCMHRSIPC 
VFEEQKGGQC YCS I E SKEETSR I 


5961 


19862 


A 


6012 


302 


42 


GLAMLPRLPGTPGLKQSSCLSLPSSWDY 
KHI LS * PAVY / C * FLKNI K 


5962 


19863 


A 


6013 


363 


82 


VSVSLSPSLKTKVNYAVASASFTIAKIW 
KQLKCPSMDKQMK* IWHIHTMEYYSA/I 
KQKI MS YSTTWMNLEDI MLS E I S I SQAY 
KGKKHMIYGI 


5963 


19864 


A 


6014 


329 


99 


H I FGNT Y FFQVAS S F I YLFRDS VS LCRP 
G WSAVA* S WFTATS G \ FRP S ACLG LP KC 
/WDYRREPPRPANVLFSMRVTH 


5964 


19865 


A 


6015 


324 


238 


LAMLSRLVSNSWPQAILLPLPPKVLGLQ 
A 


5965 


19866 


A 


6016 


2 


308 


F WKAI AG I SGDS SDKCRQ/ S KLKT I WKK 
FI SLDAMRNIHDS * EKVKMS I FWKMI PT 
PRDDF/EGLKTSVEKRTEDWKTARQ * E 
LfCVDRT* TDEELLLLDQQRK 


5966 


19867 


A 


6017 


2 


457 


FFFLRWSL/ DSVAQAGVQWRDLGSLQAP 
PRGFTPFSCLSLPSSWDYRRPLPRPANF 
FYF* * RRGFTMLARMVSI S* PRDLPALA 
SQSAGITDVSHRRAERVISKQRIVSVMW 
KPLPEIHIPCILNLSLRFNKRSVAGLRN 
SLI VRMLS ILTEG 


5967 


19868 


A 


6018 


389 


142 


AHMLFAAQGDSS I PMLVAPLFT WKMWE 
QSKLPLADEWSGFW*HIHVTEY/YAIKR 
REILTHATTSLKALCYVI * ALTKGRTL 


5968 


19869 


A 


6019 


384 


232 


LNLPSSWDYRRAPSHSA/NFSYF*RDGG 
LTML\ PGWPQVI QMPRPLKAL * LQA 


5969 


19870 


A 


6020 


333 


84 


GLIDQEFATDTTKSTSLSLT*T/IPSRC 
SLCLRGSSDSPASC/SQPPASVAGITDT 
CHHAHI I FVFLVETRVHHVGRAWTEKPR 
LH 


5970 


19871 


A 


6021 


106 


366 


DTISFFFFLKAPIEPVFIIFIFFYFFFI 
FFFI * FFFLLFLFLF I FY FVF/ IFFFIL 
LCLFFFFLVLLFYLLFFF/ SLFFLFCFV 
FCFFF* FCYFYFYLFFILFFFFFYLALF 
IFFFFGSSFLFIVFLFLFFLFCFVFCFF 
F 


5971 


19872 


A 


6022 


384 


18 


R E KKPG PGTGKP PKKKGG PKFG PKG PKK 
RGE KK KKGG P / NS F FPRGGGT PGGNF* R 
GFPPENQKKRGRKKFFQGGKRGANPGIP 
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PLLGPKAEGGFI PGKKKKKKKNSNKKSK 
QTKKALSRTSHS 


5972 


19873 


A 


6023 


3 


191 


LVKGKIRSWEETWVKTW/RFYPGVMAH/ 

TTtrTkTTMVTTNT Or»/1/inr.TTH ATirtrDVUPT ri\T i ri y 

ITcNPNPIjGGQGR W I A* AQE FKTSIjGInvV 
KPKLYKKKI 


5973 


19874 


A 


6024 


364 


142 


I INGL I IHLHWFKI NGHTLSTW YTF H I L 
LvCvx 1YICLC\CIYVSVCJjCT\/CVYTHI 
QYTY*INKHIC*RLEAK 


5974 


J 9875 


A 


6026 


338 


102 


WGFTILARLVSNS*HLVICHLDLPKFWN 
YkHEPPCLjAIiFIjIF VGT* *VYIFMGYRK 
Y/C*YRHAMCNNHIRVKMVSVGLNF 


5975 


19876 


A 


6027 


99 


77 


REICTPTFTEAFSIEAKGWK*PKQPLTN 

ewintmwcihmrgkf*p/yaktsmkled 

V FHE I SKSQKDKHYYI * FXjS PTYVKIQA 


5976 


19877 


A 


6028 


2 


328 


tityrgakiritsdsssetmqaree*se 
if*vlk/ekkkth*pkilysgklsfksk 

GE I K/ YF * GKQN / LxCEF V S S RPVljQGMIj 
KWLQREGKLYRSETQTYKKKSKASEKE 


5977 


19878 


A 


6029 


8 


241 


GFAPLPRLECSNTNMAHHSLNLSGSSDS 
PASANFS IF\ IETRFGHVAQAVTi* FLSS 
SNPPISPSPSVEITGVShHAR? 


5978 


19879 


A 


6030 


108 


362 


CCYLMVTEISAHQNPAKQGLA/HTAFP * 
KTFFLKVSWTGTVAHV*KPSTLGG*GGR 
TS*SQKFKTSLGIMRPFSLQIKKKVFKI 
YP 


5979 


19880 


A 


6031 


1520 


77 


ASSSSSSSSVILSS*NIPNYNYIVIFM/ 
CL*IIF*AASSSSIYYVPGICYIYH\IL 
CVCVCICMCIYVCICIYSIC+YIVYVSI 
YYIHY 


5980 


19881 


A 


6032 


405 


100 


EGFFLPPRGGGGKR/ PPGPLTPGGPGNP 
RPKPPQKWGKPGPPPGGGPF*WVLLSPS 
LPGGQVESPGAVSGPRQVPPSGSRPFSP 
TAVGTTV TFjjS KKKKKQV 


5981 


19882 


A. 


6033 


42 


309 


CDKFFHKASNHIYVSTYIWVFQICVCVC 
VCVCVMCIbC*AY/CGCLGIMIWFLFVL 
*Cl/CYCCY*SSLFSVIICIFLMCFVSY 
ERRFILFA 


5982 


19883 


A 


6034 


397 


2 


KGPPP*GGPDFF\*PRGGPPQKNPFFPP 
GGGPPPPKNGVGGPPPQKAPFFGPKRGA 
PQKGGPPGGGPPRGEKKSLCPPPNWENT 
PRGLKKGPPPFFFFFFFFFFFFFFFVLR 
S TGLRAP FQS VF I VE L VEDM S 


5983 


19884 


A 


6035 


343 


2 


LWPPQGSFKTAAPFFFFFFLFISDFITF 
HFLYM\ YVFFI PLFLL* SFFCI K*I F Y Y 
VTF* FLQSFSILFFS WFL \ WLL* SLLY 
SF*LIRI YFRFILT* FQ*DTETLFFYYS 
IP 


5984 


19885 


A 


6036 


319 


68 


SLGE I CAPK* FFPPAQKKGP FQKFPSCK 
FSPS PVFKTRPRAKF* KGPP * GKKFYVA 
NPGKI \GPPKGSr KJcPPLr FFF.fr FFFFF 
F 


5985 


19886 


A 


6037 


419 


191 


R I HT I P I TALFAVAKR/ WQS KYPLVDE / 
MDKIWYIHTMED*AAYGKKEIMSHARTW 


5986 


19887 


A 


6038 


434 


401 


YGKNFSFSLiKV*IFCEGLV]jVFFPPKKK , 
/ FFFKNFHRWFPLYVFF* TGRGWFFK 
PPFLEKIFFFLTRVNLGPPRGFF*GAPF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFS 
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*NFYFMN* IFTVT 


59S7 


19888 


A 


6039 


12 


346 


S FS FVR R SLTVTQAG VQ WCNLGSLQ P P \ 
LPPSFK*FSCLSLQSNWDYRCLPSCLSN 
FCIF/M* RRGFSMLARLVSNS * PQ/ CDP 

Tl7\ O "A PArl\PT «T»/^T 7T» T T /CUT T7 + "VTA ■*- O T7>T/" r P 

PASAS QS AG I TG V S H / bHLV * NA* b r KT 
ESHF 


5988 


19889 


A 


6040 


275 


45 


KDVQHQWKIVQPLWKQYGSLKASTBSPH 
DPSILLLGTYPRKMKT* IHIK\TCIRMF 
AEALFLI SKK* KLPKCLLADK 


5989 


19890 


A 


6041 


355 


1 


FGTI KNFFFYI FFFKNYF/ FFLYI I FFK 
KKVLFFF* KKKI FFLFI PNKKI FWVFFF 
FFFFFFFFFCPRFFFFFPFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFLVTRHLF 
EMRKGN 


5990 


19891 


A 


6042 


635 


209 


PLNI KKTFFFLRQSFTLVAQAGAQWHDF 
GSPQPPPPRFK*FSCLSLPRSWDHRRPP 
PRPAN LVF\ D FLVS TG FHRVS QDGLDLL 
T/S / GDLPALASQSAGITGVSHCARPTL 
EFLKVKEMIFCSTTVKAVLDHANFLILR 
FSIT 


5991 


19892 


A 


6043 


12 


348 


ESRYSSICLPFSSVYCSRI ISLQLYSYV 
LLVYILFFF/CFLKREFGFCPPLGVQGP 
GLGSLNPPLLG* KQFSGLI FPGIGNTGL 
APPSPPNFGFFRKKGVSPCGPNRPAISK 
F 


5992 


19893 


A 


6044 


192 


1 

e. 


IFHRFCT/HITLP/TTALNNN*PWLGTM 
AYAYNPNTL * G * GRRIV* AQAFKSS LGN 
MTRPCIiYKK 


5993 


19894 


A 


6045 


338 


42 


WKTAWW/ FLKKLD/ MNYC * DPAILLLG I 
YPKELKAGT*TDIRTSMFTAELFAIAKR 
WKQNVDLKNLRQNECRQFIWAKVEGNWL 
PS WFGE WS I QHRLQAG F 


5994 


19895 


A 


6046 


300 


3 


RLD CH YLHSH THTH THTKN PAG I YTHH / 
H CWNLHTHNPGG I LGVALDL* MQFLLNR 
GWSWGGEQMAWPCPQGHVMWLGDPGKMI 
J_iK.P P L W V G b b Y L;G 


5995 


19896 


A 


6047 


332 


3 


SLESAGFT* YKMYFYYYYKI K\YI YKS I 
b 1 J-J-ir b*xrrr r uLtc r JLr r r £ irrfrtr 
FFFFFFFFFFFFFFFFFFFFFFFFLKSP 
KRVIYFGFGTQNPQMFSLLTLKYVLC 


5996 


19897 


A 


6048 


2 


152 


KNLKTS WT* WLMPVVSAT/ WGGSLKPGR 

T DT f\+ 7\ -i rT 7\ T>T UCPT fTTO 7\ TD D VT 

IjKIa^ AvlAFijHbbJjGlJKAKi' iJb 


5997 


19898 


A 


6049 


1 


288 


PI 1 1 YVLLLLFFETDSHS VT \RL* CSGA 
FSTHCNLCLPGPGDSSASDS/RVAGVAP 
ACLF* PYRGFSGPATLAFRALDPGLPLH 
PGFSLQRPSCSRGG 


5998 


19899 


A 


6050 


358 


1 


FSI FSILiI FDFLGGFLKFFAFr YGWDFL 
RFPFFYR * PFGP KS S FIFPHVF* R\W I W 
FLLLSSWFLKWSFDPQFFLKRFFFFFFF 
FPKGFFYKRVFFFPGFFFFSNIiL tiSSQG 
KGvWLI 


5999 


19900 


A 


6051 


308 


45 


ADLSAEAL*TRREWDDIFKVLKEK/NKG 
QP K I hi P b KliS L» 1 NE / NE I KS FPDKQ KL 
RE FTTRLVL * EMLKG I LHM E AOGO YL P S 
*KHTKV 


6000 


19901 


A 


6052 


2 


124 


I FCVLVETGFHWVAQAGL* LLSSGSPPA 
S/DLPNAGITCVSH*AGITCVSH 


6001 


19902 


A 


6053 


I 


330 


LC*PGWSR\FLTS*SACVGLPKCWDYRR 
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EPLHLVFFFFFFFFT* I YLKGFYLNPRK 
IFKNFFPLLKNPFFGVPPKIQFSPIPNL 
PSLWPNMVFFKRVLiGEKI FRGFFPP 


6002 


19903 


A 


6054 


34 


293 


SPILFLLCLRHQSGCHPSWTAVAQSELT 
AASNSWAQAILPHSWDYRHAPPH/LNLF 
GWSHAALPRPDSNF* PQAILLPQPQVLG 
LQA 


6003 


19904 


A 


6055 


319 


217 


YHFTPTGMAISKNRQKTGDNKCC*GCRE 
IGMLVCC* WQYKMVWLL/WKTVWHSLKG 
I K/L * ELP YDPAI PL * G IQRVSWKP I PH 
SSEG 


6004 


19905 


A 


6056 


341 


151 


VHAGGSPES/REFETNLGNKARL.HLYKK 
* ISQMWWCIPAVPATHAGPSLHFNLYYS 
PWLFQCDY 


6005 


19906 


A 


6057 


2 


558 


FFFLRWSFTLIAQAGVQWCNLSSLQPLP 
PRFKRFSCLSLPSSWDYRHVPP\RLANF 
FVFLVEMGFLHVGQASLELLASGDPPSG 
DPDLR*LAFQSLGITGVT/HHAGQIFLF 
/CLETESPSVAQVGVQWCNLGSLQPLPP 
\GSE*FSCLGLPSSWD\YRHALTHPANF 
\SIFS RDGVS PS WPGWSQMPDLS 


6006 


19907 


A 


6058 


320 


21 


EICMTTNHPSVSNTIW/SKMTLASVYLT 
LLPISKTSKTSGLPASKEVNRYLGTCAI 
RYFVFYFI IMLFV+RQGLTMLPRL VSNS 
WAQVILPPWPPKVLGLQA 


6007 


19908 


A 


6059 


294 


50 


CYFSLSFGL*QFLSLNQSFVTLSSHCDR 
DTFE * SWS VI / PG iGVYSS * E * TRGLAM 
MPRQVYNSWAQAI LMPWLPKMLEVQT 


6008 


19909 


A 


6060 


616 


354 


ERVSPCHPGWSAVALSQPTAALTSW\VK 
QSSELSLLNSWDYKCMPPLLANF\KFFC 
RNE/SLLMLPRLTSNPWAQVIC?P*PPK 
VLGLHA 


6009 


19910 


A 


6061 


214 


11 


AASTRPSAWQPPLLGSEEPLCPATTPSG 
RCTQQ / LH * ERAM M TMAVL WNR KG G KVG 
KRLRNRLVAMSV 


6010 


19911 


A 


6062 


349 


54 


QSKSAFPKKRINRISDSSEGYGQSKLKT 
F WKGFT I LDAI KN I CN SSEE VK I ATLMG 
V/WDDFEGFRTSVKEVAADVC/ETARQL 
EVEPESVTG*QQSHVQP 


6011 


19912 


A 


6063 


332 


3 


SQPASGQISKVTQSTLRCLKMSVQQVKS 
CYKCTATYLCAKFNSHEFRSSGVCENPV 
SYFYTHSIRSKIL/WPGMVAHTCNPSAL 
ESQGGC^IT*GQHCETSMVNMVTPRVSRA 


6012 


19913 


A 


6064 


174 


416 


NENELFFCFFFXI FFFPXFFFFFFFFFF 
FFFFFFGFFXFFFFFFFFFFFFFXFFFF 
FFFFFFFFFFFFFFFFFFFFFFFXF 


6013 


19914 


A 


6065 


826 


571 


DGSHSLA\RLECSGVISAHCNLHLPGSS 
DS VASAS * VAGTTGTCHHARL IF/ WFL 
VETRFHNLGQAGLELLVIHPPWPPKILG 
LQA 


6014 


19915 


A 


6066 


1 


284 


GTSFFF*NRVLFCCPG *RAVPQ \ S WLTA 
ALTS WAQGSS *NHRLKP/HMLS FFNFCK 
QGLTMLPKGALNS WAQAI LQ PQ P PM * LG 
LQACT YAWS LGP 


6015 


19916 


A 


6067 


36 


384 


VSKNKPIHLFLFFFSKTKSWSVT \RLEC 
SGVTSAHCYLRLPGSREGAASAL* LSAT 
TWLCKVALPLGEDLGALCRNPWGAGVKG 
SSRES PPLGEARGGELPWGQVR I RLWGR 
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6016 


19917 


A 


6068 


393 


116 


PQPIFPRY*QNKDPNL*KTF*HKYMPKC 
PAK\LIIFIEKGFSRFGQPGL*HLSSSN 
Lir'AJLiAoyoii^lbljlJNFUvjKAKr Fiji? c r t 
FFSLLLLSF 


6017 


19918 


A 


6069 


53 


280 


FPRFCFC/ LLESESHSVA*ARVHWRRLG 
PG/LKRFACLKLPSNWNYRHTPTHPANF 
CIFIRDQGMTL*PSWS*T\PDIK j 


6018 


19919 


A 


6070 


3 


377 


HEEPQGCATVPGEFFKFLWRHGIjITYPR 
LVSNS WAQAAL PPQ * METH 1 1 FS IGRLH 
MVK\ RQYWSGAVAHAYNPRLLGS * GGKA 
A*AQEFETSLGYWIRPCLHKNLKNSPGT 
VAHPCGSSC 


6019 


19920 


A 


6071 


337 


3 


RESQAFFS KKKKKGf TKMWFEQRCERG 
KGARQQSHLRKS I YV * E IAS AKALRRHM 
PGVFQKQ PGG/ W PG WAHT* NPRTLGGR 
CGWIT*AQELETSLGNPEKPCLYKKYKL 
V 


6020 


19921 


A 


6072 


1007 


628 


FFLIiAHSPHSVAQAGVQWGDLGSLQALP 
PGFMPFSC1^LPSSWDHRRLPPRLANF\ 
FIFLVATGFHRVSQMVLIS *PCDPPASA 
SQSAGTT/GARPKVF* IFVYMYFGVRKH 
SILMSMPQHDLFGYKI 


6021 


19922 


A 


6073 


27 


364 


MDLPSVSI FDLRSLWADISRATSTLFWL 
LFSWNCFLHLLSFNLFLSLGQK*VCCR* 
HIQSCVT*QWRY\WSGLVTHACNPSTFG 
SQGGWVASV*EFETSLCNMAKPCLYKKY 
E 


6022 


19923 


A 


6074 


2 


68 


ARACSHGTVALTASCFPLQLDYKCLQYS 
FVCFFFFETKSNFVT\RLKCNGPISVN* 
NFNLPGLTRSQA*ASREAGTTGTCYHA* 

*■ TT T TIC? OTOT 
* IKbPSSTRIl 


6023 


19924 


A 


6075 


330 


49 


KRRFALWQAGVQWRDLRSLQPLPPVFS 
CLSLP SSRD YRCPAS / RPS * F FLELLTS 
GDLPTSASQSAGIIGMSHCAQPPLIYFL 
FQIKVILFFS 


6024 


39925 


A 


6076 


2 


267 


ARGVEVKvGKTQKFLNMIRRTLVHCWWK 
CKLIQPLF/WRILPKYKEELPYHLAILP 
LGIYPKQMKSISL\KYICSYI*IVDHFV 

RSFCIGTFLLL 


6025 


19926 


A 


6077 


379 


236 


PPPPQTRETPPPGHVHKDPPPPHHPYPP 
PQEPQPRTPVPNTHPGEEQNHELFNSSL 


6026 


19927 


A 


6078 


2 


290 


RRSLPSSPLEDGRLS IAPGLTWEPPSHP 
PPQ*RSPPPPKKKKKKKKSPPGPKF/HK 
FKTGKPEKPPRVL * RGHWPPFKFF * IJCP 
GLPFF*GTKPPKV 


6027 


19928 


A 


6079 


1241 


1487 


KMYCIFFVE*KPCLSCAIFIRLLKWS*P 
TVCNSISGGPPHG* IMP^I IL* FLFLET 
VFHSV\TRVGVQWPDHG3LQPGPPGLK* 
SSCLTASQSTQIIGVSHHAWPPSITFEI 
FKRLAYSMS ADGW * KS P PMKLDLGALKK 


6028 


19929 


A 


6080 


333 


1 


XjCHHVQLSFKFFVEMEVSLCGPGWPFK* 
FSHLGLPKCWDYRHDPPHLAFFFSLS\ * 
FFCRDRVSMLPRLVLKSWSQA1 LLL* PP 
KVFFLFVCFLRRVS FHHAEP YAVS S C 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=GIycine, H=Histidine, I=Iso!eucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tnreonine, V=Va!ine, 
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X=Un known, *=Stop cod on, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 


6029 


19930 


A 


6081 


2 


239 


ARG KY I YWRDKI S FCCPGWNA I I /RSLQ 
P * TS E L KRSSR I NR PNS WD YRCMPSHLA 
NF* IFRRDGGLASLPRLGTGWLKS 


6030 


19931 


A 


6082 


323 


185 


S ACLGL P KC / WDYR REPL CLVS R * VLNS 
* AQAI CLPWPPKVLGLQA 


603) 


19932 


A 


6083 


296 


2 


GERILEECIYARSHRAEITSLHCSLSVS 
K/HSVAQVWKWPDFCSMQSMPPRFK* F 
SCVSWSSWDYGLPPPCAGLHIWRRNVT 
LALQGLPTTSLALV 


6032 


19933 


A 


6084 


443 


1 


QKPK**KTQTWSSISLQSRKMKKKILSV 
IT F FKT * GEEEEE KEKKRK\ KEE * EK*K 
EPSQKKEVEEVEVEDERRGRK* KVDMEV 
KE EE KD KGEE KS GEEQ * KES AE AAVRE * 
VDS/SELVTFAVLPGDSSEEEQGMMVPR 
A 


6033 


19934 


A 


6085 


328 


I 


SNILI PKAQVTKH I YSLETCS PDYPG I S 
QAIKANFSPCTVG* LPSNWAYRHLPPCL 
DNF* IFCRG\RSILCRLISNSWbQQVKL 
QEFPQFLGCHARFQGAGFLSHCLV 


6034 


19935 


A 


6086 


299 


46 


LDDLHLLSLCLSLFRDRVSLCYLGWSSV 
A* S * LTVA\ * TPVILSLRLK* SSHLNLP 
SSWD YRWSNS* PQGLLLPRSPKVHLLL 


6035 


19936 


A 


6087 


266 


48 


GIEIVRQMRSRTGADKVIYIVQSPLIiKM 
QKI /WLGTVAHTCNPSNCGGRSGRIT*G 
QEFKQSSHLSLPKCWDYRL 


6036 


19937 


A 


6088 


2 


185 


ARVTMLPRLISNS\GFK*SSHLGLRKCW 
DYRCEPPCPAE I S F I SSGYLTRSGI AGS 
YCNSV 


6037 


19938 


A 


6089 


520 


297 


LG YR D FLHLS F P I S / WNYRC LP S CPA/ N 
FFILVETN* FHHVGQAGFEHLTSGDLPT 
SASQ \ S AE I TGMTYRAQPAAGS 


6038 


19939 


A 


6090 


388 


187 


SQHLGKLSREDCL/RLIVPDQS \GQKSK 
TPSLKK*F*K*AWVW*HMPWPAIQEAE 
VGGSLEPRKSKLQ 


6039 


19940 


A 


6091 


188 


329 


TLGLKRFFCLSLR* FI YFFFRDRVLLYY 
PGWKAVGIHRRDYSMLYSLELIiASKDS/ 
SCLSLRSGWDYCVWLIFFGCM*CKRQDL 
S I IP WQRSLIGQ 


6040 


19941 


A 


6092 


124 


342 


RPAW P T W QNP I S AKNTN I TWA * AAR VAG 
IT\GTCQRAQVMFVFLAEMGFCHVGQAG 
LYPKI I IIjYDKIiSHYKCSHLQMRKALET 
YSGIYSLFTIYLV 


6041 


19942 


A 


6093 


169 


2 


SH*PFTSSTIRSIWT\GAVAHTCNPSSW 
GE*GGRMA*DQKIETSLVNMTRPHLYPR 
A 


6042 


19943 


A 


6094 


2 


324 


FFFLRWS L/ DS VAQAGVQWRDLGS LQAP 
PRGFTPFSCLSLPSSWDYRRPLPRPANF ' 
FYF * * RRGFTMLARMVS IS* PRDLPALA 
SQSAG I TGMSHRARPAVY I LTSTSN 


6043 


19944 


A 


6095 


76 


340 


IjRKKFCFSVYPFIHRWTFEIiLPIiNDFLM 
NFYFRIVFRFMKKLQR\WPGAVAHAYNP 
STLGG*GGRIS*AQEFKTSPGNIVRDFI 
STKNM 


0044 






ol/yo 


J4y 


54 


KQSSHRSLL.SSWDCRCVPPCPVNFFSFC 
RDE/SLAMLARLVSNLWDQALVAEAGVS 
VWARQE 


6045 


19946 


A 


6097 


346 


184 


LTMLPRLVSNS\GLK*SSHFGLPKCWDY 
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residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine. 
G-Glycine, HHHistidine, l=Isoleucine, 
K=Lysine, L=Leucine, MNMethionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
I— l nreonine, V=Valine, 
YV = Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, ^possible 
nucleotide insertion 














RHEPPCLAKISLYITKFFHILSLPI 


Ct\AC 

oU4o 


1 OA/IT 


TV 

A 






OCA 


TTWnPQPT.QT DTQWnVCDPDDAH/taMCX * T? 
r ivy V oV_lJoJj±r J. oWl^XoKliJr JryrlrtlMo \ Jf 

LVETRFHHVYSYVP 


6047 


19948 


A 


6099 


3 


225 


GFKRFSCLSLLSSWDYR/HPAPYLANF \ 
* FLVEMGFHHIGQAGLELLTSGDPPTLA 
SQSAG I TGVSHHAQPL I IF 


6048 


19949 


A 


6100 

« 


1129 


457 


YCVSSKYRRIISFSLIA*LLLTKLGS/V 
QKVGVQWRDLG * LQPPPPG \ FKLFS CLS 
LPSGW\DYGHVPPHPA*FCVF*WEMGVS 
PCWLRLV\LNS * PSGGSRPPSASQSAGS 
LQGLI \HRARPEH*F*TS*PLGEGKTLD 
EI YI KTKTFYQFCI VCLKLH* RLMEFAL 
VAQAGMQWHDLGSLQPLP PQFKWFS CLS 
LPSS * DYR YWLLYHPALELKEAPSFPES 
PCTH 


6049 


19950 


A 


6101 


3 


296 


EIIQALLITIVLGLYRTLLQAS*YL*SA 
FTISDGI YGSTFLAA\TGLHGLHVI IGS 
TFLTICFIRQLIFHFTSKHHFGFEADA* 
YWHFVDED* LFLNG 


6050 


19951 


A 


6102 


1 


749 


RHEGGLFKTI LL I * TSLTIGS LALAGLL 
FFTGFYSKNHIIE\TANISYTNA*ALSI 
TL I ATS LTS AYS TR I I LLTLTGQPRFPT 
LTNFKENN \ PALLN P \ 1 1 YVAAGSLFAG 
FLITNNISPASPFQTTIPLYLKLTALAV 
TFLGLLTALDLNYLTNKLKI KSPLCTFY 
FSN I LGFYPS I THRTI PY \ LGLLTSQNL 
PLLLLGLT*LEKLLP\NTISQHQISTSI 
ITSTKK\GILKFYFLSFFFPLILTLLLI 
T 


6051 


19952 


A 


6103 


377 


1 


SKAFGP?GF*APYGLKAHCFPPGFRRGV 
WAPSGFWARPPIGYPFGALIGAPVWPSP 
GGPPKPGPP\GPFGVPKLGGKRLGIGGP 
FPGSPGFLTTPGSKKKKSTKKGXPQKKI 
TPLDTSCSRTIVM 


6052 


19953 


A 


6104 


1001 


531 


FFLRRS FTLVAQAGMQWCDLGS LQPPPP 
GFKQ FS CFGLLS S WD YRHVP P TLANFS V 
FLVEM\GFSMLARLVSTS* PQ/CDPPAP 
ASQS AGI TGMSHCAQLPKPI FNYL* AID 
SVTTSISLARDWPNSSENRLLGYMGKPS 


6053 


19954 


A 


6105 


376 


117 


CSEWSAVSRSQLTAAPNS* \ VKQSSCEL 
SLSSCWN/ YRCQPRHLANFVLFCYVLF* 
RWALAMLP ILVLNCWPQAVFLPWPPKEM 


6054 


19955 


A 


6106 


689 


433 


LCHPGWSAVVQSWLIATSPSLV\KPSSH 
LSLLSSWDYRCTSPCPANFFVFFY*R*G 

p™+ 'DT3T 7 T A TNTCr»T7\na TT Q"DD T5TD V"17T f\ 
oiPJ" ItKIjIjVvINo WAyAl IjFlrKlrr'lwJjLjljy 

A 


6055 


19956 


A 


6107 


2 


176 


AREYPANFFVCLF FLI LCRDG / DFTMLL 
RLI SNSWLKRSDRLSLPKW * DDRREPPH 
QAY 


6056 


19957 


A 


6108 


328 


69 


LYIFSFG* ERVLFCHPGWSAVAQSQLTA 
ASTFRTQVLSLIGRWDNRHVPPHLADL/ 
SFYRDGVSPCCSG* SOTOAILLPWPPKV 
LGFQE 


6057 


1995$ 


A 


6109 


111 


329 


FLEMGSHSVAQAGVQWRDLGS LQPPPPG 
FKRFSCLSFPSSWDYRRPRPRPA/NFFV 
FLVETGF\TILARLVSNS*PRDPPASAS 
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X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














QSAGITGVTYPRPALTQNFSNLTFGEEN 
RKPNPEDSCVRYLLLSIRQRSPTALLHV 
ILFQQMIQLVL 


6058 


19959 


A 


6110 


345 


309 


ASL I FV* LVEMRFLiHSG I SL FNSGKNGP 
TAFSQDPPG\SSVSFCKAIIIIINKINK* 
IQKVTVIQIEVQWHNHVSLQPLNPELNR 
FSRLSLLSSWDHRHMPPHSANF* NFKN 


6059 


19960 


A 


6111 


332 


3 


RKVLVCPPGLSAVAGFQVPARSAFQVHP 
LFLEAEVAVSPGCAPAFKPGPQGPF* KK 
KEEERKKKGRRKEEE/EEEEEEEEEEEE 
EEEEEGEEGEEEEEFYFLRAGKVKSPRA 


6060 


19961 


A 


6112 


498 


191 


STDRVLLCQKARVQWCDPGSLKPPPPGF 
K* FLChSLPKS WNYSHAPPCLANF\ * FL 
VETGFCHVGKAGLELLASSDPPALASQS 
VEITGVSHHAWPMQEYFQI 


6061 


19962 


A 


6113 


1381 


172 


RNIDSKAILSKKNKAGGITLPSFKLYYK 
ATVTKTA* YWHK/NH/DVD * WNRIENLE 
MRPHTYNF\LI FDEPDKNKQWAMDSLFN 
KSCWDNWPAIWKRLKMDAFLMPYTKINS 
RWIK/DLNVKPKTVKILEDNPGSTIQDI 
GTGKDFMTKTPKTI ATEAKIHKWDLI KL 
KSCTAKETIKRVNRQLTEWEKIVANYAS 
EKGLKS SIYT*FKQI* KK\NH P I KKWAK 
DMNRHFSKEDICGQQSY/VKK/ CPASLI 
IREMQI ETTTRYHLTPVQMAI I KMSENN 
RCW * G CGE KNML I H CW WKC K / L.KLVQ PL 
WKTVW /RI KTE I PFNPAI SLLGI YPKE * 
KS / C / CYKDTCTRMF VAA\ CP S VTDW I K 
KMW Y I HTREY YTAI KRNKTDWAWWLMPV 
ILALCEAEVGGSLEVRSSRPAWPTW 


6062 


19963 


A 


6114 


3 


155 


HEKKI SQVWWCMPWP/TI LVJWLR* EDS 
LSPRKLRLQ* S YDCTTVLPAVJATE 


6063 


19964 


A 


6115 


47 


368 


STPTPSAFLCLAF/YRQRWLCWPDWGA 
V\ IQHTYSLPT* LSILKQD* PLNLLGT/ 
WSSRRVAACPENQ I YRRNGVS / MLPRLV 
LNSWTQVI LLPWPPKV/LGLQA 


6064 


19965 


A 


6116 


1 


592 


RQRI FLLECGGAISAHCNLHLPGSSNSP 
ASVS * VAGTTGVRHHAQLI FVILVETEF 
HHVGQDGLDLL/NLMIHPPRPPKVL*LQ 
G 


6065 


19966 


A 


6117 


1 


306 


LWKTLWQFVLKLNI LLPYKPAI VPLGI Y 
PNKMKI YVHTKTCTQICIGALFI IAKT* 
KQPTCSSIGEW/IKKLWYIQTMEYYSTP 
ERNEPSSHENTWKNLKCI 


6066 


19967 


A 


6118 


2 


284 


QTGVHWHDQ\NLL* PSTPG\ SRDS PGS A 
SQVAGTTGMHQHARLNFFSFSFFLSFFF 
FFSLGKTLGLGLEFFPRWLQMSGPKKIL 
RLYLQNPKNKG 


6067 


19968 


A 


6119 


2 


315 


SRVAGITGVGHHARLI FVF/ CFFFFFLE 
RDFLFVPRVGFQGPWLG*LKFPLPGLPL 
FSGLTLPKTGD * GAPLQPRVNFW I F* KK 
G\ V * YCG PG WFE I LDLRGS P P F 


6068 


19969 


A 


6120 


42 


392 


LQWRNLCSLQPLPPVFKOFSCLRIjLSS* 
dyr\happrlg* fci * srrivflhgwvr 
AGSR1STS* PQDGSAPPQASQLAGITG\VS 
HHHTWAPHFLFFKNSGCSTQNR*QVHKV 
GDKETS 


6069 


19970 


A 


6121 


2 


202 


VHLSDQKQPKCP * M I EWIKKMV7HIDTME 



916 



WO 01/64835 



PCT/US01/04927 



SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 

NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
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YYAAI KKDKFMS F \ETITLGEETQEWKT 
KHRLLSLIRGS 


6070 


19971 


A 


6122 


355 


161 


FIXYLFFFFFFFFFFFFFLFFLSFLFFF 
XXXXXXXXXLXVFTGMIS*LVJVCLYVYIVI 
NVYM*IYMF 


6071 


19972 


A 


6123 


1 


133 


LC* PGWSR\FLTS * SACVGLPKCWDYRR 
EPLHLVFFFFFFFFTL 


6072 


19973 


A 


6124 


3 


212 


HPGLPSSWDYRCT*PCLANF/SCCCC*R 
LGLTMLPRLASNSWTQVILLLWLPKLAQ 
NTSLINNFYIDQMLK 


6073 


19974 


A 


6125 


156 


2 


ALLTHAKNKAL \ WLGAVAHDCNLSTLEG 
* GGWI T*G * EFN I S LGNI VKHHLY 


6074 


19975 


A 


6126 


345 


34 


HPSTIHRVFCSL*S*SSPQARNNGSCLS 
SEHFGITGVNKHTWLLFFKKNY/CILIK 
I ETGS * YVAQAGLELLSSSNPP I SGS ES 
AGFTGMSHGIRPVS PSLKWS 


6075 


19976 


A 


6127 


2 


328 


ARAYLPLCFF IIFFI* VSVI YF I WLL * G 
L I * LNKRYLAHPK\ Y* LNVS YSVS FCKT 
tf FFPRHKEKAI IQLGWAHACNPSTLGG 
QGGWIT*GPEFETILTSMEKPRtiYY 


6076 


19977 


A 


6128 


1 


341 


GTRKFSDYRGKPPHPAKNRNLLFSIFFY 
FETLSCSVLLCEVTSSQL/CNSFHFPDL 
PSSNDPPSSASRAAETTGACHHAQMIF* 
FFVETGPDWS*TPG/LLAVLPPRSPRML 
ELQA 


6077 


19978 


A 


6129 


264 


2 


TAI S ENKVASRSEHRRHRML * LQV I / L I 
ALQHRSQRDNLSH* KN* KKKMLGSVAKA 
CNPS TMGGQGGW IS* GQQFQTS LSNMVK 
PPRA 


6078 


19979 


A 


6130 


1392 


1157 


LSLPKCWNYRRE PP CPALCCPGWS * TPR 
LK* SS PLGLPKCWDYR/ R* TTAPGLFF I 
S KSSLRS PAYS CVTVCTLQLSHH 


6079 


19980 


A 


6131 


383 


108 


MRLCSQPHHHVDFSVLETESCCVTQVGV 
QWHDLSSLQSPPPRFKRFSSLK*LGAI.L 
LLPSS WDYRCTPPHPAN/ LCRF * HGPVC 
LYMFGEKVTP 


6080 


19981 


A 


6132 


307 


1 


LKTLPALCELESHSVSQWRNLSSLQPPP 
LRFKRLS YLSLPSS * DHSHP *AHQANF\ 
C I FSRDRETGFEHVGQENLI MLI VQQS S 
GLGFS I FATLLAVMQLV 


6081 


19982 


A 


6133 


351 


66 


RWSFTMLHRLLLKS*LQ/CDPPASASQS 
AGMTGMSHHTRLHNLF* TANNLTQSHTK 
FYSFISLTPHFAINVTNYIFLYYTSINI 
VCIFLSFKFYARI 


6082 


19983 


A 


6134 


2118 


1392 


FFCKGK* IFIQCPDWGG/DLCV*QEKIQ 
KKTYHTGKDKYV* LF* KVKP ITKI * RQ I 
HTSGNTDAQLSYKENHDQGH* QAKNFKF 
FLFFWFYLRRSL/NSVTQAGVQWRDLG 
SLQPPPPGFKQF/SCLSLPSSWDYRRPP 
PRPANFC/ 1 FE*RWGFTMLARMVS I S + P 
RNPPTSASQSAGITGVSHCTRQEFKVKT 
GYHFHLLPSGCLFLFYVFYLYACLLYVS 
VYYPLRQQESRKI SKIEMFLTTWH 


6083 


19984 


A 


6135 


383 j 


47 


HhblMr YYYLiSHYWYLYYFLLHLIPSSP 
KIFFFFT*KIIFFLIKFNKVF*NLSF*K 
KIF/IFSYTF*FWPP*NFF*KPPPFFFF 
FFFFFFFFFFFF*QTLFLGGCGYNTKljR 


6084 


19985 


A 


6136 


2 


196 


TRPSILVTALYYLYIFTTTQWG* \LTHH 
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X=Un known, *=Stop codon,/=possib!e 
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nucleotide insertion 














INN I AP S FTRENTLM F I HLS P I LL LS LN 
PDIITGVCS 


6085 


19986 


A 


6137 


700 


433 


FLRWSSASVTRLE*SGAISAHCILCLLG 
SSYSPALAS * VAGTTGTCHHAQLIFVFL 
VETGFLHVGQDGLDLL/NLM I CPPRPPK 
VLVLQA 


6086 


J 9987 


A 


6138 


377 


298 


QMKWKLLLLFRYRVL»LC/HPWSAVA*LQ 
LTAASN\FGLK* SSCLSL/MI/TMLVLNS 
WPQVI LLPWPPKVQEL* AI AAYTI KMWV 


6087 


19988 


A 


6139 


1 


199 


L YTT Y FVTKTLLLTS LFL * IRTG\YPRF 
RYEQLIHLL * KNFLPLTLALLI *HVS I P 
ITISSIPPQT 


6088 


19989 


A 


6140 


381 


28 


STPNRLIERQEDPPES *NTNPPGLPRKT 
PSFTKNPKINLARWGGPENPPNPEEKPR 
KVPESPKRKKP*TKIRPLPSSPGDPTKP 
P\LKKKKKKKKSWPATQEAEMGGPLKA 
RSLRIi 


6089 


19990 


A 


6141 


1 


209 


LKLLASSDLSTLGSQSAGNIVGMSHCTQ 
PKINMGILLKLVYRFNINPVXTPAGLF/ 
TKKKT * KTE I DKR I LKFI WKCTEHR IAK 
QSLKRTKLEDFLLNFKTYYEATEIKTS+ 
YRFNINPVKTPAGLFKKKKHEKQKLTSG 
S 


6090 


19991 


A 


6142 


3 


282 


QGIFI/TKRVKNF*RGKFKTLVKKNQGDP 
KKGKNPPGPKIGKNNFGKTPFWAKKI*K 
FNL I \ PKKT P P F F FQKLKKTGVKFF WAP 
KGPKGFLSKK 


6091 


19992 


A 


6143 


382 


158 


FFF*DSLAVSAPGVQ\WHDIiCSL\RPLP 
PRFRRFWCLSLLSGWD\YRHVP\QYPAK 
I CI FS RDR VL P I LAKASRNVS F 


6092 


19993 


A 


6144 


185 


316 


PCLANF* IFFCK/NGGFSMLPTLGLNSW 
AQG I LLPKPLKVLGLSA 


6093 


19994 


A 


6145 


489 


231 


SVA* AGVQWHDHS *LQPQ/ PLGLKQSS C 
LSLSNSWDYRHMAPHPTNFLKKFFVETG 
VLSCCPGWSSTPGLKGSFCLNLPKF 


6094 


19995 


A 


6146 


179 


1011 


GTILNPQSLPFKPFFFSTLIFLKTVLLC 
CPGWVSNSSGSSCRSLPKCWDY*HEPPH 
SAL * TFPWPFP * LVKDTSIHSTQLLKAE 
IFSASLDAAPSPTPYIRTHPSSGLQRWL 
LDAARSPTPY I RMHPASRLQRW CSTCTL 
LPSPSYHTPA*AMTTASSLISPFFPLPP 
PAHFSQCRMTFCLFVL/ clfemesrs va 
QGWECSGAILGSL\QPPPPGFK/RGFSC 
LSLLSRW\DY\GHAPPCPG\NFCIFSRD 
G\VSLCWPGWSRTPDLWHPPRPPKALG 
LQA 


6095 


19996 


A 


6147 


348 


3 


GYHKTINWHCCMSVFANLTCGFNAIPIK 
/IPAKYFVDPDKLVLKFTWKGKRPRIAG 
TILKEKKVRRLILPNFKTHYKATVIKIG 
W*W*NNRHNHWNQIGSPEIAPHKYSQLI 
FDKEA 


6096 


19997 


A 


6148 


22 


228 


THRVALFVRT*NVJKQPKCPSTGEWLNTL 
CYIHTIEY*YYSAIKRNKLSIHVKT\WI 
DLKG I VLNENNQ I 


6097 


19998 


A 


6149 


406 


279 


RS IQGSSHSHAS A/ S * VAGITGARHHTQ 
VI I VFLVETGVMALTY 


6098 


19999 


A 


6150 


406 


10 


MRYHFTPDERLKSKGLKLEYW*GSGETR 
SLIHCWLEYKMSHPLWK* /TVWLFPIKL 
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G=Glycine, H=Histidine, I=Isoleucine, 
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NKNVP YGPAI SLLDIL* EK* KHMSTKNT 
YK* MLQALFIITKNWLLSIKRRMTKQIM 
VYSNRNYYSTIKNELPIQTRI 


6099 


20000 


A 


6152 


367 




PTPWEAEAGGS FKPRR/LQ/ CPN*PPMC 
S CT PAWAPGS PP VS KKK KMQ FHRRMGHV 
SSAWWPCGADVRNLRAGGYVRHRERFPG 
LPWAAGGRDEVIRCLTVCTACGEDHPGS 
HLAGPRGGGFP 


6100 


20001 


A 


6153 


441 


38 


ILNPGEAITSEKYVQ* FNEMYQKAKRL / 
QVNRKGPVLLS YNA* PQVAQPVSQKLNK 
LGHEVLSHRPYSLSLSPTDYHFFKCINI 
FLKEJCRFYNQQElAENAFQEFTES * SA/D 
FCTTR I NTFISHWE KYVD YNRS YFD 


6101 


20002 


A 


6154 


308 


121 


GES FPAEKTFPKENDYRWPS FFQWSS VL 
/CKIHGLLMLPRLV*NSWPQAILLPWPP 
KVLGLQA 


6102 


20003 


A 


6155 


1 


420 


QNLQPHWKTLRQF/LYKVKHMMroSPAI P 
LLD I T *GRCKPC P YK/ES I C/RMFIEAL 
FI IAKKLKLSKCGSAGE* INKV*DI YTT 
E Y YS AI KSNKQLTHTTTWI KLKS ITPS E 
*SQT*ETACPTMLtjIWHSRKGNIIVTEN 
RSMF 


6103 


20004 


A 


6156 


77 


415 


TLGLKPTSSSHLSIjSKCWDYRQEPPCPT 
SEANFFFFFGGEFPFWSPS*LKQFFFVE 
MG\FTMLPRLVLNSWPQAHLILP/ASAS 
QSVGITGKSHRAPPQKLTFFFFLGGNFP 
FGPQAGCPGAESNYPEP*TPGLKAFFGL 
NLS KCLKYGDS PTAPGKLE I FFFKWGGV 
SPSCASRVLNPQPR 


6104 


20005 


A 


6157 


436 


206 


LW Y I HTME Y YS ALKKNAI L I HT I \ ST WM 
SLENMLNEIKQENHKRANI *FHLYEIPR 
KESRMEVNQGLGRRGIWEFTVy 


6105 


20006 


A 


6158 


416 


50 


RFFTHPPAGEFFSPPPPKNIFFPPPP*K 
FWGGGGKASPPQKKFPPLPPPQNGVLCF 
PPHKKK* FFPPPEKTVEPGPLHVGQRPP 
QS\PFLCYPPLSFFLYFCYF I FFPFVFF 
HFLPFFFFFIR 


6106 


20007 


A 


6159 


443 


145 


FFFFS IRFLFWWLYQG* EYLCFFMT * 
* IGVLLFLFEICEFFFFLLVGILFGMFL 
/YFFATASPGLFFFCFFFIFFFFFFFFF 
FFFFFFFFFFFFFFFFFFCEKS 


6107 


20008 


A 


6160 


322 


2 


TQDKNKRHRARH I YKRARKS ARD * RSL V 
HR*K* KVQKRNEY * PEGTLHTC I ALSQG 
NN I * DA/ HSQE * W I TPVI * TLWEAEAGA 
SRGQEFKTSLANI VKPDAWANAW 


6108 


20009 


A 


6161 


421 


57 


FCEMKFTPGNNAVN IV* PTTKDLE Y / W I 
NI /DKTVAGCEKTDSNYERSSTMGKML\ 
SNSIAWYTEIFRGRKSQLMWQISLFSFK 
KL»*QPLQYSATIALIRQQPTTSRQDPSP 
AKRLRLPEGSGDH 


6109 


20010 


A 


6162 


333 


41 


GAWACHLHVGAALNAPTPGLSRFHEEPG 
WSCRTAGGGSRLELSVPFVQMGSCHVGQ 
VGFFCIFCR/DRGLAMFSRLVSNS*TQG 
X CP P W P P KV LiGJUQ A 


6110 


20011 


A 


6163 


2 


423 


KI NNT KSQCFENI LKIE KTLSNLTKKE \ 
REDTN Y* YWE *NMD I TVESTD I KR 1 1 RN 
TT\ETLHKFDNL 


6111 


20012 


A 


6164 


434 


5 


I CHI FFS I S KLIxNILR P VSKLFFS INFI 
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nucleotide insertion 














FGPELYYL* IHFNP* *GKCLSFLFLQNI 
CIGFFFNSFMQI+SYFLRISLFIPVSPI 
LLG\FVQIFIVSINILYLSI*LRLFFSF 
rhShrhr r rYYYTLSFRVHVHI VQV£?x I 
RIHF 


6112 


20013 


A 


6165 


1238 


1017 


S TGSH S VTQ AG VQLHNS SS LQ PAP PR LK 
*SSYLSLLSSWDYRRTPPQPAN\FCIFG 
RDKVSPCCPGWSRTPEFR 


6113 


20014 


A 


6166 


504 


210 


FFFKTRSHDQGSLQP* PLGI>KGSSCLSR 
TLPHS WD/ YRV/ PPPHPAKKYKNI FIFF 
VETE FHHVAQAGLKVLDSSNS P VSASQS 
VGITS I SHCAQPILCQF 


6114 


20015 


A 


6167 


258 


65 


YKTKNTTSIFSLPSSF*TQILTF*NTYY 
YYY/CRDE/SLAMLPRLVSNSWPQVILL 
PWPPKWELQV 


6115 


20016 


A 


61 68 


1 


156 


LARMVS IS* PHDP P APASQS AG I TGVSH 
CAQLFFFFFWRQPRFFLRAGGQGRDLGK 
VKPLLSGVRPFLS * TLPTEGRKSGPPTH 
FG*VLRE*FGGSH\PVSSLW*QSAGITG 
VSHCAQLFFFFFWRQPRFFLRAGGQGR 


6116 


20017 


A 


6169 


3 


285 


HASAHASGNILSYVGSFISLTAAI\LII 
IMI * EAFASKRNVLI VEEPS INLE * LYG 
CPPPYHTFEEPVYIKSRRIRRSLIDQDA 
ARKE TQQGAAD 


6117 


20018 


A 


6170 


645 


15 


LPFIYCSPCRNFNSLFWQERDSNNPLG/ 
I SLPHSDKITFNP \ YYT\ IKDALGLTSL 
HSLL KDLHS FPHETS * GPQAYYSL/QFT 
PLNNPSPHARPEWIFPYCAYHNFSGSGP 
LTNLGGVL\ALLL\S ILHPKQ* FPHPPS 
YPNKQRH*YFGPLRPITFIDFLGRRPSF 
I SNRESGGQPVS YP FTI IGQVAS VLYFT 
TILILIPTISLIENKILKWA 


6118 


20019 


A 


6171 


409 


25 


YTTPGGKI FFKSNPGSKLFPPPPF/PPF 
FS PFS P * KF FFS P KPLNFFGGFFP FFP P 
PKKRFFSKNPPLVFFSPPFKEKVFPFPP 
PLNFAPPRVFLKAPPPFFFFFFFFFKGG 
E VKWLS EALDCKS KDRG 


6119 


20020 


A 


6172 


73 


52 


VQLLKCR I H I F* DKVLLCH PGWSTVARS 
♦IjIATSkSSCIjSPPSSWDYRRVPPCLiAS 
F/SF*YIAL3SnSI*A*AIILPQPSKVLGL* 
AVAHAYS PSTLEG 


6120 


20021 


A 


6173 


325 


470 


I FTFFFLIiAFI * FFCRNR/ SLALLPRLV 
LNSWLQAVLPPWPPKVLGLQT 


6121 


20022 


A 


6174 


2 


228 


TLEDNSTIPCKTIQSSSCTPWYVTQKAA 
CITLLSM* PKKLWYVYTKSCTQMFIVAI 
FI I AKNWKQPRCLSVSE* INKLWYI * TM 
E*YPVLKRNELSSHEKTW/MEM* IQLLI 
ERSQS*KAWCLHKKLHTDVYSSYIHNC 
QKLEATKMS FSKRMNK 


6122 


20023 


A 


6175 


3 


408 


RYLTLAI RDMQI ITPVSYHFVPSGDYGQ 
KDNNKCW * RG* KMGS LMHC * W* CKMVQP 
LWKAAWQFLK/DVKRRAILFLGIYPREM 
KTYvHAKTCMHIFTIiALFIjYVHLNALIS 
VMDWMLGHP I EEISRDSYSCVCLL 


6123 


20024 


A 


6176 


2 


237 


WVW WL I P WLAAWEAEAKGLLELGS s RL 
* \WTMPLHYSLGNRARLCLKKKKKKKFK 
VFPPQKKKKKKKKKKKKKGGAL 


6124 | 20025 


A 


6177 


387 


154 


AWQWRDLSSLQAPPPGFE*FSCLSL\W 
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C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, (^Phenylalanine, 
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DYRHKJjIjTSGDLPAS ASQSAG I TGVSHC 
AQPLSLFVFFLTNQRFVAALS 


6125 


20026 


A . 


6178 


1 


239 


FFFFEMECLTVSQAGVQWYNLHSLQPLP 
PGFKQFSC\LSLPSSWD*RVPTSRPAKF 
/ CVIF*DGVSHCQPGWSAVVQPPLiH 


6126 


20027 


A 


6179 


325 


50 


RKFPLTWRRVS FFS LLRS STDWMPTPHR 
HIGGI I CVTQSTDSNIGLIQKHPHRHTQ 
KNV* PNV\WHPMAKSS*KPKLTITVYPL 
STWHTYTSL 


6127 


20023 


A 


6180 


157 


338 


SQHFGRPRQEACSSSEVRDQPGQQREFP 
HLYKN*KEITQWWHMPPEAQDY/LEPG 
RSRLQ 


6128 


20029 


A 


6181 


3 


273 


HEGQSKLETFWKGFTI*EVIKNIDDL/W 
RGVKIPILTGVWEKLIPTLINNFEVFK? 
SLEEITEYWEIGRELELEVEPKDVTEL 
LQSHEKT 


6129 


20030 


A 


6182 


2 


359 


YYPAIPLLGIHPK* KNSIYQMHTCTHWF 
TAVLFT IAKI WNQL * CSSMGEW I KKM / W 
MHTHTHTHIHTMEYYVPIK/WEIISFAA 
TWMELGVI ILTEI IQAQKKTKG I FSHLY 
VGAKIFDYM 


6130 


20031 


A 


6183 


3 


211 


NPGKLSVSINQDCFPWTPCPFQMGRCPT 
LLF\SFYGGQAI PL I TPSVGTQ I FRLKM 
/PEVTHHFHSFP+EPLTAAALNQPSLPL 
S LLVDF /RDRGLFCHLGWS AVD* SHLTA 
ALNS\LVTQLSCPSFLSSWDYRRMLPCL 
GNF* IFRLKMLKSLTIFILFHESH 


6131 


20032 


A 


6184 


707 


384 


LQI FF YNRVKCVKNKSTQNQFY I FSR I Y 
MHANRHG K YE I / W 1 1 KKMPMREG * AVQL 
RMKWKI KKNTH I KLG VWHG YNL.STLG 
GQGGRIT*AQEFKTSLGNIARLHLYT 


6132 


20033 


A 


6185 


720 


363 


FFFWGRSL * T ILF*/QTGSHS VAQSG VQ 
WHHHSLLQP*TSGLKQSSHLSFQSSWDH 
RCAP P RVANFFLD FCRDRVCVAQAGLEL 
LASSNPPTVASQSAGIT3GSHRTWPFWD 
ILGIMYWGC 


6133 


20034 


A 


6186 


439 


66 


KPKTLGSKKVNSNPRVPPGK/ PWGVGPI> 
GPGP*MGGSKP/S*NPSPPPPLFKTRFG 
FFPPG*RQGHYSGFLQPPPPGGKTFFPP 
SPPKKWGHKKKI PFQKKKKKERKKRNI I 
GSWTTSETVGNPAPW 


6134 


20035 


A 


6187 


83 


287 


NKMFKIFRFRF/CGFLGQSHSVTQGGVQ 
WHDHSSLQP*TPGLKRSSLPTCWDNRHK 
PPHLVIiFCFYYRL 


6135 


20036 


A 


6188 


290 


609 


RRMDDPPFFKYFFERRVLVCHQGCS*V\ 
K*SSHFSFPCSWDHRCV/PMPHPAN\FN 
FFHTDKSLTMLPRLVSNSWAQVILWPLP 
P*VLGFAGMSPCPQPPSQFCIEAV 


6136 


20037 


A 


6189 


2 


371 


RYIFEKVFAKYSELTNDSLSILSGKSGI 
LWKKRLSLFS*Q*SYKWFSLGAIYCGMK 
* KRFLGTS lb/ FTE * VETGFHHVDQGGL 
ELLTSCETLASAY*SAGITGISHCAESQ 
STLKKYI PGSTI 


6137 


20038 




ojyu 


531 


205 


KWKr/UlVn.ynOVyW/ nUUuOiJU cLjcxr^ C 

K* FICLRLQRSWDYRCAPLRPA\NFFVF 
LVETGFHH FGQT \NLQLLTSGDP PASAS 
QSAGITGVSH*ARPWFIIFTALLYK 


6138 


20039 


A 


6191 


52 


375 


NRCE TS FL FFFFE KKFWFCP PGGGE WRN 
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IjJtv * i v J U. f \ L>t? FG J_ris.il, t £> C Jj 1 JJK.KGGN * G P 
FFPLQENFGVFRKKGVPHCNPG* FQTPD 
PRELARLTSPRGGDGGGRPPPRVK 


6139 


20040 


A 


6192 


3 


158 


FFFLLLNCGLKKETG/WLGVVAHACNPS 
TLGGRGGWITR * ELKTSLANMAEPY 


0 J4U 

./ 


20041 


M. 


6193 


2 


54 


bbthL ESHSVAQAGVQh CDLGSLQPPPL 
GLKEFS CLSLLS S WDYRHTPPHPANLFV 
F/SFLVETGFRHVGQAGLELLTSGGPPA 
SASQSAGITGVNHRAWPEVFKK*RVSFC 
KPvjWbAVv 


6141 


20042 


A 


6194 


1 


323 


FILPSSWDCMCAPP/RPDWVLYFLVETG 
FHR VGRAGLQLLTAS DLP AS AS * GAG I A 
DGVSLTQCSMLPMLECRGVILARYNLHL 
PAACLGLPKCRDCSLCPAATPSRK 


6142 


20043 


A 


6195 


619 


243 


SPETGS CSGAQAGVQWHDLGSLQP * LGL 
K*SSYLSLiLSNV7DYRCA/PPRLANFLFF 
VQAGS ChIiAQAGIjELWSS chlpalvsqn 
AGITGVSHHARLAVAWKRTPLYGLPSRI 
SNLCVLLKFLFFYLY 


6143 


20044 


A 


6196 


385 


19 


GGK FGAQE I S P PGVQLFFRLH FLSNWGF 
KRPPWPPG*FF/VFL+KRGFPPFGRWP 
PSLFP/SDLPPPAPPKGWVSGLRPCPRT 
FFFFFFFDKEC/RliSCPANFCIFSRDGV 
SPCWPPWSRTPDLQ 


6144 


20045 


A 


6197 


498 


437 


ENLQSSINLVDKAMAWFERIDSNFDRSF 
TTVKMPSNLTS I INACCGE I FHERKS * S 
IWQTSLLSYF* KLLQLAPPLLATTILIS 
KE P * TS R / QDRL LLAEG * D 


6145 


20046 


A 


6198 


79 


394 


SVKLLALFFFFFGENFCFAPQVELKGPN 
LG*LNPPPPGLKNFSGLT/LPKKWELGG 
HPPPPSKFFFLFF* KKGMAP F S QGWFKT 

r pt ivt/™vt?7\ r~*T rnx T\zr /~~*t\tt nnpr<T^ 

LAjLJsQhACuTLPKGGNLiRREGP 


6146 


20047 


A 


6199 


2 I 


313 


SASHSAGIIGESHCAWPIMPVYSLVLNF 
FFFLEKKPPFVTQLEGQGGNLG/SMEPS 
PSGLKGI FCLGPPKKWELRAPPPCPGYF 
WFF*KKGGLPIRPGRFWKSVP 


6147 


20048 


A 


6200 


62 


732 


AFQQARKGTQ\GRLSSPAAGALP\PVLA 
PSGPPLPA*\GSSS\PASVAPYPPPQAT 
G/SPPTPPGELRRGGWSKGGLDHPRNTF 
YCWESSEP/PPN*GQS*APGGVNSEGRG 
Q/LPPSSIARCVP*TPSHVPPHPrjAFPP 
SLPHPGPASRAPP/TGSSDLSASAVSAP 
GREGHP *GTRKVPQGAGRQKPGHRGAS * 
GSSCSKRTLSPLSPRQSGFQSRCCFHPP 
RQNKSRC 


0145 


20049 


7\ 

A 


6201 


424 


7 


KNMKi<SPISCVTRDPQIKITMR*HYTGI 
KMANV/RLTGNTKC* *GYGAKGPLIQC* 
WECKIV* PLRKTAWQFSQS /LKI V* /PY 
DPAI ALPG I QLC * KCTS \ KKNCM * I F I T 
ALFIMAKKKQKPRYPSIGEWKYKLLHL 


6149 


20050 


A 


6202 


371 


144 


RWALPRRTNFV FFVETRSLHVAQGGVK 
LLGS VS PP ISPS PSAVT TG VS YCAQP YG 
HPr JiLSFSLIEYEYIYIjDFF 


6150 


20051 


A 






tin 


GliSDPLTS ASRVD YRGMPP C PRNF 1 1 FD 
M * FI * CVCVC/KTWGLI VLPRLVSN FWP 
EVLLKPGPSKVLELQV 


6151 | 20052 


A 


6204 


15 


221 


KASRKMQIETTLRYYLSPMRLAKMQMLI 
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NTS CW / RKSQETRTLI C * W * EGKM VQ PL 
SKGIWPYLIQAEYP 


6152 


20053 


A 


6205 


23 


371 


KVACTCM F I T VL FT I VKL * NQ S RCP S TD 
DWIKKM/WLCVCIYSSVCVCLYVCIDR* 
I ERHD I Y VY IME Y YLALK * /DEIMFSAA 
TRVKLDAI I *S * ITQNVKCCI VSTHSGN 
LIMVST 


6153 


20054 


A 


6206 


113 


417 


S AVTLLV I YLNDLKT YVHTKTCTQMFTA 
ALFIIAKI*KQLRCPS\LIWIN*C*KEM 
SYQAMKRSYARNLWSSKKLWRNLKCLLL 
RERSQSEKSAYCVI PTI 


6154 


20055 


A 


6207 


510 


167 


GLGQAISPSLPKSQGIGAYGFGFFFFFK 
R*DLVCCPG\SLNLVASSQSSCLSLPKH 
WDHECEPPCLAFFFFLL*RQGLAMLPRI 
LPLLKALASGGGMAKKKCIRYFSMKIHL 
QAQ 


6155 


20056 


A 


6208 


418 


40 


S YTHTHTHTHTKTHTH IP* * LTGS/RLQ 
GICRKGSRKHPRGliSQLLP 


6156 


20057 


A 


6209 


1 


258 


SGFLLS*VPSVRDPTGNRTV*LTWQPLP 
APLEF\CPRLSD 


6157 


20058 


A 


6210 


229 


310 


DFSEAQKSRQCGGG I SNVNHPGWLKPV 
IQHF * DGVLLCCPGWSAW* SQL? * LPR 
LK*SSQLSLPR/SWDYRHVLQ\HPANCF 
VFFVEMS IiAMLPRLVSNSWALA I LLPQ P 
PKVLDDWLEHHTWMI YITYATTTLP 


6158 


20059 


A 


6211 


496 


1 


CCQES FE I LGS I NS FGPGLKKGGVKNMR 
PGAGPTLNKRPFSDNPLQPGRRGGCFPF 
VSQRGSQKDRVPPKWFFFFFSF*DRVSI* 
WPPGPSARKPSWLPVASTSR\V*QSSCP 
PLWFPGPPTMVSYFFFF\KRQDLPMLPR 
LV*NFWPQVILLSQPPKVLRLLQARA 


6159 


2006O 


A 


6212 


3 


340 


VTAGI I I I FMI*EAFASNRKVLIV*EPS 
I * SLQV/ * WRGLLLLGSPPPYHTLEEPG 
YVTSKRKRKESNPPKLASSQPHGLHDFF ' 
KKKKRGGPFKRILEGAPFGGGWGHREFF 
P 


6160 


20061 


A 


6213 


60 


493 


KSQQNFFCFETESRSVTQAGIQWHHFSS 
LQPPPPGFKVISYL/SLKSNFSGYP*DF 
FCNGPPCS*GPIRFTYYICVIPSSWDYR 
RLPPHPANFCVF\ IKTVSPGWSQTPDLR 


6161 


20062 


A 


6214 


414 


1 


LCNGRPGLLVSPLPLGEELIPCLPR*KIi 
GPPRVL*\KGGPFL*SSFAPFF*EMESH 
SVT\RLECSRAILVHCNLCLPGPSDPPP 
S AS * VAGR I ARAQKVEAAVSHDHDAALQ 
PEQHSETPSQQTNKNNQTKLTQKYPF 


6162 


20063 


A 


6215 


2 


403 


SLLLPLVQLPLRLCFFFFLKRGFNFGAP 
DKREWGEFG\*LKPWLPG*NNFSGQTLR 
RSGD* RKTPHGGVNLVF * EKTGFSNLGR 
GGLKPPPSGNTPAWPPKGVGITGGTPPP 
P P VFC FLG F C ENTNLRAKKKF I 


6163 


20064 


A 


6216 


2 


388 


KMEQTLWKTDWEFLTKLNI L YP YS PAFA 
DLGIYPNELKT/ C/ CYTNTCI*KSIAPL 
FIVARTWKQPGCPLRDGWLNKLGCIQAL 

QEDRFSLGVQEQPKHNG2M 


6164 


20065 


A 


6217 


363 


302 


NKE FKTTLANMGE P P FL * KRGF S \MLAR 
WLNSLFHDSPPPASQKAGIPGVSPPPR 
PLFFFFFLLETVSHYVAQAGQLISNYDL 
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nucleotide deletion, ^possible 
nucleotide insertion 














KLPSCLSLLDTWDYRH*PPYPVFKLLNS 
TYKAC 


6 J 65 


20066 


A 


6218 


576 


6 


RAVLRGLLVLLLTKHTFFLFP PRQSLAL 
LPRVGVQW\HDL\GSL\QPL\PPG\FKQ 
FS\CLSIPSSWDYRYPPPRPANF\*FSV 
ETGFHHVAQAVLELLTSSDLPALASQSA 
GITDLGQS2TIHGGSGLDKPPDHKVDKG 
PTEERACLLS YSAGRKYKEYH I LPEQGS 
EPPHHGDI LS I S FQAASH IAQTPPAHT 


6166 


20067 


A 


6219 


44 


496 


GFRVDPRVRPRVRLNS * PRDPPTSGSAG 
I TGV/ S SHHARP * N I FT* KLG I S FLVFW 
GFLFLFFVFCFFFFWKGGFLLVAQVEGK 
/ WPKFGLI EPSPSGVKEFFFFNP PEGWD 
\ YGLAPP PGVI FWVL I KNGVS F F CRAWV 
QTSGLSWINRLNLP 


6167 


20068 


A 


6220 


1 


421 


QRGRLTAI SRETS FTLVAQAGVQWHDLG 
SLQHLPPGFQGFPCLSLLST\TGNCSVT 
QAGVQWCHLGSL* PLPPGLKPSSHLSPS 
SSWDYRL* KPHQGSD*VCVT\GAASPEP 
NTMTAWNKPPDIRMKETLLLPPSSSEKM 
L 


6168 


20069 


A 


6221 


3 


36 


AYCIVKPSP*PLTGALSALLMTSGLAM* 
FDFHS I TLLILGLLANTLT I YQ * WRDVT 
RESTYQGHHTPP VQKGLRYG I ML? ITSE 
AFFLAGFF*AFYHSRLGPTPQLGGDWPP 
TG I TPLNPL* VPLLN \TNE* LA *GVS IT 
*AHDP 


6169 


20070 


A 


6222 


895 


13 


ANSWANN* FELATQ I AFSGR I PLTDSHL 
GLLKAHVEAPI \ AGL I VLAAVLLKTGGL 
MLYTASHSFSNPLTK\HIAYPFPVLSLI 
EA* FITS SHLPSDKQDLKS \ LMAYSS \ I 
SPHSPS * LTPILI QT?\ W\ S FNGAGQFS 
IIAHGTYHAYITYSA*ANFKITERTSQC 
ACINPAAQGLSNS * LQLIGLFE*LPRQA 
\SLTMGLTPPLINPTGENCSVASNQRFS 
/ WTQ I SLLLP YRDST I LGSQALYSL Y I F 
TTTQWGSLTHHINNI KPSFTRENTLMFI 
HLSP I LLLSLNPD I I TGFSS 


6170 


20071 


A 


6223 


68 


405 


HS YALVYNP VGIR VKVHLSKC I L YTWH I 
SNSSQEVCEMPNLNLKRVPTTLAPRF*N 
IS *G ISQLGAVAHTCNP3TLGG* GRW/ 1 
QQLKTS LANMAKPHLLLKKKKN F PN FPK 
RG 


6171 


20072 


A 


6224 


591 


283 


GTSTQAGVQWCNCGSLQPPPPGFKRFSC 
LNLPCSWDYRRVPPHLANF\ *flvemgf 
HHVN/QAGLELLTSSDPPALASQSAGIT 
GMSHCARPRVAF*GEKNHRSL 


6172 


20073 


A 


6225 


302 


173 


GFFSFFLKC*DR/SLTMLLRLVLNSWAQ 
VILPLWPPKVLGLQA 


6173 


20074 


A 


6226 


417 


81 


TQTRGPPPPPFLGGPPSGPERTVPPPRG 
GVPQTGSRFFLFRQNPGPALFPKN/EGG 
F* KKFGPPLPPF* KTPFFFTNRGNPCPF 
FVGENPQAPPVFGGFNPFPRKKKKRNDF 


6174 


20075 


A 


6227 


41 


474 


TDATETFHFTSKH\HFGF*AAA*YWHFV 
DW*LFLYVSIY**GSCVPHPA 


6175 


20076 


A 


6228 


503 


28 


AAHAPPRVRKVIGH*AFFTQRCTNSRH* 
IIKFPKDKDKARILKGAREK/RTTYNGA 
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nucleotide deletion, V=possible 
nucleotide insertion 














PIWLAVNFSVETLQ/A/RREWHDMFQMP 
KEKT F YLR I L YPMK I S FKHERE I KTFP D 
jfJjKVkiur ISSKPVLQEMLJtoWPGARVHT 
GNPTLLGG 


6176 


20077 


A 


6229 


418 


175 


VSPLSPLKILFFPKGLKFWREGGPKCPP 
PKKRVLPKK/PPAMF* IPPPKRKKSKLP 
f FKKl WPfyvjlr ivbt'Fif'Ur irrbfrr 


6177 


20078 


A 


6230 


45 


308 


ACVRLKH*TDN*QPNIYNQP /HKSLLP* 
KKKKKKKKKKKKKKKKKKKKKKKKGGGG 
r KKKFGGG P I S WGGii KI X F FF FGGGKKN 
TLGIF 


6178 


20079 


A 


6231 


440 


209 


IQHFRGPSRRGLLGPEARASPGQHRESR 
SFLKMQP\NKKFS*WJWCTPAIPASPKV 
EARESLEPGRRRLHLTEALQPG 


6179 


20080 


A 


6232 


419 


2 


SSPPPPGGKFF*KKPGKKKFSPPKKKRV 
FSF P \ PPKKFFFS PKA*NFWGGGGQKF P 
PPKKGVFSKKPKGVFKTPPKKKKKFFFP 
PGENWGPPRVF*KGPPPFFFFFFFFFFF 
FLS FNVN I LFKP VQAPCLT 


6180 


20081 


A 


6233 


448 


112 


QSARITGVSHRTRPMLRTSTPLSATDRN 
AGQKI SKDIADLNNNEQDHRDI YRNSTQ 
* QQNTHS FQAAI RNKI DP I LGHKKMLTH 
/ CKQS DQTWNKLE I SNKYAKPQKIKHT I 
K 


6181 


20082 


A 


6234 


2 


351 


MDQGI I/SDFESYYLRNTFCQAIVARDG 
D S S DGS GQRKIjKT F WKGFS LLD AI KKTE 
DSW/EEIKISTGWKLIPILVDDFEGFK 
TSVEE/VADVMDLSRELEL*VEPEDGTE 

liljQbHSQT 


6182 


20083 


A 


6235 


3 


252 


FHTYNSLLVCHSNLCYQNSLLFFIIH* / 
CI /HFHS * AKVS I WKHTYTHTHTHTHTH 
MHTHFFQATDRKI CVNVYMFS VAYIHKG 


6183 


20084 


A 


6236 


3 


294 


IITF*LPQLSGYIEKSTPYECGFDPISP 
ARVP FS I KFFLVAI TFLLLDLE IA\LLL 
PLP* ALQTTNLPLI VMASLL* 1 1 ILALS 
IjAYBoIjQ1n.GJjD * Ail 


6184 


200S5 


A 


6237 


381 


121 


P VPAS F* FFFF YRDR \ VSLC YPG WSQTP 
GLKQSSCLGLLSHWDYKYDPPHPAFISF 
LLSS S LLWVL * RP VVLLCATVPAFLD I R 
EQ 


6185 


20086 


A 


6238 


395 


62 


PFFPIPKFFPWGPNFFF/PLGPPPFLGF 
FPGSFAPPSHFSPPD*NFPRPPKNFPPG 
GDLFPQKKGDWFFPKKRGVFFFLGGFSK 

tfcnPtiiiPCT PDnvwDDVT v/^* id v ircr^'C 
r btiljiri'r oJjtaFyKJvJvXr J\J. AtxFAlVr Lr 


6186 


20087 


A 


6239 


1066 


761 


S FFLRV7S F I LVTQAGVQWCDLS SLQLP P 
PGFK* FSCLSLPSSWDHFJRPPPRLANFC 
IF/M*RWGFSMLARPVSNS*P/CSDPPT 
PASQSAG I TGLSHRAWRLVFL 


6187 


20088 


A 


6240 


390 


27 


GEL PT P I YLAP P AS P KGG V SGG S P PS RV 
KFFPLEVPPPPPPGC*PGPFFYIWPQGG 
ARVP * QQI RFFSLRW \ PPRPTRPKNPHL 
Q VGP LR I CLS PE TGGG VI KVFRGS LLKK 


6188 


20089 


A 


6241 


366 


1 


KI KNFDRP I /TSKGA* S W/RNLPTKKS 
PEPNGFPGVFCPTFKKKLTPVF/SKLFP 
KF*KEGILPNSFF*SSLALIPNPFYPSK 
KKICKTKKWYRAISLMNIDEKIHYKY* QT 
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1 — inreonine, v=valine, 
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X-Unknown, *=Stop codon, /-possible 
nucleotide deletion, \=possible 
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EFRSTLKVLYTITK 


01 07 


OA AO A 

zuuyu 




0242 


23 


439 


lAdbRFthrr r h i*r r LJjFPPPGCjNPSF S 
FLKKKKKGGGGGENPPNPPGKAPNPQGV 
GVPPFPPPPVFQKNPPPPFFF*KNPPPS 
r'JN.ot'r'vjtjJt'r'r r r Flr\iOvjOijPPF± / rlfii 
KKKGGWGGPPFF*KTNPPPKGEITPIiVG 


6190 


20091 


A 


6243 


20 


261 


WAQEILKKFSCLSLLSSWNYRCAPPHVA 
N\FVFFVETGSYHVAQSGLELLDSRDPM 

r mT?CT4U7\nD'tJi?r i * tt v r P/" , i?or\OTT 
Xoiioiixxrtyi'xir -LJ_>I lvjH*oyK±ij 


6191 


20092 


A 


6244 


376 


76 


LLPRLECSGVIKPFSHLSLPSSRDYRCV 
/ PTMPNF*KFFVELKFNSL*R*GLTMLP 

LTDF I TTFKG YRHGDTY 


6192 


20093 


A 


6245 


222 


233 


KS F YFALI FE KMFCYVKKS QVGLVFFS K 
VKVLVRPFFPMVIKKNQP/FNPL/RFFF 
FFFQDRVLLCHPGWSAW* S KTLS 




20094 


A 


6246 


2 


358 


EI IiiTir QSYY* ENTFI\EAITAIDSDSSD 
GSGQSKWKSFWKRFTILGAI KNICDL*D 
*V/KISTLSGVWKKVIPSLMDDFEGF\K 
TS VEEVI ADWE IAR* LELE VEPEDVI E 


6194 


20095 


A 


6247 


3 


322 


PDKSLLQTPTPGFRRSSHLGFSSSWDYR 
LEPPHSVCVCVCV\FCVYIYFFFFLEKT 
GFGFVPQAGLNLGVKR I FKA* PPGGVE I 
PGLTHTPGP WGGP IKGKGANPAP 




20096 


i\ 


624S 


1425 


1280 


SjjDFQGIiALPGWSAVV * SCLTEASTSR\ 
VKRTSHPSLPNTWVYRKALPRLANSVYF 

lj^KWLTlaIMJjPKJjVjt>Ni>* iyAIlA2PWPPK 

VIAL * ADEEMKAKGHISTKWGGQTLNPQ 
PDCGATSVYSLP IRKFQ 


6196 


20097 


A 


6249 


189 


336 


HAPFPVSVISLSCQKRRR\WLAMVAHTS 
NPSTLGG * WGW I A* GEE FETSQ 


6197 


20098 


A 


6250 


2 


300 


FLPSFMGFWPVLGHGRASTVTPSEPLSL 
PtliijPAEPrLiFFFFSLFRKRESIFGA/ P 
GGLEGGQFNFLEPS PPG I KE I FWPNPPR 
GWELKIWPPPPINFCDF*EKTGFTOVAR 
aot WT.* r P(^r2T.T7r , <^ , rM?Mt: , T uric nnnr vt? t 

FWPNPPRGWELKIWPPPPINFCDF 


6198 


20099 


A 


6251 


1 


198 


LTGRL/SKCEWSWRFDVQLKDLKRWQN 
TGGKSLGFFF 


6199 


20100 


A 


6252 


11 


214 


SLRIiKSSSHLSLPSSWDLRCELPHLAN* 
KKKKKL/ CKN/RSLTGLPKLVSNSWV*T 
ILPPRPPRALGLRA ' 






7\ 
*\ 


02j3 


A AO 


3 


JxWjr viji>pyvi£j^jv±JAJLovji>r JNPfc»FPGEK 
IPPPQPPEEVGPPGGPPQTRGNLGI/LG 
N*GGGGGF*PPGAGKSPAPGPQKG\PFG 
KDKREKPYGPPPKLFFFFIiRQSLCFAQA 


6201 


20102 


A 


6254 


362 


78 


VGVN/MTFFPQHFLGLSGIPRRYSDYPD 
AYTT+NTVSSIGSFISLTAVLIMIFII* 
EAFASKREVISVSYASTNLE*LHGCPPP 
YHTFEEPTYVKVK 


620? 












ALFL I AKHWNQPTC/ PHTME YNS VI K/N 
EQT I DTTTTWMDLKC IMRHEQRQS * KKS 
VLVCFHAADKD I PETGQFTKERGLMDLQ 
FHVAGEASH 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine. l=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline. 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y^Tyrosrne, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, V=possible 
nucleotide insertion 


6203 


20104 


A 


6256 


2 


305 


QWCNHSSLKSNPGERETVSSLKLPKAWN 
PRHMLLCPTTSFILRWGFIYVAQAGFDT 
RGHASL/LA*ASPSVGITSKSQRVRPVH 
DIACFHRESTMFSAENFN 


6204 


20105 


A 


6257 


335 


3 


GPFQSPPPGVTPISCFTPPKKGGNRGPP 
PRRGKFWI FKKNGGSPG* KRRVLFPDPG 
VPPPRPLKKGGLKGGAWAKI FFFFFLRQ 
SHSFTQAGMQWCNLGSLQPPPPGFK* F 


6205 


20106 


A 


6258 


219 


2 


PTLSSQKDTKAQRLYSFPLTTTKDERLV 
TLVS/WPGWAYTFIPSTLGGQGRWVT* 
AQKFETSLENTVKPPSC 


6206 


20107 


A 


6259 


306 


117 


SFELLGSSDPLISALSSWDYRCAPSRLA 
NF* FFVCVEMGSSYIVQ/SWPQ* ILLSW 
PPKVLRLQT 


6207 


20108 


A 


6260 


572 


269 


ETDILSVAQARVGGLHLGSLPAPPPGFT 
PFSCLSCSS *DYRRLPPRPA/NFFVFSV 
EMGF\TVLARIVSIS*PRDPPPSGLP*G 
ALGLPGVSPLRPALCLLLK 


6208 


20109 


A 


6261 


357 


116 


FAQAGLELMGS S S PLAS ASQS VGI TGMS 
HCA\RLLLFFCGGRW*GG/LK* FSCLSL 
LSSWNYRHEPPHPANLCIFSFNTISY 


6209 


20110 


A 


6262 


3 


337 


HERHERHERHEQ S AE ITGVSHHAGGG I G 
LLNF FLFLFFEAGS CTVT* T/GV* WHDH 
GPQ*PQIPRFK*SSHLSLHSSWDYRHVP 
P*LGLQACATMPG*FFIFCRDPISEINK 


6210 


20111 


A 


6263 


515 


150 


FLRWS FTLVAQAG I QWCGVS S LQPLPPG 
FKQFFCL,SLPSSWDYRCPPPCPTNF\*L 
LVEMGFHHVGQAGLELLTSGDLPASASQ 
SAG I TGISHRARPGTLFF* AANGGGKQV 
FLLLRVLNGL 


6211 


20112 


A 


6264 


J 


296 


GTSFLPSFLP/LLSFLKFLNFFQIVSHC 
RPGWSAVA*SQLTAPSI\FGLKPSSQLS 
RLSS WDYRCPINFF\SLPMLPRLVSNS * 
PQAILPTQPPKFLGLQA 


6212 


20113 


A 


6266 


298 


182 


LHFFEVSKIIKLIRAEAKWCL*VLFTIA 
KIWKLH* * ICKYLSIDEWIKKMWYIHTV 
E YYSAIGNKE I / PVI VTIWI YMENIMLS 
EIN* *QKDKCMIAIiL*GIYIQ 


6213 


20114 


A 


6267 


402 


234 


PPPPLFKNPPPPPFKNPPPKKKKIIFP/ 
SPPVNLAPPKEF*KGPPPFFFFFFFFFF 
FF 


6214 


20115 


A 


6268 


125 


17 


KGQAQWLTPVI PALREA/EEAGG* LEPR 
SSRPAWAI 


6215 • 


20116 


A 


6269 


1109 


454 


GDSAGAEGTMENFTALFGAQADPPPPPT 
ALGFGPGXPPPPPPPPAGGGPGTAPPPT 
AATAPPGADKSGAGCGPFYLMRELPGST 
ELTGSTNLITHYNLEQAYNKFCGKKVKE 
KLSNFLPDLPGMIDLPGSHDNSSLRSLI 
EKPPILSSSFNPITGTML/VRLPPPHWP 
VAGAVSSDAYSASQEEE*AQAQTEPYPG 
S CPPRNT I * FRSQEEEKEKRRGS 


6216 


20117 


A 


6270 


111 


86 


WGITGVSHCAWSRNGTLTHCWWECEKV 
/QPLRKTV*QFLMLKI\PYQMI*PLLGI 
Y LR KNKAY IHMKTLK * MFS LF L FE I AK / 
TWKQPKYPSASEWIN*LDSFSLYWLGWS 
RTPDLK 


6217 • 


20118 


A 


6271 


2 


222 


RSCESNSQTSACYKSFYLYPLKS /GGFK 
I LERM + P I Y FL I PYWTI FNQRYYFYIW 
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ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
f>=Glutamic Acid, F=Phenyla)anine, 
G=Glycine, H=Histidine, I=Isoleucine, 
JK=Lysine, L=Leucine, IVNMethionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Va)ine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possib!e 
nucleotide deletion, \~possible 
nucleotide insertion 














NLKKTRGIYTVLYKMYKSF 


6218 


20119 


A 


6272 


787 


470 


LPFFFSFFETGSHCWRWERESLGVAQVG 
VQ*CDLRSLQPPPPRLKQSSHLSLPSSW 
DYRCAPP* LDKFFFCXFCRDGFHHVAQA 
GLKLLSSG\WLPASVSPSAGTTL 


6219 


20120 


A 


6273 


2 


322 


FCMAIAAIDSSSSHGSRESKLKTLWKRF 
SILDAIKNTCDSWEEV/KVST*TEVWKK 
LIPTLTDDFQRFKTSVEEVTA\ELLWE 
I TRE I ELEVELDDVTELLQPHDQT 


6220 


20121 


A 


6274 


175 


406 


KNSLGGPNLTGEGKKITFSL*NFFFSIjE 
ALIFVGRLSSFVPPPKIRFFSKNSRNLF 
/CPVGEFFKTRPPVNFFF£LP*RKSYFF 
SFPG* IWPP*GIFLKAPPFFFFFFFLRE 
RREVSMLPRLMSNSWVQVTPPPWPPKVR 
VLLGIVDVQ 


6221 


20122 


A 


6275 


2 


181 


PRVRGRVGHRKGKKSFVHCWWQWKLL* P 
LWKTVWRVLKTLK\ I ELPYYP/ATKLL* 
SYPLE 


6222 


20123 


A 


6276 


397 


47 


PPPSKNFFFPPQKDFFLGGLPKKSPPPK 
K\FFFKKTPKKNFPPPSFFKKPPPPAFI 
KTPLKKKKKFFPPPQKFGPPPKFFKKPP 
PLFFFFFFFFITTAP* KITKS I PYFYFF 
TKTFYL 


6223 


20124 


A 


6277 


2 


264 


ETESHTVAQSGCSGAI IAHRSLNLQGST 
NPPTTAS * VAGTTDACDHAWLMF * FlA V 
EMGSHYVAQAELELLSSSDPP\PQSPKI 
LGLQA 


6224 


20125 


A 


6278 


3 


204 


IRCYRKI FYERKHQWLQQISLLS YFKKL 
PQP\PPL*ATITLISQQSLISRQDPPPV 
KR* * LTEG * DDC 


6225 


20126 


A 


6279 


57 


306 


I NKANRDKL/GT I DFLEFLTI KVRKMQN 
TKTESKKRN* ES I LHVE*EDTGHVRAAA 
VCRDMTYLGEKLTDEEVDK\ I IREAKYW 


6226 


20127 


A 


6280 


419 


18 


SSSS PKACMFVYE WVCTQKCVW I YEYMC 
VFVYV* LVLMCCMNVCACM\C* SACV* M 
CVICECVCM*GLSVICEGVCM 


6227 


20128 


A 


6281 


459 


235 


FLRNGVS CCHPGWSAWQS * LTAASNS * 
AQVILPSSWDYRPVTSHPAKTHNFYMQ/ 
CGNKKMCVTHS I VI FSLLQ 


6228 


20129 


A 


6282 - 


505 


470 


GVP*GQEFEPSLSHMGRPCFFLKCKINP 
AWWQKLF* PRRQKFRLTQI /LALAPQPG 
QQKVNFFS/NKKKKKNPNNC* IALR 


6229 


20130 


A 


6283 


206 


374 


GGGGGKKKN'WGGEKKRGGEKKIFFF*SP 
PRFFFSPPYKKKKFFFPPPFFFPPPNFF 
FYPPPPPFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFLNHFF\FF*LYVIIV 
L FMGHS D I IjMHVHNV 


6230 


20131 


A 


6284 


431 


269 


Y I KGGPPPEI ETFLI FF Y YI YL 1 1 FI FF 
FFFFFFFFFFFFFFF*FVCLKYLIFNHI 
QRF*TYISIPHFMLLT^\YTLKLFTFCIC 
SLFTI I WFYIFV*L* YCCTVKALCNSL 
LVSELDKY I KNNYWCKQ 


6231 


20132 


A 


6285 . 


298 


44 


YLFFFFF*VELVFFLFCMFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFEFKNKVSFNS 
CFSLSLFKLYFLKYNFIQLQGNLHFNNI 
I 


6232 


20133 


A 


6286 


386 


1 


ANLH2 KVNFWNFTLLNFSFLPVWFGFH 
FLW*RVYLSHRGFYVF?QNFLRIFFFF 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProJine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tbreonine, V= Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /-possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 














SF*VG\FFFLFFLLFFFPFFFFFFFFFF 
FFFFFFFFFLCIAI VFY* IWYISYFTKY 
IEEPRTRGRTRGRTRG 


6233 


20134 


A 


6287 


190 


134 


LEFGRNRFQFSRMILRGSGLQWRNYADM 
VKIAKT* YLRNTFHKVIAI IGGDSCDGS 
GQNKL KT FWKGFS \ LDAING I HNSQEE I 
KMATLTGVWKK* IPVLKDDFEGFRTSVE 
E/V/ TADMVKI AKT\LELEVEP\EDVT* 
LLERILF 


6234 


20135 


A 


6288 


394 


2 


RRQASLSCGGLHPVRSSRLLCLPKQAWA 
MAGAPPPASLPPCSLISDCCASNQ*DSV 
GVGPS KPG VGYHLLVHRFLS PLSKRS I R 
VGVARFSRCRLSPLSLTRKGN* LP\TPC 
ASRVRQCLTLLQLAHGALHP 


6235 


20136 


A 


6289 


233 


58 


DRVS L CRPR WHWS RTRLT VASTSR / LQ 
AILPPQPPE*LGPQASANRSPRSPKLTG 
LRG 


6236 


20137 


A 


6290 


385 


48 


TLQYLCTPTSTV* KKKKPEN I KYW *QCG 
ANRTLFQ * WQVCKVAQTLWKPVWHYT VL 
NLNTRIVYDPGMPLLiGTNNNAYMCCIC\ 
YQKTFTRMFIAALFEITPK* KMHKYLTV 
VG 


6237 


20138 


A 


6291 


273 


36 


RNGRWTPCGGR* IHPS*GTDPVYLPVPG 
TbSSSQASNSEPLLKAPGSRPXPWPTDL 
GYKDPRGPPSNPACRRRPSLFHSD 


6238 


20139 


A 


6292 


394 


5 


AKIYFSINPLVFSKQ*FLKNVFPNILHL 
QFFSN INSLMLI KV * ALLQD F PLVHNVY 
NKICDTS KDTTTLFMYLMFVFKMKTFFS 
LKAYI /WSGTVTHTYNPSTLGGRGMWI T 
*GQEFKISLANMVIASLY 


6239 


20140 


A 


6293 


536 


190 

! 


RRSFALAVQAGVQWRDLGSLQPPPPRFK 
R FSCLSLPSSWNYRHAPPRPANFVFLVE 
TGFLHVGQAGLELLTSGDPPV1*ASQK\A 
GNFTGVSHRVQPD*GHGGSRLSSQP FWEA 
KTGGSPEVRSSRPA 


6240 


20141 


A 


6294 


2 


238 


HEADLGTTQVLTHPSTTAMYFVHYCQPP 
*ILYGTINT*PPWHKNPIKIKTP/SPC 
LQASTAINPQLSHINC\NPKDTPH 


6241 


20142 


A 


6295 


2 


447 


F FFLMGV S LFCPGL / WS AMGA I LG SLAT 
SCLLGVQVILLCLSLPSSGKLQVCAYPH 
LG+FLYF*AEDGGFSPCWPG*VSNS*PQ 
VIHLPWAPKALGI TSMSHCCPACFHCTF 
FRHFRQNPHLNLFFFLEGLGLSPPPPSQ 
PLEGGKGPGN 


6242 


20143 


A 


6296 


2 


133 


AEERERERERERERE \RERERERERERE 
R*VERERGGVPLRTYR 


6243 


20144 


A 


6297 


3 


244 


HDYSS FLAPPALGYMRMPLHPAYFI LFY 
FI S/ FYLFFSQDEVLIiC* LGWSQTPELN 
QASCLGLQKWWDYRCEPQCLAECDS 


6244 


20145 


A 


6298 


1 


361 


GTRRSAMLARLVLNSGPQ/CYLPALASQ 
RAGITGVSH*GCPFFSLLKKRWRSHCVA 
QAGVQWFFRG*TPGLSCFSCSLPSN*DY 
KCAPLCPP*DTVF*VRAVSSEVPLTITY 
RGEGAGVWA 


6245 


20146 


A 


6299 


3 


348 


HERHELEELIKNLFFFFFFGKKIFFFFP 
GGKNGGPF* FIGAFSFRPQGIFLPYPP* 
KW\DWGPWPPCRGDFWFF* KKTPFS 1 3UA 
RWLNSPPWGILSPYPPKGVRFPGGTPR 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Giutamic Acid, F=Phenylalanine, 
G=Glycine, H-Histidine, I=lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
I— 1 nreonine, V=Valine, 
w— i ryptopnan, Y— lyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, V=possib)e 
nucleotide insertion 














PGPV 


6246 


20147 


A 


6300 


1 


436 


VDGEFSCLSLPSGWDCRRASPCPANF/S 
V FL VG I GFR YVG QAGLELLTS GDIjLAS A 
SQSPGITGMSHCTRPRKHH* IANSSSVL 
GDFRIDYFQ I SRKAQWI TLGYNPWAESH 
LLMGAGLRVLPAPRLM I SQVDLL PATHP 
SSGRRG 


6247 


20148 


A 


6301 


370 


70 


P KG * S TLPAS AS RS AG I TAV I L * SHCT S 
PPM*LRFKHFTHSSTSTPF/CFFFFGKK 
ILI FVPQPGGKGPNLS * LKPLLSWLRKL 
PCLTIi*RGGDYPGFPPPP\SYFCFFIKK 
bv* PCGPGGJNS*PAbASRSAGITAVIIj 


6248 


20149 


A 


6302 


1 


285 


KHFFL.FTYLFRDRVSVCYPSWSTWE/S 

T * C TVJT T 7\ 7\ CM / 1^1 PCV-tDCTTT C T T /T_TT_T 7\ 

LL VDFNFL* RRRLI RQ PRL VSNS WAQAI 


6249 


20150 


A 


6303 


194 


351 


GTFICIKKPVQ/WLDTVAYTCTPSTLGG 
RDGRVT*AQEFETSLGNIVRPHVYKK 


6250 


20151 


A 


6304 


354 


211 


YRHMP/S*LADFFIFCRDSLFMliPWLVL 
NL*AQAILLPQPPKVLGLQE 


6251 


20152 


A 


6305 


1 


346 


GTS S LQL I SLHS S PTAS YP PKHLHTRLT 
ELLSVSLMAHVLSVSPATSS/PILQPHP 
IYPCTWIMFTHSSGSGKPSCQTHTKTYT 
HTHTRMCAHGSGHATLPP*HPLHPRHSA 
YLLW 


6252 


20153 


A 


6306 


395 


177 


WLNTLWY IHTVDYFLAMKRSDLLIHS PW 
MNLQGIMLSEKRQ\SQR*HW*FYLYNN 
VEMKKS QKWRTVLWLPE 


6253 


20154 


A 


6307 


2 


718 


GVLPYWPGWSRTPDLV/HPPTSASQIAG 
ITGKSHRALP\KFAFLTSSQVMLTLLAH 
GTTAWFSKLS * F * EPPMTCAVFS WTG 
SCHINSFRIQTCL*YYSVFTSCFYCSMI 
EKLTTGIS*QKEHELPFSQSSDIFKLAY 
LI FFVJ/ CWEMESHS VSQ AGVQWHNLS S 
LQPLPPRFK*FSCLSLPSGGDYRCPPPH 
MANFCI FSRDGVIiPC* PGW\S * TPDLRM 
DP PGIiRP r QSAGDLiTGMEA 


6254 


20155 


A 


6308 


377 


67 


PPPPKIFFFSQTLFFFWGGCPQIAPPPK 
KFFF* KNPQGFFFPPPKKKKI FFFPPPK 
FLPPQKFF*KPP/LSFFFFFFFFFFFFF 
FFFFFFFPTI * I FYSMSFLP 


625 D 


20156 


A 


6309 


1 


362 


QTYAKMHVTI I FFGVK\LTFLPQRFLGL 
SGMPRRYSDYPVAYTT*NILSCVGSFIS 
Juj. All I Ij 1 1 r M I Rt AFAS IRQELI VEE PS 
INLE*LLGCPPPYHTF*EPGYLKPKRKR 
KESSPPKLV 


6256 


20157 


A 


6310 


2 


411 


VLPTLWEAGARGSLEARSSRLQ*AMIT\ 
HCTPPWAI 


6257 


20158 


A 


6311 


59 


334 


THACGNMIjS YAGS F I SLTAVLLI TFM I * 
SA\FASKRKVLIVKEPSINLE*LYGCPP 
P YHTFEE PVYI KSRQKRKESNP PKLVS S 
QPHGLHDL 


6258 


20159 


A 


6312 


3 


22 


KPSP * PLTGALLDLLMTSGLSM* PNFHS 

QGHHTPPVQKGLRYG I I LF I TSEVF\ FF 
AGYF\WPFTHSSLTPTPQI *GHWPPTGI 
T PLNP L* VP LLNT FVL.LALG VS IT* AH D 
P 
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G=Glycine, H=Histidine, l=Iso!eucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V= Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, A=possible 
nucleotide deletion, V=possib!e 
nucleotide insertion 


6259 


20160 


A 


6313 


11 


182 


INKMW* I*VRG* YSVIKK\DEILSFMAT 
WMEVEDILLSKIREAQKVTHPVFSFISG 
S 


6260 


20161 


A 


6314 


3 


421 


IQASTMAGAPPQAS LLP CS L I LDCCASN 
S/RGSVGVGPSEPCTGYNLLVCLLLRPS 
EKCSIRVGVT*FSRCRPSP/TRKGKSPT 
PCTSRMR 


6261 


20162 


A 


6315 


1 


111 


DKGMNVFMPANTASVLWPMDQGVILNFK 
SFYLRN\TFSKYISAIDSDSSVGSGQSK 
FKTF * KGFTNLDAIHDS WEE I KI ST * TG 
VGKKL I \ PTFRDDFKGFSTSVAEVSADV 
VEIGREL*LEVKPGDRTELL* SVLWPMD 
QGVILNFKSFYLRNNIF 


6262 


20163 


A 


6316 


440 


312 


TSSFSLSFF/CQDG/ SLTMLFRLVSNS * 
PQAILPSWPPKVLGLQL 


6263 


20164 


A 


6317 


5 


404 


PR YM I VKLLK I /ERNKILEAAREK*HFT 
SKGKAI WI IVDLSSETMEVRKNLHNFQV 
LKEK/NKTWEVRI LYSAKI S FQKKGKI K 
I FS DEGKLR * S FTS RPTLRE * *RKFSK* 
K*NDRRRNLKTFGRKKEYPKELKP 


6264 


20165 


A 


6318 


394 


182 


MFIVA/VFSKAKEWRQPKCLSTDEWV/Q 
YM* YIHTMKYYSAI KSNELVI HVAA* I L 
FENTVLSERSHIWHGSHI 


6265 


20166 


A 


6319 


1 


378 


APSDFFLFQSLNKPLNCTHFSSVHNVKK 
TSLT* LNSQNPQFLRDEPNGWYHCL/QK 
YLELDGAYVKK 


6266 


20167 


A 


6320 


286 


314 


NRCCFSLPTQVKCFNLYLGI CCVCDWGY 
YRWLNF *NP I F * FI LRQFHS VAQAGVQW 
RDLGSLQPLTPRPANFYILVEMGF/TPL 
LARLVLNS* PSLLKYKN 


6267 


20168 


A 


6321 


23 


397 


QKTLNMEKEMNIYIHKM* ITPNRSNI KR 
SSLRHITVKFSKVKDKQRI /MXLAIEKQ 
P IT YKETTIRLSEDFSAETL /NARR* WD 
DIVRYWLKKKKKKKKKKKKKRGGGLLKN 
SLGGPILTGGGKEIF 


6268 


20169 


A 


6322 


411 


2 


TEPPPPGGNF \ L * TKPGKKNFLP PKNLG 
FFSPFSPLKFFFSPK\P* ILGGGGAPFC 
PPQKKGFFPKI PRGVFFSP P / YKKKNL I 
FPPPVILGPPKVFFKGPPPFFFFFFFFG 
QSGQVKLKSPKCK 


6269 


20170 


A 


6323 


395 


280 


MLARLVSNS*P\CDPPGLASQSAGITGM 
SHHVPQVNHF 


6270 


20171 


A 


6324 


3 


359 


QHYLVFRETQIKAAVRCHYI PTGMAKI I 
KTGSTKC *Q* CGAAGTLIHCWEECR\LK 
QPLWKTAW* FLI * MLCD*TVPFLQI YPT 
EMKM C VCRMTC \ RVFLS V L F I I AS S WKQ 
PKYKKLVN 


6271 


20172 


A 


6325 


3 


174 


FFFETGPNFVNQAGGQGHDLG*LQPPPP 
\GSRI FPA*TSRVAGITGLNHHPHPLGS 
IF 


6272 


20173 


A 


6326 


265 


13 


TASPIRI I ITSQDVLHS*AVPSLGLKTD 
AIPVRLN\QTTFSDTRPGV*YGQCSEIC 
GANHS FMP I VLELI PLKIFE IGPVFTL 


6273 


20174 


A 


6327 


32 


259 


^»WCt»T PTOf /tt 7>tt a 7 V 77>I_T ET» IT 1 ->t T /VT •* C T»T/^\ f»7 

Ci iJbnl>L.loJjCvr«iWi £nrr *vil w c>wyw 
PKYLSADEWINKM*YIWTVEYYLTIKIS 
EVL/SATTWLGLEWIMLSKRS 


6274 


20175 


A 


6328 


1 


228 


PGLNKCSHLS WDHRHALQCPANFG \ YFV 
ETRFTVLPGLVSNDPPALASQSAEITGV 
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X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
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SHCAQ P * VCFHYCLTFFS LQ 


6275 


20176 


A 


6329 


346 


1 


LPKKKKKKISRGLFPFQLPKKKKDNEKS 
LTFSSETMPGRRQWSKIFKVLRGKKQPQ 
QPRIISLAKLFLKSEDFL/ETQKLREFV 
ASRPALQAMLKEVLi*REGK*YRSLFKKG 
RALK 


6276 


20177 


A 


6330 


349 


77 


HTTPKKNKHN/RKKI KLDVLRYNKVDDN 
SKMSGLH*QCPSDECDAGVFMASHKQ/F 
DRHYCGKYCLTYCFNKPEDKAQHCGSLE 
PRSLRPAWAT 


6277 


20178 


A 


6331 


319 


46 


RPRRPETK*ALNK* KDI/PMCSWKORHN 
TVKMLILSKLINRFNAITIKISTE\FHR 
NKLILAIIWKCKGPRIAKTT* KGIKLEE 
LLYLISRPP 


6278 


20179 


A 


6332 


496 


•309 


DRATAF*RQSFTLSL/RAGMQWHDLSSIi 
QPPPPGFK*FSCPSLQSSWDYRCPSPRT 
G * F\ CI F \ VEMGFHHGGQAGLELIiTSGD 
LPASASQSAGITGVSHCTWPN ILNWK 


6279 


201SO 


A 


6333 


3 


173 


HHHSRL/ IFFCS VLGTSSHYVA*AGLEL 
LGSSNPPDLASQNPGITYGSHYAWPDI P 
VF 


6280 


20181 


A 


6334 


339 


153 


FSPPPPFFKGFFFPPPPPPF/SF*PLRG 
GGPPFFFGPPGVRPPPSPWSWCSPPPV 
VLASFFFS 


6281 


20182 


A 


6335 


32 


331 


QLLNHSVIVFLLSFDHSYFLRNLSLFLA 
FKL I GVKNS * Y I I LS FRYLLHP * HLLNF 
FFF / CKRQGLPVLTRLVSNSWAQG I CQP 
WPPKVRGLRAWSPGLSL 


6282 


20183 


A 


6336 


226 


177 


DGVSLLLPRL\GGAISAHCNHDLPGSGD 
SPVSAS * VTQLQYYFCICLGLFTFTTRL 
S A YLGTG KLTTAL PNT PGLLD S 


6283 


20184 


A 


6337 


302 


59 


KMLLNTIMCNRE Ih/RKRKS PRVHQTS S 
VS YFKKLPQPPQHSAASTLI SQQPSSS K 
QDPPAAK* L* FVEGRLI AFPSNNVF 


6284 


20185 


A 


6338 


197 


28 


AEKVRFSSAVTWHLGSHSL* * SKKNL/W 
LGMVDHNCNPSTLGGRGGRIA*AQAFET 
G 


6285 


20186 


A 


6339 


46 


317 


EVPSS ISSQISLFSGDFSS VPLEELSNL 
FFMFCPEHIIVTCERVTSNYS/YYKWKP 
EFSYP* IF/CFKMSLTMLPGLSLNSWAQ 
VIIPPQPPK 


6286 


,20187 


A 


6340 


4 


276 


QTTIiRGTTVQITKNLSSETMNASRCEKK 
NL*SAWGWGMWENNYQPILYSAKISFKN 
GVEIKTFTN/KQKIKN/FITSITLV*EI 
LKEVLQAEG 


6287 


20188 


A 


6341 


485 


2 


G VWE FLETE S RS VAQAG VQW CD LG S LQA 
PP\PGFTPFSRLSLLSSWDCGSPPPHPA 
NFFFVFFVETGFRHVSQDGLDLMVS IS* 
PREPPASASQSAGITGVSHRARPKHSKI 
LIAGSRRS IQMFPVKFFQKLAWENFQN 
TKDFLNPYSEILLHTSQIHSSC 


6288 


20189 


A 


6342 


2 


339 


EDLRVKLKKEGYSN I S YI WNHQDPMLN 
SNGSVTWALLQAS * YLCILQAS * ASSQ 

EN QTD WJ TLLNGS KDD FL I Y DR CG CL VY 
HL 


6289 


20190 


A 


6343 


181 


12 


RWGLPSGWDYRSMPLHPVNF+ 11/ CMRQ 
DFTVLPRLILNSQPPVILLRWPPKVLGL 
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ed end 
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ding to 
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residue of 

peptide 

sequence 


Amino acid sequence (A-AIanine 
C=Cysteine, D=Aspartic Acid, 
E^Glutamic Acid, F=PhenylaIanine, 
G=GIycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, V=possibIe 
nucleotide insertion 














QA 


6290 


20191 


A 


6344 


3 


295 


YWHLSCF*FLAITNKAAMNIRLLWTCAS 
I FPR*RSRKE MAGS *GS /SLFNFVRNCL 
TIFSHDCTTL/HS/HQQWGGV/PVLPFP 
HQHLL*SIFLVFDILIGI 


6291 


20192 


A 


6345 


334 


8 


KPSNPGGEGP P * KKGPKVFS PWGKKENF 
SPKKKKXI* CPQKPKGPPWGAPAFKKKK 
NF*NPRGSPQL*KKRGIIGPPTLIPPSS 
QRPPPKKKKKK/GPPKGALLAFGGIKFF 
FFFLEKNFLFFPKGKTLLAPFFKGGLPL 
PG 


6292 


20193 


A 


6346 


68 


256 


WVA\ P VTHTCNPNTLGGRRGR IT* AQE F 
KTS LGN I VG PHL Y KNLKKKKKKKGE KKP 
GGGFSPL 


6293 


20194 


A 


6347 


385 


60 


KKKKKNRTPYDAAI /PSLGTSSKEFKSG 
SSRDICTPMFIA/AITKRWK*LKCPSMV 
NI / VKKM WFI HMMQ Y * SAKKKE I LS *MH 
XjKDVMLSERMLSQSQKKNFCMILLIWGI 


6294 


20195 


A 


6348 


234 


61 


CHR I EH I KMVNL / WLDVWHTCNPSTLG 
GQDTW IA* AQE FKMSLSNTAKPCLYKKY 
KN 


6295 


20196 


A 


6349 


2 


358 


FLS LG I H FNY C L VT TESKLS KLRLTY F F 
FFFLETEFPFCSPA*RPW\PNLG*LKFP 
PPGSRETPALTLPRTWNNRRAPPHPANF 
GFLRKNGV* FFGPGGFKTLDLGGPTRLW 
FQKGGNN 


6296 


20197 


A 


6350 


346 


27 


KMV* SLWKAVWQFLIKLNIH* PYDSAVP 
LIG/IYPT*MTT* IYTKP*TQMIiAAVSF 
IMAKNTRNESNIFNWPIDKLWYIHIMEY 
**AFKRN*LLINTKT*MNLKC*LF 


6297 


20 198 


A 


6351 


36 


158 


XNSVPSQE*L YYL \RQENRLNSGGIGCR 
EPIWCHYTPAWAT 


6298 


20199 


A 


6352 


2 


146 


RSSHLSLPSSWD YRRAPP/ H/ LANCC I F 
CRDR/SFTMLPSLVLNS*IQTILL 


6299 


20200 


A 


6353 


74 


303 


EPSICGQIYL£FSIYKIQK*/WPGAVAQ 
ACKTSTLEG*GGWIT*AQFKTSLGNMMK 
PCVFSKKKKKKKPHPGGGGRKK 


6300 


20201 


■ A 


6354 


3 


210 


NNKKIWTLKNLKFFRDRVLLCFPDWSTA 
VHSRLTAAS\TLGLKQTSCLSLVSS*DH 
RCMPPCLALKRKKT 


6301 


20202 


A 


6355 


2 


627 


GRVGGFIiVPMRDLAEASRGQQGQVKLGE 
MKPSVSCVCGGV*CISKFKLKN*L\WPG 
TVAHTCS?STLGGQGGWIA*AQEFETSL 
DNMVK\PCLY 


6302 


20203 


A 


6356 


584 


166 


RQGHNVAQAGVQWRDLGSLQPPPPGFK* 
FFHLSLPSSWDYTRVPPHLANF\ * FLVE 
TGFHHVGQAGLELP TLSDLPAWASQSAG 
I TGVT/HQARLQLL * KELINPSTLGPTW 
ELLEAWGRCPSVVMGDDLALNCFHHFSV 
S 


6303 


20204 


A . 


6357 


448 


202 


FTTLVSECI*DIHDSWKDV* ISTI/TGAW 
KTWLI PTLVNGFEG \ FKT* VEEVTADW 
EIARESELEVEPEDVTKLLKSHDKS 


6304 


20205 


A 


635S 


3 


237 


DRSSL* P* PPGLK* SYYLSLPSGWDHKR 
A*PHPANFCLFTF/V*QRGLDLLA*HDP 
PASASQGN* ITGMSHRAGPPLTF 


6305 


20206 


A 


6359 


49 


242 


SSYLSL*P*PPGLR*SSYLSLPSSWDHK 
HA* PHPANFCLFTF/V*QRGLDLLASHD 
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sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIalanine, 
G=Glycine, U=Histidme, I=Iso leucine, 
K=Lysine, LF=Leucine, M=Methionine, 
N=Asparagine t P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknovvn, *=Stop codon, /=possible 
nucleotide deletion, ^possible 
nucleotide insertion 














PPASASQSTS ITGMSHRVWPPLTF 


6306 


20207 


A 


6360 


508 


223 


FVTQAGAR+LDLSSLQLPPPGFKQFSCL 
SLPSGWYYRHVP/STPIVF/CIFSRDGF 
HHVGQAGLELIjTT * STCLGLRSAGITGL 
SHRAQQERGFEP 


6307 


20208 


A 


6361 


2 


595 


AIWG+GTHFLIPWVQKPIIFDHRSQPH 
NVPVITGSKDLQNVNI I PCILFGPVTSQ 
LPR I FTRIGEDYDERVLPS I TTEILKS V 
VARFDAGEL I TQRELVSRQVSDDLMERA 
ATFGL I LDDVS LTHLTFRKE FTEAVBAK 
QVAQQDAERA/RNSLATAGDGLMELCKL 
EAAEDITYQLSRSWNITNLPAGQSVLLQ 
LPQ 


6308 


20209 


A 


6362 


1 


259 


LES FYFLLYYYK* SS * PLRLKRLILFI F 
ETECCPVA\QLECSGTISAHCHLCLQGS 
GNSPAPAYRVVGI TDSFRI THCILTCI P 
SE 


6309 


20210 


A 


6363 


60 


221 


NTT KLGMVAHT CNPSI LGARGGR IT * V * 
E F KTS LDNT VK\ P CTP P KKS LKI KLN 


6310 


20211 


A 


6364 


I 


393 


RPRRPQTGVQWQNLGSLQPPHPGFKRFS 
CLDLPSSGDYRHTPPCPANFSGFS/HHV 
GLKLFDSSDLPWPPKFVGIARHESHRKC 
T*RSRRGRRIADPLER 


6311 


20232 


A 


6365 


19 


85 


CINSKIMQSSR I TTLCKMVQPL *RALWQ 
CLI KLNI PHP YDS AI PLLHFYQCK* RHM 
NPHN \ TFTSMF I TALFI TAKN * KQS KCL 
SIGQWMKK**CIHWQ*NSGI**HE*IN 
SKIMQSSRITTLCKMVQPL 


6312 


20213 


A 


6366 


2 


298 


PPRLAKVQKFLLFFYLVF* DRVTLCHPG 
WTAW * SQLTVTLAHRNLRLPSGWD YRH 
VPQRLADFVFL/CR/DRGFSMLVGLVSN 
SRPQEV*ATSPSQSLLS 


6313 


20214 


A 


6367 


67 


636 


QDNKKGI R YLFVYFRQS FALVAQAGVQW 
HDLGSLQPLSP*FKQFSCLSLPSS*DYR 
HAPPRPANFL/ QFLVETEFHHVGQAGLE 
LLTS GD * PAS TSQGAG ITG VSHCTGPRI 
RFLKTFSHLEAPGGFPEKRQPLRSERLQ 
FRAFPQTAPCAHAYLLGCVSHLEMGCRN 
IDIPELSFFLCPLQCLIYRGKSP 


6314 


20215 


A 


6368 - 


3 


251 


RSRH CTPAWAPLSTQDHFYVF YFFLHFL 
IGSLFIFACFVSSYFIEFSYCL\YFFVD 
S FG F YR YTMK * NLNSNYF KL S AS I LPQ 


63)5 


20216 


A 


6369 


333 


393 


FKKLVLriAKAKYCKANFFFQGVPVASKN 
/ TGDLWEEPGWSQTPDLR* SALLGLPKS 
WDYRHEPPPRLANFCI/FLVETGFQHIG 
QAGLALLT/S/GDPPASASGALE*LGGG 
SCL 


6316 


20217 


A 


6370 


36 


369 


R PTRPEMRFHP VGQAG/ PRS PGLK* SAC 
LSLSKCWDYRHEPP/ V *PG*FSFFTYIYI 
YFFFLFFSFSFLEEMGFHKWKAGLELL 
C*NDPPASASYSDRI IGVSHQARPFFFF 
FF 


6317 


20218 


A 


6371 


2 


375 


DIQMIDRNIKN/CSTSQSIRKLQIKRIM 
* YHLTS VRMAI TKKTTHKC * RCGKK\LH 
C * WQCKLVQPL * KTIQRFVQKTENNINI 
+ PSNSKN/WKSAC*RDICTANCITT*FI 
RAEHGGS \ PCNP S TLG 


6318 


20219 


A 


6372 


75 


372 


KTGNSQKKKXKKKKKGGPPLKKPSGGPNL 
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Predict- 
ed end 
nucle- 
otide 
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ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, Wsoleucine, 
KHLysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
1-1 nreomne, V=Vahne, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














PPGGAKKSFPLKGGQKKPPGGFLEKNPP 
LGGAPL\GPPPPPKKRPRGKTKP*GG*R 

r~* T/'XTn 7\ V "DT»T T PWTT VO 

Vj ivW FAiS. lr W J_»vjtJ\ K. 1 JjJvP 


6319 


20220 


A 


6373 


605 


350 


I STG I CLGMF I AALFTT AKL / WKAT * PS 
FNR* KKKMCY I YASYSAI KK\DA I LSFA 

T TMiTIV/l'KTT CXTTIUIT P P T P rtT1 1 T/TWOr«MI T fT T 

1 1 WrWLbNIRLSEISQTHKDKCCMISLI 
CAI 


6320 


20221 


A 


6374 


167 


378 


KKSQGGTNIKPPAITKYLSQMGG* KKLL 
TGFLKXKSLLGGGAGVTILLT \WRTFST 
KINDPDGGGDH I LSQ 


6321 


20222 


A 


6375 


994 


747 


VSLALSPRLECNGVI S AHCNLR FPGSNN 
SPASAS * VAGAHHHTQLTFVFLVETGFH 
HVGQAGLELL/NLVI RPPWPPKVLGLQA 


6322 


20223 


A 


6376 


1956 


1592 


HE FEMES YSVTQAGVQ*HDLGLLQPLPP 
RFKQLSCLSLP / RYWDYCLAPPCLANF\ 
* YLVKMG FRHVGQ AG LKLLAS S DAP AS A 
FQSAG ITGMSHRARP KSAFKIQAI IGEG 
GVSELHRKINI 


6323 


20224 


A 


6377 


94 


402 


KTQELQNKISHCLIRRVDRKM/WLCTVA 
HTYYPSNLGGQDGRIA*3LEFETSLSNI 
TRPRLYRKKKKKKNVFQAWPLWSNFLRR 
LRREIRAEKSF 


6324 


20225 


A 


6378 


463 


146 


GKLGPPKGSLKPPPPFFFFL*R/P/HPP 
PRGETPFFFKKPKKSPRGGGASPPFPPL 
GR VRP EN F PP P RGQG S P * PKRG P T PALP 
PGGQKETLFPKKKKRKKKKKYMRLI 


6325 


20226 


A 


6379 


3 


207 


LEYQVAQTMCI ISSRIQLVAEKASVPFV 
DKTVLS \ TMGQL + EAKAEDGFFVPG* YG 
LAVS PLKFQSTRP 


6326 


20227 


A 


6380 


328 


70 


RPRRRLGLPRS WD YRHEPLHPDQLLVE I 
M S HYVAQAGLKLL \ I TGVSHHAR PH / YR 
i* *LGHL*SLVRRPWTISYSFLCLGFPMW 
EVSD 


6327 


20228 


A 


6381 


13 


77 


VI FLLQPPK* SSCFS LPNRQDYRRVPP Y 
PANF*FS/CRHSLTMLPRLLF+TGLQAC 
ATI PS 


6328 


20229 


A 


6382 


342 


77 


KGQGFYPHLFFFFFFF * KDS PSLPRVEC 
CGE I SAPCNL * L FG S NDFP /AS HA I VGG 
JsJjPKyKKGAr ro*GGGr r r r r *M*KHSV 
LNSSRN 


6329 


20230 


A 


6383 


3 


208 


RGRVDKIFYLFFETDSQ3VT*AGV*WLN 
PvjSIjUPljPPLjr K* F S * PQVI \TCLDLPK 
CWYYRREPLCPVY 


6330 


20231 


A 


6384 


277 


50 


LVLKLGRCKTSS F I * N FKGPQI AKT I LK 
XNKAGSLT \ LPNFKTNYKAT VI KTV* YW 
HKH*YRHGFISGLSILFH 


6331 


20232 


A 


6385 


2 


354 


AAASTNFRDR/IL/LCHPGWSAVAQS *L 
TAVSTLN*AQGIG\SSHLSLLNSLDYRY 
APPLLAVLLCCPGWPQNSWPQVILLSQP 
PKALGYKLe-FFCPAPJjLLiNFFFFFKNRL 
WFCCPGWSA 


6332 


20233 


A 


6386 


2 


188 


FFKYICY/CINMYMCFYIFINIYSLIYL 
* IHIYKYIFAYIFIKTYu* IYICLYIYK 
YI FIYYR* * IYICLY I YKYIFI YYR 


6333 


20234 


A 


6387 


2 


241 


WATKCR*G* RRTGTLTGG* WDCKASGQL 
*KTFQQFLISITY/PYGPAISLLEIDPS 
ELKTYVHTKTRT * LF I AASFI I TKN 


6334 


20235 


A 


6388 


142 


429 


YC VC I YI Y I FL YL F I F FCTFFT Y S S LHL 
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Predict- 
ed end 
nucle- 
otide 
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correspon 
ding to 
last amino 
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residue of 

peptide 

sequence 


Amino acid sequence (A-Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=GIycine, H=Histidine, I=lsoleucine, 
K-Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Gtutamine, R=Arginine, S=Serine, 
T=Threonine, V=Vahne, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














L I F S F FF FLERG VC FS P RGGGGGKYLG L 
WRFFPPE*IKKNVYIYA\QQYYSAIKQN 
E I LS FAATWMELEAI MLSE I RQAQNDKH 
YMISLICGIKR 


6335 


20236 


A 


6389 


468 


250 


VHRRTPPCPIIFFFF/CKQ*VLPMLPKIj 
VSNS WPQATLLPWP SLTVEFQALTPGVJK 
NFKKMFLMVMSWSIKKR 


6336 


20237 


A 


6390 


616 


1084 


LCLCQYAGISVQCPDCVFQFFHNVDSTY 
EDYLRKLKTSLS FFWRRSFALVAQAGVQ 
WCDLGSLQALPPGFK*FSCLGLLSSWDY 
RHPPWHPANFVLLVEMGFIHVGQAGLEL 
LTSGDPPASASQSAWITGMSHRTQQE\Y 
FLNLKFSTSFGELLCLK 


6337 


20238 


A 


6391. 


415 


200 


SPPPRFFFFCPPKKNFFFPPPRSKNFFF 
*RGPPFFFFFKKTPFF/YFFFKPPFPPR 
/ RFKNPPXI FFFFNRGPKNPGGGGGKKK 
PPGPGGFSPLWGPPPPFKKPPPPPKKNP 
PLFGKKKKKKKKKKRAAPR*NPRARGVF 
FFPPPPLGFWAPG 


6338 


20239 


A 


6392 


3 


117 


KLKPSSH \DY * VGETYGKDI 7SRGKDKP 
IAVCKTFSPLS 


6339 


20240 


A 


6393 


3 


204 


DRVSLCRPGWSAVA*S*LTATSNS* \ VK 
QSYKLS I ?*GS WDYS CVP\ PPLG* LVIL 
LRQP PKVLGLQA 


6340 


20243 


A 


6394 


501 


187 


AGVKWYEHGLWQPPPPGLKRSSHLSLPS 
S*DKRHEYPCPANF* KIFF\VETRSHYV 
AQTSLEFLDSSNPPTSASQNAGI \ *GMS 
HCAQPMQT FSLVKIGTNFL I F 


6341 


20242 


A 


6395 


3 


393 


HACGAIMNSKPHAVTQYHTPLLM* SVLI 
TT\ VLLLL FVT I LNDG I TI LLTDRNRNT 
TLFYPSGGGDP I LYQHLF* FSGHAGG* I 
L/LLPGFGTISHIVTYYSGKKEPFGYIG 
MV*AMISIGIiLGIIV*AHHI 


6342 


20243 


A 


6396 


101 


135 


FDS P T PG I VPGT WQT I RSLNVYLTRLTT 
SFSLMPS * FLFI FFCRDG/SLTILPRLV 
SNSWPQAILSPQPHKMLGLQA 


6343 


20244 


A 


6397 


395 


187 


SSYLSLLGIWSYRHVPPHPANFLFFCRD 
GWYFIFFC\KMGAMLPRLVLNSWAQVI 
IjLL * P S KVLGLQA 


6344 


20245 


A 


6398 


35 


334 


QNETLSKTKKIKNKKKKKKKKRGGPLKK 
PLKGPKLTRACKKKPFSLKG / WRKKNNL 
GIFEKKPYFGGGKKNNKPPTKIKSFKEK 
KKF*REKGEKQPQNPGC 


6345 


20246 


A 


6399 


464 


146 


HIQIREFQRTPTRMSPKTSTLSHFKIKK 
QLVMNKRSFIKQWIS*QKPCQMARRD*H 
NI FTVQKGTNCQ I R \ I LCPAKLS FKNER 
EIKPFLGKYKLREFITA*FIFKG 


6346 


20247 


A 


6400 


723 


632 


FFN*DKI LLCHPGVJSAVARSQFTAASTF 
RVQ\QSACLGLPSSWDHRHIPSCLATF* 
* KRG FAP LPRL LSN FWAQA I L P P W P PDV 
\LGLQA 


6347 


20248 


A 


6401 


522 


46 


FFPPFLS FFFFSKI PKQSLGLVFYF*NK 
VWVFCPGWRAIR* FPFPA/APPLGSKNS 
/ bCLNLiPSl WCjYRI SGMvPPKAH* I PPE 
IFFFFFF/CRQSLALSPRL/WQWQVQA 
I LLPQPPE * LRLQVPGRW 


6348 


20249 


A 


6402 


103 


261 


NIPFTCLIKKKKKKKKKK1CKKKKKKKKK 
KKK/IK* KKKKNVWGGWKKKLGGD 
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nucleotide 
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ing to first 
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residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
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peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E^Glutamic Acid, F=Phenylalanine, 
G=Glycine, H^Histidine, I=Isoleucine, 
K-Lysine, L=Leucine, MNMethionine, 
N=Asparagine, P= Proline, 
Q^GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possibfe 
nucleotide deletion, \=possible 
nucleotide insertion 


6349 


20250 


A 


6403 


146 


384 


GFLPSLSGHKELSLFFLFETESHPVAQA 
\DLGSLKPLPPGFK*FSC\SLPRSWDYR 
RVPPCSVNFCVS PY* PG* SQTPDLR 


6350 


20251 


A 


6404 


3 


395 


ETESHSVAQAGVQWRNPGPLQPPPPKPK 
RFPYLSLPSSWDHRHMPPCPANSAFSVE 
TGL/LHAGQAGLQLLTPGDP PT * TSQSA 
RTTGASHRARPSLKYFLI SFVI SSLLHG 
LFRSVLGSMYLRISQVFFCC 


6351 


20252 


A 


6405 


30 


205 


CRLTRPAEFRTRKTHT\REKPYECKQCG 
KFFSCKSNLIVHEKTHKVETMGIQ*VMW 
LFL 


6352 


20253 


A 


6406 


3 


444 


ARKITGY*ISKTIVFLYT/DELAEKEIK 
KAIPFTIVPNK*DN*ELNLTKQVKDPHK 
EDYKILMKETEQDTK/WKDIPSSWIRII 
N I VKVAVLS KAI YRFS AI C I KI RVTFKT 
ECHWKKKIWIGKVAHACNPTTLENQGRR 
SLEARSLRLS 


6353 


20254 


A 


6407 


480 


307 


QHDSPSAKKPPQKPDKS/HPSSRS/HPF 
PPPPPC*TPPPEF*YPPPKGEWGARPPP 
QG 


6354 


20255 


A. 


6408 


923 


833 


KAMKHFLIKTRAFFRHVTAHL I DQS I VK 
TELLYVTS F I AI FALLWWYGKE PML / WP 
GAAAHTCNP STLGD * GGRTA* AQEFKTS 
LGS * NKTKP * CLS AV 


6355 


20256 . 


A 


6409 


382 


152 


ECSGTLLAHCNLCLLCSSDFP/SLFHNV 
SLSSNSSS*VAEIIGLCKHVWLIFVSAK 
LFLGRRVRNLFSPTPKNMFSK 


6356 


20257 


A 


6410 


553 


173 


CLLI IKNTLPAKKKKKKKKKKKKKKKKK 
AR/WPP * K* RAP * NPVKTPGGS F 


6357 


20258 


A 


6411 


38 


562 


FLRQGL/DSAAQAGVQWCNHSSQRAPPF 
GLGRS SH\PASLAGWSAVA* SRLAATST 
FRVQV I LLSQPPE * LGLQS CS I TQVGVQ 
QWRDLGSLQPPLPGFKRLSCLSLPSN*D 
YRRLPPRPDSS I TFSRKFFQS PQARNL P 
PTPELKPQQPVPAPSPSELHLSWSSGNK 
CHFLWFLCP 


6358 


20259 


A 


6412 


414 


144 


FFFETESHSAAKLEC\ISAHCWLHFPGS 
SKS PASAS * VAGTTSVCYKAQL I FVFLA 
DTGFYHVGQDGPDLMFCPPRPPKVLGLP 
RLPLICF 


6359 


20260 


A 


6413 


3 


455 


AEAGGLLEPRSSRLQ/CSQP*SHPCTPA 
WAT 


6360 


20261 


A 


6414 


462 


162 


TQGPRLFPHPFGAPGGPVPLGWEVRPPP 
APPGKPPFPPKPQ\NSPPWGGQPSIPPP 
LEG*ARKSP*PPKGLVPVTPKGPPPPHP 
GQENPTPFPKKKKKKVN 


6361 


20262 


A 


6415 


499 


0 


HFTSKHHFGFEAA\A*YWEFVDW*LFL 
YVSIY**GSH 


6362 


20263 


A 


6416 


187 


236 


MCHINGIMQFFFFFFFENEVFFCPPAR I 
GN*R/WPNLG*LKPPFPGLTLFSCLTPP 
RSWNKRPPPPARVI FLFFNKKG VFPLLP 
GRV * I PHT PE PLATTNLL S VT I DLLI LN 
VSYKV7DHAVFFFFFF 


6363 


20264 


A 


6417 


158 


412 


SSDPSRRSCLFSF/ CFFFLEREFFFAPR 
VEWRGPNFCLLTPPPPGKKHFFAPSLRG 
GGDYGHLPHGPVI FGTLRKAGFS PCGPG 
GS*PPPPGKKHFFAPSLRGGGDYGHLPH 
GPVI FGTLRKAGFS PCGPGGS 
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SEQJ0 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E>=GJutamic Acid, F=Phenyh»)anine, 
G=GIycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W^Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, ^possible 
nucleotide insertion 


6364 


20265 


A 


6418 


457 


11 


HTGQSPYRCSECQK/AFTQKSAITNHQK 
PQQKGRKSMQ*LRI *ESFFD*SHQLRHN 
RSSMNTMIVMGPFGTLVGEKPYRCNEYF 
KPSFLKISLVYRRDLTRRI IGQAHWPMP 
VISALWEAEAGRSLEVGSSRPAWPT 


6365 


20266 


A 


6419 


103 


376 


S ALS L I LG I T FFF * I FLETGS H F I AQAG 
VQW/ PIHTCDHS I LQPQTPGLKPSSCLS 
LLS S WD YRHVP P VPRTS ACFECT I LRSG 
SVAKTVLWKAKAGGSLEVRSLKLACATQ 
GAPCLTKKKCDPQD 


6366 


20267 


A 


6420 


463 


6 


FPPNPHKKQNQQTP \ PPKKAPQQQKYTA 
DP/QDPPGNDPKAQAYKKS PAATYPAAS 
SSPSSPSS/RPPSTAAHVLRVPL*GSPP 
GLVGNPRPPPPGGAQHPQPPDTWQKRPA 
HHGLQSLSASPPG IS* EPQLAPRNSPHW 
TRDL 


6367 


20268 


A 


6421 


1 


479 


RGFAVLFSAVE FPSSS KSHLDL>ETSPKA 
YKHGAPCS APGS LQAS LPKGRNQRNPFS 
NFFVLF*DKISLCCQGWSAVIQS*LTVA 
STSPGSDDPPTTPPPPHAPDTWDHRHVP 
PHPAKFYI FYRDR/SFTMLPRLVSNSWA 
QTOCLSSLFMLLPSLIiWSLSP 


6368 


20269 


A 


6422 


11 


242 


FEIIFSAVEFGWSDHGSLQTQPPG\SSD 
PPTLDS*VAGTTGLHHHAQLMF*LFVEI 
GSPGLKWSSVSASESEPLYPA 


6369 


20270 


A 


6423 


488 


335 


SCQLRPWNES IGQRSGIRWLTPVI PALR 
EA/EEAGG*LEPRSSRPAWAI 


6370 


20271 


A 


6424 


454 


229 


RDPPASASQSAGITGVSHHARL/VSLFL 
IVM*AEGCYLSGIWRTRLKMNPPQ*RKE 
RQIQVKSTRSDIFFFYFLFL 


6371 


20272 


A 


6425 


1575 


1309 


FESESHSVT\RTECSGTIS\AHCSRHRL 
GSSDSPASASRVAGITGVHHHAQLIFLF 
L»VELEFHHVGQDGLELLT\*VIHPPWPP 
KVLGLQA 


6372 


20273 


A 


6426 


276 


437 


CFHPPEPWDQQWGPPPLGNF* IFWKKRG 
PPFVPQI ILNPWPQKNFFAWPPKTLG 


6373 


20274 


A 


6427 


445 


240 


R PRRADHL / RSGV * DQ PGQHG ETLRLRK 
IQKISWVWWHAGTLWPFWERDGSHRLSA 
RTQMLGSWACSPGF 


6374 


20275 


A 


6428 


114 


450 


LLLLLLFESESCSVAQA/PSAVL/WILL 
GSL*PLLPGFKQFLSLSLLSIWEYRGLP 
PYPANFFKFLVEMGFHHVCQAGLELLTS 
GDLIiASASQHSGI TDVSHRAQPVTI FLK 
GLS 


6375 


20276 


A 


6429 


380 


224 


LRHCLFTQAGIQRQEHGSL* P *SPGLKQ 
SSfiLSLPSSWNY\GVKLPCPNKFQH 


6376 


20277 


A 


6430 


509 


228 


PHNKGRVS PP PPQNFFS PPSRNLGGG WL 
PKXPPPKKRVFSKKPPRGKKK/PPP*KK 
KKNFSPPPFFGPPPKI IKKPPPPFFFFF 
FFFFFFFFFFFF, 


6377 


20278 


A 


6431 


3 


117 


HS WES I + KRGACFN I FIAAQFTIAD I WN 
* FMCLSGDE * IKKLGL I YT/MDCYWGVK 
KKETMPLATTWIELKVN I LSEIGQA* KD 
KYHVF*MNR 


6378 


20279 


A 


6432 


417 


271 


LSFCKTKFRNSPQPPLPCSFL*NDRK\WP 
RWVHTCNPSTLGGRGRWIT 


6379 


20280 


A 


6433 


385 


145 


TPGIKQSSCLHLPKCWDFSCEPPHLANL 
NF/CKR*YIPIVPNSKV*TFSQEVFSYT 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
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Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
(^Cysteine, D=Aspartic Acid, 
I>Glutamic Acid, F=Phenylalanine,* 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, k=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














WPHSSSFTGKEVI ILFPKECPGMFK 


6380 


20281 


A 


6434 


65 


383 


LFPNLLKKKKKKKKKKKKKKKKGGPIKK . 
KP *GDKI KTRQKKKNFFPKRGGKKNQPG 
KFKKKNNFWGGKKMEKPP \ KKKKTCKEK 
KKI * KKKGEKKPQI APEKKKGS 


6381 


20282 


A 


6435 


3 


454 


siqrtpvryytk* tlprhi vtrlskstl 
k/ekkkkkkflraptekgqippkkip/i 
kltvnf*kks*rrekwg\pifnilkkkk 
f* pki fypakltfi regkkkffpkkqgl 
gkfvttraplkki ltgvlnlkmkkgylf 
pqntlkylahnhp 


6382 


20283 


A 


6436 


424 


410 


NFFFPPQGGGGGGFFF*DKPPPGNPSSL 
FFFFLGGGKKRGPPPPPKG/QTPPPGGL 
F*KWLGGKKETPKKFSPPPPFFGGGGKI 
FF*KPPPPPGFFF*KSPPGGGGPPSVKK 
KKEKKHKKKKKKKKK* R 


6383 


20284 


A 


.6437 


3 


214 


E I DCCAAVWKQPKCPS ADE * RKKMWYLY 
TMEY C S ATNKN E I QS F / VNNWKE VE 1 1 M 
LS\KKSQAEKEKHGIF 


6384 


20285 


A 


6438 


83 


420 


QSFP * I S CRKYS / WLGAAAHACDANTLG 
GRGGRAAWGQKF KTS LGNLE PGP KTADP 
RTSHGCELRRSSYTYALTHGFSSTPETA 
RPAPLLPPPPPPTQREDYKEEDLYDDLL 
NK 


6385 


20286 


A 


6439 


163 


289 


KM YCY Y YY/ C * DR VLLMLPRLVSNLWAQ 
EILPPWPPKVLRLQA 


6386 


20287 


A 


6440 


459 


431 


DKRIRTRITPLAIRKRQIKTTV/RKQLQ 
SHYMVWKETTNNK/ CGIFIHSWWECKM 
VQLLWKTVSQVFKRLDLPYNIiATLLI FI 
CQLQIKTLSSPKYLHMKVHSIIIKIRKK 
Y I KKNLKCPSTDEW 1 1 KI * C* S IHTMEN 
ERA*K 


6387 


20288 


A 


6441 


478 


36 


VFGCRVILVEKLFLCE PLRVT FLRLGKQ 
LEKSL/WNFHDLELNISLHKNQPNRCVH 
QWRIG*RKRS*CSE* *R/TMQFINSNLT 
LIN* I PSCHTAMDI I I IAQH/ ITHTVEY 
YSAI KKNE IMCFPATWVELEAI ILGEIT 
QKQSQIWRSPP 


6388 


20289 


A 


6442 


464 


206 


LVQRSGIHLPYSSTVPL*GL/HQKNEAD 
VYRNTRMFTAALFLIVQNWK*HNYLQTG 
EKINKLPYIHTMEYFSVIKSNQHSIT 


6389 


20290 


A 


6443 


414 


216 


CYIYTCPLTQQSRSWVAT*ASSKEEWIN 
KLWHIHTMD*Y/SSNKKNKLQINTTWTN 
LKSIMLTEISLT 


6390 


20291 


A 


6444 


7 


411 


S FYCLHS FAFSRVS YCWNIQYIAFLDWL 
IiSRSNSLHVFSW\WD/CKMV*PL/YETV 
WQFLTKLNILLLHDSALTIiLGI Y* K/DM 
KSYVHTKMCTQIFIAALFI IAKTWKNL* 
CS S VGE WRNKLW YLQTMSYSS S VLKK\N 
ELSSHEKTORELLRERSQSKKAIYCIFQ 
QYDTLEKAKLWRQ 


6391 


20292 


A 


6445 


257 


419 


KSLPFFF / CW * ETGSHS VTQVRVQWCNH 
IALQPQTPGLKQPSCLSLQGSWDYKFHF 
TQACPY* RP 


6392 


20293 


A 


6446 


20 


417 


HQYCFASFFFFNNFFKTESHSVAEAGAQ 
WHDLGSLKPLPLRFK* FSCLSLLSSWDY 
RCMPPHLDNF\ * FLVETGFHHVGQAGLE 
LLTS DD P PALAS * SAG I TGVSHRARP VE 



939 



WO 01/64835 



PCT/US01/04927 



SEQ ID 
NO: of 
nucleotide 
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nucleotide 
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amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine ; 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, KHHistidine, I=Iso leucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, JR=Arginine, S=Serine. 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosme, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














LFLLRRQGLRLLQN I G IGRQ 


6393 


20294 


A 


6447 


371 


3 


VIGCQFL/RKTKRYLPYDLPIVLLCIYP 
R/EKKNLCRH*TCTSMFMTALFIITQH* 
KYPKCHS IGEWLNILWHIHTMEYYAEMK 
RNKLLIHEKTYMDLKDILFSGGQRVYLK 
SSILS IKCIRRQE 


6394 


20295 


A 


6448 


46 


332 


LGEHLRQGPI ILKVCQVILTHQQYFESQ 
WS I MALS NRK YNANCR YNLKCPNS Y I KK 
EML/WPGAAAHACNLSTLGGQGRHIA*A 
QEFKTIAKPCLY 


6395 


20296 


A 


6449 


2 


381 


LCSLVFTH I CS I FliF FFF FFKDRALLP / 
H/PCWRAGP*SYLTAASNSG\VKQSSNL 
SRGKPLCRANFGSF\ CFWFWKKGGPTMW 
PRLV*N\ I L.LPQPPKVRGL* AAAHAGGQ 
GHDHTSLRPQTPGLKQSSNLSRGKPLCR 
ANFGSFWFLVLEKRGSHHVAQASLKQSS 
CLSLPKLGDYRHEPSHRANMFNFRSLPG 
LRKADCLYPY 


6396 


20297 


A 


6450 


2 "1 


136 


KSVGCWRGCG* I GAL/ LHCWWECEMVQL 
LQNAWQFIiKNCKIVPHY*A/IPHLGIH 
PNALKTATQRN I CAPM FI AAL CATAKRQ 
CK/PKCPS IDKT* YTHTMKYDSA* KKCW 
LLARVW I NWSFVHC WWECEMVQLLQNAV 
WQFLKNCKIVPHY 


6397 


20298 


A 


6451 


314 


72 


MLPLTNGS CRTKVSKDIEGLSNS INPLP 
LIGIYRTPFPKTAGTPFFFFFLV/PNGT 
FTKIDHI LGHKTRLNI LKM I * I LKYI P 


6398 


20299 


A 


6452 


54 


350 


LRSFCSALFCFETVLLCHLGWS AVA* PR 
LPAGSSNPPPS/CLPIPSSWDHRLVPPH 
QVFLEAFFFFFFF/CRKGVFFFLPGGV* 
NSGVKGTPRPNPPKALN 


6399 


20300 


A 


6453 


368 


172 


WLTSVI \QRFGRPR£EGHLRPGA*DQPG 
LR* STHLRLPKYWDYRRESQCPARP V I L 
KSANMFHLG 


6400 


20301 


A 


6454 


1 


365 


NILKLNNKLLNNL\VKEEVAKEIKK*LV 
LNKNENTTYKDLWNEAKTVLHL*H*VKF 
IALSA* RIVSNQCSK*ASTLRNHKKKSN 
LNPRLI *QQKSITLKREKSMKPKAGSLI 
RSIKLIIRLT 


6401 


20302 


A 


6455 


364 


1 


P2TRSCSVWPRLGGVQRQDKGSLPPLSP 
RLKQFSHLSLPSS +NHRGMPPCPANLLF 
KFYLFIYL/FCLFVEMGSHHVSQAGLKF 
LSSRDPSALVSQS VATTGMSRRAWQDI I 
LQQKQAATAA 


6402 


20303 


A 


6456 


91 


300 


GHARHFGRPRWGDCLGPEVRDQPGQHGE 
TLTLQK\ KNKN* PAWWHTPAVPATQEAG 
ITGVSHRARPCPPN 


6403 


20304 


A 


6457 


2 


195 


RFSCLSLPSSWDYRHEPPRLAIFFFF/C 
FWNFKKRGGFPLWARWSNS * PRGIGLL 
WPPKGWGLRA 


6404 


20305 


A 


6458 


522 


202 


THNWRPSLQI FTAAL FLLAKKW KQP KC P 
STDEMIHKMWYVHKMKY*SVIKRNEVLI 
HS A\ KWMNFENMLNEEACWERPH I V * FH 

T Vt , MC'D\fr , Y , Pr!T? p PVCDT \T\T\T 

JjlCiiyioKvoivlvjil IiwKJjv V V ! 


6405 


20306 


A 


6459 


377 


161 


EMN I KC P S TD K* VKKRY / LY I HTM EHCS 
ALKGNEVLI IHAMTWMNLANIMLSEKSQ 
SQNTTYSMITWKCLE 


6406 


20307 


A 


6460 


92 


484 


LLQACFLRTVL ID YKMWAEKGMVGMCVL 
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ing to first 
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Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, JF=Pheny lala nine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
1 = 1 hreomne, V=VaIine, 
\V=Tryptophan, Y=Tyrosine, 
X-Unknown, *-Stop codon,/=possible 
nucleotide deletion, ^possible 
nucleotide insertion 














MRNMTVII I FRDRVLL* HPGWHAWRSQ 
LTSA/ VKRSS CLKPP KC* DYRCE I LCQV 
E IAFTHRSLFNS FLKRSMRVILLNYWKD 
P b iiG F CS YD STGMALLRNF S 


6407 


20308 


A 


6461 


483 


192 


MGLMLILRANFQSTKKKMICLLKKSLTF 
LNCIVLSNWPCTFNMYIFVCVFAVYKLH 
VWAE \ PVAHACNPSTVGGQGERTA* AQE 
FETSLGNI IRPCLY 


6408 


20309 


A 


6462 


451 


98 


KQVWQLL* KQVWQLLBSLNMKLLYDliAV 

lnlgaypreekthvhtktctqmftaapf 
i \ i kk* kknfs i dmkyeerdkmwt i hvt 
qhyliik*sid\yaptwmnlenimlsed 

SQTQKTTHCI ISFI 


6409 


203 10 


A 


6463 


395 


67 


flrfwcthhwss/dilypvcsllslnrs 
hsfprvhksplyhsyalcphslaptye* 
ahkmf /rlvnflflvetgfrhaaqas ps 

LPSASNPF FS ASQS AGTTDASHRAQTK 


6410 


203 ll 


A 


6464 


485 


220 


FFFETWSHSVTQARVQWCNLGSIA KPLP 
P\GSSDPPTSAS*VAGIAGMSHAPTHEI 
PS I QARQQKLQGE KK I ELLSLPTSAS QV 
LRLLA 


6411 


203 12 


A 


6465 


1 


371 


ISIPPKKYLRINLTTYVQGLYEQNYKTD 
KRNQGRSSKWRDGPCSRIGKLNIRMSVL 
SNLIYSFNVILtVIVPASYFVSIT/RLIIi 
KFT * KCSRPR I VPLFHLLLPLLVF FVCS 
FILCRPGASLVK 


6412 


20313 


A 


6466 


489 


175 


YKI I E I SRC I K/GLL* LWRTTHLFS H PH 
WRRGMAYEKNADFRLGTVAHAFNPS TLG 
G*GR/WNQGFKTSLANWKPCLY 


6413 


20314 


A 


6467 


2 


367 


FGTETLGQAG I KGDPGE KS H WGEG LHQL 
REALKILAERVLILETMIGLYGS*LTEL 
PAAAGSAGAPGQTGHPPGSHHLA\EPEL 
GSGAGPAGTGTPSLLRGKRGGHATNYR I 
VAPRSRDERG 


6414 


20315 


A 


6468 


472 


260 


TTLGGAGNSG YRHI QPKVANYLFI CRDR 
/SIiILLPRLILNS*L*AILLSRPPKVLV 
LQT 


6415 


20316 


A 


6469 


88 


435 


RGVLLHMLiSCLPLCKTCFCSSFTFKHH* 
TS FS FS FFFFLLGN KVS FYCQAEGQGPN 
LG*LKPLPPGLKQFSCFT/LPEKWE*QT 
EPPGPVKRHFGLSYFPLTRG 


6416 


20317 


A 


6470 


34 


106 


EFAI PFLTIYPI ELNTHVHTKI CI *MST 
AALSTR*KSPTCPSM\INKMWYIHFMEY 
YSAISKGRNKANVHQWIHKM*YIHTVEY 
YSA I KSNEALT YATTW/ MNLEN IMLS ER 


6417 


20318 


A 


6471 


390 


92 


DFLIDAINHNLKVFFYS*FISMC/HPYN 
*SLSFSNIQKSLIKTMINVSLSFKY*YL 

TTTDn ^ VT/mr^xXTAn a/it mxiT vit%t t *.t ^ y 

I YFS * KKTF*N* RQGLTMLPRLLSNS WA 
QAILLPWPPKVLELQV 


6418 


20319 


A 


6472 


414 


167 


HIHIKTCA* I FSAA* FI IAKK* KPKCPT 
ADEWi.NKMWYVHl l£/YYSAIKK\NKLLI 
PWMNLENIMLSTGNQTQKATYCMI PFI 


AA lO 

04 iy 


10320 




o4/5 


5 


244 


A x X XjJjO JL J. lUnLr tr CtUX 11 if vi t\l IjLiXj 

TSLFL * \ I RTAYPRLR YDQLI HLL* KNF 
DPLTLALLI *HVS IPITISSI PPQT 


6420 


20321 


A 


6474 


12 


224 


IASGRPFFCYDVCV/VKRLCRHSIVIIM 
* KKKKKKKKKKKKKKKKKKKKKXKKKGR 
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Predict- 
ed end 
nucle- 
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location 
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ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D^Aspartic Acid, 
E=G)utamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Jsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknovvn, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














GGALKKKP GGE P LF PH 


6421 


20322 


A 


6475 


417 


155 


FQSFTENKMLS SS ITCSSEI FGERKSQL 
M*RPSLL/SY*EKLPLLASTTVISQRPS 
AL* VQ PL/ PTAKNL * LTAGS KGHYYFIiN 
SKVFSN 


6422 


20323 


A 


6476 


1 


131 


RTRP PTRTRTRGRWLTPVI PALREA/ E E 
AGG* LEPRSSRPAWAI 


6423 


20324 


A 


6477 


830 


513 


WNS VKESLPADTWFLPFLGPLI D I FLLL 
IFALCLVNLLVKFVSSRLQ* FHVKMMIiA 
QGFQPIPSSDPEDKDILPLSLLEQVS\G 
ILPLQC*AGSMPITSAGSSYRR 


6424 


20325 


A 


6478 


399 


196 


SKQSSHLSLLSR*KYRCAPPWWAFF/SF 
FLINIFCRDE/SLTMLPRLLLNS * PQVI 
LLPWPPKVFGLQA 


6425 


20326 


A 


6479 


1 


411 


NTGIMPANTVS ILLLIDQEVLVTVKTCY 
LRNTFCGFSDGSGHSV\QTFWIGSTILD 
AIKSL/R/DSREDGKISTLIGIRKK* I P 
I LMEDLVGFKTSQEEGTVDGLNLA* ELQ 
VEGPRDETELL+ PQDQTCMDEKFLLMDE 


6426 


20327 


A 


6480 


381 


162 


AG FTHHNFFFL F * DMVLLCHPG YS A WQ 
S WLTVASNS W\ VK* S SHLSFPKH * DYLQ 
AVHFRMASNS PGLYLLYQ 


6427 


20328 


A 


6481 


379 


85 


SILFFLFHPINFFFFFF*AWI FI IPPPH 
\RAFK WRWGFFLADSWF I KRLPQYY 
FFFFFPGFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFFFLSVC 
GG 


6428 


20329 


A 


6482 


355 


117 


PKVKIFP?PPLPRKWDPRGPP*S?GTNF 
S I FFLVEKDF PHVAQAGLKLLDS S S TP S 
VAS\QSV* ITGMSPHAWLQVLLSL 


6429 


20330 


A 


6483 


402 


97 


TAFHLLD I YHKS * KIMFTOQPLYKCPLF 
IIAKN* K* LKCPRRADKQIVA/ WCIHTI 
EYYSARKRNVLLKYTRLWMNLKS IMLCK 
RSQSQDVTYYVIPFICHS 


6430 


20331 


A 


6484 


714 
i 


365 


NGFALVAQAGV / Q WCDLGSLQRLP P IFK 
QVSCVTLLNS*DYRSMPPHPANFCIFSR 
DGV*SYWSEWSLTPDLGLPKCCDYMREP 
PCPANAKS FTVEETHVTYQAYETMLLSD 
HDESG 


6431 


20332 


A 


6485 


408 


146 


LKTETQTETCPPMFIAALFTLV/RRWNQ 
PQCLSTDEWISQ I * Y I CIMN INSDI KRN 
EALKHAVT CMNLKNMVPFI S PTEKGKKK 
EKKK 


6432 


20333 


A 


6486 


243 


11 


GMSQYQEYSKSNFFLSNRVLLCHPGWSA 
VAH/SRLTEALNRFK*SSCLGLPKLWDY 
RRVPLLliAQNATR *NSL* SCPA 


6433 


20334 


A 


6487 


403 


161 


DMVSPRWPVWSQNS \SLK*SAYLSLPRC 
WDYRREPS CQAS RNTS IGRGNRKNQWNT 
CMYLLFI L I PLKKYMCNGRVMEYFKT 


6434 


20335 


A 


6488 


292 


24 


DWRGRMAI PLKKGSKC*QGCRNTGLLRY 
* W/MECEVAQPLQEMV* W FLQRLN I KLS 
YDPDF/PLLGIYPKE*XAGT*TDICMPM 
LVATLFSG 


6435 i 


20336 ! 


A j 


6489 


3 


203 


SS WDCRCI PPCLANFLICCRDR/ SLTML 
LRLVLNS * TQAVLLPWPPKGMPGLLFVF 
YGGIVLSYVNIS 


6436 


20337 


A 


6490 


413 


30 


RGP/YQPGFLGFSPPNGLGFG*KVFIjPL 
P P P PRKG PGKKGS PRARGFPGQAPP KKE 
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NQRGGGFWSRGRLAKKKGFWFG * GPAKG 
KKFFFFLGAGVWGLKVLI P I KKKKGFPP 
PPLFFFFFFLQTIHPIRG 


6437 


20338 


A 


6491 


101 


335 


SSSFLGFPKYWDYRREPL/ WPS / LWQFF 
RLNIHLLYNPS I P I LAVY PREMKTCDLI 
KSCMRMFTVAVF\ * I TQTWKQPNVF 


6438 


20339 


A 


6492 


532 


127 


STVFRARQLPCLHEPGRQSSP\PVPTSI 
LYT\PLPFPSSRYQQNGPCLSSPKLSPS 
PELGTQS FPVHKS FS PCLPS PKPLTVPP 
ALPSTGPSQSL*QKHS LAS LLHTHSHTP 
PAPSCNPESWGLDATAAISTCFYG 


6439 


20340 


A 


6493 


399 


1 


LCPAPRGGVYRGR/ ESSLSWDGLHSVQA 
SWPLCLPTPASAMVDAPPPA\SLPSRSS 
ILDCCISSEQGSVGMGPAEPGAG*NLIjV 
CCLLRPLEKCSIWAGVSRFSWYSLSQLiP 
LARKGKS PNPLS FPNEKMPRPVS 


6440 


20341 


A 


6494 


319 


75 


KKIWGPLKKKFGLQTRRPKKWFFKRPPP 
FFFFFFFFFFF/HVAI FLLFCRDRG\IiA 
MLPRLILNSWP*RILQFWLPKVPGLQA 


6441 


20342 


A 


6495 


333 


190 


LFFFFFFFFFFFFFFFFFFF*GFFFFIF 
YSFRDGVLLCHPGWSAAMPS 


6442 


20343 


A 


6496 


1 


299 


LPRLEYSGTITASNSWAPAILPPQliIK/ 
YNFFCCC*GRVLTVLPRLVLNSRPQIVL 
PPPPPKSAEITGVNHHVWPKKI ISNBKT 
THCSCSALIAKQQ I SS 


6443 


20344 


A 


6497 


51 


266 


PVNVILPSSWNLRHSPPCPARFFVLFCL 
LFFGKN/RGLAMLLRLVSNSWP+VILPP 
QPPE * LGLQVQATMLSS 


6444 


20345 


A 


6498 


366 


78 


LIAHCHS SNNKSLL I LDNFNI YH YGPSN 
TVTS*F\*MSSLMILLSSP/C*GTSWSY 
LFYALKKIFCKD/RGLAVLPSLV+NSWP 
QWLLSWPPKELGLQV 


6445 

i 


20346 


A 


6499 


37 


396 


FPRPTRPKSSSPAKATQQNTISFKKKKK 
SKQKKGNTVI I IRVKNQRKRKQKN* KLK 
Q*N*EQ*KINKISKPLAKIiI/KKKRVKT 
*IANIRNERSSITT*STDIKRLIWKYY* 
QL 


6446 

': 


20347 


A 


6500 


706 


394 


VPLLLPRLAVQWHNLGSLQPLP PGFKRF 
SCLSLLSS*DHRHTPPCPA/NSFVFLVE 
TRLHHAGQVCLKLLTSG /DTPSSASQSA 
GIKGMSHRTQPIEILFYSVFF 


; 6447 


20348 


A 


6501 


403 


216 


ILNVLSFK/1WILY+HYTTVLRTVRPLSP 
R * ATVTCT YTSGWPEAAEDPQKMTFPHC 
DLFLPHPN 


6448 


20349 


A 


6502 


377 


116 


LLPY*EN* FLMYLPPICCLPFDFGNSGF 
CHQKKNFFFFFKTKSLSVA\RLECGGAI 
PAHCKLCLIiGSSDSMKNFLNEAEFPNlA 
FNGh 


6449 


20350 


A 


6503 


3 


401 


CVQHLRAS WS SRWAVJAWRAPHSEQ PAGP 
AAPGNEGLS IRASGCGGCTGYPSS AS PP 
ALCSIS\PRP*LPSRRAGLATCSPPCLS 
LPPPPWAPVRPEPPRRVPPPAPWRPVPS 
TTQGLRSASARHRTGRQIiHLQ 


6450 


20351 


A 


6504 


406 


55 


WPKGFTPPRFPRKEGSP\PPFPKVFILE 
TRQPGPQTKPPFFPPIFKLTRPGGRPP* 
FPPPQRVRAGKPP*PQSFRVPGAIFPPP 
PPPLKGHPPRPGFQKKKKKERKKRPTRK 
RGGT 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, l=Iso leu cine, 
K=Lysine, L=Leucine, MNVIethionine, 
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W^Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 


6451 


20352 


A 


6505 


3 


281 


FFFCDRVLLCHPGWSAMVRVRSRLTATS 
/SFLGSSDSHACLSSSWDYRHLPPLLAN 
FC I FLKRQGFTML P R P VSN P * AQAMRLP 
QPPKVLGLQA 


6452 


20353 


A 


6506 


1 


247 


LGSLQPPPPG\SRHSPASASQGAGIADM 
SHRTRQSIVFLRTNNKPY* IENMMITIS 
CTTAYKYEILRHSFNKICARSIiYEKL 


6453 


20354 


A 


6507 


34 


47 


FDPNLTPRHIRI ILWAVNKERMLTASRK 
KKQII *RSSNLSGDRLLNEEASWEP*CK 
KSMQETIQATEGWNDIFKVLKEKKNL\H 
S KILCPAKLSFK YEGE VKAFPHKPKWRE 
FTVTKP I LQKC * R ENQRT FDLTQ I 


6454 


20355 


A 


6508 


403 


106 


HYVPPPSPPS/YFFFSPPGKRGFFPNPF 
I LVS PGFFPSPLF * KPPPEFFFFAPLKK 
IFFSPPRELKFFFFKGPPPFFFFFF 


6455 


20356 


A 


6509 


389 


230 


LSFFLFFRDKVSLCNPGWSAV/VTFELK 
RSSCPGLPK/SQSHCTKPQPGF* KYIYF 
SFFIK\KR*SLTTLPRLVSSSWDQVIHS 
ALWHPKVLGL*AVAHAYNPSTLGCQSAE 
WIT 


6456 


20357 


A 


6510 


101 


419 


ISFFFFFFFFLKKFFFFPPGWRGGGEF* 
LPQTPPFWVKKIFFPPPPKVGGFKNSPP 
PPFFFFFFKEKGFSPVCPGFFKTNPKK/ 
SPPPPPPRGVGLGGWPPPPGWDP 


6457 


20358 


A 


6511 


41 


376 


EWWCETRGGAGQGRAHGAAGGATGRVIiA 
IQARKRRPKREKHPKKVSCSVAQAGVQW 
RDLVLLQPLPPGLRVSLCHPS / WECGGV 
VSAH * NFCLPV/ 1 KDGVLPCCAGWS *TP 
GL» 


645S 


20359 


A 


6512 


2 


332 


WEKVKIATLTGFWK* LI/SGSSRTLMGD 
FERFKTSVEKVIADWEIARELEIiEVEP 
ENVTELLQSHYKTPGTLAFSRLSPDVSL 
YSQLPETRSHEWQAAS KIQPRKASTSQ 


6459 


20360 


A 


6513 


3 


294 


FFFSEMESLSVTQVGTOWHNLGSLQPPP 
PEFKQFSCLSLPSSWDYRHAPPHliA/NC 
FCI FNRDG VLPC * LGWVSNSHLG FTGMS 
HCTQPLLALSFLHG 


6460 


20361 


A 


6514 


1 


526 


FRLGRKPR* GGVW * PVWSRGEPGSVGAE 
AG/RS * SAPRRLLHHPAAGLATGLSASG 
RRSARWKMERASGLS PGGGLGATSRQMS 
PGTQLANPPDHGDKDCLGRISPGSGKQI 
QAAGQLPGPPTSIiAPAQGRLRSLTPWGL 
QTPEHSEPEGIGHLQAATEAVLPHSTQN 
LITKRNLM 


6461 


20362 


A 


6515 


747 


451 


QAGVQW*DLGSLQPPPPGFK* FSCLSLP 
SGWDYRYRPPNPANF\*FLVEAGFHHFG 
QAGFELLTSGGPPASASQSAGITGMSHH 
TPLGVHALVGENIQ 


6462 


20363 


A 


6516 


444 


3 


PGGGGPFFPPPWGGKPGKIS * / PPGGGG 
SNKPKLPPCLPPGGPK/GNPPQKKKKKK 
GFGPGWAHACNPI I PALWEAEAARGLP 
PDAGRYGPGGRREAVRREAGPQECQDPE 
GQV*GDPRGCGAAHAPARGHPRKA+ RGA 

DuooruLir 1 IS. 


6463 


20364 


A 


6517 


257 


575 


ASSVTWAGVRWCHLGSLQLLPLGFK* FS 
CLSFPSGWDY/ GHAPPGPAKFCIYSGDG 
VSLC* PGVISQTPHLR 


6464 


20365 


A 


6518 


337 


389 


SQHFV* I S CLHDMPALASQSAG I TGVSL 
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nucleotide insertion 














RARL IFIFR \MDSCYVPQAGLE/ PPELK 
RSSHCGLPKPWDYRHEPQAQPVCRFIRA 
TKCAYVSTQALSPAPVQPGMTPPAKRGL 
PCQ P PGS PC PS L CHHQLAG TPP YE 


6465 


20366 


A 


6519 


61 


323 


FMWDRVARKGS IEGASLELPLGGSVGGH 
GSC I SMWGALGHSS\THV* HTHTHTHT 
HTHTQTNTHTPSSDSKKSSILDKEEAEG 
CMTP 


6466 


20367 


A 


6520 


399 


101 


KIQIKATMKY/HNIPIRMGFFFKLSSQG 
/C**RCRK/IYTLPCHCQ*KCETVWPLWK 
SVW / RVFLKLNIHPLHS PE I PLLGTYPR 
/DMKT * TCM * MFTATL F I TT YS 


6467 


20368 


A 


6521 


1 


276 


RTRGFEGLRFFLGFFGTVLLCCPGWSAV 
E\QSWLIAASASCLW\VK* SFCLSLLRG 
WDYRCIPPCLANF* *R/CGSWAQVILPP 
QSPKALGLQT 


6468 


20369 


A 


6522 


280 


277 


GDDPS CSMGGGHCLASRSE * GS VGVGPS 
KPGAG\YNLLVCYLLRPLEKRSIRVGVS 
QFSRCHQSWLPFDRKGNSPTPCTYWVR 


6469 


20370 


A 


6523 


2 


327 


S LFS S * D / YRH I P PHPGNFR I FCR / SAL 
TMLFRLVSYSWPQVILLPQPPSVGTTST 
SHCPQPCIMFLKADLNFVRCQFRCAIiAP 
EP Y I QDP FLRDDVAFPCLGASAP WHIiA 


6470 


20371 


A 


6524 


298 


414 


PG PQS ETLTKKKKGGGRLKGSNFTPAGM 
EGN IFF/ CFFLS KSHSVAQARVQ W / P * W 
LTAGSLQAPPPRFKLFSSLLi* VAI YAS P 
TLLSS*DYRCTPPRLANFCIFPRDGVSP 
CRS CKAAVFNLLDTRDW FHGRQS FHRLV 
AGDGFRM 


6471 


20372 


A 


6525 


376 


180 


KCKKYPGGVAHACSPS*DYRHVPP/RPG 
YFLHF*QKHFGQAGLDLPTSGDPPASAS 
QS AG I TGVSHRTQPQLNYF 


6472 


20373 


A 


6526 


415 


86 


LYLPYDLAI PFLGMYPRGKKKCAHTKI C 
TRI FTWLFI IARK* KRSKNPSTREW/ 1 
KMWNIYTVESHSVIKRKELLIHASMILV 
H* / WM I FEN I I LSDRSQMQKI TYCMVP F 
T 


6473 


20374 j 


A 


6527 


391 


166 


FFFFFRDWFSLSHPGWNAE \S *IiTVISN 
SWSQARLIFKFLKLRQGLNLLTRLVSNS 
RLQG I IiSS WPAKVLGLQA 


6474 


20375 


A 


6528 


544 


176 


TMRYLLTPVGMAI I KKAK\NT* C* *GGR 
E/IGTL/LQCR*ECKLVLPLWRTVWRFL 
KKLAIELPYAPPFLFSLEYGLFSLGSTN 
CAQLRS VYRKAVQNHQGVSGE CAGFQDS 
SHSAPYAI PTARQ 


6475 


20376 


A 


6529 


392 


139 


TYVKMHCK/WLGMVAHAWNPSTLGGRGG 
QIT*GQASTKLSHSWGLSRESRFWIxECY 
SRSCSFCSCRLPADCTSLTDSKQ 


6476 


20377 


A 


6530 


362 


87 


NGPPFFFFFGEIDKLILILLWKCK/RPK 
I AKTVFKKKNKVGGLI *HYFKAYYKGTV 
I KLVQFWHKVRKTDQ* SRRES PETELH I 
YVLWARDSA ' 


6477 


20378 


A 


6531 


1 


346 


APTRPRLKKTLSKLKTTNLQLPYDPAFS 
r Jjo v j r'KlLrln.l "K \ri\KJLCv "rlr MAoJuF 
MI VTTQ/NVPRCPLMGEWLNKWR\ * FIT 
* YHTGMKNT\ VMHKTTWiVINSQR I MLNE K 
SQSLTY 


6478 


20379 


A 


6532 


3 


294 


FFFFRDRVFLYHPGWSAW* S * LTAALS 
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. PWLKRHSHLSLPGS *GHRMH/ PPHLGWF 
FV\ FCLFL* RQCLAMLPRQVLRSWPQVI 

Jf IjF W Jr Jr l\J\ v ijVjJ-»y " 


6479 


20380 


A 


6533 


2 


280 


PRYLVRNPTPTLGSCVTVDK* LSLWASV 
LYIYIYTHTHTDIYRKIYAF/IFVCVYI 
YIFI * KHIYYKEQAHAIMDAEKFQDLPS 
Ao W K Jr o r\J_i V r» 


6480 


20381 


A 


6534 


96 


183 


DQLNVFHRKLKRYDTYTFTVALLARHGG 
SC/LSLGSS*D*RYTPPLQTN*K*FFL* 

LEA 


6481 


20382 


A 


6535 


2 


241 


GFKRIDSNFESSVGKVPTN/TIFCERKS 
LSMHQ I / SLLS YFKKLPQLPQCS AATTL 
\Io'Ui'i iJbi\yDFFFUKJjUFAEiUoJJljI 


6482 


20383 


A 


6536 


485 


97 


PTTRPILLLGFYTK*LKS\NKSDSCPPM 
FIATLF/ ILKCPSTDKWLKKMWYKHTTD 
YYSALKKGNHA/YM*QQMNliEGIMLSEV 
SQSHEGQLLHSTYVRYLK 


6483 


20384 


A 


6537 


411 


57 


Lj FP S H I KTrVLS w qs cfaktsg PQTP s f f 
FF* LFFF/RDRVSLCHLS WSAV/VSWLT 
VASTSQAQACLSLPSSWDYRCPPPFPAN 
rbKKlji? IXljFKJjVbJNoKAyblLPFWPSiv 
VLGLQRRGI 


6484 


20385 


A 


6538 


438 


201 


IYPKEMKT+VHTKTCI*MLIATLFTLAR 
KWKQTKCSSTDEWINKWWHIQ\TMEYYL 
AxKRNEAQLEHGEIIMLSERGQSQ 


6485 


20386 


A 


6539 


466 


131 


SFPKKGGYRAPPPPPPPFFFFFFFFVEA 
VLSLCCPGGSQTLGLKRSAHLVLPKC* D 
HRHKPPCLASN/SFFNRHRMMTNNCLMC 
IKSWCLCI INPFYHLDSSLLWMLSLI YR 


6486 


20387 


A 


6540 


419 


50 


PLPFFKPGPKIF/CIajAPKKKI *VPT/L» 
REI KLFPLKGPPLFFFLNKGPLCFPGRK 
GRGVLFFLATPKSWGQKIFPPSKEGASF 

pfffpffsw*rwdltmlprlvsnswphv 
ilpsqpprvlglqv 


6487 


20388 


A 


6541 


1 


341 


SIIKj. AREKuLVT Y * GTS I RLTGDF * KY 
I/SAKILQVRRQ*NDIFKVIK/EKNCQP 

DTAVDHV7 CTJTMPDlrTVCPnhTVAVT rjTjri 

Riy J FAKboi/XjNhKKXKbr PNKyKJjRJir 
ITARPDLQElMI KG ILYLETKE * YLPS * K 
HMKV 


6488 


20389 


A 


6542 


409 


290 


NCKQPTCPSTNEW/ IKL* YICTTEYYSA 
IKRNKLQLHTTT 


6489 


20390 


A 


6543 


32 


397 


AM I V FXjQ AS1jGWk£>kKTJ-j& LKJvkKKKKG 
EKETLPGGST * KEKVRKKQNNLLV* KTH 

KTJHPG *HGKTPFLQKKKKRGALIjKGTNIj 
TARGRQSKTF 


6490 


20391 


A 


6544 


402 


0 


LSWTQ*CKPIAPATGEGGTGGSLEPRES 
RLWCAM I APVNA/NCTPAWVT 


6491 


20392 


A 


6545 


1 


365 


RKVTSLPSTSRCLS ILPHFSDGPKPVKP 
LQ * * LRCS ICSAS LPTHLHTARAAGLLP 
\ LAS S TAVNQN * SRELGACVKWEHSGI K 
YGLF* RD/RGLTMLPRLVLNSWPQAILP 


6492 


20393 


A 


6546 


1 


331 


I STFKS YLRN I FS KAI AAVDSNS TEGSG 
QG*LKTFWKGYTIIDAIKNIYDLWEEGK 
TSTLPGVWK\LDDFEG\FQTSVEEVTAH 
WQ1AKELELEMDPEDVTKWLQSHDTN 
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X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 


6493 


20394 


A 


6547 


1 


417 


TRPKINKTKNLLFERINKIKKP*VKLTK 
NKNKNKTPHITKIRNESEAFTIEKLKS I 
LRE \Y MNNCVYNLL YANRKPRQMDKFLE 
RYKQLKLTQEETETLTD\YITSKEIKFV 
I*NFSTK*S*GPNGFNSEFGRVLWFTPV 


6494 


20395 


A 


6548 


410 


288 


AHFLFFC I NK \ S LTMLPRLV * NMWP * VI 
FPPQLPKTLGLQV 


6495 


20396 


A 


6549 


1 


216 


IDRGFTMLPQLVSNSL\LKRSSHLSLPK 
CWDCRCEPPCPACFHFLSECNTCIKNQL 
HVTLFFPNT*GLTSEG 


6496 


20397 


A 


6550 


479 


146 


VPGFKQFSCLSLPNS * KYQRAPPCPANG 
FAFSVETGFPHVGQAGF*FPTSGDPPAS 
ASQSAGITGLSLHAWP\ENL* KLRLHAT 
GLRLHDHVPLKVPNYLKVCSFKFEIADV 


6497 


20398 


A 


6551 


42 


484 


SMRWNSMNVTIMPANTTS I PQPTDQEVI 
SVFNS/YYYYYRYCKATAAIDSNLSGGS 
RQSKLKTFWKGLTI I YAIKNI CDS WEE V 
KIST* TI * KKVI PTFMGDFGGVKNSIK\ 
E ITADVAE I AR * L* LEVEPRVGWLTPVI 
PALRKAEVG 


6498 


20399 


A 


6552 


304 


74 


HTGKAAKKKEC / WRGTVAHAYN PNTLG G 
*NGWIT* GQ* FETSLINMVKPRFYSCIC 
I SGYTHKVIHRI I I CKGKKLE 


6499 


20400 


A 


6553 


432 


62 


FWAGNS RMQQ WCTD T/ VS AH CS LDLLG S 
SNPPASAS * VARTTGARHHARLML I TMN 
E KPLG AR Y CAR I FFENKLG I YPNES VPT 
KTNANWS KPRSC F SHDPDPGHW I YLMS 
SGLIVNKIGKVK 


6500 


20401 


A 


6554 


2 


174 


YCNLCLPGPSDLPTSAS*VAGTTCMCHY 
TWLIFFCI \LVETGFYHVAQAAKLFIYS 
I 


6501 


20402 


A 


6555 


3 


268 


DAWADN I KCR *G YGAI * TL I HCWWECK \ 
W*ATWWFLIILNMLLPYNPVIMV\PRTC 
IQIFMSTLFINTKTWXQPRCHLVGEWIN 
CGTCR 


6502 


20403 


A 


6556 


3 


402 


DAWTDN I Q CR * R YG AI *TLIHCWRQC\K 
W*ATWWFLI ILNMLLPYNPI IMDHK\TC 
LQIFMSTLFIDTKTWQPPKCHLVGEW/I 
KLWSMPAKEYHSWLARTEL/ IVP *NT*R 
RLSCT* LWEKSQYEKATYCWPTV 


6503 


20404 


A 


6557 


396 


2 


KKQNAFQSPSFLPRL*TPKITILISFHN 
I Y I YTKTYI YIHVC I FTYIHTHIY IFF/ 
RDRFLLCLPGWRAWQSQLTAASTSQAQ 
GI PG* FFFK/ SIFCRDW/ S I TMLPELVS 
NSWAQAIPPPWPPKVLWDYRCGS 


6504 


20405 


A 


6558 


393 


175 


FKSQGLVLLPRLEFSGMLIAHCSLKLPG 
FSDSLNSA\SQ*LGL*GCTATPGPYIjLIi 
IYAGELGLNXRSISSK 


6505 


20406 


A 


6559 


401 


188 


SPPEPFILGGPRVFPP?PFLKPPPRI*F 
V7GPKKKKNF/CPPPGPKNFFF*KGPPLF 
FFFFFFFFFFFFHRSS 


6506 


20407 


A 


6560 


37 


343 


YEPAWLLDI YPKELKT*AHRKTCTRIL 
IAS PFKTVKMWQP PD/ CP * ADE / WDK*T 
ACLQTVE * CSARKRSELSTM/ IEMWRK1 
KCVALSERGQSEQAACYVI PTL 


6507 


20408 


A 


6561 


792 


141 


LFPPPP/YLSTSPNAGA*TTSPLPPQFL 
YHS PRRQQSGTKPPFFS KK/ PEPPSP/S 
PNKFYPGPWWGVSPFKF?PPPQKS*GPK 
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C=Cysteine, D=Aspartic Acid, 
E=Glutaraic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
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X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, V=possibIe 
nucleotide insertion 














KSPLPPEAP/PAPPLTQDCPPFVSNPGG 
KKKTPP/SAKKKKDPTPS/PSPPPRH* I 
GGPPGFSPPPLFKTPPPKLILGPPKKKK 
IFPPPGGKNFFFLKGPPPFFFFFFFFFF 
FSLLFPVGVWSRWLRSEAADLCGECYSS 


6508 


20409 


A 


6562 


1 


356 


FFFNFIFIYLF/NFLRWSLALSPRLNGV 
QWSDLGSLQAPRPRLMPFSCLSLPSSRD 
HRRPPPRLANFLYF* *RWGFTMLARMVS 
IS*PQ/CDPPASASQ\STGITGICHGTR 
LIPRNPMIA 


6509 


20410 


A 


6563 


416 


84 


LSLPSSW/DYRRPPPCIANFLYF* *RRG 
FTMVSIS*PHDRPTSASQSAGITGVSHH 
TQP3 ISFLKSFFFTLSNRFNRVCIDSEL 
LLQCKDFYLQKFNLHIAFQEYTHCKGVH 
Y 


6510 


20411 


A 


6564 


2 


51 


FWREMQIKTVIiRYFSLKLVKIKK*DST 
FFWKGC/GETGNLIHKANW/R/SI*QYL 
KKL YVHFLF VP V I P PL E I YL KNKK I LS T 
K * LFAVF I VTE KC * KKKRCG * T 


6511 


20412 


A 


6565 


2 


511 


VRW/SCAGL/LHTAV*IPGRIITSEKCA 
QQVDETHRN/LQRLQPALINRKGPILLH 
NNTCLYTAKPMLQKLNELGCEVLPHLSY 
SPDLSPTNYFFNHLDNVLQGKHSHTQQN 
AENAF * EFVKF * SMDFYATG I NKL I S /H 
WQKCIACNGSYFD 


6512 


20413 


A 


6566 


3 


248 


GDI CT\SMLVPALFTVPK/ TWNQPVS I S 
K*TDKMWYIYPVEYYLAIKKTXFC/SFS 
ATWMILEDIMLSKPGTER* ILHVLYLCG 
R 


6513 


20414 


A 


6567 


3 


258 


WRDHSSliHS* PFGLKWFSHLSPLSNWDY 
RBVPACPANFEKVL*K*GLIMLPRL\LL 
SSFPEW* ILSSRLPASSFQSVGITHMNH 
CS 


6514 


20415 


A 


6568 


392 


166 


GGGGPPRGGGLEPPGPPPFFLKNKK*/R 
PPGGPGPLFPPPGGGREEKFFYLGGKNF 
H*TRVG?PPLSGGKKKKIRG 


6515 


20416 


A 


6569 


234 


411 


PGAVWHAYNPNTLGGQGER IS* GQEFKT 
S/LGNIVRPHLY 


6516 


20417 


A 


6570 


1 


175 


FLQHV*NKFQTMSDQIIGKIDDMNTLID 
LEKNVKDL \ L ITQTE VEELEGENKI P I L 
PG 


6517 


20418 


A 


6571 


392 


140 


LPSS WDYRHVP PCPANFVLLVETRFLHV 
SQAGLELPTSASQSAGITGVSHQARP/E 
* *NS IRKGATTI QRDLGPVLQVR YLVRC 


6518 


20419 


A 


6572 


2 


228 


R TCTQMS VATLFL I AKNWKQPR CPS I GE 
W INKL * Y I LTME YYLVT KK\ND?S SHKK 
TRGNPKCI LLS KRSQS EKA 


6519 


2042O 


A 


6573 


412 


100 


ALiHSHPLGGPGQGVPKSRIPNPPCPQGG 
TPFFPKNPKLTRPGFARLPFPPLKKLKP 
EKFPNPG/MPGVPLN*IPPPVSHPGVQP 
N FFS KKKKARG FCLVLNPS S S 


6520 


20421 


A 


6574 


406 


257 


NKREKKTRKMLDSIVIRKRHIKSQMRCY 
SAPIQMAXV\SNRQSPKFWVRCRQTISL 
*HC* *ECKlVQSMW*TJbQKF LPAIVFLC 
I YHTEKLRFTQQP /CT* I FLAALLMIV/ 
RTV/KRLVCPSMGEWLSKLW*VNLSFSV 


6521 


20422 


A 


6575 


370 


68 


EQGSVGMRPSEPGPGYNLLVCR* LRPLE 
KRTIWAGVS*FSRCSLSRLPLARKGKSP 
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NP\CTS*VRFSNGLS 


6522 


20423 


A 


6576 


1 


259 


IDLIDVLRPSL7VT*AWQWCNLGSL* P 
* P PG \ SGDP S AS ASRVAGT I GQHHHRWL 
IFVIFVEIWFRHILVYFHTAIKNYLRLG 
NL 




20424 


A 


6577 


418 


218 


KGKTRLGR W P PCNPSTLG \GQGGW IT* 
GQAFRTSLANIVFPLM 


6524 


20425 


A 


6578 


1 


401 


EKAGCSGSYL*P*HFX3RPRQEDHLTLGI 
*DQPGQCSENLSLQKNF/TNI SWA*WPG 
IVAHACNPSTLGSQGRRIT 


6525 


20426 


A 


6579 


3 


356 


HWDYRHSPPFPV*MHFFIWLPGCHSL»IiA 
S PNLTGCFFS I S I TESLLS LF I YLETRV 
*DLQCSYFWERRG\SLCCPCWSRAPGLK 
RS CCLRLPSS WDYSRLTPPHLANYC I LG 
GEGESS 


6526 


20427 


A 


6580 


411 


158 


IGSVAQARVQWRDI J SSSDPSAN*APPGL 
KQSSC\PASLPSRWDY\NKCTPPDLANF 
YIFFVETGFPYVARLADCFYPPPL* KMT 
GSY* VFTMCQI PH* AFYKF* FI * FRVGE 
DKNSLPAWQHRETIiSLGKKYKN 


6527 


20428 


A 


6581 


399 


218 


GGAELiKVPK*GGGPGSPFLPPPLQKPRP 
KGPLWPGG*GPPRPPGEAPSPLKNKN* / 
PLPG/AGPPPVPPNWGGGFYPRG*GFG* 
LPLGP * TLARGKQRDP VFKKKKXN*NVN 
KKKAKL*KS*QFFFFFLKTGSRCFPRAK 
VQGPNGSYPKPQPRG 


6528 


20429 


A 


6582 


392 


214 


GSSDPPNSTSRVAGTTDEVLPCSPGWS * 
NSW/VSK*SACLGLPKCWDYTRHPQKLA 
RSNF 


6529 


20430 


A 


6583 


390 


257 


PKRPPPPAFFC* PK\SLGMLPRMISNSVJ 
AQAI I *RWPFKVLEKQA 


/CIA 

6530 


20431 


A 


6584 


65 


251 


HYFNSFNFFLMGSSNHPTKNPG* FFFFL 
TFYKNK/ SLTILPRLVLNSWAQGILLPW 
PPKGLGL*AQGILLPWPPKGLGL 


6531 


20432 


A 


6585 


2 


143 


RHMPPCLSSF/SYFIj*RRGFAVLPRLFS 
KSWVQVILMPWPPKVLGLQA 


6532 


20433 


A 


6586 


142 


334 


GRWLSQLLILNFTTRSVLKQMSW/MACV 
SETWPA/ STLGGRGGRIT*GLKEFKTSL 
QNMARPYLYK 


6533 


20434 


A 


6587 


485 


377 


RTRLSCRFFILSLI FRFGNFYCFTLKF / 
H**VFLLSFVFYC*GHP/FEFIFKVFFI 
I FFHFKVFI CVFKYLLFFF I L I ASTFYF 
FVNI FSFKKFVSVMFVL.LIEMFYYGRFT 
IPLR*F 


6534 


20435 


A 


6588 


241 


251 


NSLFTKKKKKKGGALLKEPILRAGGGEG 
IIFPKGPQKSIPGPGFKKAGREKPGPSP 
*NWFF*KGPPFFFFFFCKE/RSFTMLPR 
PASNSWAQAICLPQPPKWGL 


6535 


20436 


A 


6589 


429 


54 


YFLFLPSLPTFFPSFSSFLPSVLSFFLP 
SI/SSFLPSLLFYILSSPPLSSPPSSLF 
LSFFLSRVSFCHPG * SAVAQSRLTATS I 
SWAQAILP/CR/NRDFTMIiSRLALNS*P 
QVIRPPRPPKVLGLQA 


OJJO 


on/in 






41 1 


137 


AUlOI i riHCSLHLLGPSDPPTSASRVV 
GLQAHHHIQIaAC\F*LID*IiID*LNVEM 
ES RHVAQAGLELLGS S DPPTLASQNAE I 
TEMSHCIF 


6537 


20438 


A 


6591 


412 


196 


QPQP PGAKKTSGGG FP I R/ WGHQP P PPG 
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AKKYNFFFFF FW * RQALAMLPR WPNCQ 
T*AIILPWPPKVL*L,QA 


6538 


20439 


A 


6592 


28 


400 


IMPNCFPKWLYITFFFFFF*RQSL/DSV 
AQAG V Q * tiSLiGS TATSAFhGFK * F5 CP P 
AFLE*LDYRHVPTHPTNFCVFNKD/MGF 
TMLARLVSNS * PQ/ CNPPASASQSAG IT 
GMSHRAQPHLFVLKMEKPPPNFLFILLM 
NQSIHIYLLNFYRF 


6539 


20440 


A 


6593 


414 


23 


QxHLir W 2 PPGr bPrPr rKWKPKlr r J-»OA 
LKKNFFFPPPRHKNLFFLKAPPFFFFFK 
RSFFPPGWCAIGLFSLQPPPPRFK*FFC 
PoIjPKKWG iKPJjPPKPGK_t / ur r r r rv ± 
LVENGFHHVGQDGLDLLIS 


6540 


20441 


A 


6594 


91 


274 


I LHWS * PLGNYHL / WLGWAHARDLSTL 
GGRGGRI AGGQDFKTSLGS KQGS I SARK 
KEKKKK 


6541 


20442 


A 


6595 


391 


251 


HLKRCSTSSvIREVQIKTTTTCHFSE*K 
R/SEINMCW*ECGQTELSALLIRC*\WD 
SEMVQPLWERKEYLKQISTESSWAPAIP 
LtLdJl ^P^liKCJAoiJoJjKJ\GWvjlKPJaPPiiJb 
ASFLLGSYI 


6542 


20443 


A 


6596 


465 


268 


I LL CHLG CS AVAQD \ L»TL CS LKR S SHL T 
bPSSWDbRRGPoHlil KrF/C* ^RGbNlli 
PSLV*NFWAQAI> 


6543 


20444 


A 


6597 


3 


229 


CHGSPQPCPPGLK*SSHLNLPTPK*L\D 
YRGMPPRS ANFLFL * RWGRCVAQAGLEL 
PUoooXjPAoAov-PKyP vijjt r 


6544 


20445 


A 


6598 


122 


418 


IIVNELCHIMCFFFFFFFKQNLLGTQAG 
GQGG KNNG * LKP KP PRLKGS P WLTLPGG 
WNYGGFPP/HPS* FFFFFLGKKGPYFCP 
r A\jVj Jvf r JM Li Jxvjlr rrrA 


6545 


20446 


A 


6599 


1238 


930 


FETDS CS FA* AGVQWRNLSS LQTPPP \G 
SNNSCASAS /RI AG I TGTHHHARVI FAL 
LVETGFHHIGQGWSRTPGPQAIRPPQPP 
KVLGITGVTSRARAISKHFV 


6546 


20447 


A 


6600 


398 


78 


PSSPFYLFIFRQSFPFFARVEEKGGHPG 
F PN P P P S GLKKF F YAPAP PG S KAPGQG P 
PA/RFYLFIFRQSFPF/ CCPGGGKRGPS 
GFPQPPPFGVKKI FLCPSPPRK* GPRAG 
P PR PRK I FFF FFS S L FFFFF VR WG FTML 
PRLVSNSGAQAI CPPQPPKVLGLQV 


6547 

• 


20448 

- 


A 


6601 


3 


473 


FPKGKKKGPPPPAGG/ QLKNPSKKPNKK 
PGAP PRGPRGPKKGGGGLA3S PPQS PQGG 
EKWTLKGGPNLPPPRGGLGGARPPPKIG 
GGPLFPKGGGPKGGTPPPPLFFFFLRRS 

lri\LioJrKijJ2»\_-£>lj.M..L * -"V -Loo/ or* rAKVoL 

PS PPAPTRPTGLTVS PGSN 


6548 


20449 


A 


6602 


398 


156 


NGRKHTDVKPEELPSWVPTRDLTPTGTA 
GAF/Q/RGYYGYYNKYVTVKKGS IAGVS 


6549 


20450 


A 


6603 


380 


77 


PFPKKSPPL*NFPPFF*KKKVPPPPPGK 
LiPLibr Jul* r f r J? r / KUKvbJjCKPGWbAIYiV 
HGHGLSPVNFCISTRNRAL/SMLPRPIS 
NTWTOAI R P S * P P KVLGLOM 


6550 


20451 


A 


6604 


352 


204 


KRGPKI * NFFFG * RQSLHMLPRWPNS W 
AQMILPPWPP\KVQGIiQA 


6551 


20452 


A 


6605 


631 


357 


F FF * DRVLLYH PG W S AAAR \ S LLTTTSA 
S WV\KQFS CLSLPSS * DYRCAP PCPASF 
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C I FSRDRGFTMLARMVSNS * PQVI RP PG 
PPKVLGLQA 


6552 


20453 


A 


6606 


3 


397 


VLHKLTQRCSAVPLRAPTAFLE I GTQTP 
KRVWKCR * P/RBTQNNLLDKEQD FKTYQ 
KATVTKAVWCQHEDTIEQWDGIESTEKK 
FHTQRAGGRDHSHSDSVRKKRKESLFFF 
LKRGLILFPNLEGTRGGISTP 


6553 


20454 


A 


6607 


323 


35 


YLKTKIGKPGGLAALQFLGFRPSKNTQL 
GNRFMCTHTHTHTHTHVHPHTSQYLYPH 
RSISFCSYLDSl/SP*VLTESSNSNPTP 
LKCC*HFPSPCL 


6554 


20455 


A 


6608 


439 


252 


LVPPLFFPRSFLLFSPPKKRGGPKPV*F 
GPPRAFYPPPF/V*RGPPPFFFGAPKKK 
IFFSPPPG*KCFFF*SPPPG 


6555 


20456 


A 


6609 


3243 


2602 


FFFGRDGVLLCCPGWSQTPRLKQSSSLS 
L\PKC*DDWH/RATSSGLQVFLSICCSI 
I S I FFMPS I CDKSTKQ FSN/ PQRAY I LV 
G * R * V I YLGWLAFL FVFLRQHL /NTVA 
QAGVQVJCNLSLL* PPPARFKRFSCLILP 
RSWDHRRLPPRPARFFVFLVETGF\TIL 
ARLVSNS* PRDPPASASLSAGIMGVNHR 
TWPILVFLHPSVFFKVLQLFYL 


6556 


20457 


A 


6610 


81 


395 


TWAALKLYSTCIFSSSETEPDVSQSQIT 
DTTACIMEPGLFW/ YRSIIWTL* FFHFF 
WRDGG LTLLTRL VWNS WAQ V I S L PG P P K 
VLGLQAS ATAPNS F I LIREG I S 


6557 


20458 


A 


661 J 


1 


126 


GRLRQEWHLNPGGRGCSEPRLHHCSPAW 
A I ERDS VS KK/NKNGLKRCFHLRKKTLK 
GSKRIW*PGGRGCSEPRLHHCSPAWAI 
ERDS VS KKKKMD 


6558 


20459 


A 


6612 


389 


67 


YLFMYLF* / DRVIiIiCHPGWSAVAQS * LT 
AASNSWPQVILLSNRFKSWTVLLLSSKI 
PVVQGSTPCPRCYSWHPKCLQLQGLLGL 
DCSSPKYPRGPSLAAFRSLPKWHV 


6559 


20460 


A 


6613 


1 


384 


RCFKVKDLKMRPFWNIQVDPKSNDKCPC 
KRLKRRGHASLPPWEKRRRPGKDGGRGW 
NSAVTSQGT/VEATRSWKSQGSVLP*SL 
*RERSPV\ELAHF*QRIiAMLPRLVLNSW 
PQDISPPWPPKALGLQEP 


6560 


20461 


A 


6614 


385 


174 


CLRLS ALKKPKLNS WY* K*LYIME Y* SA 
LKT\NKG I LS FAI TWMNLDD I LLNE I S Q 
TQKDKYHMNSLICEI 


6561 


20462 


A 


6615 


1 


370 


MNPRAHLSFLLRRKERRILWLKALGHIM 
/ FTV* GLTG I ALANS SLY I VLHDT Y YGV 
SHFHYVLSIGAGFAIIGGFIH*FPLFSG 
YTKKKKKNIRGGGTKNFLLCPALVQREK 
VNPFFALTNLWG 


6562 


20463 


A 


6616 


3 


342 


YRHEPPHSAI FAQLYKL* K* LI YALKTG 
KVYVYKLYLNKAI KSQ\ S YNRASLSLT* 
* PRVKRKRS * PGTGAHTCNPSTLGSQBG 
L I A* VQE FETNLGN I AKPCLHQKKKKGA 
PV 


6563 


20464 


A 


6617 


1179 


915 


FFFETESCLVTQAGVQWRDLGSLQSPPS 
G \ S SDFQL I FVLLVHGQAGLELL I S * S F 
ALASQTAR I TGVSHHAQLVWSDF IKS IN 
LFCCL 


6564 


20465 


A 


6618 


3 


390 


HASVKMELPYDAAI PLPSI YPKELKARF 
*DLCIPM/FITALFTIAXRWRQLNCPSV 
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C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, l=lsoleucine, 
K==Lysine, D=Leucine, M=IYIethionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon,/=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














V I RS WGR EGVKE S CLMGME PH S AR * KS S 
GDTFHNTENC Y* TVHLKI VSGKF* *L*C 
VLNYVGFFYKNKNIKRLS 


6565 


20466 


A 


6619 


392 


49 


QFLFERLFSSYSGYVSGVKMLGSTEVQL 
FPRKCEMKSCSVWPRLECSGGISAHCNL 
CLPGSRDCPASAS* VAGLI FY/l FLVEM 
GFHHDG/RELVLNS*FQVIRPHQPPKVL 
VLKV 


6566 . 


20467 


A 


6620 


193 


282 


TPGLK*SSCLSLQNSWDCIWDYRRLPLY 
PVNCFCC*FVCFVC/lFVEPGSCIAQYC 
LELLASSNSPMSASQSTGI IGMSHGACL 
ILT 


6567 


20468 


A 


6621 


383 


8 


LNDTVS TFF F * DRI SL \ CPS WS AVARSQ 
LKAASTSWVQAILPPQPKTNLLGSSDSS 
TSA* DQSPGFK+ SFCFRLPGSWDYGCVP 
PFSANF*IFCRDKLSL/WAQAILSPWPP 
KVLGLQALWEAE EGG 


6568 


20469 


A 


6622 


395 


42 


HVS STS FQGDELHLRIVALPLP \PLKNL 
PLPLSPRSL*C/HSRGWTVLPPSRLTAT 
SLPD/SSCLSLLSAWDCRRAPPHLSGFC 
WSAVA* SW\SLQPPPPSRLPWPPKVRRL 
QPLPSRHPV 


6569 


20470 


A 


6623 


3 


327 


PSPPSRWNPKDVPPNQENLGI FFFFL* T 
RDLATLPRLVPNS * TQVI /WPPKVLGLQ 
A 


6570 


20471 


A 


6624 


395 


47 


KNRALFFFPPPRKRGFFPKPLIWGTPRF 
FPPPPF *NPPP YFFFGAPKKKFFFS /S P 
PGGKIGSFKRPPPFFFFFKQNFFPVPPG 
GGPRGDFGSLQHPPPRVKGFSCLAP 1 1 F 
FSSF 


6571 


20472 


A 


6625 


100 


267 


LSLSKKANISKFWLGT\VAHAV\DPSTL 
GA*GRWIT*VQGFETSLGKIVRPCLYFC 


6572 


20473 


A 


6626 


409 


126 


PSSSFSSSSSSSSPSSSSSSP/PFFFFF 
FFFS FW* KWGVSLCGPGWSQTLGLKQSS 
CLSLPKCWDYRNKPIiHLAFLASLILFTF 
QEWDITD 


6573 


20474 


A 


6627 


506 


323 


REAPGTLI PCWQKYDMIKAQLVWQFLRK 
PNIYLPYNPAIPL*HTCPRKINTQYT/C 
FITKTCTRMFI AALFI IPQNWKQPT* P F 
CPFTIISHSCSPRDNWF 


6574 


20475 


A 


6628 


330 


117 


DVCPRKI YTLVLQQLCI \RMFTAILFS I 
AL* FCKPMCTS ILEWIKKL* S IRTTEYH 
LVMRRNKPiiLYSMPQM 


6575 


20476 


A 


6629 


396 


139 


FFSPPEKKGVFPSP*NLGTPGFFPPPPF 
KKRPP * FIFGAPKKKK/HISPPPAEKFF 
FFKRPPPLFFFFFFFFFLFFFFFSAFKL 
KV 


6576 


20477 


A 


6630 


79 


405 


VSGRNTALESYLLASSA\RNKCSSNCKI 
KKMTARLDVLAHR CN PST FGG * GGR I C * 
AHKFETSQGHI I RPCLYKKKKKGRPFKR 
TQFS PARWQGNGFFWGTLKNSRAGF 


6577 


20478 


A 


6631 


2 


417 


SFIFPCKFLNQFISFYKNVCHDFDCNCV 
ES KDQLG / WPDAVAHTCNHNTLGSQGGW 
I A* AQELKTNLDNMAKPCXiY 


6578 


20479 


A 


6632 


74 


417 


KTVFRYNGWAL*ASENRISLADLLLPSF 
HVLLCLPKAG\PNFPLPSS IKKFFDLPD 
LIRSHKTHISEGVLFPFP/RRKEPRTER 
PRRI + TDRSCWVSSLSLIVLDHAFFVQS 
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QFYM. 


6579 


20480 


A 


6633 


740 


776 


CSFFLKKQTRSRAAVQAGVQWCHYSSLQ 
PSPPDLKQPPASASQTV/KD*GAYFWCC 
FFL F LKRWGL / NS WQAGVALIj PDL P VV 
RRLVIPLSLWAVEPALWAAAQL*LI*/SP 
CPLASQSPRHQTGQSPKQKCFPDILGLH 
LALRSSFSPGLERPTLRLPPGLASFHT* 
ATWLSWPLPYK/ PPKRVKMYLVKADRPE 
PFL*GTRGRKSVMQOTHAKTLQETLYSG 
PEAQAAASVTDEF 


6580 


20481 


A 


6634 


2 


296 


FFFLRQSFALWQAGVQWHDLSSLQPPP 
PRFK*FSCLSLPSGWD*RHAPPRPAIFV 
FLVEMGF \TMLATMV\ * I S * PGDLPTS A 
CQS AG I TGLS YRARP 


6581 


20482 


A 


6635 


407 


212 


GRPR*TPPPLGKKKKGAPPPKTKKKKNF 
LGGRDSYSQKKPFFF* RKKTRPGGGGPP 
GIPPPLKG*AKKKGLPRGPKFPVT* /NK 
PPPPPPGGNNQNPFPKKKKRKSTR*I*K 
IRISRFILYFFFFFFLERGFGCSPRGGG 
GGAFLSYGKLWPPG 


6582 


20483 


A 


6636 


20 


223 


PCKKLiPQPLQPSAT\TLISQQPLTLRKE) 
LLPGKRL* LAEGSDDSYQFIiVAKYFKIK 
GLSFGRNKLANI 


6583 


20484 


A 


6637 


866 


647 


FFLDRVL/ LCHLGWSAVMRS WL/TAALTS 
GLKQS SCLSLPKLRDYRHEPQHQANNIS 
♦SSWSDSNMQPRLKTTL 


6584 


20485 


A 


6638 


398 


26 


KGAPPPNI FFFFLFNI FFFFFFFFFFFF 
FFPINFLQCLLYYRALHFNKVKVINNF/ 
PLNFCAYYHPL*NM*FSYISFQNLYKFF 
LI/FGVFNLSGIYFNVL*IANF**G*LI 
LPGT I YLKLC Y FTTN 


6585 


20486 


A 


6639 


1 


341 


NWDYRHALPCSANFFILVETGFCQLPRL 
VLNSCPQVSHKA\KFLFIFFFLFFLEEG 
GFVGPPRLEYREGSLQP*LPKLKGSTCL 
TLPKG*N*RGEPLPPARSSFFFISKFFI 
ET 


6586 


20487 


A 


6640 


391 


118 


NAKFEKSFSMGKIL*NSLAWYKKILCD/ 
RKNRSMQKTFLD* * YYFKKLPQPL* PSA 
NTTLIGRPALTSRQGLPPAKTL* FAEGF 
DPMILSLF 


6587 


20488 


A 


6641 


352 


285 


KKKKKGGGGGAYSGIPTF/CFFFLNGVF 
LGPAGVQWQNQRSLQPPPPGVKLFFFLS 
LPSH*GPRHGPPRPGiMFIYKNF/CFF/V 
RDGVLLHCPGWSQTPGLK 


6588, 


20489 


A 


6642 


433 


64 


TGNFFLDS LFYFNPGPLGGGG I LKG FFS 
FFFFLETKFFFFPPAGGQWRYFSSL*PP 
PPKFKQFFFLSM/WPRPSKFFFFFFGIL 
VEAGFHRVAQAGLKLLSSGNLPTSASQS 
ARITAKSHRTVF 


6589 


20490 


A 


6643 


424 


11 


PPSIFLAPPPSRGGGPPPPPKKK/QSPP 
KKKKKGGGGGPKNPPPKKKGPPKKTTGF 
KKNPPQKXKKKNSPPKKKGGPPANK*KP 
PPRPQPFYFLTFY YKI FIYI FLLKFLLF 
YILFFFFLKSSKQGRPLAI 


6590 


20491 


A 


6644 


411 


2 


PPLRIWLASPPSKIF*PQPI J HI/YFSPF 
PP*KFFFSLKPLIFLGGFFPKFPPPKKS 
FFPKNPQGVF*NPP*KKKIYFSSPP*NF 
APPKIFLKTPPPFFFFFFFFFVEMGSRY 
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K=Lysine, L^Leucine, JVI=Methiomne, 
N=Asparagine, P=Proline, 
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W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














IAQARTRERTRGRTRGRTRG 


6591 


20492 


A 


6645 


34 


381 


AKLGTRGAAKQEKEIKVTRI*WKEIELS 
QFTEYTQKNL/NVTDKLL*LVYMFSKIT 
GYKVNLKNINCIFHKHTNQQQKILKMPF 
1 1 A* K/ T I K* L * INLTKDTQDLYAE I YQ 
TLKAEL 


6592 


20493 


A 


6646 


17 


448 


HFYAKLGTRASLSCSGLHPV*ASQPLYL 
PTQASAMADAPFPARLPPNSLI S YCCTS 
NEQGSVGVGPAEPGAGYNLV*HLLRPLE 
KHS I * VAVSRFSHYSLSRLPLARKG\NP 
QTPCASWVRQCPTLLRLALHGLHPLSKQ 
SQGD 


6593 


20494 


A 


6647 


462 


336 


SLRLPGSNDSPASAS*VA\GITGPCHHA 
RLVFVFLVEMEGLGS 


6594 


20495 


A 


6648 


447 


62 


KDDIWIAITHMNRWSISIjAKRKM*IKIT 
MKYP YTS I KMA/ I KQTNKN * KKS S S I KC 
W * ECRTMRTL IHCW * * HITG* PLWKTW 
KFI I KLNLYLP YDPA\TCTQMFTA/ ALF 
IIIKY*KQSKYPSSGYCVPA 


6595 


20496 


A 


6649 


3 


1492 


LAYARAFLDDTNTKRYPDNKVKLLKEVF 
RQFVEACGQALAVNERLIKBDQLEYQEE 
MKANYREMAKELSEIMHEQI /CPPGGED 
ERLTEFPSHLQRHQWDSNKHNGSRDDQL 
VFGRV I TS HGP C VG T C FV I C KLRMLS KA 
NHWGDRAQ / ERTKGKGRER K* RTTLFLN 
R/RFYRSWKKVHI FF 


6596 


20497 


A 


6650 


495 


170 


GAAEFPASQSAGIAGVSHHARLEEFFSL 
DSGRVMRGTEVLKYLGGYTITI * S IFFF 
FSFF\WPGLGSLQPLLPGFK\LS3jLSRW 
DYRCPAPHLISL 


6597 


20498 


A 


6651 


9 


222 


QTTRPERRNSGDPTHLSL*SSWDYGHVC 

hhawlif*k/cfsggiallprlv*nswa 
qavlppwplkvlglqv 


6598 


20499 


A 


6652 


241 


833 


kktplamwnpwpgp pqi phpltklvqma 
g/qengclsghdlesirpaglhnsarge 
vlglssslgkelvflqeeldlseihipe 
aqevemasghfaflhvpvpdgrapycka 

SLSASSSLSPTPPEDTAISSLRPPSAPE 
MLTQHGAQEQVEDHPG/ Q*QPSPHSQSR 
PSPQKDPQALVIASLRSRTERKQASHGG 
TRPG 


6599 


20500 


A 


6653 


1 


398 


F F FD IGSHSLTQAAVQ * HDKGHGS LHPR 
LPGFK*SSHLSFPES*DYSRHTPPRPAN 
F/EVETEFLHVPQAGLMRCKLLPAT/AT 
SKSARITSVSHHVQPEVSIKW*NNISNL 
I KCFLLFYKYKC I F INTKFVFTV 


6600 


20501 


A 


6654 


489 


282 


PAS / WDY / RHMP PLLANACAFLC * RDFS 
MLPGLVSNWWARVI CLPWPPRVLRI QAR 
ATTPG 


6601 


20502 


A 


6655 


953 


908 


SQSCSVS \RLECSGAISADCNLCLLGSS 
YS S TSAS * VAGTTGARHHTQLI FVFLVE 
MGFHHVGQPGLELLT\ *VIHLSRSPKVL 
GL* APS SARGQNS VSQ 


6602 


20503 


A 


6656 


412 


44 


R I S ERRKS LGAVHTKDKKYSSSRVRTSN 
PNCARSGASSKQSFLFGKKRAPHPGIKS 
F\KDALLWDGITGVHHHA*QIFVRSRE 
R* IRPVI PALWEAEAGGSRAQKIETILA 
NTVKPRPSLLRA 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=G!utamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, ]=Isoleucine, 
K=Lysine, L^Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S^Serine, 
T=Threonine, V=Valine, 
W=Tryptopban, Y=Tyrosine, 
X=Unknown, *-Stop codon, /-possible 
nucleotide deletion, \=possible 
nucleotide insertion 


6603 


20504 


A 


6657 


3 


241 


FFIFLEARSHSVIQVGMQWCDHSSL*TQ 
TPGLKES \ PASS FHRAGITGVSHHTYLF 
L*GH* SHSQGPHPHDLITSKRPCS 


6604 


20505 


A 


6658 


467 


42 


FFFFFFKQGFGLFPQLEGKEGI PGPCNP 
CPPGANDSPPSTPLGNGGPGGPPPNPGN 
FGIFGI *GGEGGF*TPGSGKPSGPGSPK 
\ I P FCWVIRERPHGPPPRPFFFFLRQSL 
CFAQAGVQWCDLGPLQPPPPRFXQFCLR 
NS 


6605 


20506 


A 


6659 


2 


286 


KNLKINWT*WLMPWSAT/WGGSLKPGR 
LRLQ*AVIAPLHSSLGDRARPYLSVSQY 
LYFSLSLSLSLSLTHTHTHTHTHTHKIE 
LLGCVAEI IHFF 


6606 


20.507 


A 


6660 


3 


249 


LSPRP*CGDTISAHCNLKLLGSSSSPAS 
VSQVAG I TGACHHAWLI FVFS VEMGFHH 
AGQDCLHLL/NLVIHPPQPPKVLGLQA 


6607 


20508 


A 


6661 


3 


1507 


PEAPVPFLDSNQENDLLWEEKFPERTTV 
TELPQTSHVSFSEPDIPSSKSTELPVDW 
SIKTRLIiFTSSQPFTWADHLKAQEEAQG 
LVQHCRATKVTLPKSIQDPKLSSELRCT 
FQQSLIYWLHPALSWLPLFPRIGADRKM 
AGKTS PWSNDATLQHVLMSDWS VSFTSL 
YMLLKTKLCPYFYVCTYQFTVLFRAAGL 
AGSDL ITALIS PTTRGLREAMRNEG I E F 
SLPLIKESGHKKETASGTSLGYGEYVIK 
ITL/SSSTDLWTHEIDLHRNSL*NRDSN 
CSNF/LREQAISDEDEEESFSWLEEMGV 
QDKIKKPDILSIKLRKEKHEVQMDHRPE 
S WLVKG INTFTLLNFLINS KSLVATSG 
PQAGLPPTLLS P VAFRGATMQMLKARS V 
NVKTQALSGYRDQFSLEITGPIMPHSLH 
SLTMLLKSSQSGSFSAVLYPHEPTAVFN 
I CLQMDKVLDME WHKELTNCGLHPNTL 
EQLSQ I PLLGKS S LRNWLRDY I YNWRS 


6608 


20509 


A 


6662 


326 


1 


IMIFLFFQTGSRFVA\RLECSGTITTKC 
SLNLPGSSNP\PTSAS*WTHCRTRVIL 
NFSVDPGSRHWKGGLELLDAILKPWPP 
KVLRTRGSTPEFRSGGSGVRIRGFN 


6609 


20510 


A 


6663 


414 


105 


QGKVGFKVGLNPLSRLLHFFKQKLFVLS 
*KGQVKKLSLKIPFPPGGFFSPPI*NPP 
P *NKPQRFKNPFMG FT/ IRKKI L FLAFK 
STKSGG*NPPSK/HKKI*FSF/CFWETE 
SRS VTQAGVQWHYLSSLQPLPPRFK* FS 
FCKPHKRVFKPLGLVLWGG ISNRRRKKT 
SGGKRNFKA 


6610 


20511 


A 


6664 


1 


407 


GCVPWPAAREAEAG * IA*TRET/RLRW 
AGIVPLHSSVGNKSETLSQKKKGAPSKK 
SLQGVALFLSQLGKLPPFLQRKKRPQRG 
GGRGQLAPGYNPPAGKTLVGPGPHLSFC 
SKKAWKELRPWKKPRGFWFLINST 


6611 


20512 


A 


6665 


1 


280 


FFFF*HRVSLCCPGWSTDLHLLPP*LAR 
FKRSSHLSLPSSWDYRHTPT/HPGFFVC 
LFVFCRDRVS FVLPRLVSNFWAQAI LLC 


6612 


20513 


A 


6666 


408 


89 


WLPNSLILPN*NYFVSYLSDRWSLCRP 
GSTAVAQ * ELTA\ A * TLGLKRS AS FSHS 
HSCSLPSSWDYRCTSSHLANFFFKRWGF 
TILPRLILNSWPQVILLPQSQRH 
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Amino acid sequence (A~Alanine 
C=Cysteine,D=Aspartic Acid, 
E=GIutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, K=Arginine, S=Serine, 
i— inreonine, v— valine, 
W=Tryptophan 7 Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


6613 


20514 


A 


6667 


484 


208 


I TWSAAGL IHYSFLNPGE PMTTE KYAQ 
EIDD IH *NLQHLQPALVDRKG P IFLHDH 
A*PHAAQPK\LQKLNKLGCRVLPHLPYS 

PHT.T.PT.T 
irXJ U li itXj ± 


6614 


20515 


A 


6668 


495 


227 


PAFVKVPPEKDQRQNSTHAGRNHLPMEI 

HCNLKLLGSSDPPTSASDYRLKPWHLAP 
SSFSMS 


VUlJ 




j± 


oooy 


Z 




PKVLG * QAGLGT PGLK 


6616 


20517 


A 


6670 


403 


224 


ILL*P*PPGLK*/FLPSSWDWRRASLCL 
SQSAGVTGVSRPGRFIiIRTVL 


6617 


20518 


A 


6671 


406 


162 


RSYAQAGTQWLDFSSLQP*PPGLIKQSS 
HFSFSQSASK/RHPDNYCI PCRNRVYVG 
QAS LELLGS CDL PAS ASQ SAG I TGS CL 


6618 


20519 


A 


6672 


453 


32 


WSATLHTSVWCACATRCVGMCMTLGV 
UGCECVvV*GCVCTS VN* \HTSTGWSEK 
V*LCVCGCV* LCVAEC3VIAFVSRQCVCDP 
VCCLPEKMRLSPISVA*VCAPMKVSPCP 

OAT CT?T UT T CTMTVTMVr'UOT OAMAMT T? 

L 


6619 


20520 


A 


6673 


378 


180 


NEVIRSYKVSLFSPRIEKWINK/ I/KWY 
I Y PME YDS A I KRNE VblHATT * RKLEN I 


6620 


20521 


A 


6674 


378 


94 


P VKN RRTKELH I KD VCTPML I V I LFM I V 
K/S*KQPKCP* IDEWMNKMWYTHMVEYC 
S VS KKEGSPF I C YNEDITLNE I S HKKTN 
IYNSTYIRYLE 


Oozl 






oo75 


I 


"3 OA 

380 


VLbJj*VIjCRHMSCFQNCSL*r SF SFWFF 
WRQCLPLSPRLECGGPMSAHHNLRLPGS 
S DS CAS AS L VAG I RGACHHAR V I FVFW 
EMG\FHCVGQAGLELLT\*VICPPWPPK 

V lA?Jjy v JjA 1 v lr L*y F ri X 1? 


6622 


20523 


A 


6676 


424 


135 


SHRWGVKCFKYN I FFFRD \RV+ L CCTGW 
S*TPELK*SSCLGLPKCQDYRHETLRVS 
Y* F * NS I * Q IG VLAMVSKL VLNS * AQAI 


6623 


20524 


A 


6677 


408 


91 


HTHFVWPPVL IP** PMR* VLLLTPNTNF 
FLRDRVSPCHPGWSVEAQSQITAASN\Y 
GLKQFSHPSVPKCWDYRRELPCMVSLI I 
FIYVHLHMYVHARKEKPEYPL 


6624 


20525 


A 


6678 


1231 


217 


PPYFNNPLTFPQKRFFKGDKTLKKWL* I 
REI I LQNRGIG * KTDS IR*MGKTEAPLK 
EKFLLGTALQPSFLPYMLGHFKEE*GEE 
F PGTHS WP FPS FPVWGLGLSVNHRGIHG 
PRVSLQTWGPGQEAMECECGNHGTEDTG 
\GAVLQPG*VT*GFHTPTSEPGLVAPPC 
TCSGLKAEA\GFLKLPGKQG\SEGAIjPP 
* SPLPSACFPELGLRRLGSSPGLMIHPG 
WGLFTSPLiGLKIIjGPRHPAGPAIjPQIAQ 
RPSLKFR *RRLGWGPELLPAQTCGAQSL 
GPSTAPLLGQGPWGLG*HASRWGTLGLA 
VAPGNEVPSLSPR* PRGAGPLHRHRVPV 
RSWG 


6625 


20526 


A 


6679 


1 


178 


RGWDYRRPLP/RPG* CFVFLVEMGFHRV 
GQDGLYLLTL*SSQSAGITGVSHRARLQ 
PPIF 
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Amino acid sequence (A=Alanine 


NO: of 


NO: of 


eth 


NO: in 


beginning 


ed end 


C=Cysteine, D=Aspartic Acid, 


nucleotide 


peptide 


od 


USSN 


nucleotide 


nucle- 


E-GIutamic Acid, ^Phenylalanine, 


sequence 


sequence 




09/515,1 
26 


location 
correspond 
ing to first 
amino acid 
residue of 
peptide 
sequence 


otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


G^KJIycine, KNHistidine, I=Isoleucme, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, ^Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaJine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \-possibIe 
nucleotide insertion 


6626 


20527 


A 


6680 


3 


135 


KSEHLEREPIIEQLSTMFF/TAKYGWHL 
RGH * HRCQKKP E P P EDR 


6627 


20528 


A 


6681 


425 


107 


KGPPTPS/PPPPSPVFPTEKIIjVPPPFF 
PPPPFKKPPPQNIFWGPKK/IKNFSPPR 
GEKNFFF*TPPPKYFLGAKKKKNFSPPR 
GEKNFFF 


6628 


20529 


A 


6682 


424 


284 


CFLQPGP \ PGS KGAS PPS FQDNWGLRQA 
LPP P /RES FFLFLFLFFP I VQTG FHHVA 
RAGLELLTSSDLPASASQSAE I TGVSH* 
A*PFP 


6629 


20530 


A 


6683 


1290 


449 


PLSAPPPGDPSP*KSSPRAPNLPAP*PP 
NLS PWPPKCPRPRR * PASS / PP * LPVP P 
KSPS PPV\ PWATEASRGHQPQ I T*ELG 
SLVLRMTEKGTWG+GIPDSIMPLPRRRS 
EPRTCRE*GDTRWRWRRRRRSTGAVSTF 
SPGFACGGGSGA/ PAHPGEQKLGASARP 
DE/ PVCPAAWVLLPSARWAS PAAAPPP 
tiPSQPVH*RFKP\S?PY*PP\ALPPLSP 
FEPSAPPHTGSVPY*QGVPP/SEGCPST 
SKPIiISVRPPVTINGCIiPQSQPPKTHTV 
SRLYEEN 


6630 


20531 


A 


6684 


425 


40 


EWASLKN* HPP/GANQAVEAVEAPTHCW 
*GCKS IEPLWKTVWQCLDKLNI CLLYDS 
TPE * MSEHVYQAACKKSFTAVLFMI PPN 
♦KQPSCPSTGWLGQAWWLTLVI PAPWEA 
KVGGSLEPRSLRPAWATG 


6631 


20532 


A 


6685 


2 


107 


SWLTAASTSQ\VKRSSRLSLPSSGDHRC 
MPANFFFLKDDVSLLPRLISNSWTQAIL 
LSLPTKGLEL*AHCSLNLPGSSDPPASA 
YQWGTTGACQLIFFF 


6632 


20533 


A 


6686 


12 


224 


TPPGSTHASGAPPCLANIKFNY\Xi*RLP 
RLFWNSWPQVILLPWPSKVLGLQVWATV 
PSPFLIFDRDIDREI 


6633 


20534 


A 


6687 


404 


2 


TAARG 1 1 YGGP I KKLLPRKPADVK/ SGS 
FKTAAPFFFFLGTRTLTEFLFI IIQN*K 
QPKRPST/DRINKLWYISTIEYYSAMKR 
NKLL IHTTAWMNYKG XMVSKRGQSQKS I 
FSRTKTGFGRNHISGCFLYPLPLRL 


6634 


20535 

■ 


A 


6688 


410 


163 


PRGNTPPGFSPPEKENQKVFNPPPGNIV 
*KKGGPPPQGAQKKGPPY*GPPLSFkPP 
KKKKKKGPPPPPPPKKNPPKGNS/SFSP 
KKKKKFFPPPPKKKGF*GGPPLKTLFFW 
GGGKKFFFFFWGKKEFPLGGFFFGGGGG 
GGPFFFFFFG 


6635 


20536 


A 


6689 


205 


405 


KSPPSPGGPPLIGKLEIGGAPKPKNWPP 
PLGPKGENPPIALGGGGGAKNHPPNPQF . 
ILLFLGGKNP*KSPPSPGGPP/ANRKIG 
NRRGP KTQKLAPPLGAKGGKPP PCPRGR 
GGGQKPPPQSPVYS PFFGGKKPL 


6636 


20537 


A 


6690 


3 


199 


HGSLQH* PPGFKSSSSL5LPSSWDHGHV 
QPRPAIF/CFFLILEKRGVSPGFPGGPE 
TPGLKGLDLL 


6637 


20538 


A 


6691 


S 


350 


S VCLGLP/ RCWDYR YEPPHLA* LAFLCQ 

I INNV\G*AGLELLTSGDPPTSASQSAG 
I TG VSHCTH PYC YTF LTREI TTP KE VK I 
GLCM 


6638 


20539 


A 


6692 


373 


197 


LPSSWDYRFMP*HMLFFK\FLFLLFCRD 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E^Glutamic Acid, ^Phenylalanine, 
G=GIycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine; 
T=Threonine, V=Valine, 
\V=Tryptophan, Y=Ty rosin e, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, V=possibIe 
nucleotide insertion 














E/SLTMLSRLVSNSWAQATLPPQSPKVk 
GLQV 


6639 


20540 


A 


6693 


369 


100 


PMLARMVS IS* PRDPPAS ASQS AGITGV 
SHRARPIYFFK*R*SHAGYSAVPVHRHD 
PTTDKHNS FLFCFLQTGSCFWQVGMQW 
HDHISL* PRTPGFKQYSHPSLPRH*DYK 
PTPP /HPS* FWQ VGMQWHDHI SL 


6640 


20541 


A 


6694 


12 


182 


FLS S WN YR C V P PHLAN* KKN F F F YTDR / 
SLAVLPGLVSNS * TQAVLPLQPPKVLGL - 
QA 


6641 


20542 


A 


6695 


402 


248 


SSWDYRCAPPCPANFYKIL*TGS/LLC* 
PRFVSNSWDQVILPPWSPKVLRGV 


6642 


20543 


A 


6696 


303 


309 


VQE F * DR I LLS P R CHPGWRAVA * S RLTA 
I SASRLKARPSHLSLLSSWDYR* VPPGP 
ANFLHFFCR/DRGFAMLPRLVSNSWTQV 
ICLLRPPSVGITGVGHRARS ILLD 


6643 


20544 


A 


6697 


3 


387 


PCLANFF/ VFFVETGTHYAA* CGLRLLG 
SSALPV*TS*SAGIIGMSHCTC/LQITL 
LKTESHS \ VAQAGV QWHDLGS LQPLPPG 
FM* FFCLS IQSSWDYRRTSPRPANFCI F 
SRDEVLPCWPGWS*TPDLR 


6644 


20545 


A 


6698 


1 


287 


LWFLYQGNAGITGI SHRAQPNS FVFFKD 
- RVULCCPGWSAVA\HSSLTVALTSQAQV 
I FLPQPPG * LGNFCI FSRD / RGFAMMPR 
LISNSWAQVIRPP 


6645 


20546 


A 


6699 


412 


3 


FEME S CS VARLE CS GT I LAH CNLHLLS S 
SSSPVSASRVAGTTGACCHTWLFFFFFF 
V/FLVETGFYHVAQMGLNNWAKAILPQP 
P*VLGLQARITTSSPNSSFTREHTSGSG 
VG YPAKFHS I VLLLLYLLFHLLCGVF 


6646 


20547 


A 


6700 


393 


88 


QKLYPYLTPYAKVN*QWIKNLSVTAKNI 
* FLGENTG VNLHS FRF I KGFLDM / TPKS 
ISTKRK*KIKWSSSKL\KNFYALKNNIK 
KMKRQPTKYEKIHLTLNNWV 


6647 


20548 


A 


6701 


494 


239 


KAGSHS\VSRLECSGMISAHCTFCLLS* 
NNPLTSAS IiAAGTTGVHHHAWLI FVFSV 
QMGFCHVTQAWVLI FDCGFESPINIKYN 
LL 


6648 


20549 


A 


6702 


2 


379 


RPTWVTWQNPASTKKKKNFFFFKTKSPT 
VPQAGGQGGNLG*RKFLPPGLKRFSGLK 
RPEAPKGQSCSLMGGGLCPLLPP/WSGT 
PPQR 


6649 


20550 


A 


6703 


220 


47 


TGFSGSQIN * EK/ I WLGTVAHTCDPSTL 
GGAGGQVT* GQEFKTS RHEPLHLTVLCS 
LF 


6650 


20551 


A 


6704 


80 


414 


ICYYFIVFAIYLNQFFSNCYRSFTFFFC 
PFETESRSAAQAGGYNPTLTLLHPPPLG 
IKQFSCLTLPSS\WARS*KPPCPTWGNS 
VFVGI YKWKLGGGGA IRRVQTCNQT YSG 


6651 


20552 


A 


6705 


69 


416 


NVSKRNNIHNIKKTPVYFKGFLLYCCKA 
EVFYVLDNPFI FYFLFYFFETHCVAQAG 
VQWHNLRS LQPSHP \S S SDYRCAP PHLA 
NFCRDGVS PCWPSW/S *APGLKLIHPPW 
SPKVLE 


6652 


20553 


A 


6706 


373 


75 


HAQLTFKFYYFRNTFCKAVASTKSNSSD 
RSE * S KLKT FW KGLT I LDAI RN I S DS L\ 
EEVNMSTFRGV*KKLTPI IMDDFEGFDF 
TRESNHRGARHGGSRL 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=IsoIeucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P= Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V= Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, V=possible 
nucleotide insertion 


6653 


20554 


A 


6707 


331 


379 


K* KINK3SJ* KGLT I LDAI KN I CDSWEE VN 
IST/LTGVWKKL/lTLIDGSEGFKISVE 
EGNAGWEIARELEIEV/EPEYVTKLLQ 
SHKKT 


6654 


20555 


A 


6708 


578 


152 


LQQHNPGSLQPLSP\GFKRFSCLSLPSS 
WDFRHAPPCSANFFVFLVERGYHHVG*A 
GLKLLASSDSPTSASQSAGITGVSHRAQ 
PL I GFKVNS VKKS VS VPTRR I LKGNMSM 
YEQL 


6655 


20556 


A 


6709 


392 


146 


FFETESLSPGLECSGTVSAHCNLGSSDS 
PALPS /RLLGSSDSPAPAWLYSFKL* I 
Y/FHKTCNYVLKINILLHVAI PYSFSWS 
C 


6656 


20557 


A 


6710 


9 


393 


KSKPCN*DKSQSYRL*LFAQLFTF/SNF 
FYFLETF* ESHCVI QAG IHSGTIMAHHT 
LKLQGSSNPPASAPPSSWDHRCGPPPLA 
TFLKI FL* RQWAHYVSQARDNGLIMFPR 
LVFELLGLKRFLLPQPS 


6657 


20558 


A 


6711 


34 


387 


QDFGFLSFFLLFFFEKGLGGPTCRAGPE 
SNLME \GGPPGLKQ FSGLTLPRTGNNGR 
ASRSRAKFWN* KGKGGFPGPPVELETFT 
LRG P PHLSSPKGKNNGGGPKPDFFKKVG 
KKAPNP 


6658 


20559 


A 


6712 


1 


188 


I OAW W Y TP WPAT W EA* AGGLR \ E PRRS 
RLQ*AVIVVLHSCLGDRVRPCLNNKKGS 
FPPQPLW 


6659 


20560 


A 


6713 


377 


98 


SLGLKQSSHLNLPSSWDYRPVPPCPRNW 
FVPVP F FF\RE * GLAMLPRL VSNS * AQV 
ILPPQPSKVLGLQARTTAPGFFSHLLKT 
LVPNNVNLNI 


6660 


20561 


A 


6714 


353 


229 


SSCLGLPKSWDYRHEPPHLAYFYY* *RQ 
ALV/MLPRL*Y*FLDVNM*LPLPFHVAP 
SHLLNSVGMV 


6661 


20562 


A 


6715 


371 


64 


ENHGDCTPAWATRAKLRFKKKT/ CYQMF 
ITTLFTIAKRK*LKCPSSDK*INKMWYI 
HMME Y/ YAIKRNWTLVKLNQNPYAL* KV 
LURE * KQNP* S VRNI FKTHI 


6662 


20563 


A 


6716 


462 


232 


HFNLITIKIIPFF/C*DGGLTMLPRLVP 
NS WAHAILLPQPPK* LGLQARACKL 


6663 


•20564 


A 


6717 


425 


48 


GFGGPPFKKARFGFLHPF*TPPVLRPRP 
PPVFFGGFKASPPKKKTPK/PPGGGGPP 
P YFHP FFLSPFFP I FFEKEPPPFFFWRG 
LGPPPPPFKNPFFFFFWVEGGKFLPQKK 
KKKFFQS I STRSVIHK 


6664 


20565 


A 


6718 


3 


413 


GTFLTKTS LCGTLL Y * VI S YMCV * FI S * 
\SYLVYIIYFKDICV/LLYMYIFYIYTC 
VHISVCMYMYVCMYVCMYKA* PANLKTF 
CTQKHLAWGWVQWL I S VI S AL WEAEARE 
SPEVETNLAQLGPTLSLPRKKKKKKKK 


6665 


20566 


A 


6719 


408 


71 


SSWDYRC/LPPGPANFCIFSRDGVSPYW 
PGWSRTPD * RSLPGGDVMPALVIRHVPE 
LLKKQHSSVRSSRPAWPIW 


6666 


20567 


A 


6720 


375 


57 


S VI REMQ I XTTM K Y PF I PTGL I V I KKTE 
NN/RGWQRCREIKACIY*WWGCKMTQLL 
QKTIW*FLKKLNIEL/PYDPAIPPLGM* 
LRELKTCLPRLGVVAHACNPSTLRG 


6667 


20568 


A 


6721 


391 


164 


RISETCT * MFPAALF I V/ DQNNPDKWTN 
LMWSYTHTIODYSAIKRNEVLLHSSTWM 



959 



WO 01/64835 



PCT/US01/04927 



SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
VSSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
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N=Asparagine, P=Proline, 
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T=Threonine, V= Valine, 
W=Tryptophan, Y=Tyrosme, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














\ENLENLMLSKRGQSPKTPFG 


6668 


20569 


A 


6722 


381 


278 


SSLQPQTPGLKQSSCLSLSNS \PPRFID 
* SISLYRQGLTILLRLLSNSWSQAI VLP 
GP P KALG L* AS KKFLYSN I 


6669 


20570 


A 


6723 


2 


363 


QRSLEHRPCKKC/CGLW*VS* KVQYLV 
YKESSSFKNLLRRIHCNFLFSGIVFHF* 
KLLRTPRSFCLCGLHI P INFT I F* I ki e 
KFKKC* PGTMAHPYNPSTLGGLGGWIT* 
GQDGLDLLTS 


6670 


20571 


A 


6724 


352 


58 


HGGAIGARFFKVFNLI *FNLVCKDE / S F 
SLLARAVINSWPRVILPPWPPK\WGFQ 
AKT PL PRLKG FLE S LPS LKKKKNLAYAR 
TITAQIGLLVCFIDQQ 


6671 


20572 


A 


6725 


383 


54 


KSSPFFFFEQGLLCCPGWSAWSPLQPK 
CPRLKQFSPLS / LPSN*EYSCTPTTFSL 
HVCVSIHKYI*YIDQYIFKFF/ cr/drt 
LAMLPRLVSNSWSQAILPP* PPKVLGFQ 
V 


6672 


20573 


A 


6726 


3 


343 


LEV * ARATRQE * K I NGT * I 1 KYKKVKQY 
SQNDMILYIQNPKDSAKKLPELVNKFSS 
FKVWRQNS WFL * NSNKKS /EKEI KKKL 
/ PFT IAS * R I KYLG INLTKE VKNLGRAQ 
WVTA 


6673 


20574 


A 


6727 


3 


615 


HTSQGNL* IQYHTPWHPPSSERVERMNQ 
TLKSHLTKLVLKTRLS WTKCL? I ALLKV 
RT VPQ KEVGLS P CEML YR LP YSHFT VD I 
PTFETKSQFVKSYVLGLSSTFSSLKAK/ 
ELFS TDATLGS S PGTS AFSLGDD VL IRS 
WEEGKLKPAWEGPYLVLLTTKTAVQT/D 
NKKMQTHHTPVKKAS PSSKS CAI VPRP I 
PTKLKIKKKKA 


6674 


20575 


A 


6728 


2 


228 


SGAI / SVHCNLHFPG/SSSDSSASAS * V 
AGTTGVHTMA* L I FVFLVETGFHPISQA 
GLELL/NHMICPPWPPKVLGLQL 


6675 


20576 


A 


6729 


98 


359 


KVE YTVLLF*GFKG KFPTLPL YLSFS S F 
LKNHL I SNFFLNR VLLCY PD WS WG * FT 
AHCNLNLSCLSLPSTW\DSRCVPTGAW1> - 
RFI 


6676 


20577 


A 


6730 


10 


428 


RSTRTDYAALREVLPDPRRCRPRTGAWS 
NGRRDRPGAGHSGPKAEAETRETLEKGL 
LFCCPGWSAVA* PRPTATFASRAQSLTV 
SPRLECGGWSAH*NF\SFHFQRWGLAV 
LRTLVLNS WS LALLLLWPRKVLGLYALA 


6677 


20578 


A 


6731 


425 


158 


SQTHDLR * STCLG P KCWDHRCAP PRPAL 
FLF / FNKC PMS I R K I ENAQ * LPG I * K KN 
FCRDDHLPLLSRLI LNSWAQGI L* CRPL 
KVLGL 


6678 


20579 


A 


6732 


431 


22 


RDRVLLCHPE*SAW* S*LTTVQNSWAQ 
GILSSI*DYSCVP\LPVNF*TFFKSVYY 
YFIIFIFEIYFCNSKALGL/LGVSHRAR 
PPRLVLNSWPPQVILPSQPPE*LGLEAL 
ATAPAI PRNSNLVWHQYS FKS I PDDL 


6679 


20580 


A 


6733 


44 


414 


AIYDLPFLLIDLYPMELRASIRAHTCT* 
KrvAl LFxVTKRLNQPrCPSTDE*QNKM 
WSL/YIMNYYPAI KRNEILIHENIMLCK 
RS QS QKAHTE * FH I CE I SR I S KSAETED 
ELWATRWGKGRE 


6680 


20581 


A 


6734 


375 


49 


QRLTYHCRFS YNTASNKMRLS * TQMHQK 
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N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaJine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possib)e 
nucleotide insertion 














sacsmcpgrl*gvhamrpkvlmrlsrsh 
/varpiaatcaq*vcdrik/r/ slliee 

QKIWKVLKAQTQSQTAKFKNSTFFRNK 
N 


6681 


20582 


A 


6735 


381 


88 


SLFPRLETPGLRPSFFFFLFSLSH\PGW 
SWV* SRLTTALNS* ARFSLLSWCVPLH 
QAHFF\ FFCRDR /SLTMLPRLVLKS WAQ 
AIPPPEPLKRSGLQV 


6682 


20583 


A 


6736 


384 


293 


AKLS LLS GWDCRCMPPCLDNC VCLC / RV 
F VCR * GQAMLPRLVSNS WTQVI LLPWHP 
KVLGLt*AQDQEPLWGRSGPGLPQAASG 


6683 


20584 


A 


6737 


24 


300 


NILSAFLCLS*P*H*YGLVIL*NVPQFR 
FLPLFFPMQMYTHIFFKIRIRPGWSHA 
YNPSTLVGQGGR\SHEFKTSLDNIARPR 
LYQVQNKIR 


6684 


20585 


A 


6738 


2 


353 


IKCW*KCKITYTLSKTWQFPIQLNIVL 
PY I PA/ 1 PLLG I HTREMKTH I YTK/ T/ C 
MQMFGWLFI I AKI VFQLVNR* IKL*SI 
HKVD YHS E/I RNKQLYEAT * LNLKCTVL 
SE*S*TQKTT 


6685 


20586 


A 


6739 


2 


336 


I EAT Y TNP * HRH \ VHTPTHVHT / H VYTH 
NGTHTHTARAW7PAPPTSLILPSPTESP 
PALQI PNLPFGLSSSPKPAGSSQGEPEE 
GGCQRRAKNRVGHRAEKWETELCAGDMN 
L 


6686 


20587 


A 


6740 


3 


385 


KKQATLLWI IHLATTFS VE I * QSK I EVH 
DIFKVLKEKKK*TTFYPKITNPGKIAFI 
HEGKI KL»Y/ IVK* QLRDFINSGLVLKAM 
LKRVL * S KRKGH * QTIRNLRVQNSLVTA 
NTQPNTEYCNTVIWGN 


6687 


20588 


A 


6741 


406 


155 


TGSCSVTQA* /CWSAVIVHCSLDLPGSS 
NPPASASWIAGTTGMCYHAQLPSFNLCL 
QFYCMGCVFSRFYLRFTFGLQSYTRKQK 


6688 


20589 


A 


6742 


I 


215 


LPYSPDLSPTD/Y/HFSKHLDNFLKGKR 
FHNQQDAENAF * KF I ESQS I DFYAVR I N 
KLISHWQKRVDCNGSYFD 


6689 


20590 


A 


6743 


376 


109 


YHMEWNLYSLKGSSYLRLPSSWDYWCP 
PP * QANF* K\ FFCRDKDLAMLPRWLNS 
*LQAILLLRPPKAEPLCLLGQFQFRRES 
GWN 


6690 


20591 


A 


6744 


382 


148 


NIPQFMFDILFQ*LITLSCPASF*SKFP 
LSSLYFI Q*GR/WIrGTVAHACNPSTLGG 
RGQR\ * A* DQELETSLAKKARPHLi 


6691 


20592 


A 


6745 


403 


209 


HIVINRI IRKYCA*HYANKFNNLDKLDT 
FLE \NTTCKNLTQEETENLNSPVS VKE I 
EQGKLCSDRN 


6692 


20593 


A 


6746 


366 


40 


QGSVLS I LNVKI VFWFKTGCVHFLFPLN 
FLSLFQNYVCSHTS / YPLSLIPHESLWS 
FPPSFNHSS*Y*HFLKTN*MPDAMAHAC 
NPSTLGS *GRWIT* DEEFETSLANI 


6693 


20594 


A 


6747 


392 


57 


APKKKGGGVYPLYPKKSGRKKKPEG / P P 
PKPKKKLKTPGRKNDAKDIKPGGAKFFF 
SPKKKRGAPPGVPPRLF* KPPQATI *GG 
PKKKKKPFQEGGKKGAPPKAPPXjFFFFL 
YF 


6694 


20595 


A 


6748 


376 


140 


LEDNI / G YSNQNAGAGRVS * INTPKIKV 
S\QSKIAKWDLIKPKRLCAAKVSMNRVN 
RQPPEWDRIFALASAD\RGFLNKILG 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

V*4*cifill0 fkf 
I cMUUc Ul 

peptide 
sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 

Iao4 fimiiln 

usi jiuiuu 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, HHHistidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
i i iir cuii in Cj v — valine, 
VV=Tryptophan, Y=Tyrosine, 
X=Unkno\vn, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


6695 


20596 


A 


6749 


540 


172 


FFLRQS L / DS VAQAGVQWRHLGSLQAP P 
HRFMPFSCLTLPSS*NCRCLPPCPANVL 

QSAGITGMSHCAQVVNTLSLHAFSCDKV 
PQNDLL/WPSKSPCGTLSHEKACKDNVF 
TTWAQWLMPVI PALWEAEAGGSRGQE I E 
TILANTVKPHLC 






7\ 


O/Di) 


i 




WP i-)iiI\.ILO x KilllUoVlMl vCl'li J. w JJJjUi X I V 

SKAVAGFEKIDSNIKRSFTVGKMLSTSL 
H/CYREIFHERKSOSMCOTSLL^YFKKI. 
PQPPQPS ASATL I VNI KAKI SAKRIKLT 
ECSDDC 


6697 


20598 


A 


6751 


404 


231 


SPS/PPPPYFGGPPGFFPPPVF*TPPPE 
Y FFR A P K K K KNV P P Pfifi KW F F F RKTi P P P 


6698 


20599 


A 


6752 


77 


371 


I TCP VLYLTS RNTKTKKY YLDCCCFW P * 
WNNRDQTYTjHLFFF/CFIKKNFFFQPKR 
d APfJfi FP T MPT P P T ON P P P P P po po 
EKRAPPKTLCFFFFF 


6699 


20600 


A 


6753 


354 


169 


ETESHSVTHAGVQWHNLGSLQPPPPGFK 
RFSCLSLPKK/WFSVKLN/PNT*SFCYH 
ILKALII 


6700 


20601 


A 


6754 


1 


442 


CICRTAFELTGRATRSEKEERKKERMYD 

iV IVES i\ Jv JVE> JVlLv7 JL Jj J_j iYrvlJ Vj y ijijijXjOKJ\l->£» 

RKEGRRQTQDE FGGQNRLSTVKQGS * PK 
TfPTf*nTn \ hto APTHFPnv Fv^nNncrtfFF 

KARHDS * QPQA* RTQSRLQQEGRKLWKR 
GLVSRRK 


67ft 1 




A 


O/ J J 






FF*P WOT .TMPP Kl .VT .N^W AOVT ~ .T .PPPT . 
E*LG*QA 


6702 


20603 


A 


6756 


3 


223 


GLKQSSCFCFLSNEDCKHVPP\HRLFFL 
WP*VI PQPWPPQMIiRTjQA 


6703 


20604 


A 


6757 


418 


244 


FSLLSS WDHRYMPPRMATF/ S FVLFCLR 
FGCT.TPFT.RPV*N c ?WTOATFPO^PKT ) T 1 f; 

uwvu i i\r dimt v lion lynic rs^ori\4JiJ\j 

LQV 


6704 


20605 


A 


6758 


46 


256 


KbKNKTEARHS S S ACL* / LPG SWDHRYL 
P PRLANF * TFWRDRVSLLP KGVSNLWAP 


6705 


20606 


A 


6759 


391 


55 


FIFFFFPHSSHGVSPCCPGWSRTP\GSS 
GSPAS ASQHAG I TGRNHHTRLQDI Y * LE 
/TGQTVTLQFSVLAIDTTLTKRLSWS ILC 
CYNKIPETG 


O/UO 


zUoU/ 




O/DU 


$11 


*> 

L 


FflT?PPnFQPPPVT .TfTPPPWT.FT .nPHTf TTKT 

infppp/welnlvplkgppffffflr*g 
i^mlprlvstswaqvillpqhpkllelq 
vhamvlssrtshsprsrslsprfsqrll 

PVSCNPAOPSPMA 


6707 


20608 


A 


6761 


2 


259 


LTIA*KNFIFLGIt*QGlCQIPKDYQTSL 

R* tkdnlnk/wdtpcswtgglntvfm^v 

LSKLMYRFRAIQMKTTSGFPFFFFWKKG 
IFF 


6708 


20609 


A 


6762 


1 


169 


FFETEFRSFA*AGV*WHDLGSLQPQPSG 
\SSDSPASAALFFFFFFLKKKIFFFPPG 






A 




i 
i 


4yo 


MATCT P KH I NTQ I CVS FLLS L KLS E FEQ 
MSIFKKGENELALGTGNIRPYDSCPGNR 
P/APFATSPAPRHPSGPSWRPPTSGGVP 
ARAECPASPNPLPPGFCGQSPSRGRKEE 
* LRGESTSTLAI F* * APNRYHS F* LPRQ 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 

- 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline s 
Q=Glutamine, R=Arginine, S=Serine T 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














P * PHGD CP ARKKKKKPND PRLQ PTVT 


6710 


20611 


A 


6764 


1 


229 


VLLLCFFFFVL*DRVLVGHPGWSSRLYL 
KTKTQKKQNNI FVCRVGG\LTTLPRLVS 
NS WPQMVLS PQPLNTALGLQ V 


6711 


20612 


A 


6765 


580 


273 


WRQSLTHSVAQPGVQWHDLGSLQPPPPG 
FKRFSCLSLPSSWDYRRAPPRPTNF\ * Y 
LVQTEFHHVGQAGLELPTSSDLPTSVSQ 
SAGITSVSHHAQLDMIFLR 


6712 


20613 


A 


6766 


220 


239 


AQEFKAAASTTALQTETLCLKKKXKKKG 
GVSLGGVSSNFRTGGGASVTSPSIF/CF 
FFFF * DRVSLSGVQWCSLQ P * TPGLKRS 
SSLSLPSSWEYRHVPLHPA 


6713 


20614 


A 


6767 


89 


387 


FLSPSGSQKEHLQIQLRSTFSFFFFFKR 
EPRFGRVGGQCPNLG*WKPPPPG*RQFS 
GLTLT/ SNWERWATP PP PTNFW I FKKKG 
NSPWWKRGAKI FGPWDP 


6714 


20615 


A 


6768 


397 


223 


IREVGWI1WQHKDPCE/KWNCSVY* h* 
*WIHKPTH£)KT\HRNTHTHTHTHTGMHS 
HTQK 


6715 


20616 


A 


6769 


3 


367 


SWKVCS KRP I PS PS LFF* KHPRVLLRAD 
GT* * VS LATL * HMQLS * AGS VKEAE P RS 
NAPPHKATFLFLFFCTD/SSLTMLPRLV 
LNTWPQAILPPCPPKVLGLPARANSSSL 
SVFKLTSFCC 


6716 


20617 


A 


6770 


490 


260 


FFETESCSW*AGVQW\LELLTSSDPPA 
SASQSAWITGMSHCARRCGTHYHLCERF 
LVSFVTALLSQIiLELGLAHSRC 


6717 


20618 


A 


6771 


418 


130 


PSPRDLNLFPLNGP PLFFFFF * DRVLLC 
CLGWSAVASN\ FDLK*STRLGLPKCLDY 
RHEPLCPAMWDTLSSIj*EIFGLFCHRPS 
LPAPRTGPGSQ 


6718 


20619 


A 


6772 


311 


314 


FAVGELFLLLHHVGHACLPFCHDCKFPE 
ASPAMLP Y/ QPVEL * AT*TGI EVLTPGC 
RFQAVFGLKVGFHQGPTPVYLG I 


6719 


20620 


A 


6773 


465 


283 


CPSLPSSWDYRHPPPRPANF/*YF*YHV 
GQPGLELPTSSAPPASASQSAGITGMSH 
RAWPK 


6720 


20621 


A 


6774 


410 


91 


RDHSKLQPQTFRLKQSSCLSLPSN*DYR 
HMLPCLAGWLAGWLACQTDRQTDRQTDR 
QTDRQTDR * TDR * IDSRDR/ SLAILPRM 
FSDSWPQVNLLPQ\WPPKVRRLWA 


6721 ; 


20622 


A 


6775 


409 


253 


Li I FAFF \ CRDGGLAILLRLVSNS * PQAI 
liPPWPPKVLGLQARAPGVRPDLS 


6722 


20623 


A 


6776 


3 


278 


FFFFSLHLWAPIAFFFFGAISAHYNLPI 
PG\SSDSPASASQVAGTIGACHHVQLF\ 
VFLVEMGFHHVGQAGLELLT\ * VIHLPQ 
PPKALELQI 


6723 


20624 


A 


6777 


79 


442 


RDEDRGLERRCV SACSTPQLGNASTT * G 
CPGVSDPSLHILF/CFFKEN/ SCFAPQA 
EGGGHNLG*LEFGPPGLKQFSCLTPQRI 
WNNGGAPPPPLYFFFLKKRGGSPLWPGW 
A*TS*PRS 


6724 


20625 


A 


6778 


77 


417 


HRGLHLSFFLF/ S FLFFFLKRAALLWPR 

Trr>!~'T2/~'T>i t i»n ttvt^T't ct dphptoitat nc*P 
tiSj {jK\j W JL W ViNV_ JL Jbb l»Kowt3 J- ^ KALif o * K 

AGTKGGRPHPTLFLVF* KKTS FSQVDQG 

GIDLWARGNPPPGNLKWFELQKETLCPG 

HIF 


6725 


20626 


A 


6779 


62 


235 


IQELRIREGKCLAYEHTARNGKSQDLHA 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

etb 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cy stein e, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
I— threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 

V' IT—I. ~ 1 i "11 

A=Unknown, *~Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














GLSCLCCCCCCCC\CNCWP*STRKFFTK 
PE 


6726 


20627 


A 


6780 


908 


566 


SKMEFRSCCPGWSDLGSPQPIi\PLGFKQ 
L\S CIiSLP\RAWYYRHAPTHPANF\ * FL 
VETGFFHVS QAGLELPTSGDLPTSAS QS 
AGIIGVRHKAHPECVFQKQLSEANLRRI 
WNQED 


6727 


20628 


A 


6781 


11 


483 


TTALLKAVRH/ G *HLS LQRLLLS F \ VCL 
C PAPRGGAYRGRQAS * S CGG LHPVRASR 
LLCLPNQAWAMAGAPPPTSLPPCSLISD 
CCASNQRDSVGVGPSEPHAGYNLPVCRF 
LSPLEKRSIRVGVTRFSRCRIjSPLSLTR 
KGNSLTPCASRVSSASPCS 


6728 


20629 


A 


6782 


225 


589 


SRRSEPGNFRSCLGVGVTCCMCLSLSLP 
SRS FLRPLFLSLS VCVFVCLGTRVPCAP 
EGG FLARR P F F WS AS P RVS AWVLW PVGN 
RFPGGSGLGVCEGLGNVGI \ WSEPQGFS 
FPPHPEQPLC*AGSRRALPNQGQR\LPG 
AHRLPAGGCPQSFKKWVTPVALCPHRE 
M* PQLDAGTEKEAGKGMGQACLSLKGWP 
SWPSHPFDTPPRMPVWAWPPRILPGLW 
PLL* PPAGRNARRKKPETDRRGRSAGSQ 
PQACADGVRVSLPKSGALVMSRTGLPVC 
PWAALS PVARSRGDQNPATSGAAWGWVL 
PAVCVSLSLSLLALSFALSFSLCPSVCS 
CALGHVCPVRRRVGFLHVGLS SGQPLPA 
SLPGCCGRLAIVFPAVPAWGSVKAWATW 
ASASEPQGFSFPPHPEQPLC 


6729 


20630 


A 


6783 


458 


170 


PCIVTASASQSAGITGVSHRTRLLVNFS 
KVGS VT I SLG * EVGLRHWP /N I / GRLDK 
TG P GE K£)H F I LHAP QS AP S VYHCLGN TQ 
ELPPLSTAMISQPKR 


6730 


20631 


A 


6784 


1 


192 


FFFFG* I LPLS PRAGVQWHDLGSLQPPH 
PGFKQFSCLSLP \RG*DYRACATRDGIiI 
FCNFRLGD 


6731 


20632 


A 


6785 


3 


246 


SLCVCQYYCRCACV*VPKQVSG*VTMGI 
/CATGFVLIYLCG*QCLCG/C/C*EDPF 
YCEQIiCVHVCS CVjJGRG WS VCV CI SRLh 


6732 


20633 


A 


6786 


408 


128 


GPVPYKARG*SPFWPPGEKPSFPKNPKT 
PPGGGGAVLFPPPWRGRPENSPPPGGKR 
C/yPJU* Xr r JjPPHPGHKIQGPPPKKKK.K 
KISAEFLSW 


6733 


20634 


A 


6787 i 


I 


418 


LTILGVMKNI CDS WKEVKISAFTGAWKK 
L X PTIjTD/ E Y *GFKASVEEGTAHVLETA 
RELEVEAEAVTEL/ lqshdqt 


6734 


20635 


A 


6788 


3 


237 


FFFLT* VLTIV/ PG*SAEVQS *LTAAST 
S*\VTC*SSHLSLPSSWDYRRMPPHPANF 
Fr F r G K J^b L» 1 JLjH PGRGGP S IaPT P 


6735 


20636 


A 


6789 


282 


3 


PHPDNF \ RFLV* TGCLHVGQNGI1AI1LS P 
GY P PTLAS PSAG I TGVSHCTRPKRYF * M 
VLSNKSLNRFGLCTVLPTCLQAPRRTG P 
PPATVLGRGT 


6736 


20637 


A 


6790 


62 


328 


FFRDRI LFCKPRWSAVIQ \SWLTEASKS 

Gj\ T V+C<3T3T/^T»DK'P&Jnr , DtrC , DT HT TXKFW 
Yi \ ± J\. ooKlialir Jx^WlJwullSlr Jj^LlLJWl X 

Y* CRQNTLPI TLIGCFRVSLPYAKYTGK 
FWGAEG 


6737 


20638 


A 


6791 


67 


353 


TNKHTYFLEIiNKREKPQLTVFFFF* REL 
RGLILANGNFRLRG* SNSPA* PSKVAGI 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=A)anine 
C=Cysteine, D=Aspartic Acid, 
E=G)utamic Acid, F=PbenylaJanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, LHLeucine, M=Methionine, 
N=Asparagine, P=ProJine, 
Q=Glutamine, R=Arginine, S=Serine, 
T^Threonine, V = Valine, 
W=Tryptophan, Y=Tyrosine, 
X— Unknown, K — Stop codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














TGPPHQAGLILVFLEKTGFNYVG/RRGP 
PPPPPKELALQA 


O/Jo 


nc\cic\ 




ofyl 


1 A A 

/44 


382 


MTT7TnfTMJTVPVT VT UTVT nrtPT /nQUA 
1*11 r If Jf Xj X Xjc -L X JjrC^O J-i/ VA 

QAGVQWHNLSS LQAP PPGFMP FS CLSLL 
SSWDYRRPQPRPA/NFFYF* *RRGFTVL 

AWPKLHIFKL 


6739 


20640 


A 


6793 


372 


129 


RVLPCHSGWTAWQS*LIAVSTSQA\KR 
SPHLSIRHVPPHPA/NFKQFFCRDGLIM 
LPRS VSNFWAQAI LP PQPPKVLGLQA 


6740 


nf\CA 1 




6794 


one 

395 


25 


yfLjjjr^r J. irKiSJvrUor olVvr Avbor / r r* 
PSL*FS/PLSL*VQEMVSICCPGWSAV\ 
TPGLK* SSCLSLPKGRGYRCAPPHSANL 

WPPKGPSLRACTSKI 


6741 


20642 


A 


6795 


412 


176 


KKS* / CLiATPRSENWVL*NGPPLFFFFL 
*RRDIiAMLPRLVSNSWAQVMLP?WPPK\ 
VLGLQAGATMDHSCVQYSHSFAPS 


6742 


20643 


A 


6796 


444 


95 


JNr VKWLyor LivJovjUI rLnb jyyr LiLi/ Ft 
PCQH* IMSVFQILTTVIGVYMHLV*ICS 
ALMTFDLFMCHVLI CHLHI FFGEVS VQH 
FSLLLFLFESGS I YWQAGLHLLFSSNP 
PASAS 


6743 


20644 


A 


6797 


378 


732 


NSRDYRHLPPRLANF\*FLVETGFRHVG 
QAGLKLLTSGDLPALASQTTGI TGVTAS 
PSLDEGVFKALPPPVGTTLCLPPWTTGG 
PFLGPAL 


6744 


20645 


A 


6798 


444 


1 


PLCFPPGGGGFLSSHQGFPPPPPKRKGS 

\7C a. vvxrtct t dtd/^* d pd t? trc r E , 'K'TrMT?'n , T? *• o o v 
V o/\J\J\.J\l>l J. fJtAj.fr rUrf t J\J\JNr r r kjir 1 

PF/SEPKKARWGPPPPGPGFSPPRSLVS 
PLGEKKKKKKIPPPETQPPSFGA*WGKK 
RAPPPGPRAGGRNSFLEGGRGQGRTGGS 
VREFRAGR 


£HA C 

6745 


20646 


A 


6799 


462 


83 


Tt?T3T T5 D 7\ T PT C* OMT57VCT* VT)UT?T f TTT? / 

CLPFLSFFFS/SFPFSFPFLFPFPFPSF 
FLIPFLSYSLCCSGWSTVA* *GLTIAS\ 
TYGliKQSSHL/SLPASWFPCLTN* FCKI F 
FRDGGLTMLIRLAYLFLNF 


6746 


20647 


A 


6800 


433 


185 


PPPGGRLLtVEKTRGEGQLKPGQPGRQAG 
LGMRGNPGKAQPRPASGAPEMQLPARPP 

SVAAAGEVSKSAPDSGLMGNSMLVKKE/ 
AGGGGEPQANQETEN*EG*PMSPESGAL 
IiLTSPAAATLPLPVPPQRRGFAPPSLFL 
APQP 


an An 
0/4/ 


2064o 




CQC\\ 

ooUl 


one 
395 


oU 


/ MS I P * HAFAWTC/ SSFLRGGCLGA* ST 
LL YGGYMFS YLRNKQT I FQS S CCIVNGE 
QPRMRNRLFS * FIQHLVFSVFLIRAILE 


6748 


20649 


A 


6802 


423 


98 


RGVPY*PGGLQPPPV/REFFFFFIGRVS 
VLQAGAAAPTCRAQS ILPGSWDYRHLHA 
NMSS * FLYFV*RQVFSMLPRLVSNSWDQ 
AIHPSQPHPALFFTCWSVKPLATL 


6749 


20650 


A 


6803 


396 


194 


S LMKLAH V * VLHTDGGFMH PD S AI S CHD 
IFDSLHLTGEGSAKICRALYELIMQLME 
EHP\GEKQTTIA 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D^Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=IVIethionine, 
N=Asparagine, P=Proline, 
Q^GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan. Y=Tyrosine, 
X=Un known, *=Stop cod on, /=possib!e 
nucleotide deletion, \=possible 
nucleotide insertion 


6750 


20651 


A 


6804 


373 


192 


PALNPWAQANLPHQTPALSSNFFYF/CV 
EMGLTVLPRLV*NSWPEAILILLPWKPK 
VLGLQA 


6751 


20652 


A 


6805 


370 


242 


KKGTSKFLFFPSYSFFSKFWYQKKVKIA 
FFFF* DRVSLCHPEWSAVW* S * LSAALT 
/CPGLQGSWGYRHVLPCLANFS\FFC*D 
GVSMLSRLENWPFCLF*PVFQPGQHGNS 
ISTKK 


6752 


20653 


A 


6806 


464 


182 


DGPHRFEPPISS * FLPPWPPQKLE I KG* 
APLAKQ I FFFFFFL * RDI VLLCC * GWGT 
VYS VYSWVSSRLTAAS \TAVLKQSSHLG 
LLSSWDYKHHK 


6753 


20654 


A 


6807 


3 


228 


YTCAGFFRRQSLCHLGWGAVARSQLTVA 
LTL/VPPGLK* SSNLGLPKCWDYRCEPP 
HLACVNFNTVKDVCFSCILS 


6754 


20655 


A 


6808 


506 


173 


NLHLPGLKQFSCLSPPPRSRDYRRAPPC 
TANF\ * FLVEMGSRHVAQAGLELLNSGD 
S PAS ASQS AGTTGVSHHGQ I S LFKQILT 
STRI S EVI FLTLLLRIIiHDLPCIjKIjFSL 


6755 


20656 


A 


6809 


676 


397 


VSSCPRDQPGKHGETLSVLKIQKLARHG 
GGCL*SQLLRSLR*QTRLNSGVGDHSEP 
RLSHCAP/ AWATEQVSVSRKCKKRQLKQ 
KERIPEWVRS 


6756 


20657 


A 


6810 


367 


31 


QVFQELYSQDLFIYLETLSLCHPG*SAV 
ARS*WTAASDSVJ\VKQSSRLSLPSSWD 
YR* VPPYSAHFDVNSS FLTYCTS PGLTQ 
KIEDSWLGTVAHACNLSTLGGHSGQIT 


6757 


20658 


A 


681 1 


469 


251 


LLSSWK/YQVPPPHLVFIKIL/CRIGRE 
TASCCI * PRLWNSWPQVILRPQPPKML 
GLLSATMPGPISVFCFCFCF 


6758 


20659 


A 


6812 


16 


462 


E I I SYCGFNLHFSNDYKXKERKRERERK 
KRKKENIGE/HI *DIGVGSDILDLIPKA 
WTTKLNIDKWYHIKL/QKLLSS*RVAKT 
NKLKS FCTAEE 1 1 TRVKRQPMEWEKTFA 
SQTSDKGL I SNTYKELKQLNSTETNKQK 
NNFDLLSADKG 


6759 


20660 


A 


6813 


492 


243 


S S WDTGCTPSRPD \ I FVFLVETGF PMLA 
RMVLIS * PRDP PAS ASQS TLVDTSRYKV 
YQNIHQQVNMF * NYGWCFFD I FTASVGD 


6760 


20661 


A 


6814 


32 


308 


EYTLPHPYLITGPPPVPKPETYSHS * YF 
LSPNTQCIQXPAPRTLNTASSTCLSRRV 
TARPVPI YQDPSLEPLQLPPNSLTLPTP 
LRHSPCWV 


6761 


20662 


A 


6815 


97 


274 


GRFLELKVKRGSLGLGMWKAYNLSTLG 
G * GGR I P \QGFKAS LGN I AKS CLYKNKK 
GAA 


6762 


20663 


A 


6816 


74 


376 


DFFIVYSVFTHLVNI KDVKDVLYKSLKY 
FFAIYRMGENIYKLYILGGINIQNISME 
YYTA/IKKD* ILLFAATWM/RLEDI ILS 
EISHEQKS\KRCMFSLICGS 


6763 


20664 


A 


6817 


1 


1127 


MRVTAPRTVLLLLSGALALTETWAGSHS 
MRYFYTAMSRPGRG E PR FI AVGYVDDTQ 
FVRFDSDAASPRMAPRAPWIEQEGPEYW 
DRETRNMK \ASAQTYRENLRI ALRYYNQ 
SEAGSHTLQRMYGCDVGPDGRLLRGYHQ 
DAYDGKD Y I ALNED LS S WTAADTAAQ I T 
QRKWEAAREAEQLRAYLEGLCVEWLRRY 
LENGKETLQRA\DPP KTHVTHHP I SDHE 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenyla)anine, 
G=Glycine, H=Histidine, I=Isoleuctne, 
K^Lysine, L=Leucine T M=Methionine, 
N=Asparagine, P=Proline, 
Q=G!utamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














ATLRCWALGFYP AS. I TLT \ WQRDGEDQT 
QDTELVETRPAGDRTFQKWAAVWPSGE 
EQRYTCHVQHEGLPKPLTPERWEAI LPS 
PTS P I VGHCAWAGAVLAS WWS SGAWAT 
VDV*EGRSSGGKGG\SYSQGWRPSRPVP 
QGFWMCLSTALKKP 


6764 


20665 


A 


6818 


34 


382 


STAGAIHVCMCMCVHVSTCVCAKMCVRV 
KF* I WD I ETGAAQLS CCVLVAQTEI I YL 
QERNSSLSERECVCAK\CVCMCMCVHVS 
TCVCVCVCV* V* KRHNWPGMVAHVCNSS 
TLGG 


6765 


20666 


A 


6819 


1061 


305 


FFLR WS L/ DS VAQAG VQ WRDLGS LQAP P 
RGFTPFSCLSLPSSWDYRRPLPRPANFF 
YF* *RRGFTMLARMVS IS * PRDLPASAS 
QS AG I TGVSHRARRRNTI LY * HLNFV I K 
SHKPPDI VFRDh * GPRLHLENP VL *GAK 
SHSSCPWLNLSPPPPHAGSHCLSSSSPR 
MGARQDLLFNASSHTGLFPFLQLNQTAS 
RPTSQVPKSLCFPLPWVQPFALPWVRQD 
EGRASVNSPQDPPRVPSCAGFTHSCNLS 


6766 


20667 


A 


6820 


610 


248 


ERRSHPVA*AWKWCHLGSLQPLPP\GS 
SDPP\T+AS*VAGTTGVRHHARLIFVFF 
IETVSHHVAQPTGF*TPGAQAIRLPQPP 
ENAR I TDVKPLCLAHFCYS KHF IMVPAP 
CISHQKVFH 


6767 


20668 


A 


6821 


3636 


3332 


TCLGSHATPHCGHS FSSCKI LLF * FFVF 
EM*SHPVAQAGVQCRDLRSLPPPPL*FK 
RFSCLSLPSSWDSRRPPPRLANF\CI FS 
RDGILSCWPGWSRTPDLR 


6768 


20669 


A 


6822 


563 


146 


APGLSFNCYIPPAPAFMPGELTLFEVD/ 
NRSIFPTEY/TIHILISSEDILHS*TIP 
LLGLKTDAIPRCLNQTTLTTGRPGLYYG 
QCSEICES/N/HSFIPIVLELILLKYFK 
T* STSTLKYHCKAIQHLPFKLKI DGVYT ' 
SLQ 


6769 


20670 


A 


6823 


399 


201 


KAEPPPAKSWSKGGYVLPRS/ IRLDKVK 
WALEPDDIAVLNFMKERHPQSKS * TILQ 
KWIQPFSPIVQK 


6770 


20671 


A 


6824 


323 


323 


LPTVIVIILSFYNFFFFSKRLIRTLSFI 
CMYVCICIYVCIYVCXYVCVYIIYVCIY 
/IYINISF*NDRISFFFMLE*YVCVYIH 
EWIKKI 


6771 


20672 


A 


6825 


396 


122 


LPPGGHRPAVPGGREMGPSRGNPGNPGR 
F*KKGPPSPGGGPAPGGPPSPEG*ARKM 
G*PRGPSLP*TQIPPPGSPP/WGPPGVT 
PETLVGFKKRGHPPRGGGPPPGVPLPRR 
AKPEKWVNPGGQAFHEPKSPPRGPPRGA 
KPETLS PKKKKKV 


6772 


20673 


A 


6826 


414 


64 


RGVEGGAACI FKGLS VAGKPG / YLSAMR 
SCFRRMLLFKS *G/QVIATKHMLPTVLR 
MVGDPVANVRFNVAKSLQKIGPILDNST 
WQSEVKPI LEKLTQAQDVDVKCFAQEAL 
TVLSLA 


6773 


20674 


A 


6827 


2 


519 


KTKTCLVEEIS*ITGRFLERINKIDKPIj 
VNL* RKK\ VQVLN I RNEKVDVTTDRT * K 
\ IIKEHYGQLYANKFN/IDEMDKFLKRD 
KLAKLTQEEIENLNI PVSI *KFLI *NI P 
KMKTP/GPDGISREL 
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Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=G)utaraic Acid, F=Pbenylalanine, 


sequence 


sequence 




09/515,1 
26 


location 
correspond 
ing to first 
amino acid 
residue of 
peptide 
sequence 


otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


G=Glycine, H=Histidine, I=lsoleucine, 
K=Lysme, L=Leucine, MHMethionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Va»ine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


6774 


20675 


A 


6828 


413 


114 


GGKRGPGP I PHP PARGPQ * GPGHPAVQV 
PEPPGLSFPHFRGGFPSSGGCQRGEGRK 
ERRPSWTGPSWMAQEGRTPAARLEGAVL 
PG/APGPEAPGTGTTMSAPTWPEPP/G/ 
GRQVQDVPAFPEGSARQAR * HEE * LHWK 
EILL 


6775 


20676 


A 


6829 


479 


261 


PCFSFLSNEDCRHAPP/RPV* F/LVERG 
FRHVARAGLEFLT*SGLPASASQGAG I A 
GVSHCAWPLQVGIBASLQY 


6776 


20677 


A 


6830 


492 


313 


HPIGWPVNWGRASAVGBFRDWKLFYPEC 
E I RTMGGREHGQS PGAQRTFFQLLLSFX 
VESK 


6777 


20678 


A 


6831 


378 


1 


SNGICLQASSASPNKIVPSTSLKVCSN\ 
VRVDGSRNPSVAFPPLTPNLFAVAPSIS 
SGMG I ET I P I QG YR VDE KTKKCY I P F \ V 
*ANRHSPSGVYNINVHALVSGPPLSDDS 
GVNKPQMKHQHCSA 


6778 


20679 


A 


6832 


374 


76 


QLVEKWTGNVKSMTDRVLTMRCELRARL 
*ALRTPGTWNLINDQIGMF\KFSGLSPK 
QVECLVNEKH I YLMPSGR INVSGLTAKT 
LGYVAAS IQEAVTI IQ 


6779 


20680 


A 


6833 


369 


56 


LKYLGTS VENRYADQAKWLS P VI SALVJ * 
FE/ CGWIARSEDRDHFG*QSE I PS/ PAN 
FLFLVEMGFYHVDQAGLDLMTSGNPPAC 
NTGMSHCTRPGRTDVLAILCSDP 


6780 


20681 


A 


6834 


457 


142 


DCLALSSRLDLQWCDLGSLQAPPPGFKQ 
FLCLSLPSSWDYRCLPPRPANF\+FLVE 
TGFHHVGQAGLKLLTS SDLPAS ASQSAG 
ITGLSHRAWPDSS FEGS S S S 


6781 


20682 


A 


6835 


407 


35 


TAPAI RGVHLRGRG WTVYGAV * AVP PAS 
I PA YPG WYQ YG F YGADL YG \DMQHSDM 
RGLMVSQHSPMPHT\LQQS\SVTVMAGC 
TQPSHTMPLPLPLVY\ELALWRVYAEVA 
TSYLPPAEVMLAPCK 


6782 


20683 


A 


6836 


1 


428 


GKTLNPPSLGKSFGLIGLPPENDWPPDV 
SVPRAAFPLRGPRSVQSWREMEQSVAQ 
LLL*MLTFTPHKRISAFRAL\QHSYLHK 
DEGDPE 


6783 


20684 


A 


6837 


3 


612 


FKGSGVFNRKKKLEEHPLWQDVALGTAL 
CPQLPSLPGWGVI E WPRRAACLPCNPQL 
CPHCCGPQEAQYPL*AGGQ*GQHQEKPG 
A\GPHPPSGPQVPSAPGEAMTAMEGRKL 
SPAVAQNPPQRKNTLPRGPLPRGAPTTT 
P * SLRPR * TP VGAGLAAP I RSHHPGCSG 
QEALALC PDRPHKDRGVRRGAGGPRGRR 
VQLGGIPP 


6784 


20685 


A 


6838 


3 


412 


YRWGFTMLTRLVLSS * PR/ CDPPASASQ 
S VGITGVNHRVRLFFFFFGTGPHS \MAQ 
AEGQGHNLG * LQPQ PPGAKGTLPRTGNF 
KGGPHTPPNF/CIFFGKNGVSPYCPGWD 
FGAKAILPPGPPKELELEGSPPAPGKVF 


6785 


20686 


A 


6839 


419 


26 


EETSRLWSIGQCPLLIQTAQVLSFTSGH 
GRHREHRKPTDS PLPSACPAP /RLTPPP 
SMCPASHSLPSSQCMPGPTLTPPTACAR 
PDTHSPPPSVCPAPLP/CRPVCPPPRSL 
PTPA*TKPRTQLPSSMYCGRSS 


6786 


20687 


A 


6840 


365 


66 


LLERQIGRSVYGPSSKSSI FDVPSLPKR 
GI KLVFCLCHTS S WDTGVHYAPPA* SWH 
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peptide 
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Amino acid sequence (A=Alanine 
C^Cysteine, D=Aspartic Acid, 
E^Iutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
^Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














TWHPRPRQVGQ/AGFELLTSCHPSPLT 
SQTARIAGVSHRARPKFM 


6787 


20688 


A 


6841 


445 


188 


GFQ* FS CFGLLGGWDTRHVPPRLADFC I 
F/M*RWGFAMSARLS SNS * SAPSASQSA 
GITGMSHCARPATLNSPICGTSLPLGSF 
FF 


6788 


20689 


A 


6842 


3 


412 


PQQS PQLTTVTAPRPRALLMGTTMANLQ 
TRVS TSHLQGPRLTS ALTYQMAQVAVAT 
PPWSVLAWVLSARVTVLP/LN/VNGI 
S WIRQLQKAAGRQP P S PHQTRNNSLE * 
VASNLSKFLILEVAELFCHVLTFFSYQ 


6789 


20690 


A 


6843 


113 


410 


NHKCNVN F YNG PRS LE KRL I TLP I LH Y F 
LNFILCKSSYLTLLNLKLQTOLFCK/YE 
M/ESHFVTQAGVQWHDLGSLHPKQSSYL 
ShhSS *DYRGMPPHLAN 


6790 


20691 


A 


6844 


2 


357 


FCPLARI LQSGRFAVLPWNSGLVRGRRQ 
EGPHGAFTREHT/SLRAPQPRSENERR/ 
RIACALRAAHQASGWRLTPRAPRSPSGC 
HCPPAEPTSR*EAWGARRKGHGCCPAPH 
QPAPQDIS 


6791 


20692 


A 


6845 


392 


279 


HFFFPCRD/KSLAMLTRLILNSWA*VIL 
LPQPSKVLGLQV 


6792 


20693 


A 


6846 


3 


275 


VNLVDKAAAGFERMDSNF\ETSSTVGKM 
LS SS I AR YRE I FYE R KS PSMQQTLFLSH 
FRKLP*PF*PIATTTLIY*QPSTSRQDP 
LPAKRAR 


6793 


20694 


A 


6847 


1 . 


454 


FFFFETQFHSLPRLECSGAISAHCNFCL 
LGS S SS PAS ASQWG I TGMKHHAW I IF/ 
VLFLVETGFHHVGHAGFELLTTCGPPTL 
AS IMCILVYIiLVFQHLVSLKNLKGQNAS 
FYFS PFGQHLSGFVKCSI WCCGSLSGLE 
GHFLPPHSPPLK 


6794 


20695 


A 


6848 


13 


148 


GSAWRHTPAG\ELLKKLRHRYGLNPEGG 
GCTEPIWCYRTPPWTTD 


6795 


20696 


A 


6850 


204 


30 


T I FSRQVLR I QNVLSEKLRVSTVYANNG 
S \ VLQGTLWASVYHGKILIGTVFHKTLY 
CEL 


6796 


20697 


A 


6851 


2 


429 


EVWASGITGKAIMAMPIVIQPKNPSSYP 
CRRQFPLQLEAKEGFQLLTEKFQKHGLL 
I P YN /TP I IiPVKKSNEKYRLVQDLRI IN 
EAWPIHPMIPNSYVILPQPPDAQWFPV 
LDLKDSFFCISP/VDPSSQFLFAFE 


6797 


20698 


A 


6852 


3 


160 


SGWDHRRHTQLI FMYS FLFFCRDR/SLP 
MLPKLVLNSWAQVILLLQPPKVLGL 


6798 


20699 


A 


6853 


2 


243 


ACMILAAAT 1 1 LRW \ PLRY 1 1 L I WGIN 
KFTKKLRNPYSIDNNELLDFLSRVPSDV 
QKVQ YAELKLCSSHS PLRKKRSAL 


6799 


20700 


A 


6854 


24 


458 


SRAARRAAGTTRRSSHTGCRPRPADPGA 
RCLPRGSPHPRMDPPESP/PSETSPGPP 
PMGPPPPSSKAPRSPPVGSGPASGVEPT 
SFPVESEAVMEDVLRPLEQALEDCRGHT 
RKQVCDDI SRRLALLQEQWAGGKLS IPC 
KEENGS 


6800 


20701 


A 


6855 


3 


183 


CCDPPVSASQRAGITS VTY/ Q/ W PS LGS 
LQPPSPEFKRFSCLSLLSSWDHRHPPLR 
PACKVFC 


6801 


20702 


A 


6856 


2 


242 


GNHPYARKLA\AQRRASTVSSVTQVEVD 
ENAYRCGSGMQMAKDSKSLKTHQTQPGI 
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C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=GIycine, H=Histidine, Wsoleucine, 
K=Lysine, Lr=Leucine, M=Methionine, 
N=Asparagine, P^Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, ^possible 
nucleotide insertion 














RRARDKAL \ LVANGKRLWC I EGS VC 


6802 


20703 


A 


6857 


54 


401 


EGERLQKFMHVLKTHTHTH / AHTHERAH 
THTHTRTHT PRSGMHRSNNRNLAKNL FK 
IFFFLSSSPPSLFFLIIFPFLSLSLSSL 
SLSLYKHTPLC I LS LRTEHLHRMGALLG 
GHILE 


6803 


20704 


A 


6858 


1374 


579 


CENPSRSTRPGQGPLLLPHQLPFH/RPA 
PSQSSPPEQPQSMEMRSVLRKAGSPRKA 
RRARLNPLVLLLDAALTGELEWQQAVK 
BMNDPSQPNEEGITALHNAICGANYSIV 
DFL I TAGANVNS PDSHGWTPLHCAAS CN 
DTVT CMAL VQHGAAI FATTLSDGATAFE 
KCDPYREGYADCATYLADVEQSMGLMNS 
GAVYALWDYSAE FGDELSFREGESVTVL 
RRDGPEETDWWWAALHGQEGYVPRNYFG 
LFPRVKPQRSKV 


6804 


20705 


A 


6859 


36 


269 


IALGSMDVLTILMFNEHSKNEIKRTIPL 
KIATR/ IKYLGINLT\RGKKDLDTENYT 
TLIEEIEDDTNKWKDIPCSWTGV 


6805 


20706 


A 


6860 


47 


888 


TLRARALQARPRTGS SCTAATWTS /SGA 
SQHSLRALSWRRLYLSRAKLKASSRTSA 
LLSG FAMVAMVE VQLESDHEYPPGLL VA 
FS ACTTVLVAVHL FALMVS T CLL PHI EA 
VSNIHNLNSVHQSPHQRLHRYVELiAWGF 
STALGTFLFLAEWLVGWVKFVP IGAPIj 
DTPTPMVPTSRVPGTLAPVATSLSPASN 
LiPRS SASAAPSQAEPACPPRQACGGGGA 
HGPGWQAAMASTAI MVP VGLVFVAFALH 
FYRSLVAHKTDRYKQELEELNRLQGELQ 
'AV 


6806 


20707 


A 


6861 


3 


391 


NKI S F FCRD WG h PML PRLVLN S WAQV I L 
/LISTCQPPKVLGLQA 


6807 


20708 


A 


6862 


322 


175 


GCRC VP P PRAS F K I FGR V / RGLD I LP KL 
VSNSWPQAILPPWPLESLVLQA 


6808 


20709 


A 


6863 


700 


294 


YVLRQGLTLLPRL/ECSSAI IAHCSLKL 
LGIiRNPPTSASRVAGTIGACHHTRVIFI 
VFWIESCLLAQAGPKLLGSSDPPAIiAS 
QSAGIAGISHHAWPPYYFYWVLSSTIPS 
VLSLHSHLI LATILRGGDVTSQTDK 


6809 


20710 


A 


6864 


371 


77 


RSMFAN\NLVYDTSDSDDYHLLKVLEEG 
IQTLMGRLEVGSRRTGQILKQTYSKFDT 
NSLNHVALLKNYGLtLYCFRKDMDKVETF 
LRMVQCRSVEGSCGF 


6810 


20711 


A 


6865 


273 


34 


DYLPTYLKLFRLLFF/ CLI IETESHYVA 
QAILELLSSSNSPTSASRSAGITGISHH 
TQPNFSNNTLLKPYLRLSYALLFA 


6811 


20712 


A 


6866 


1 


190 


GAKGMHH YTWLVRY F / CCVEMRSHYVTW 
VGLE/ PPALKHSSCLSLPKCRDYRHEPP 
HLALFCFSF 


6812 


20713 


A 


6867 


506 


293 


E KPSNGHKKP YTW I S I AI LF I IAKTWKQ 
S\PRCPSAGEWINC/WYTQTMEYYSMLK 
RNELSNCEKT 


6813 


20714 


A 


6868 


2 


116 


LFPFCRDR/SLTMLPRLISNSRAQGILL 


6814 


20715 


A 


6869 


10 


141 


GSAWLFFIFCR/DRGLALLPQLVSNPWL 
QAI LLLWPPRVLQLQA 


6815 


20716 


A 


6870 


2 


603 


APTFINFPAKGKPKRGDTYHLQVRGFSA 
EQIARWIADRTDVNIRVIRPPNYAGPLM 
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K^Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
J— threonine, v=valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














LGLLLAVIGGLVYLRRSNMEFLFNKTGW 
AhAAJjCF \VIiAMTSGQMWDHIRG\PPYG 
P KDP HKGHVNY I HGS SQAQ FVAETH I VL 
LFNGGVTLGMVLLCEAATSDMDIGKRKI 
M CVAGI GLyVLFFS WMI/S I FRSKYHGYP 
YSFLMS 


6816 


20717 


A 


6871 


375 


1 


GFPGFPVP KFG I PAPVAPQWP FPP PS F/ 
PQGPPPSPGGPNPNGKNPFFPKTPFFLK 
NSPQKKKIFLKKFKRGPP\GPNQKPPFK 
D£ KJCKRGIjKPPKKGGPLKKKaKKKKRSN 
YSLKKIYSPFGFHFF 


6817 


20718 


A 


6872 


255 


23 


GRVD PTSTSQNAG I TGVSHR / GVQWRDL 
GSL/QPLSPGFKRFSCLSIrSSSRDYRLV 
/PFCIFSRDRILPFWPGWSQTPHLK 


6818 


20719 


A 


6873 


5 


157 


YLLFTSYTHAHTHTHTHTHTQFLLLviTS 
GLS \SCCYI FSLSGPNTKCNM I F 


6819 


20720 


A 


6874 


351 


132 


S PRRCS AYGVA/ 1 YLFLYFLINLLSLYS 
IFYGFAPNSFLCEVQEPSLG \SGWEPLS 
GNHFSSIKMKIAHREVNM 


6820 


20721 


A 


6875 


99 


337 


AGSFFQKKGKKKISRVNRQPIQLEKIFM 
NCASDKGJjVPRI YKELESAR \KYQSI PS 
KVG 


6821 


20722 


A 


6876 


69 


306 


Y VSHHLSFFF/ CFFGGGEKALFFCPRTG 
IKWPQFGLLEPSPSGLKQFFGLNPPETL 
EYRVFFPPPGKFFFFFFFFFFFFF 


6822 


20723 


A 


6877 


335 


169 


WhDLGLiLQP P P PG \ SS DS CAS ASQ VAGI 
TGMHHSFSNFIYLFCYFEKVFLTFTLD 


6823 


20724 


A 


6878 


2 


237 


ELGYKVLPHPPYSPDLSPTDYHFLKHLN 
NFLi \ FHNQQCAENAFQVFIESQSTDIjYA 
TGINKLI SHWQKCVDSNCS YLN 


6824 


20725 


A 


6879 


2 


199 


RGRVGRQFQKEIEELKKKLEEGEE/VAP 
SSSSSSSDPiilSEPDISGSEEDDDEEGE 
VGEDGEKKKKK 


6825 


20726 


A 


6880 


1 


206 


FFFFE TBS C YVAQAGLELLG LGDPPAS A 
SQWGTTGARHHAQLIF/VFVFLVEMGF 


6826 


20727 


A 


6881 




123 


WY IHTME YYS ALKRRKFLS FATTWMNLE 
\EIMLSKISRRRG 


052/ 


20725 


TV 
J\ 


6oo2 


15 


1 AC 

105 


byL»li/ WRLiKQKWKIjNRGGGGCSI? PLWCH 
CTL 


6828 


20729 


A 


6883 


223 


1 


AASTFLFPNLKNSLRG\SIiRTFSSVTNV 
RKTALTWIjNSQD± QFFFSRRS LTLVARL 
ECWRSWPAASRRLGCL 


6829 


20730 


A 


6884 


193 


3 


QGEQDFLPPCLPNFFNF \ F FCRDTVSML 
PRLISNSWPQAVILPEPHKVLELQARAT 
TPSHVFIF 


6830 


20731 


A 


6885 


2 


355 


RLLTSSDLPASASQSAGVTGMSHRARPR 
NF/CILPNLSPCNTHSPLSLPQPLAPTI 
LLSLWI Cb/ SLD I S / WDYRHMP PRUUCV 
FFFGDGVSQ\YRPGWSAWPFAASTSGV 

t?7\TT 7~1 n/^NTl V)V 

bAJLliPi'yPf J\ % 


6831 


20732 


A 


6886 


1 


151 


HSVAQAGVQWWDHGSLRPGTPGIiKQ/ FL 
PPLKQSSCLSLLS SWDHRCATLY 




2U/33 




Ooo/ 


3 14 


OAT 


WPPKVLGLQE 


6833 


20734 


A 


6888 


334 


86 


QNRSTFKRGM/MGWAWWLMFVI PATWEA 
EVAAKIAPLHYSLGDRARLSKGGMKCTS 
I FCS KS AKDCQTLP VKKKI S LPTL PS I 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Pbenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon,/=pos$ible 
nucleotide deletion, \=possib!e 
nucleotide insertion 


6834 


20735 


A 


6889 


345 


152 


QHAR I MS VF I VETGFHH IGQS GLELMT / 
S /GNMHASASQSARITGVSPHARPSLPY 
YKCSKKQLCNE 


6835 


20736 


A 


6890 


2 


169 


ARECSGAITAHCSLDLMGSSDTFTSASQ 
VAGTVGVTIMPRYIFK\FFVEMESHCIA 


6836 


20737 


A 


6891 


361 


184 


NQSSCFSLLSSWEHRLMPPHLANF/ SFF 
DRDE/SLTVLSRLVSNSWTQAILPPRIF 
FENS 


6837 


20738 


A 


6892 


3 


330 


HESAAITGVSHCTQTFFFF/CETGPRFV 
PQVKGQGHDLSSGEFGFPRLRNWPGLTL 
QNAGNTGTLPPHRTNFGDFRRGSFSPCG 
PGG FGTR DLGG F PH LTG KRDTGLDP P 


6838 


20739 


A 


6893 


2 


348 


ARAFFFWNFFCK/NRGLSMLPRLALNSG 
AQGILPPWPPKKLGLRALPTWPKTGSSG 
LEGHSSL YPHHSLI FPHHCSKFCFIKQ W 
RL F I LR I TENQGLTPLNAAKP AE P VMNE 
ARTQA 


6839 


20740 


A 


6894 


340 


240 


CRD/RSLRMLLTLVLNSWIQAILLSWPP 
KVLGLQA 


6840 


20741 


A 


6895 


155 


) 


HLCF WAG \ PVAHS CHPGTLGGRGGR I TR 
GQE FETS LLGPGTQDVLAKCSRA 


6841 


20742 


A 


6896 


454 


294 


PSS WDYRHA/ P PMHNWPDFFCRDE / Sh 
PMLPRLVWNFWAQLILLPWTPKALQLQA 


6842 


20743 


A 


6897 


49 


333 


I YI CLSFYQS 1 1 YLYDKQKDVLSTVSHS 
VSLDLPGLRFSHLSLWDHRLILPWLASS 
NN FF\ CRNRGLLVLPRLVSNS W VQAI LP 
PWLP IVLGLPA 


6843 


20744 


A 


6898 


434 


324 


FFFFCRD/RSFAVLPRLVLHSWTQAIRP 
PWFLKVLGLQA 


6844 


20745 


A 


6899 


127 


365 


LPERRRYVS I SGYR YHLPQSLL F FYAQH 
ATFNNELW \LGVI AHACNPSTLGGRDGR 
MALGQEFEIGLDS I VRLHLCNKXK 


6845 


20746 


A 


6900 


208 


1 


FFFFFWILVETRFHHVA/RAGLELLSAG 
NPFTSPSQSARITGIWFFVWFSSNLSQF 
CYAHSFFTCWYSRA 


6846 


20747 


A 


6901 


1 


162 


GTSGTRLFTIGKRWQQPKCPPMAELINK 
MW\ YTMEHYS VLKRKEIFCTNLTQLL 


6847 


20748 


A 


6902 


120 


313 


DKVL/WRIiRQENPLNPGGGGCSEPRSHH 
CTPAWATRAKLYLKKKKNFKGTLSNWYS 
FRRVANTNLQMQSGSNYMQI 


6848 


20749 


A 


6903 


340 


234 


IFFCRER\SILPRLVS1TWPQAILLPRP 
PKVLGLQV 


6849 


20750 


A 


6904 


1 


327 . 


GTSGTSR I LP WG PG C I RTPELKQS AHLS 
LPKCWDCWNYRHEPPVYSFAMLIIKLFF 
ELHLSVILWFLLLANPQTQTESKTFSKT 
/ PRLVLNSWPQAVLPWSLKALASQA 


6850 


20751 


A 


6905 


532 


359 


RDG/ DLVWWPRMVSMPGFKQSSHLGHPK 
CWDYRCEPLCQDFSYSIITWFPPSREL 
RR 


6851 


20752 


A 


6906 


270 


3 


GGTTVYPRKTKT/ WPGAVAEACNPNILG 
GRGGWITRSGLPVSTKNTKISRAWCCAA 
WLATPEAEQENRLKPGDGDRDHHGQHG 
ETPSLSC 


6852 


20753 


A 


6907 


1 


312 


GTS WKKMLPTLMDDFRG\ KTLVEELI PD 
WEIAREVELE\MEHEDVTELLQPHDKT 
WTDEELLLTDEQRWFLEVGSVPVEDAV 
NIVEMKTKDLEYYISLLIKY 


6853 


20754 


A 


6908 


331 


61 


FRHVGQAG / LQKLLTS S DLPAPASOS AG 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSIS 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
iast amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H-Histidtne, l=Isoleucine, 
KNLysine, L= Leucine, MNVIethionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W-Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 














ITGMSHHIRPKLILRXKSMRAKLDSRQT 
RLQPPWEQGLLEEQHEVYNVYMAHVSK 
SSSRKLS 


6854 


20755 


A 


6909 


419 


199 


SAVQDGV\SWCDLGSL\QFPLSRLKQFP 
CLSFPSTtSfDYKCAPPHPATFCIFIRDGV 
SPCWSDWSQTPGLK 


6855 


20756 


A 


6910 


341 


69 


EAEVGGLLKARSLRPAWAIQQDPISILK 
KSFTLGGQGGWI TRSGDRDLRASASQSA 
GTTGVSHRAQ\LIFAFLVERRFHHAGQG 
GLDLLTL 


6856 


20757 


A 


6911 


174 


356 


KTYLMPLRSSLRPGMVAHACNPSTLGGQ 
GGWITRSG/ESRPSPVLVNRLTRCQANT 
VKPCLY 


6857 


20758 


A 


6912 


391 


3 


SPPPPPFPPSFPPFPPQNFFFPPGASSF 
FGPPPP/YFPPPPKKSPPPNPPPPFFFP 
PPLGKQFFSPPPPFFFPPPGFPFPPPPF 
FFFFFFFFFFFFFFFFFFFFCDKKYFIC 
LMICMLFFCKYILNFCRYS 


6858 


20759 


A 


6913 


43 


331 


I RDT YS FSLSAPAIAPMLGTKG VLLAVA 
SLGSRGASLCVFVCVCLCVRIH\ IGVQA 
SGCVCVCAC/CVCVa/RVCVCVCPHWPA 
SLLTKDLQCLPFTV 


6859 


20760 


A 


6914 


250 


11 


TPNLWGLFLVFCFEM / EVH Y I AQAGLE / 
PPGLKPSSSLSLLSSWDYRHVMPHAAQI 
CISVFSLAQKGVPPSNWILITHYSIKS 


6860 


20761 


A 


6915 


532 


53 


SQMALHEGFLLLFLKEKELHGKFSHPPP 
SAPNSSMSKWFYFSETESHSVAQAGVQ 
WCNLSSLQPPPPGFKQFSRLSLTSSWDY 
TYLPPHLANFLFLVEMGFLHVGQAGLKL 
PTSGDLPALGSQS VGI TG VSEHTR\ PRF 
LFKIGPLFLFSKSMSQDEAR I 


6861 


20762 


A 


6916 


2 


373 


WHTPWLATHEAEAGGSLEPRRAKLQLA 
MIVA/NCTPAWVT 


6862 


20763 


A 


69 J 7 


1 


270 


GTRQSPRLSLLSSSCDHRRVSPHPADF/ 
SFFYFYFFCR/DRGFTMLPRQVLIWAQV 
MLLSQP PKLLGLQ WG VS HGARP PHCLG 
YSWVYNKP 


6863 


20764 


A 


691 S 


2 


367 


RCAPHCPDNS SFFK/ RVKTGLT I FPRQT 
SNSWPQVILLPWPP/KVLGLQA 


6864 


20765 


A 


6919 


3 


354 


HEVN I VETTAKDLE YS INLVDKTGFEKI 
DSSFKRRSTVSKMPSDSITCYREIFCER 
KSQSMLQTSLLSYFKKLPQPPQPSA/TT 
TVI SQQPSTLRQGPPPAKRLRCAKGLND 
FQHFWL 


6865 


20766 


A 


6920 


366 


41 


STSGEDAVHIVEMTTKGLEP/YTAVPGF 
HPSSERRS WGNMLSNSVTCYRGI FGER 
KSQC/RQTSLLPYFN/ KPQPPQPSATIT 
LI \SQQPSTSRRDPLPPKRLQLTESSND 
H 


6866 


20767 


A 


6921 


3 


368 


QLLMGLHLEPATLDLLQLKT\TIQGGSH 
S HHI TA I P \ PS \ FTRENTLMF I HLS P I L 
LLSLNPDI ITGFFLLKKKNKPKNHKNRG 
PSYKRTPEGPLLLLWAQKTLPPKGGLY 
KIAPLRLFGEDS 


6867 


20768 


A 


6922 


3 


91 


FSVLPRLV\QNSWPQVICI .PWHPKVLGL 
QV 


6868 


20769 


A 


6923 


334 


91 


ERCGKCAHISEIFLSSLDYIYIFVFY\F 
FFLFFFFFFFFFFFFFFFFFFFFFFFFS 
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SEQ IB 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acta 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=G)utamic Acid, F=PhenyJalanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutaroine, R=Arginine, S=Serine, 
1 = 1 hrconine. V= Valine, 
VY— I ry prop nan, Y^Jyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














QFKFNTKSKEKMFNKTMKKVQVTF 


6869 


20770 


A 


6924 


375 


37 


TKAGVQWHDHGSLQPQPPRLKPSSCFSL 

psswdyrhapp/rp /alflflfglvwfg 
lvflveigscyiaqaglkllgssdpptl 
vsqsari igpthhawptltts i qctirg 

PG 


6870 


20771 


A 


6925 


647 


1054 


CFWKAHIQAGETNN IQEGSERFHFCPAD 
E VYVHSGPAVEDQ PQRRS FAL / SAQAGV 
QWCNLSLLQPLPPRFKQFSCL\SLPSSW 
D\HRHVPACLAKFC I FSRDEVCHVGQAR 
LELLTSGDPPASASQSAGITGMNHHV 


6871 


20772 


A 


6926 


3 


204 


DAW ETTG FGM I Y D S LDYAKKNE P KHR \ L 
ARHGL YE KKKT S RKQRKE RKNRM KKVRG 
TAKANVGAGKKK 


6872 


20773 


A 


6927 


1 


112 


PTRPRTRGVASVLYFTTILILI PT\ ISL 
IENKILKWA 


6873 


20774 


A 


6928 


1811 


478 


DRARSPRSERA\RRSSSRLRQRDPSLGR 

RRLRREIRPGLPES5PRPPRPPAALTAD 

Q PP PRRLS ESRGGGGMSE \ AGE \ ATTTT 

TTTLPQAPTEAAAAAPQDPAPKS PVGSG 

APQAAAPAPAAHVAGNPGGDAAPAATGT 

AAAASLATAAGS \EDAEKKVLATKVLGT 

VKWFNVRNGYGFINRNDTKEDVFVHQTA 

IKKNNPRKYLRSVGDGETVEFDWEGEK - 

GAEAANVTGPDGVPVEGSRYAADRRRYR 

RGYYGRRRGPPRNYAGEEEEEGSGSSEG 

FDPPATDRQFSGARNQLRRPQYRPQYRQ 

RRF PPYHVGQTFDRRSRVLPHPNRI QAG 

EIGEMKDGVPEGAQLQGPVHRNPTYRPR 

YRSRGPPRPRPAPAVGEAEDKENQQATS 

GPNQPSVRRGYRRPYNYRRRPRPPNAPS 

QDGKEAKAGEAPTENPAPPTQQSSAE 


6874 


20775 


A 


6929 


324 


203 


LI Fv FLVETGFhHVGQIXSLDLL / NLVI R 
PPRPPKVLKLENP 


6875 


20776 


A 


6930 


3 


364 


HEASLTQTRTTMTHCSRTTGCSTASGRT 
WTRSRHSCAWCSAALWRAAVASRCPSS/ 
I PVTPPQCLSWP / WKVPLQCPPALS 


6876 


20777 


A 


6931 


2 


165 


GRVGFCS VAQAGLQWHDLS S LKPL P P \ G 
SSDCPASASRIVGITSLAFHFLNNVF 


6877 


20778 


A 


6932 


1 


370 


TELRSYSPSNGAISSQCNLCFLGSGYS/ 
RSSTSQVAGITGA/R/HHTQLI FVFLVE 
TEFHEV\AKAGLELLTSGDPPASASQSV 
G IAGVSHCAWPHVY I LKE IMLGNLVRCL 
MS I CHHHTE SICLFL 


6878 


20779 


A 


6933 


1 


441 


RRANTPNNPEAPPN/QKKKKKKKKKKKG 
KNPPGGPPPFFKTPPPFKNGGGSPPGKK 
KGGEKKRGGKI FFPPPGGKKPPPQKKKG 
GGGGKKI FFPLFGPGGGGNFFKKKKGPP 
PFPPRKKKKIFPTGGGPKKKKKKKAPPK 
KPRGFYFP 


6879 


20780 


A 


6934 


518 


340 


PKPKNFPS/PSPPKIPPPKKKVFSKKPP 
GGFNNPPHKRKKYNFPPPEKLGPPKEFL 
KRPPL 


OooU 


zU/oI 


A 








V VIjVV,/UVV»VLV\,V V Lv v — Lr.OJJLA.O J. J_l 

HRTGV 


6881 


20782 


A 


6936 


486 


111 


PTRAPTR PAPPHLADFL I FCSDR/SLIK 
LPRLVLNSQPQMILPLWHPKWGLQA 


6882 


20783 


A 


6937 


1 


264 


YSNLRDRARSCLSLPSSWDYTCSWDYKC 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


JVI 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning » 
nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 
peptide 
sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A-Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenylaIanine, 
G=Glycine, H^Histidine, l=Jsoleucine, 
K=Lysine, LHLeucine, MNMethionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
1 = 1 hreonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
XMUnknown, *=Stop codon,/=possible 
nucleotide deletion, V=possibIe 
nucleotide insertion 














ASLRLANF/ SNr CRERGLAMLLRLVSNS 
WAQVILLPQPSKVLRLCLAVSVFFGEKQ 
GQCS 


6883 


20784 


A 


6938 


2 


407 


TTGTCRSVWLTFVFLVVTGFHHVGQAGL 
ELL / R P PQ P P KVIiGIiQA 


6884 


20785 


A 


6939 


422 


65 


LKGPPPSFFFFFFFFFFFFFFFFSP\LP 
YILYFDCASNACLFFKNNTWRYIKRIFF 
SPTPFI PFFHEVENVTSLS YMVFQYCFL 
c if 


6SS5 


20786 


A 


6940 


462 


209 


LSEF WD YRREPPLPAKFAI FMLCRDRGL 
DTL P S LVS S / S WHQA I LAP Q P P I MLGLQ 
VHSAYKGFCLPVLTFEEVIGNRPLGLLP 
P 


6886 


20/87 


7\ 

A 


6941 


491 


299 


LAEYSPESGPTHASAHAS / SnvQMVYSR 
ISCKEELLLGRTSPSKKYNMMTVSG 


6887 


20788 


A 


6942 


429 


256 


FSHLSLPSSWDYKHLSSCPANFCIFVQT 
GFHHVGQA\GDPPASASQSVGVTGMSHH 
S 


6888 


20789 


A 


6943 


932 


661 


FKTGSYSV\TRLEGSGEI SAHCNLRFLS 
SSDFFHLSSQVARITGSSQYARLIFVFF 
VEMG FAMLP R VGLKFLGF KPLH P P \ RT P 
RVLGLQiii 


6S89 


20790 


A 


6944 


381 


55 


PASLPPCSLISDCCASNQRDSVGVGPSE 
PGVGYSLWRRFLSRSEKRNIRVGVTRF 
SRCV/LSPLSLTQKGNSLTPCASQVRQC 

T "A T" T T*> T 7> T T/"*l TV ^irriTTT/I T\ T\ T*1 fill TT.Tf T n T T TT^ 

LALLRIiAHGACTHWPAPTWHSLVR 


6890 


20791 


A 


6945 


816 


513 


FFIFIYFFEMESLLPRLECSRMISAHCK 
VCLPRSSSY/ PASASQVAKITGA/TRHS 
RLIFCVCVLVERRFHHLSQAGLELL/NL 
VIHPSRPPKVLRLQGVTRTA 


6891 


20792 


A 


6946 


383 


234 


LI PS LMDNSEGLKTS VEE VTADWK I VR 
ELELKVKPE/NVTDLLQS \ YDKT 


6892 


20793 


A 


6947 


423 


60 


LNPPPPPF\YKPPPKKKKFFSPPPLKFG 
PPTKFFXRPPPFFFFFFFFFFFFWHGTF 
PLNSLVGPPKVEGWVSWAQLPRAVGPPL 
PNPKWGLNKPPAGGLNKP IHFPSLNGSG 
EATPPLIPWC 


6893 


20794 


A 


6948 


1 


327 


MRLLAAALLLLLLALYTARVDGS KCKC \ 
SRKEPKIRYSDVKKL \EMKPK \ YPHCEE 
KMVI ITTKSVSRYRGQEHCLHPKLQSTK 
RFIKWYNAWNEKRRVYEEAQGLRRRIG 


6894 


20795 


A 


6949 


952 


796 


FFLHFLVEMGN FLLVGQAGLELPTSGDP 
PALASQSAGITGMSHRARPSVSS 


6895 


20796 


A 


6950 


364 


6 


I TTHKN YFLSRTATPPKEGLPQRKKI FL 
FFTPIFIFFVGFIFWSGGGRGFLVFFFV 
v v r r r r r r r r £ FFFir r tTrFFf Fr \RVK 
TFIGFKVLFFKINVFFFLIRAVPLDYMN 
HALTIFQW 


6896 


20797 


A 


6951 


418 


266 


ISFFFXXXXXFFFXXXFFLLFFFPFFFF 
FFFFFFFFFFFFFFFFFFFFFFN 


6897 


20798 


A 


6952 


459 


7 


PSYLYKPGPSQPNRRRHSVP/HIRSNKD 
PPPPEVMKKWGPPTPQKKKYFPPRKKKL 
GGGGGPIYPPKKKVFFKKKPGGFKKPPK 

FFFFFFFFFFFFFFFFFFFFFFLIGEKA 
MLLDMGA 


6898 


20799 


A 


6953 


325 


143 


LPSSWDYRRVPPHSAHFKFFCRN/RGLA 
ML PRLAK PR FPS I S I FLNHPQ VS E I LGA 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A-Abnine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Pheny lain nine, 
G=Glycine, H=Histidine, I=lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T-i hreonine, V=Vahne, 
YV= tryptophan, Y=Ty rosin e, 
X^Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














TRIiAR 


6899 


20800 


A 


6954 


585 


306 


KTESHSAIRSQCS\GEISAHCNLRLPGS 
SNS PAS ASRVAG I KGTHHQ VQLI FVFLV 
EMGFHRAGQDGLRSPALVICPPWPPKVL 
GLEAWPALHC 


6900 


20801 


A 


6955 


2 


155 


FLVETGFHHAGQAGLELLISGDPPTLAK 
S\DSQDVRITGLSHRAWPFLDIY 


6901 


20802 


A 


6956 


3 


346 


DAWPDAWGK \ RNWPPHQANF YTFVE/ TG 
FT/HVGQAGLELLGLSDVPTSSSLDAGI 
TGMSHCSRPRPEISSRDVHTrPGALQET 
WHTPDLASS IRSLRCRLDCPHNHCPLDK 
DGVTL 


6902 


20803 


A 


6957 


1 


224 


QCGRIRSRKDARKGNVQI IPREGRRRTS 
PRYLFPVTGQETDGWG\MGLPAWLLKTR 
STSPPRSPRLHRRRGATSW 


6903 


20804 


A 


6958 


102 


344 


GLLKP KS L KLQCT I ITPVN \NHCTAAWA 
T 


6904 


20805 


A 


6959 


3 


296 


KEEEEGEGEGGGGGG\EEEEEEEEEEEE 
EEEEEEEEEEHAEFTGKVESITQCKLLL 
CHLSLRVEAG I THLLCFTVRAVTRSQRL 
LTKRRNGSVGHTSE 


6905 


20806 


A 


6960 


3 


172 


RLGLPKCWDYKREPP/RPGLELLTSSDL 
PALVSQSDGIKDVGHDTEPQDFYTLGQD 
P 


6906 


20807 


A 


6961 


2 


224 


LALLPRLARSGTI IAHCSL\NRSLRLGL 
SDPPASASESTGTTGMSHCSQPAKLLST 
CIRNSVAETKYYLMFKCK 


6907 


20808 


A 


6962 


2 


330 


KGTLRRQVTYKGTLIR\LVADTLAETFH 
ARKEWDDIFKVLKKHCQPRILYLAKPSF 
KNEGES FSQRKC I TTRVARS KMAKGVLH 
IiEVKDISTIIKQMKIRSSLVEQIHKW 


6908 


20809 


A 


6963 


1242 


929 


ETGS CS WQAWQWCGHG YLKPQPSHAH 
/DDPPTASS IAGTTEAHYHAQL I FKKFF 
NSLCCPGWSQVSWRILGSSDPPASASQN 
AG I TGMSHGAW PQLS I YRKGN 


6909 


20810 


A 


6964 


I 


378 


KFFLQLFLHSFIYEHFLFFSFLFFFFFL 
RPS FAFVAQAGVQWRDLGSPQPLPPRFK 
QFSCLSLLSRWDYRHAPPQSANF\EFLV 
ETGFLHVGQAGLELPTSGDPPTSASQSA 
GITGVSHRTRLAFY 


691 0 


20811 


A 


6965 


393 


43 


VSPFPLKNFYFSFTPKTFVGGGGPSGPP 
PQKRFFFHTPNSPFFPFPPF/YKSGPRP 
GFFFTPPPEKGKNFP\PPLRLGPPPVFF 
TR P P P P F FFF FFFE RQRTCS FLH FLWR 
PLFRFYL 


6911 


20812 


A 


6966 


225 


2 


TRVECHGVFFGSCNPLLPGLKEFSAPAP 
RGSGNSGPFPPPR/RNSFFFFFSWLLE 
MGFHHGQADLEILTSSDPP 


6912 


20813 


A 


6967 


8 


341 


FVSVPP I LHYLFKI FFFFLRRSLCLPPR 
LECRGTISAHGSLGLPGSSDSPS/STKN 
TKLPLRGGTCLGPQLLGLLRHEKTWTLG 
GGSGSYPKSGKRAMALQGGVQNKTCVPH 


6913 


20814 


A 


6968 


29 


362 


DYTCKHPHSSELKKKKKKKKKKKKKKKK 

PFKGGFKKPPRGFFEKNPFFGGGNLGPP 
PPPKNKPPGEKKNFLGGRGQKNSFLSAW 


6914 


20815 


A 


6969 


391 


17 


VFFFFFLKKFFPPPDKKLFFPPFPLKIF 
F/SPPNPFFFLGGFSQISPPPKKGFFPK 
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IPPGV /YFSPPFKKKNFFFPPPLNFGPP 
RVFFKGPPPFFFFFFFFFFFFFFFFLGL ■ 
RGNAGDCNGYGDVSYK 


6915 


20816 


A 


6970 


374 


268 


IMLPRLVLNSWPQVILLPWP\PKVFRLQ 
VRAPVPG 


6916 


20817 


A 


6971 


614 


315 


FFFGDALS PGLE CGGAVLARLQA/ CLLG 
SHRSPASASRVAGTCKRPPPRPSGFFVF 
LVEAGLHRVGQDGLNLL/NLVI RPSRPP 
IVLGLQGMSHHARLFYFL 


6917 


20818 


A 


6972 


861 


528 


FETESRSVTQAVVQWCNLSSLQP/LPPG 
FKRFFCLSLPSSWDYRRTPPCPANFCIF 
SRDGVLPWWPGWSRTSDLVIRLPRPPQC 
WDYKREPLRIAKSRHPWNHLPEQSVEHF 
K 


6918 


20819 


A 


6973 


274 


119 


PPN\FVFLVETGFLQGGVKLSTPGDPPP 
PASRRAGITGVSHPACPGKEYIFKN 


6919 


20820 


A 


6974 


370 


194 


KITNFAKLFGYILPYSHYFWCLEKNRNR 
/ SLTLLPRLVSNS WAQAFLLLW P P KVLR 
LQA 


6920 


20821 


A 


6975 


2 


239 


ARGPCSS I / SDCCASGEQGS VGVGPAEP 
G/MGYNLLACCLLRPLEKCS IKAGVSRF 
SWYSLSRLPLT\RKGNPPTPCISQVR 


6921 


20822 


A 


6976 


361 


270 


PQWLVPVIPA/LLETEVGGSLEPRSSRP 
AWAR 


6922 


20823 


A 


6977 


363 


48 


GPPKEMCPYPN\PRPYECNLFGKRVFAD 
VIKFKI LRW\ I I LDYTQVDPKSNDECP Y 
KKQKRRGHRQIKRKSQADQGGRDWSGAT 
ARNAKMITSVGEDVRKRKPLAYC 


6923 


20824 


A 


6978 


9 


163 


AGYNFLVCC\LPRLLEKCSIRMGVSRFS 
RYHLSWLPFARKGNSPTSCTSWVR 


6924 


20825 


A 


6979 


408 


212 


I ETGFCHVAQDG / LELLASDDP PASASQ 
SAEITDVSHRAQPHSAFLPWFSSKIFAY 
GSVWLVFPHS 


6925 


20826 


A 


6980 


395 


254 


LIFVFLVETRFHRVGQAGLELLTSSDLP 
SIASG\ ITGVSHHSRPSSAF 


6926 


20827 


A 


6981 


408 


2 


KVSFFFPKGPRGKKKNGAGGFSPFFPPF 
LGGGTKNFFYPPIFSQGGGKK/QKPPPP 
EKKKNPPNPAFWGPPLVLKNWGGRPGGP 
LDPKVSNFMGAPPFLFGAKKKTFFS PPP 
PKKKKPQKKKGGGRSRSRTSRTRG 


6927 


20828 


A 


6982 


2 


302 


RKHLPPH \ PVI FVFLEQATLRHVGQAGL 
ELLTSSDHRASASQSAGITGVSHCSLPA 
TSFSTPPFSVAKMLRGTQKMKVPLIGPL 
PQP IRLVAGH YF I Y I T 


6928 


20829 


A 


6983 


2 


375 


RGRVGGPVGGPVGRTFI FFQAGSDSARI 
LVLFPYCTLPHPLPGLFFFFFKTGIKSW 
VLLNPSPRFVIFLEKKIFLPQPPFF/HI 
LS PPHPGGAGGGNPTARKI KGLS YRDPK 
EGHKRGWAHTPTP 


6929 


20830 


A 


6984 


1 


146 


PRPANFC/ DFLVETRFRHAGQAGLKLST 
SSDLPTLVSQSAGTTGMSHRA 


6930 


20831 


A 


6985 


2 


95 


LTMLPRLECNDTIPAHCSLNS/SGSSHS 
LTSTSKVAGTTGVHHHFWLMF/PLFI IQ 


6931 


20832 


A 


6986 


340 


169 


VCS SGLSS PLLEQHKTNijIFYASGDI CT 
/ANGKSGFNQPQPFLKTFCCTHRIMSCT 
YL 


6932 


20833 


A 


6987 


346 


218 


PCLTNF/SIFCRDE/ SLTML/PRLVLNS 
WAQAI LLPRP PRLLRLQA 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, ]=Iso)eucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P^Proline, 
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X=Un known, *=Stop codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 


6933 


20834 


A 


6988 


345 


141 


NRPKCPSMIEWIKKMWHIDTMEYYAAIK 
KDKFMSF\ETITLGEVTQEWKTKHRLLS 
LIRGS 


6934 


20835 


A 


6989 


154 


236 


ELISFSRSVERCISSFIKVLFVETDLFT 
LSLSPFFSVLFFFC/RRWGFTMLPTLVL 
NSWPQVILLFWLPKVI.RLPT 


6935 


20836 


A 


6990 


2 


156 


FLVE / TG F VHVGQVGLELLTSGDLPAS A 
SQSVGIIGVSHRVRPCFLFVCICLV 


6936 


20837 


A 


6991 


310 


96 


KNFFFFLVKKSP/LNVAQGGLKLLGSND 
PPASVSQSGG I TGVS PRARPLNFYKEAT 
DPNTS VLKGLN I KKKKN 


6937 


20838 


A 


6992 


15 


279 


NLLCFPSISPLIGIFFFFERGFHHVGQA 
GLKLLTSGD/LRASPSQRARITAMTRHP 
H PLVCGLLNS THCDW RGR I PHWG FDLR F 
REGPS 


6938 


20839 


A 


6993 


127 


352 


KLADTPQSRFVLVCV/LIETKYSLCCPG 
GLQLliASSNPPTSVSQSATITDVNHEAE 
SQVFLNLVVPKSSKPTNTGL 


6939 


20840 


A 


6994 


323 


119 


SSSEDHKHVPPHSAMFCFF/ CLRQSLTM 
LPRLVLNS WAQVI LP PSP PKPFLDALKK 
KSSKWMSSLLIT 


6940 


20841 


A 


6995 


230 


1 


FFFETESCSVTQAGVQWCDLGSLQPLPP 
GFKQFSCLS/LPNSWDYRHALPCPANFC 
IFSRDGVSPGWSGWSRIPDLG 


6941 . 


20842 


A 


6996 


346 


121 


RDICTPMFVTALLIAKIRSPHKCPSVDT 
WIRKMWF\IYTMEYYKAIIKNKILGMSH 
HASLPVFNHMCKLKYLIS 


6942 


20843 


A 


6997 


2 


198 


SNNTWAFREPGITGMCHEAQblVLLYM 
QFHHVDODGLDVR/NLVICPPWPPKKLG 
IQAbNQPSPDI 


6943 


20844 


A 


6998 


183 


31 


MRHLKSSQIQQGTVGHACNPSTLGDHGE 
RITQGQEFKT\GNIVRPCLYKD 


6944 


20845 


A 


6999 


3 


471 


LALSCSGTILAHCNIiHLLGPSLPPTSAP 
RAAGTTGVHHKAV7L I FVFFGKDGVAQPG 
LKLSFK/VIRPPQLPKVLGLQA 


6945 


20846 


A 


7000 


483 


239 


GNNNILFLFLFFIETGSHFVT\RLECQW 
VQSSAHSQPSAPRAQVLIFVFLLQTGFH 
YVG \QAG LQLLTS SDPPHS ALLKVRD 


6946 


20847 


A 


7001 


132 


353 


YAKLGTRG FARG P WPGG FLTVAVWMS F 
AQAGINPFVCI FSNRELRRCFSTTL/ PL 
LQKI QVTKGTLLC YMREHL 


6947 


20848 


A 


7002 


2 


399 


EDAQE VELQEGKVHSESDKAI TPHSQEE 
FQKQERESAE\SELTES 


6948 


20849 


A 


7003 


2 


135 


IFQFLVEMGFKHVGQAGLELL/NLMIHP 
PRPSKVLGLQAYWHSTRP 


6949 | 


20850 


A 


7004 


2 


114 


I FQFLVEMGFHHVGQAGLELL/NLMIHP 
PRPSKVLGLQA 


6950 


20851 


A 


7005 


455 


1 


ALFPAWLWLPSSSKGGLKLSHGTTLTLF 
LLPSS TLKDLWPQEQWL/GTCNSQHFGR 
PRKADRLAPGVQNRPGQRGETPSRKKGG 
EGNWGGGGELH I YTYTHTHTYVHTYIHA 
NWTWWCVPWLAFQETEVGGPLEPEVAP 
AWVTVRPCLKKK 


6951 


20852 


A 


7006 


371 


172 


QVILCLSLPGSWDHRHTPPHPA/NFLVE 
TRFQHVDLELLASNDPPASASQSVGITG 
VSHCARPCYDL 


6952 


20853 


A 


7007 


486 


252 


PQTVFFFFFFFFFFFQALFSTFYILHVF 
FFFFPPFFPKFQFFPFP/LSLIKQGKFF 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
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K=Lysine, L=Leucine, MNMethionine, 
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Q=Glutamine, R=Arginine, S=Serine, 
T=Tbreomne, V=Valine, 
W=Tryptophan, Y^Tyrosine, 
X^Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














FFFFFCYLCNSLFIWTQYFFVNE 


6953 


20854 


A 


7008 


1 


113 


KNFCR/DRSLDTLPRLISNSWPQVTLLP 
QPPKVLGLQA 


6954 


20855 


A 


7009 


105 


383 


SLHRPAHPGLLKFGKKAIHSFIHLCRDK 
GDLAILPRLVSNSWPQVILLPWST/NVL 
RLQA 


6955 


20856 


A 


7010 


609 


348 


ESHSVTQAGVQWYDLDSLQPLPPGS\SD 
S PASASHVAG I TGMRHHTQL \ I FVLLVE 
TRSHHVSQTKFVLFDTTKFVVIFYSGKR 
TRIRI* 


6956 


20857 


A 


7011 


382 


233 


DLEI FRGKTFLHHIFPFFFFFFC/RRRG 
LAMLPKLVLNSWPQVILPKCWD 


6957 


20858 


A 


7012 


359 


199 


DLLPSLPPFLPSFIAFFLFETESHSSPR 
LECSGAI PAHCSLGDRARKRLK I NK 


6958 


20859 


A 


7014 


3 


274 


CFFFFFETGSHSVTQAGVQWRHLDSIiQP 
PPPGCKRFSCLS/LPSSWDYRHV/PSSP 
LIiVRNS S ICLLS CFEjyiT PELFDGVLAI Y 
CCVTNDLKA 


6959 


20860 


A 


7015 


372 


54 


LPSAWEAEMRGPLEPKGLRSQRAEIAP\ 
CTPAWVTEQAYVSNKTNTERP FQKIEHV 
EGLSCTTQKFSSND 


6960 


20861 


A 


7016 


548 


158 


KFDT/GATLFDGRPAVFDEA/DFPSVAV 
YLTG AE YTGEELDSDTWQAELH I EVFL P 
AQVPDSELDAWMESRI YPVMSDI PALSD 
LITSMVASGYDYRRDDDAGLWSSADLTY 
VITYEM 


6961 


20862 


A 


7018 


396 


34 


QKNE IKPTMQLHLTPARMAI IKKILKNG 
GWHGCGE/METLLHCWQECKLVQPLWKT 
tMWRFIiKEPKVELPFDPAIPVLGIQQSQH 
WESSNPSTGDPAIPPDGIQQSQPRRKVI 
KKRYLHTRL 


6962 


20863 


A 


7020 


2 


241 


FPGPTAARRRQKEDRYEEYDKVLSDKLK 
EAETRAEFAERS LTRLEKS I DDLQBEL \ 
YAHKLKYQAI SEELDHALNDMTS I 


6963 


20864 


A 


7021 


347 


3 


NDLKSLTHSVAQARVQWQ \N I SSLQR PP 
PQFKRFSCFSLPSSWDYSLLNCKFYKLH 
FADE E I ETLTS QRAQ LECKS NM Y KD I HN 
TVRSYKLKYKVPQKLERSRNRTKSSTRE 
VRS 


6964 


20865 


A 


7022 


2 


160 


LREDDRRGRGFHHFGQAGLiELLTSSD/ L 
PTSASQSAGITGVTHRARPDWS FCQF 


6965 


20866 


A 


7023 


3 


248 


DRVSSVAQSSLELLSSSNPHLGFPKCWD 
YRPNPPLPFSSRSLLFLPSFLIFCRDRV 
SMLPKIASSFW\VRPPWPSKVLGLQA 


6966 


20867 


A 


7024 


483 


143 


PLNPLKFFFFPKAFKFWGGVGP I CSPPK 
RRVLSQKSQVGFYFPPQKEKGYTFPSPG 
KFGPPKEILKRAPPFFFFFFFFLRRDK/ 
SLTMLPGLVLNPWPQAVLPLQPPKALGL 
QA 


6967 


20868 


A 


7025 


414 


81 


PHPLKFFFFPETFYFWWGVFSPFSPPGR 
KVPPQKSPGGFLLAPPKGKSFPFPCPFK 
XGPPRG I FKRAPP YFF FFFFFNRRDK/ S 
LTMLPGLVLNPWPQAVLPLQPPKALGLQ 
A 


6968 


20869 


A 


7026 


16 


218 


KFAPRCRNSARENTQNQPRKHSDTPFLK 
K/HKINWNNKLAGRGTHLWSQLLGRLRW 
EDHLS PRQSRLQ 


6969 


20870 


A 


7027 


28 


272 


EFVGVGPSEPGAGYNLIACAFLSPSEKC 
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SIRVGVTRFSRCHIjSPLTLTKKGNSLTP 
CASRVRQCLTLLRLAHGALHPLSSTP 


6970 


20871 


A 


7028 


277 


480 


LG I FHMLSDS FLLLLLFCLFVFKI FFG I 
FCRD \RSCYEVWAMLPRLVSNS WAQVI L 
PPWPLKVDGLQA 


6971 


20872 


A 


7029 


24 


153 


SVWWNSP/RSSGEFRKTGVHDADFEAHI 
IDMLAKASKIEVYRGN 


6972 


20873 


A 


7030 


2 


470 


IETPRPLWWHS AYRGRQAS LS CGGLKPV 
R/AS WLLCLPNQAWAMAGAPP PASLLP C 
SLISDCCASKQRDSVGVGPSEPGVGYNL 
LVRRFLSQSEKRNIRVGVTRFSRCV/LL 
PLSLTRKGNSLTPWASQWQCLALLRFA 
HGARTHWPAFTVWHSLVK 


6973 


20874 


A 


7031 


487 


107 


SP PAPGVAPAP P P PTRVFS KKKKKFFF F 
FLKKIFFFKPGGDPGGIFFFPGGPPPGP 
KKFSPPLFPKKGGYRVFPPPPGKISIFF 
FFFFFFFFFF/CRDR/ SLAMLPRLVLNF 
WAQ VILLPWS PKVLGLQA 


6974 


20875 


A 


7032 


576 


36 


GYTSQMGRPGRGAPHFPDRAAGQRRSSL 
PRWGGWAEALLTSQMGWRPGRGAPHIPD 
DGQPGRGAPHLPDEEQLGRGAPHLPDEE 
RPGRGAPHFPDGAAGQRCASHPTRGGRA 
EVLPTSQTGQPGRGAPHLPDDGRPGRGT 
PHLPDG\GPGRGAPH I PDGAARQRHLS P 
PRRGNCTTR I VDQ 


6975 


20876 


A 


7033 


2 


129 


YGPTHASGAMLRSCAARLRTLGALCLPP 
VGRRLP\EASRDPS 


6976 


20877 


A 


7034 


1 


287 


RLSLALVARCGVQWCDLGSPQPPPTGFK 
RFSCLSLPSSWDYRHVSPRPANFVFLVD 
TGFLHFGQAGLELPTSGDPPALASQSAG 
I\TGVSHRAWPK 


6977 


20878 


A 


7035 


3 


315 


HAS AHAS ALFY F I F FETRS RS VAQAE VQ 
WRKLG SLQ P PR F KQ F S CLS HP YRHAP PH 
FANLVFLVETG FA\HVGQAGLELPTS£N 
LASQSGGITGVSQRARPGLNF 


6978 


20879 


A 


7036 


430 


304 


VI FYLI FFCKDGV/ LTMLPRLVLNSWLH 
AILPPQPPKVLGLQA 


6979 


20880 


A 


7037 


3 


256 


FFFFNRQKDLFGNFQTCQKWHS PS PR PT 
/APHQAPKGNTSPPSLP IHLTLAS TCT Y 
P I PQAMLENWS VPS PAQAAGEAAL WHPR 
R 


6980 


20881 


A 


7038 


13 


481 


FARLHEFGTSRVI YLLARLVSNS WPQVI 
CLPWPPKL/LGLQA 


6981 


20882 


A 


7039 


455 


255 


SCLSLPSSWDYRRAPRHPANFR\FLVET 
GFRKVALAGLK YPPALASQNAGI TGVSH 
HARPEQFILTN 


6982 


20883 


A 


7040 


490 


172 


KCWQG / C/ GETGTL VHC/ WIGRNCTtMMR 
PQWQQAWQVFKKLHRELPSDPAMPLLGV 
QPGGLKIDTQT/RYMYMALLTVKRWKQP 
ECPPVDK/LDKLWYIHTTEYRVTIKKK 


6983 


20884 


A 


7041 


438 


37 


IFPNLPPKLHYFPQFSPPEKLPKPLLIK 
MAKRQSQGKVFETPKEK/SRAGPPPKFT 
TGFSAEKNGGPKPGGY I SQRVQKKXNNC 

/DFTASRPALQKILQEAEFHRTD 


6984 


20885 


A 


7042 


292 


463 


IEAISFLPFFFGDSLTGAQWRDLGSLQP 
LPPGLKRFSCLSLPSSELP/CSGRCQS 


6985 


20886 


A 


7043 | 59 


257 


LASLVSQSWMKKMWYIYTMKYYAATK\G 
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nucleotide insertion 














NE IMS SAGTVJMKLEAI ILSKVTQEQKTK 
HHMLSLRSGS 


6986 


20887 


A 


7044 


279 


14 


RTADTEIGRGSSLCWRYGSLQIQVSKSH 
NFFFETGSHSATQYGVQWKNLGS PQPQT 
YGLQQSSHCGLPKCWDYRE/LPPSMECS 
GTI SAHLSLKLMGS SNPLTVASQSAGI T 
GNSSSRLHDINYGLQ 


6987 


20888 


A 


7045 


578 


253 


AAASTFSCFFVFFETEPNSVA\RLECSG 
ATSAHHKLCLPGS SNSPVS APQVAG I TD 
AHNHIQLI FVLVETGFHHVGQAGLELQ/ 
NLVISLPWPPKVLGLQAWSHHAQPFV 


6988 


20889 


A 


7046 


3 


229 


FFKTGSLSVSQAGVQW/PNLSSLQLPPP 
RFKQFSCLSFPSSWDYRHPPPHPANFCS 
FSRDRVPPCWSGWSQTPDLR 


6989 


20890 


A 


7047 


367 


151 


LSLLSSWHYGHVPTSPANFFYF/ CVDTG 
PRCVSNSWTQAVCLPQLPRVLGLQA 


6990 


20891 


A 


7048 


370 


14) 


CQPPHPANFCI FVE I VFCHVGQTGFKLL 
TSNDL PAWAS Q SAG I WD\TGVS HHAW P S 
RFNSEVQFDYVSNKNVDFSLS 


6991 


20892 


A 


7049 


104 


361 


VTAPGLEAAFKERVWPGAVARTCGPSTL 
GGRGRRI \RGQE I KT ILANTLKPCLY 


6992 


20893 


A 


7050 


2 


325 


RFSCLSLPSSWDYRHEPPRLAIFFFFFF 
GI IRKGGFTH\ VGRGGFKPWTSGNMPAL 
PSQGVGFPGLFS PPRPVRGFFYPLPKWG 
GLFLQNGVKFKKGG S GDLFKMEKK 


6993 


20894 


A 


7051 


14 


208 


AHE I LS E I SN P LE PG WDHRCVS S CPAHF 
FVF/CYRDGGLPMLPKRVLNSWAQAILP 
WAPKVLGLQV 


6994 


20895 


A 


7052 


103 


368 


GNRVHTYMLTALFMTAKKWNKPKC/ PVS 
DEQIRQIWSVHAMEYYSAIKRNEALICV 
TTWVNLENIMLSES SQKTMGCMLRRS I Y 
MKGPE 


6995 


20896 


A 


7053 


421 


2 


HiFLP PEVARGFPKGRGRAS PS PGGFFFG 
PRGGP IFPPP\S FFPTRPWSKGEVFPS P 
TPGKG FRGFFFP P P P KGKGGVFPLFFKK 
YLGGEGAPQNKGGPVFLLWPPHSGPPGP 
PKNFFFFFFFFETESCYVYPGCSAWQS 
Q 


6996 


20897 


A 


7054 


489 


40 


IPEAPGGPGFKREDSPGANKSGPKALYP 
P KTFHPP WEGRFRS FQKGAPP P PVFF P P 
FFPLPFFFLRQSHS I PQAGVQ WRDLGS L 
QPPPPPGFKQFSCVSLPCPP\RQMGFRH 
VGQAGLELLTSGH P PDSASQS TG I TGVS 
HCAEFHRTD 


6997 


20898 


A 


7055 


3 


261 


TGSGSVTQAEVQAEVMAHCSLNLLGSSD 
LSALASRAAGAAGTCHHAWLIFVFFIEM 
GFHYVPRL/ GLELWARVIHPPQPPRVLG 
LQV 


6998 


20S99 


A 


7056 


1198 


1040 


FLWKIALLFYFKKLPQSLQPLVATTIiSS 

qqpstsrqdplpakriArlaegsddr 


6999 


20900 


A 


7057 


271 


33 


LFFFFKMESRSVARLECSGAILAQCNLA 
RLFLVE\QGFHHVGQDGLGLLIHPPWPP 
EVLGLQAQATPPSPKITYEAATVM 


7000 


20901 


A 


7058 i 


2 


194 


CYRKQi^PnioAivlNrYr LiUnF F CLFFC\KA 
LSLLARLLLNSQLQASLPPQPPKVLGLQ 
ACGTTPCP 


7001 


20902 


A 


7059 


3 


272 


NMWKLNNILLNNQWVNGEIKMKI FKNCK 
E \NGNTT YQNLRDAAEAFLKGKF I AVNT 
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correspon 
ding to 
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residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D-AsparticAcid, 
E=Glutamic Acid, F=Pheny!alanine, 
G=Glycine, HNHistidine, Msoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Pro1ine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threon in e, V= Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possib)e 
nucleotide insertion 














YIKKDYRKTCLRLFYSYHFFNKEKEYTL 
KTMIKV 


7002 


209O3 


A 


7060 


397 


129 


FWG PHKKNFP L P ARGRKLGS F KTA/ LPL 
FFFFFSFSLSIjFFFSGSCSVS\RLKCSG 
TITAYCSLEFLGSSNPPASFSQAAGTTF 
TFFLFIG 


7003 


20904 


A 


7061 


3 


351 


SCFRLLCLPKEAWAMAGAPPPASLPPCS 
LISDCCASNQCDS/VGVGPSEPGAGYNL 
WRRFLSGSEKRNIRVGVTRFSRCRPSP 
LSLTRKGNSLTPCSSQVRQCLALLRLAH 
GARTH 


7004 


20905 


A 


7062 


60 


324 


DDFVSWQTGVHWCGRGSLKPQPPRLSS 
TSRLSSPSSWDYGRVPF\NIFCRDRVSL 
CFPGWPRTPDLRGSSCPSPKWKLHTRV 
TAPGQ 


7005 


20906 


A 


7063 


481 


160 


RPGAPG / PDD FARMG PR P KGGAFTL VHP 
MPSGAAFPGLLSE P \ PHRGPRS FAARS P 
TLRYPGRAPSKGSGEDPARPPATGFGPN 
LIQDGhVhMShTENVC 


7006 


20907 


A 


7064 


65 


481 


CLCP APRGGAYRGRQASLS CGGLHP VRA 
SRLLCLPKQAWAMAGAPPPASLPPCSLI 
SDCCASNQRDSVGVGPSEPGVGYSLWR 
RFLSRSEKRNIRVGVTRFSRCV/LSPLS 
LTRKGNCLTPCASQVRQCLALLRLAHGA 


7007 


20908 


A 


7065 


392 


1S3 


GGLKP/ I PGNPGNSLFSQKKKKQPGAGG 
S PRG S P P PGGLGGG I F FG P KGLG SIN PN 
FGPAPPPGGQKKKSLF 


7008 


20909 


A 


7066 


538 


248 


FFLKQS CS VAQAG VHWGYLS SLQTLHPR 
F\KFSCHSLSSSWDYRCAPPRLAN\FVL 
LAEMGFHFHHIAQAGLELPTSNDLPTSA 
SQNAG I TGVSHHT 


7009 


20910 


A 


7067 


2 


127 


VTGQAWHACS PGTLGARAG/ W I VWAQQ 
FRTSLGIMVRPCLYL 


7010 


20911 


A 


7068 


379 


217 


KKMFCKNE/ SFTIFPRLVLNSWAQTILP 
PWPPKVLGLQVQTTVPGSYSPLSLYKE 


7011 


20912 


A 


7069 


386 


176 


KRDKVDKWLKKMWCIHIMBYYIiAIKKET 
LPFATWVNLQ\TFILSEISQEQDKYKMI 
SLICGIKMLIYRIRE 


7012 


20913 


A 


7070 * 


30 


408 
* 


NSSELGSVGVGPTEPGTGGNLLVCWLLR 
LWEKCSVWAGVSRFFQ/CTDYDGFPWVG 
KGNP P TPCASQ VRRHPALLRRTLHGLH P 
CASHBKNQVPQLEMQKSSVFCIDLTGSC 
RVELFPFGHLGSKNP 


7013 


20914 


A 


7071 


310 


414 


HVPWRSFAPAHCNLGSLQPPPPGFKRFS 
CLSLPSSWDYRCPPPRLAN\FCSGSLQS 
WLTATSASWVQAILLSQPPE 


7014 


20915 


A 


7072 


353 


243 


IFIFCRV/RGLTIIiSRLVWNFWDQVILL 
PEPPKVLGLQV 


7015 


20916 


A 


7073 


1 


276 


ELWDNFKEPNIYLKSLKERKECVCERGR 
RDRNKTLEDIMAKNFSNWMKI INPQIQE 
AQQTPN\QETGRKLALRHI tikllktnd 
KKKILKAV 


7016 


20917 


A 


7074 


362 


133 


GGGGGGGRYSPQKKGTLPLTPPSNWGPK 
GAFF PRG WFFFFFFFS \RDGCIiAMLPRL 
VbDSWAQVMLLPQPLKVLKL 


7017 


20918 


A 


7075 


3 


177 


KWS PHHS L PNN \ WL YRHMP P CPANVE FF 
CSLAVLPRMVSNSWPGAILSPWPTRMLG 
LQT 
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residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C^Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=lsoleucine, 
K=Lysine, JL=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


7018 


20919 


A 


7076 


1 


169 


PQLPSSR3STYRHVPPCWANFCIFS/RRRC 
FTMLARLFLNSWPQVTRPHWPPKVLGLQ 
A 


7019 


20920 


A 


7077 


402 


231 


PSCLSLRSSWDYRCMPPHLTNYYYYYYY 
YFVEMGSHSVTR PQDILPPWP \ PKVLGL 
QV 


7020 


20921 


A 


7078 


422 


316 


FCKDK/ SLPMLPKLILNSWVQTTHLPQP 
PKVLGLQA 


7021 


20922 


A 


7079 


58 


293 


DPI CTKS KKLKKTQ\TDKEDE ITTKKAK 
KAKSK I KI EEDGEEGKWWEEE VSVKK 
KKDKKKPWLGMVAHTCNPNTLGG 


7022 


20923 


A 


7080 


363 


123 


GVFFRDHARAFSLHTGVCAVHISRCF/H 
VLTHFKE FLFHWFLTIHTHTHTHTHTHT 
HTPSFPELGKAVPMHRIVIVPTNYQH 


7023 


20924 


A 


7081 


3 


181 


YFLL\FFWDRVLPCHPGWSKGWSQTPGL 
GQSAHLSLPKHWCRREPPHPALFLIjFK 
NTIS 


7024 


20925 


A 


7082 


453 


142 


FFXXFXFXXXXXFFFFFFFFFXPPFFFF 
FFFL KNS FP YF FLFFFFYPL FL P LFF FF 
FPPNFFFFFLFS *NFFFFFFPPPPLFFF 
FFFFLFFFFFFFFFFFFFSF 


7025 


20926 


A 


7083 


268 


92 


KNNLPSSWG\YKCSPPHKGNRFFFYIKD 
GGGLIMLLRLISNSWAQVIIiLSWPSKVL 
ERQA 


7026 


20927 


A 


7084 


413 


0 


SCSVAQDGVQWDDLGSLQRLPPKNKPFF 
CLSSTPAA\FPSDWDYR 


7027 


20928 


A 


7085 


400 


125 


KTRSFSWRGGVQL/CI /GSLKPRTPGV 
KSSSPSASPVGRNTGMCPPPQQIFFFLV 
ETGSHYAAQASLKQSSCLGFLILFLSQP 
LGEKEGPTICT 


7028 


20929 


A 


7086 


377 


38 


WPG PGFYFWGPKKK I KPFPPPGVKLVS F 
KRAPPFFF/CFFFETGSPFVAQTSLELL 
DSHNP PTS ASQS AG I TDMSHHARPMHS I 
CMIQLWKSYKRFHFISQGSRDCSSSREE 
T 


7029 


20930 


A 


7087 


1 


151 


S FCRD/KRLIMLPRLVLNSQAQVMLLPQ 
P PKVLSHHTWP 1 1 KFSFMHFNLF 


7030 


20931 


A 


7088 


405 


111 


PRLVLN \ S ELRRS AHLSLPKCWDYGPE P 
PCPAQMHVLKSGTAAHKVEDFLASWSHL 
PTHSPFPVLCSKTQISVAFFIQNLRAGQ 
RSGQRHFGKQCGSIH 


7031 


20932 


A 


7089 


] 


309 


KQIN KS SLRISGKP PNCEC VLH LG fCS VN 
KS LLE PH KLAS D P / HLCDFLE S S \ EQ VK 
SVKELNCHV\TSLYTLGALESDMAEYF\ 
, NKYTLGPGiMLAHAYNSSILGG 


7032 


20933 


A 


7090 


414 


240 


FSLF/ CKNLKKGLTMLPKLVSNFWPQAI 
LLPQPPKVLGL 


7033 


20934 


A 


7091 


460 


328 


RHPPPYLGK/FFCRDR/SLTMLPRLVSN 
SWPQVILSPWPPKVLVL 


7034 


20935 


A 


7092 


388 


199 


KQFSCLSLPSRWDY/ SALPPRPAHFCIF 
NRDGFSPYWSGWSLGLPKCSDYRRRIFL 
RAKAHVRE 


7035 


20936 


A 


7093 


18 


271 


MHS FAT Y L LVN LHLliV LCRL C P LS M Y SG 
EKSYXliijl KJKFMijWr / boLsfr IrSWSFSS 
ITLRIFLTCLTLKLSCCSCCCCCYCCCC 
C 


7036 


20937 


A 


7094 


415 


291 


GDLGSPQAPPPG\SRHSPASASRIDGTT 
GARH I MNFKNLAI K 
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peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 
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peptide 

sequence 


Predict- 
ed end 
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ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=PhenylnJanine, 
G=Glycine, H=Histidine, Wsoleucine, 
K=Lysine, L=Leucine, M=Metbionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
VV=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 


7037 


20938 


A 


7095 


21 


428 


DKGLTVTQARVQW/HDPSSRQPPSPGLK 
RFFRLSLPSNWDHRCAPPRVANYYFFYR 
WGSYHVAQAGLBLLASSNTPTSASHSAG 
ITAIjFLRDTLVRFFMGSFCFPQS fgaaf 
TAQIPQRGRTEELFIFLFLNDYLFS 


7038 


20939 


A 


7096 


320 


134 


IiPL \ PKTTF YLFLNRDRVLLCCPGWS R \ 
I LS S KQS TLLGLPfCCWDHRHE PPQQAVD 
HTLDFS 


7039 


20940 


A 


7097 


402 


221 


DYRHVPPFPANF/ LVEMGFWHVSQAGLE 
LLGSSDPPNTSQSAGITETSHQAQAYTF 
IFIIY 


7040 


20941 


A 


7098 


427 


259 


PPPRFKRFSGLSLLRRWDYRPPPSCPGN 
fffff\ci FVQMGFHLVGQAKMLKLHD 


7041 


20942 


A 


7099 


422 


275 


VQWYNLDSLQPASSGFKRFSHLRLLSTW 
DYRH\ PRPANFCI FSNTKDTFI 


7042 


20943 


A 


7100 


3 


404 


HASGATWRNP VSTR VWWH I P I VPATQEG 
EMGGSLEPRK/LKAAVSHCTPVWVT 


7043 


20944 


A 


7101 


539 


336 


FETESRSFTEAGVQWRDLGSLQPPPPGF 
KR/DSPALASRVTGTTGAHHHAGLIFVF 
LVEAGVS PHWPGWS 


7044 


20945 


A 


7102 


408 


287 


WLILFFIFCRD/RGLTMLPTLVSNSWAQ 
AILLPQPPKVLGL 


7045 


20946 


A 


7103 


1074 


705 


SFLRWSFTLVAQAGVRWRGLGSIiQPLPP 
WFKQFSCLSLLSSWD\YGRPPPRLANYF 
F\VILVETGFHRYWPRMVSNSQPQ/CDL 
PAVGRPNSAGI \ TGVSHCAWPKMFLNNC 
TCTHSPYLQGIVSLG 


7046 


20947 


A 


7104 


227 


55 


FSQHKI NIQFYISLS PTH I TMS FFLCRD 
/RGLAMLHRL/VLNSWPHVILPPWPPKV 
LRIt 


7047 


20948 


A 


7105 


479 


314 


PRLLLIFFFIERGIALCSPGWSQTFGLQ 
ESSCFGLPKSWGPRKEPQPCLALKWTLQ 


7048 


20949 


A 


7106 


37 


143 


NSFF/ CKRRGLTMLSRLVSNSWPQVILT 
PQPPKLLEL 


7049 


20950 


A 


7107 


441 


106 


GAPSPASLPPCSLISDCCASNQRDSVGV 
GPSEPGVGYSLWRRFL»SRSEKRNIRVG 
VTRFSRCV/LS PLS LTRKGNSLTPCASQ 
VRQCLALLRLAHGAHTHWPAPTVWHSLV 
R 


7050 


20951 


A 


7108 


413 


207 


CCIQAGPAFFGGPPPPRWGWGGVPPPGY 
/MPSPPVFRVGLLVFRGPPPNWFliPMGL 
QPDGPGLGDPKKFAK 


7051 


20952 


A 


7109 


92 


409 


PHLPDGAAGQRCPPPGRGGGCVRAGPQL 
PPGRGGWPGGGLPPPPSRTGWLPGGGAP 
HFSDGAVARQRVSSLLRRGGRAETLLTS 
QTGSRPGRGTPH I SDDGRPGRD 


7052 


20953 


A j 


7110 


408 


13 


PPWGPPGGGPPEARGSNPPGPPWGTPLS 
PKKPKNYSGQGGGPLNPPPPEG/SGPGI 
P FS PEGKG SLNPGGQKGKIGAPS PPPGG 
PKKKPPSQKKKKGKEKKEKVIKGESSQI 
LLFKVSKI LSVNFLHLSSLI 


7053 


20954 


A 


7111 


290 


175 


IjKKNCR/ DRSHTMLiPRLVLNSWPQTILP 
SQPPKVLGLQI 


7054 


20955 


A 


7112 


1136 


552 


I LtFKEMGCEAGQRCS CPVI PITLGRPRR 
ADHLRCGVRDQPGPTWLKPRLYSNTKIS 
WAWWRVPA\ I PA\ IRRPEATRFTFNPRR 
WRLPLNLRSHPLHSHPGRQS KI PI KKER 
KKGQDRKGKGCGQNWGQKQKRRKEGKKE 
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C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIalanine, 
G=Glycine, H=Histidine, 1=1 so leu cine, 
K=Lysine, LHLeucine, MHWethionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valme, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=St6p codon, /= possible 
nucleotide deletion, ^possible 
nucleotide insertion 














MGMGRIGGKNKKEGRKERKEGRKEGNKW 
VWAELGAFPEQGKGLRRGPSEEA VGVS P 
L 


7055 


20956 


A 


7113 


570 


199 


LCLLKQA WLMAGAP P PAS L P PCGIi I SDC 
CASNQQDSVGVGPSEPGWYSLWRRFL 
SRSDKRNIRVGVTRFSRCV/LSPLSLTR 
KENSLTP CASQVRQ CLALLRLVHGAHTH 
WPAPTWHSLMR 


7056 


20957 . 


A 


7114 


466 


284 


RTRGIGWTANRRMKATQPGLTFFHLIHK 
QEGGNVGSQKGQRD \VTNSAFHWRLHDQ 
TANCLS 


7057 


20958 


A 


7115 


474 


224 


DSNFEKS STLGKMLSNS I KCYRE I FCER 
KSQLMQQTSML \ FKKLPQS PQPSATTTL 
I SQL P PTS RQN P P PAK \R LiQLlAEG SDG C 


7058 


20959 


A 


7116 


377 * 


111 


FXXNFXS FLFFSLS LTRKNWVAH I TFFF 
FFFLPTPPFFYFIFLLIFCFFFFLFYIF 
FFFFFFSFFFFFFFFFFFFFFFFFFFFF 
FFFFL 


7059 


20960 


A 


7117 


417 


148 


VPPCLANF\NFFVETGSPYVAQAGLELL 
GSSHPPTLASQSSQI TRVSHCSLLYFLM 
HSNTS I S FFS F WAL VS FLEM PS S CQDG I 
TVLAR 


7060 


20961 


A 


7118 


2 


196 


RVPP P CPA \ NFVF F VQTAFCHVAQAG P K 
LLASSNPPASASQSARIAVLSYCPLPSL 
RLRSHLDFF 


7061 


20962 


A 


7119 


2 


417 


EAGGSLEPRNLRLHCA/LVAPMNSRCIP 
AWAT 


7062 


20963 


A 


7120 


801 


450 


G P KFLG P PG VR IN/ RR PGGG G LP RVYKK 
NNYPEDRPI VLPAFFSG3GR/ GPGPHDK 
HPLPRKKQKNS PGRGGEPPFFP P ?GFP P 
GGER KNP LT PQRKRVHWGQGLAP ? FP P S 
LSAKPKLF 


7063 


20964 


A 


7121 


3 


199 


DAWGFAL WQAGVQ CCDLSS LQ PS P / RL 
AN FVFLAETGF LHVGQAGLKLLT SGNL P 
ALTSQRLRLQ 


7064 


20965 


A 


7122 


407 


164 


NVAGLTGAYLDSFFGRDRV/LTMLLRLV 
LNSWAQAILPPWPPKVMGLQACTI EVSS 
LVAQIASKRLSGVQESDEAAWIRDC 


7065 


20966 


A 


7123 


33 


192 


RWDFTML PR L VLGS KRS AHLGL P KCWD Y 
KHEP/PCCPGWSWAQRDLPTLASQSAGI 
TSMSHLHQAI PLKFLEQFIRFS I LVF 


7066 


20967 


A 


7124 


3 


399 


SIPVDWPSRAPTPRGRE3AAKQGN/YPV 
GQRN\RMAGLSPRSFCWREVSFSR/ shs 
C/ I CWVQLG KSAVLPQQSRGPGAH3GSR 
RRGGSSPPCPPLQTQLVFSPQELSMSAP 
I HKLiaV 1 LQGDL I S I GGAPf KCRLA 


7067 


20968 


A 


7125 


1901 


1574 


FFEMES CS VAQAGVQ WHDRGSLQ PL PPG 
FKQFSCLSLPSSWDYRHLPPHRANFFVF 
LVETGLHH VGQ \SGLELLTLGD / LLP AS 
AS Q SAG I TGV S H HAW AKNPC FLT S I DT 


7068 


20969 


A 


7126 


408 


234 


LPSSWDYRSPPPQANFYIFSRD/MGFTM 
LJVRLLTSSDPPALASQSARIAGVSHHAQ 
PVYF 


/Uoy 


20970 




till 


oo 


11 1 


AQAGLELLS S S DPPTLAS QSAG I TGVSH 
HIQPPFKK 


7070 


20971 


A 


7128 


3 


140 


QRWWCVPATRKAEAGESLETESSRLQGA 
MITP\CSPAWATEQDLAS 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Pbenylalanine, 
G=Glycine, H=Histidine, I=lsoleucine, 
K=Lysine, LHLeucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T—Threonine, V = Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \~possible 
nucleotide insertion 


7071 


20972 


A 


7129 


1 


212 


PLLFVSPPLLSVSPPLLPFP/EPAASPP 
PSTHSLLPSVQSESLGQFLHPPDPPPLA 
FPAPLKHQDLHI SHL 


lull 










3 


KGFTILHAI KDFCDS/WEEVKISRLPGV 
WKKLI PTLMDDF/EGFKTSVEEVTADMV 

CilLiXEt JjEiK J_ill» Lie* v r\Jr Ht lui wiiji-i^o rCUix. 1 
LTDEKLLLMD 


7073 


20974 


A 


7131 


421 


201 


NFCILVD/TGFHHVGQVSLELLTPSNPP 
ASASQSSGITGLSDHSQRKSELVLSFNF 

P&T PUTT T nJTVITT ZlTrVCNT) 


7074 


20975 


A 


7132 


221 


529 


LIKFTIHSFYFIYLFIFFSGDGCFALSP 

YAG 11/ CMRHHTQLI L Y / CL VET / E FHH 
VGQGGVDLLTSGDPPASASQSV 


7075 


20976 


A 


7133 


1 


227 


KGIHFSSANDVKKKTVLTWLNSQDPQLF 
RDGLDGWYHRLQKCL\ELDEVYVEK 


7076 


20977 


A 


7134 


1242 


300 


DFVLPDGRVIP\KGIVCLINIIG\THYN 
PN\LWPTPLEVYDPFRFNP\ENIKERSP 
LAFIPFSAGPRNCIGQAFAMAEMKWLA 
LTLLHFRILPTHTEPRRKPELILRAEGG 
L WLR VE PLAGTS RGNPRAS SR PSL £ DP W 
VLRRPLPPPIFLFIiLLLTLPLLFSVMAR 
EPPSPLRVEAPGPPEMRTPPAIEATPKG 
ILQPAGGRIjKFLSGCVPIIjHQVAGHRYG 
KDKMG I LQHPDGTVLKQLQPPPRGPREL 
E FYNM VYAAD CTDGVLLE LRQYLP KYYG 
IWPPPAAPNDLYLKLENVTHKFNKPCI M 

TWIT T CZCklCY T . 


7077 


20978 


A 


7135 


1 


259 


GTSCI LASMLTRHRLRFS F/ CLFETEFC 
SVAQSGVQWYDHSSLQPQPPGLKLFSPV 

LGFYS 


7078 


20979 


A 


7136 


1132 


900 


CSGVI SAHCKPRLPGSRHS PASASRVAG 

TTPTPtrtlZiPT WCTT PT7T,\n7Mr , PT4T3A7COn 
lib ±K.ririrtK±jiy r Llrf Jj VHl v J\jr rlKVoyiJ 

GLH LL» / NL WC P PR P PKVLELHA 


7079 


20980 


A 


7137 


1 


501 


FFLRWSLTVTRLECSDVI SAHCNLHLPG 

■jOUD rViO-rir xvIj/Hj i a / Lj/lX/rt-VJ-' " V r Ju V 

ETG/FHHWKDGVDLLTS 


7080 


209S1 


A 


7138 


2 


229 


WHSHGLLWSQPPL\YLSLSSSWDYSHVS 
pppartT PT.PT.\n?TnQDVvzvnar:T.QT dpc 

Jrv- ir/ivj ijV^ijr JjVJG»±v»ofX v>Vv ( ^i-VOi_»i>l_'r<Vjo 

. ASQSTGI IGMSQPAGFWKQL 


/vol 




t\ 


/ J jy 




Do 


t T?n yci v p ppprvYNrnn pnv p t> p pp 21 p p pp 
FF/CPAKYYTMLPRLWSNSWAQAIKPP 
WPLKVLGLQA 


7082 


20983 ! 


A 


7140 


422 


274 


TMLPRLVSNS \GLKQSSRI»SLPXWWDYR 
YKPPCLVYTFFINRPPEVPKE 


/Uoi 






71/11 

/ 14] 


417 


60 


\7T«IDT?T PPT PTCT DfnOATDT T O TUT VT?TT 

KLLYHKDTC\WNQPKWPSMVDWIKEMWY 
INTMEYYTAIKKINAIFRDPLTSPWDPF 
SWRKFCLPLEIWPASYFLPGINTRSLDE 

DT T-I7 YCZVQ 
ir Jurl J. i\br>K 


7084 


20985 


A 


7142 


391 


70 


SPPPHPPRKGDYRGGPP/RPGKKREIFF 
FFFFLVDTGFCHVAQAGLKLLASSN /LP 
TSASQSAGITSVRHYIQPTLLKI IPQFL 
DILLFLKIFHSFLKSFNFNLGSFY 


7085 


20986 


A 


7143 


405 


304 


FCRDR/SLLMLPRLDLNSWAQGILbPWP 
PKVLGL 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=A!anine 
C=Cysteine, D=Aspartic Acid, 
E-Glutamic Acid, F=Phenylalanine, 
(^Glycine, HHHistidine, I=Iso leucine, 
K=Lysine, lj=Leucine, M=Methionine, 
N=A s para gin e, P^Profine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop cod on, /^possible 
nucleotide deletion, V=possib)e 
nucleotide insertion 


7086 


20987 


A 


7144 


3 


159 


HAS EK I WYM YTMEH YA\ SQKKNKIMS F V 
ATWMRLE VN I LS KLMKNEMMS VNGR 


7087 


20988 


A 


7145 


343 


115 


CEFYFLSCIYVFIYFCR/DKGLSLliLRH 
VSNSWPQVILLPWPPKVSHQAQPEFYFR 
NKQTKI DLS FEETAADLAYYS 


7088 


20989 


A 


7146 


453 


119 


RCWKGCGLWISLRTT/WREWQLVQPFWK 
TVWRCLRKLKVEPPYEPATPLPGIHLKK 
TKAVAQRDMCT PL FI VAFQHHLLKRLF F 
SRCVFLAPLLKIA 


7089 


20990 


A 


7147 


3 


105 


TIEWGEVIS/SSYSSTMANNEGLFSLVA 
RKLSRPL 


7090 


20991 


A 


7148 


295 


23 


FANIFSHSAETLQTRREWGPIFNIL/EK 
NFQ PRI SHPAKLS F I SEGE I KS FPDKQM 
LRNF I TTRPALQEFLKKE FMS FPGT WMK 
LETIIIS 


7091 


20992 


A 


7149 


1 


161 


RTRGFFYLDGVRS LAMLP / RLVLNS WPQ 
AILLPWLPKVLGSQTLAMVAAYTVFV 


7092 


20993 


A 


7150 


1 


219 


DYRCTPPHPAHF/ SIFIFCRQR/SLTMF 
LSLVLNS WTQAI LLPWPPKLLGLQACTA 
MPI PGVELLWVSGCFVKES 


7093 


20994 


A 


7151 


177 


34 


ELVSFLSFSLSLSFFPSFLPSFLPFLSF 
LPSFNPSF\LPSFLPSFL 


7094 


20995 


A 


7152 


398 


40 


MNDRVLLFHFGWNAVAQS W FTGASN \ FG 
LKQSSHLSLLCSW3STYRHTPQCPASLLAY 
VCMYVCMYVCMYVCRD/RGLTMLPWLVS 
NSRPQAILPPQPSKIMGPLLGAVAHAYN 
TSPKGSQRR 


7095 


20996 


A 


7153 


418 


179 


RNSCTQMRNEALPNSGYFPKLKYYYDF/ 
CQRQDLAILPRLVYNSWVQAI LLPWPPK 
VLGLQHEPPCLAVIQFLKGNLKIV 


7096 


20997 


A 


7154 


767 


537 


QALLCLPGWS\AVAQSWLTAASNSSDPP 
ACLPSSWDYRHVPPRQAEWSFRVSVSSP 
VKRDPGSPWRRMRPCSE 


7097 


20998 


A 


7155 


421 


301 


HAQLFFCRHW/SFTMLPWLVSNSWAQAI 
LLPWPPKVLGLQV 


7098 


20999 


A 


7156 


410 


269 


AGGPQTGSFFFF/ CYERGVSLLPRLVSD 
TWSHVILQPQKPKVLGLQA 


7099 


21000 


A 


7157 


3 


237 


MQIKA/TMLYHFTPTNMTVLGFKKNSRG 
WQARGE IGTLVHCWWGCIMMQPL WETVW 
HFLKRLNKLGVWHACSPNYSGG 


7)00 


21001 


A 


7158 


435 


211 


LFLVEMG \ FAVLASLVLTLE \ PRDSPTS 
ASQRARMTGMSKHTQPLFCLFVS KFHLA 
KKLSLIWQNLRRAGI CRLS 


7101 


21002 


A 


7159 


3 


220 


SSSASQSGGITGVSHHARPSERQ\SCSV 
AQAWQWCNYSSL I TPTPGLKRS SRLS L 
LSSWDY\GALPHPSVAS 


7102 


21003 


A 


7160 


1 


172 


FRKVLGRTGSNGLCTQVRVE FMDDTSRS 
I MRNVK\ S PVREGDVLTLLELELETLRL 
R 


7103 


21004 


A 


7161 


1 


180 


DAAPVLKATARPTRPDRWK/ IDIDSLDI 
HGETPSLQKNTNTFRAWWCI P WPATRE 
ADMF 


7104 


21005 


A 


7162 


3 


108 


KKKKTGRFKL V \ MQ YNRRFWWP TFGK 


7105 


21006 


A 


7163 


409 


110 


GQHGLTQSLPLSAPPRGAVPRWAPRPD/ 
SGLSLSPLLPTAFATTGASSANRFVSIG 
PRDGNFLNI PQQSQS WFL 


7106 | 21007 


A 


7164 


35 


371 


Y FHHLSLLLKFLLASMM VFLTVWRLLE 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E^Glutamk Acid, ^Phenylalanine, 
0=G Jycine, H=Histtdine, I=Iso leucine, 
K=Lysine, L=Leucine, ^Methionine, 
N=Asparagine, P^=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threomne, V=Vahne, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, V=possibIe 
nucleotide insertion 














LLFFFLGETEPAFVAQPGR\QGPNLGSL 
RPHFPSVKHFPGLTPPRAGTSGLDPPSP 
AIFGFFKKKRGSPCGPGRAWTPGPRGGP 
P 


7107 


21008 


A 


7165 


3 


382 


YTGPAWWSMLWLATEEAEVGGSLEPRS 
S KLQ \QPPLRHCTPAWAT 


7108 


21009 


A 


7166 


1 


305 


TAVTVDP YHSVY I EVNARPGACMNGGGL 
LNGEQNSLKRKQLVQTLNKRHGEELAMV 
DSLTLGPKLEEA/AS PPAFARVRLRARQ 
ACRRKGSRIjCWEHLVS kv 


7109 


21010 


A 


7167 


383 


55 


RDLSMVRMKSMLAI ALCFTALMGMFNS I 
FYGRWAKX.PFPPLSYIQGLSHRNMMGD 
DTTDCS F I FLY I FCTMS IR\ QN I QKI VG 
IiAPSRAATKQAGGFLGPPPPCGKFS 


7110 


21011 


A 


7168 


465 


166 


SPQGAQIKGVGFQKLRPWGP\PGFPRGK 
PFFWFKKKLPPQGGGPLCSRFSGGLSQK 
N\ SGPWGE I FPWGQNGPPPS P PGGKETF 
FSKKKKKKKAVDYLNS 


7111 


21012 


A 


7169 


2 


323 


IQGFKTLSEEVSADEVEIVRELGLQAET 
EDVTELLHFNSTACCREI FHERKRQQMQ 
QTSLVSHF\RKLLQSPQSSAIATINQSQ 
P S TLRQD PM P ATRLHLTGGLDNH 


7112 


21013 


A 


7170 


3 


509 


FGTIPIiVLCADLNSLPDSGWEYLSTGG 
VETNHKDFKELRYNESLTNFSCHGKNGT 
TNGR I THG FKLQSAYESGLMP YTNYTFD 
FKGIIDYIFYSKPQLNTLGILGPLDHHW 
LVENNISGCPHPLIPSDHFSLFAQLEL/ 
YTAFPAPSQRHPPSWQEVVKHLQRTALI 
HL 


7113 


21014 


A 


7171 


1121 


320 


SSCFQTSSHACFVTEFVPGGDPMMQIHE 
DVFPEPQARFYVACWLGLQFLHEKKII 
YRDLKLDNLLLDAQGFLKIADFGLCKEG 
IGFGDRTSTFCGTPEFLAPEVLTQEAYT 
RAVDWWGLGVLLYEMLVGECPFPGDTEE 
EVFDCIRLHGRPPTPAFLSVQG\VKFIQ 
KLLQKCPEKPLGAGDQNAEEIKVQPFFR 
PTNWQALLARTIQPPFVPTLCGPADLRY 
FEGEFHRAA\ PALTPPAPHSLLTARQQA 
AFRDFDFVSERFLEP 


7314 


21015 


A 


7172 


355 


3 


GPIjIjSMLGMNILFLTLFLLLPLSMLEGG 
VCGNF/ITNCCLEIDDNRKVIEDITAKI 
QKLAHVPVQIWKGWSPDTLFGGWFSSLG 
R F KTL VG I VLVI LG VC LT LP CLLP LLVK 
NIQSANV 


7115 


21016 


A 


7173 


184 


2 


EKKVEDDDDDDDFEMESRSVIPAGVQWR 
NL\LHPLPSGFKPPSCLSLPRSWEHRCA 
PLHQCI 


7116 


21017 


A 


7174 


1190 


407 


SLKRRPSARGSRPMSMYETGSGQKPYLP 
MGEASRPEESRMRLQPFPAHIGRSALVT 
SSSSLPSFPSTLSWSRDESARRASRLEK 
QNSTPESDYDNTPNDMEPDGMGSSRKGR 
QRSMVWPGDGLVPDTAEPHVAPSPTLPS 
TE D VI RKTEQ I TKN I QELLRAAQENKHD 

CVTDPCT7D TUliTAUTTMli AT PDinfDVCHM 

VRTSLRLLTSSAYRIiQSECKKTLPGDPG 
SPTDVQLVTQQ\VI PVCVRPFAKAAKQL 
VTITTKENNN 


7117 


2)018 


A 


7175 


3 


136 


AIHLASLKVFYRYRVLSL/ LKRLVSNSW 



988 



WO 01/64835 



PCT/US01/04927 



SEQ ID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A-Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenyla1anine, 
G=Glycine, H=Histidine, l=Iso leucine, 
K^Lysine, L=Leucine, M=Meth ion ine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonme, V=Valme, 
W= 1 ryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














PQVILLPWPPKVLGLQV 


7118 


21019 


A 


7176 


3 


316 


GIVKFFFFFFLTKCRTVTQAGVEWCSLG 
SLQLPPPRFKQFTCLSPPVSWDYRHLPI 
CLANFC I F \ VETGFHHVGQMGLEIiLTSG 
YPPTSAPPDAEITGVIHRTQP 


7119 


21020 


A 


7177 


529 


217 


LLPRLEYSGMVI AHCGLQLLGSSQAI LG 
ACYCAW/HNFKAFFVETKSHYVTQAGLK 
LLAS SKPPALASQS VGITGVSHCAQATL 
PFKTVDKGPSFNLFSPTPPLL 


7120 


21021 


A 


7178 


3 


150 


YTLCKSKKD IKSIYHIMT/ HLKTHTHTH 
THTHTHTHKPSTTFSYTETFIM 


7121 


21022 


A 


7179 


540 


69 


ACRGAHVQVGDGSAKFYVEDGTACGPTM 
LS LDKLCLPALAFNFSTCTDSGSRR I CS 
HHG VTAW S LGWREKLTRGDRP LLTS PG P 
ALPIG\CSNEGKCICQPDWTGKDCSIHN 
PLPTS PPTGETERYKGPSGTNI I IDS I A 
GAVLVAA I VIiDE TG WG YK 


7122 


21023 


A 


7180 


452 


279 


CIDSIYQEIYHLSISILSVYLSIYRSIY 
LSIYLPIFW\SLYLGPRFLRTRDPQLLS 
LT 


7123 


21024 


A 


7181 


501 


183 


HVRCLAFICALIGFPSGYFSIMPWT/V 
DLVG I EHLPNAYG III CVNG I S ALLGP P 
FAGW I YD I TQKYDFS FYI CGLL YMIG I L 
FLL IQPC I R I IEQSRRKYMDGAHV 


7124 


21025 


A 


7182 


446 


245 


LRVRSLALNQVSSASRYTOTGVATKLAT 
F/SLFWKDRNLDILPSMVFNSWPQVIFP 
SWPPKVLGLQM 


7125 


21026 


A 


7183 


420 


73 


DSSDDSGQSKLKP / FWKGFTI LDAIKNI 
CDSLEDFKI SLTGAWKNLI PPLMDDYEG 
FKTRVEE VTAD WE I ARELELEVEHGLG 
AVAHAYNPNTLRLRSLKPRSLRPGQHSK 
TLSL 


7126 


21027 


A 


7184 


567 


174 


FFLRDRVPS LTQAGVQ WQDHS S LQPRP P 
RLKQSSH\PALLSSWD\YRHRPLCLAKF 
LI FLQRRGLiAI / FARG WSNS WSQ / CNL 
PASVSPSAGITGESHYSQPFLLSDSQTK 
VY KTKRST I SST I TEMGCERKD 


7127 


2 1028 


A 


7185 


345 


59 


RNGSSCLWQVIQSLTLLPRLECSGVITA 
QCSLGLVGSSNPPTSASC\VESHCFAQD 
GLELMGS SNP PTLAG I TGMS H CAQ P L Y D 
LFLYEYGLSVHY 


7128 


21029 


A 


7186 


1 


209 


LSWLE IGSHS VTQAGVQWVHS S S LQP * 
MLGLKPSSRFSLPKCWDYRHEPWCLPGL 
LRLKFDPSVGGVA 


7129 


21030 


A 


7187 


3 


225 


EKTPVSDRGAKGCTESLVNRRPCFSALE 
VDETYVP * EFNAETFTFHAD I CTLSDKV 
RHFLKPIALVILNHHIF 


7130 


21031 


A 


7188 


1 


377 


ES * RNHNGDEDDSHVRAQGAC IEAKHDK 
YMADMDELFSQADDNRKKRDI PDYLRGK 
ISFELMRQPCITP5GITYDRKDIEEHLH 
RVGHFDPVTRSPLTHEQLIPNLAMKEVI 
DAFISENGCVEDY 


7131 


21032 


A 


7189 


1 


359 


SRATDHVQRQGHG* GIHNYSDWENTDWF 
RETKNQS VPLS CCI ET VSNGNGS LAHP Y 
DLYAEGCEALVGKKLLEFMMKVI WAELA 
FAAIQLLSMLSACIVMCRKSTDPAYELL 
ITVGTYA 


7132 


21033 


A 


7190 


496 


118 QNQKFPFE PGTS * LGL YP KKYKL I YHKD 
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SEQ 10 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=GJutamic Acid, F=Pbenylalanine, 
G=Glycine, H=H istidine, I=lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V= Valine, 
VV=Tryptophan, Y=Tyrosine, 
X=Unkno\vn, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














P KTCM FTAALLT I E KP WNHH KH LS MAD W 
I KKVW YGTVGHACNQSTFLFWFFFEMES 
HAVAQGGVQWLNLCSLQAPAWATERDSS 
QKRPKIDPHKYIVN 


7133 


21034 


A 


7192 


1337 


342 


ISEATRTFRES I WQFQS S PREDDRNRQL 
SASGSRQSHPGPRSTPGPGQGHRSSHQG 
CAPVSSAARPGGPSFRLSEGALSRSQFS 
VLVY IP* VAAGQFVKACLGAS S PP S RL * 
GGLGTGHRTPGKEARSWPETSAPPAWHL 
LRSLCLPSRGEKLAQARSQKHLGWKWLS 
KLCPKTKYSWSGPDDSPSVLRDPEAGTT 
RWAPSGNPEEEWIjQLKAKYLQNISEVL 
SVLHLLHIKALQLHSTKPFFVNARHLLV 
GQISSALTLNNLSQILFLLLRFPFLSQV 
ERHS PLQKTGTTETNLVLFQDTVPMNFP 
SS ILLPSKYSCPALDKEASHFYQ 


7134 


21035 


A 


7193 


469 


130 


SASWDYRRPPPRPSNFLY* *RRGFTILA 
RMVSVS * PRDPPASASQSAGCAYS I YLQ 
ERFARLHPACLAPRYMASDPLSSLHHSV 
PPGLHLEHEHWL WHCLLPGQ I CRPGKLW 
Y 


7135 


21036 


A 


7194 


430 


149 


TVWQFLKQLNIELSCDPVILLLGICPRE 
IiKTYVHTKTRT*AFIVALCTIVPKQKQL 
KY P S VDS WVI KMW Y VHTMD YFS VI KKNK 
LAKHRGSHL 


7136 


21037 


A 


7195 


257 


412 


TATSNKSPLKKVRCLIWRKKDCWARGFT 
ACN P S T LG C * GRW I MRSRDRDH P G 


7137 


21038 


A 


7196 


1 


124 


PLISGWHPCNPSTLGGRGWRIA*VQEF 
ETSLGELAEPCLY 


7138 


21039 


A 


7197 


2 


120 


YGI ILFITSEVFVFAGFF* AFYHSSLGK 
KISKKESVIIY 


7139 


21040 


A 


7198 


234 


416 


GVKAEVLISPGPQDRLRALWIPIIFGI 
LFAILLVLVFI KKVAKKPTNKVGHP * EP 
GTEF 


7140 


21041 


A 


7199 


372 


3 


VH Y P VPQTGS PCC VPHPRGLS T IHPGLH 
SRYTGLGMVAHACNCSTLGGQSGKIA*A 
LEME I SLGNMVR PCL YERTNKQS KPRHM 
GITFGLTFSLTLTDKYSLI ISGFYLPQV 
LISLHPLGPY 


7141 


21042 


A 


7200 


396 


21 


GLKESHKCCAI PACLQRDPLVRVLGKAG 
F*RKSLSISTELQSLFDSPDFSKITGKP 
I KLTQVEHRAGFEvyNEDGAGTTPS PGLQ 
PAHLTFPLDYHLNQPFIFVLRDSDTGAL 
LFIGKILDPRGP 


7142 


21043 


A 


7201 


2 


148 


REPPHHSLANF* KFFVDKGFTLLPRLVS 
S S WP Q WL»PPR P prelglq a 


7143 


21044 


A 


7202 


1 


291 


AEGFRMHALVSQAACGTSGYKLSVGYMV 
QGPLTHLMLGLLISLIFLSRGSGRTWAF 
SHS * YKTSDLLPCRNRREVI EFLHYRNL 
HSH I CLS VTKTFL 


7144 


21045 


A 


7203 


262 


406 


I LARMTS WNS I KHL I LFLK* S SHLS I P 
SSWNYRCVLPHQANFLFLFL*R* SFTLM 
PRLVETVLLIQNSPTKIDSSPDDFSGKF 
YQ I LN 1 ViTbN X RRGNTSNIjIjHKNSRI L I 
PNLTKTLHD 


7145 


21046 


A 


7204 


2 


247 


DTVLERSGQGTAALPRSTSLDRDWAITF 
EQI LASLLTEPALVKY FDKPECMMAR I T 
SAKNIRMMGKPLT*ASDYEISAMTG 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
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Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIalanine, 
G=Glycine, H=Histidine, I=lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S^Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, V=possible 
nucleotide insertion 


7146 


21047 


A 


7205 


2 


168 


FFF I/vTsTGFTVIiARM VS VS * PRGLPASA 
SQSAGIAGVSHRAQPALRFLMLENRMC 


7147 


21048 


A 


7206 


1 


237 


GKCAHTHTKTHTRDTHRHAHAHTHACTH 
THI i VQGCSIiSQWSQDCVKMWACH I FG V 
PRE * ASR YTS SLVYKPGCTG VCG 


7148 


21049 


A 


7207 


2 


239 


KVDSEHKLERSSGGVLGRFKGKGSGGLW 
LGNFFASRKGYSRTGFDRLNTEGSDQDK 
EDDGSESE*EYS*PMPALEPCSS 


7149 


21050 


A 


7208 


1 


405 


GSRDNMNVIL I CFPNAPKVS AESVKNDA 
QLDKYLECTAQE I I *NHWEDVPDLFHVM 
RTLPSDNMPSLPPGGELASMRTVIEAVY 
YRLNP Y I NDDTDSTSTDDMWFNCS S SHG 
VYLDLHRRVQVNFDETFSMHPEL 


7150 


21051 


A 


7209 


2 


761 


QD FG TR FATVLL F YG YFKCTRTLKGTCL 
YNAILYKVCSPRNDQFDVCYWPSFFLLP 
IKTGELLGFLVYASREKRSIAIGD*KDD 
KWPPERI IQYYG?AT*AQDSSWGYRIPI 
YMLN* 1 1 RLQAVLE 1 1 TNKTGRALTVLA 
LQEI QMRNAI YKS RLALD YLLAAEGG I C 
GKFNMTNCCLHIDDQGQI VEDI IKITKL 
AHVPMQIPDIVWKNIVKIPVLFCSVLIT 
GTCSPQSGTPCWLSRSRPSHVDPFRWS 
P 


7151 


21052 


A 


7210 


147 


390 


KTGLS DQMPRAI QLS YS YLS PLLY * * S Y 
KKKKKKKKKKKKKKKKNFFKKGPGN* SF 
WGGEKKSGPGAGLKNPGGGKNTFY 


7152 


21053 


A 


7211 


433 


214 


WGRGNALTIKPPWVPFQESETTPSPL*G 
ITLLDNPFSPIPKQPPPPH*SITVPYLI> 
SSPQPGFPHPFQAPGLA 


7153 


21054 


A 


7212 


532 


253 


KWINVFHKAAIMSDFSDRSRHSTLKAS 
E*GFTMIDAIKNICDL*KEAKISILTGI 
*KKSIPTLMDDF*GFKTPML*VTANVGF 
QRFSKKFYW 


7154 


21055 


A 


7213 


456. 


24 


DAQAFSWYSQGYFNSSQGFSISLPGDTS 
PVKIKRMGGPLFFS *LLKSTCWACSKTS 
RVPGKVLKTGLPRPR* PVYKBPLGGFYP 
RF 


7155 


21056 


A 


7214 


2 


445 


ILRDLCQRVPTWSGFPSWAMELLVEKAI 
SSAS S PQSPGDALRTVFEC I S SG I I LKG 
SPGLLDP CEKDP FDTLATMTDQHRED I T 
SSAQFALRLLAFRQIHKVLGMDPLPQMI 

fAD O Ttf T TJVTVTD TO TD T> T\ o rvn TT\f~* CT? 7\ r?/"» V + T\ 

yKblN IHWiNKKKKKDSlJGVDGr JfiAc.VjK*U 

*KDYD7F 


7156 


21057 


A 


7215 


203 


3 


VRPGWAHACNPSTLGG*GGRIA* SQEF 
ETNLGNTVRPCRNSGEVLNRKTVSIRRA 

T DUtJC TDT U 'PA 


7157 


21058 


A 


7216 


1 


285 


GKCAHTHTHTHTHTHTHTKMHTRTHAHT 
HTHMCRGAV*VRGARTVCTCGSAPFLES 
PGKEPKDTLRPLFVSQAARAFLGEKAFC 
SVYRQREPPPI 


7158 


21059 


A 


7217 


1 


386 


GDSNCEWSVLYVj. iatfgigyalgilsw 

tvicmlkrqngkpkmkskyknldatdqe 
sl*lkpaspj^gikqkglllssslmhses 

ngqtplkats pree i l 


7159 


21060 


A 


7218 


70 


379 


nilyfniffffk*flkflwvfffffff* 
vfffynffffvfflcvfffffvklkiff 
wgpfflffrfffclgffgtfqswrwffa 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=GIycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
YV=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














LFPWGLMFLFPLLFFLVFF* LVFF 


7160 


21061 


A 


7219 


1 


208 


PRLAASVSPGDMLGMQVLGLGTVASACN 
CSTLGGRGMG I T * G * EFETSLVKKARTC 
FYKRLKRQKSANL 


7161 


21062 


A 


7220 


455 


96 


H YLYGSQGLSRF YFVTQGRVQWHDHGS L 
QPPTPGVKQSSSLSLPKCWDYKCSPPCL 
ARP C P FF F FL * TS S VDTGLLLNQQ P I L * 
IMTFPYPLLRKSLSFHHQIYEPTSICSH 
LLSFCYT 


7162 


21063 


A 


7221 


201 


319 


IYELWPGPVAHACNFSIIiGG*GEQIAQV 
QEFKTSLGHLG 


7163 


21064 


A 


7222 


307 


3 


NRRQAGPFPAKLGRPSPKGGFPNFFKLF 
FFKSSF*KNPKGQGWGFPPLIPGFQGPQ 
VGGSLGAPGLKPPWGTPQNPFSKKKKKK 
R SNCTDTG TT I AHRRQM 


7164 


21065 


A 


7223 


331 


141 


SETITAlSrvSPNL*DANDVPlQCEISPLI 
S YAGEGLER YVADKE FHAPVI IDENGVH 
EWKNGI 


7165 


21066 


A 


7224 


157 


314 


WLPGFRDGGEDGVCHVAQAGPKLRT*GD 
PSIKASQRAGITGLTHRTGPK*KF 


7166 


21067 


A 


7225 


2 


392 


RDVRSLQPLPPSLK* FSCLSLLSSWDYR 
CAPPTT*LIFILLVEMGFHHIGQAGLEL 
LTSGDP P AS ASQS AG VTGVSKRTR P PS F 
VLVNPIPPLLPPVPSPDGPWISSSTTSW 
RGYPEVLRWGLCSFWGCG 


7167 


21068 


A 


7226 


84 


204 


FI FFFFKR * SLTMLPRLVSNS WARVALP 
P QRP KVLG LHV 


7168 


21069 


A 


7227 


3 


307 


DRAAPRGAGRAGAVGPHRTGDWGLAGTP 
AGRGHARH I GRG P KQGRAAG QRGHLH PG 
GLLGRGGNSEP*RKQRTPGSMRRAAPLL 
STGQVWEG PAS VGDR VP 


7169 


21070 


A 


7228 


52 


1176 


GSQQPLLH I SEAQGDTLI KKAPAEAAWP 
PLIAFSSGGRLPCSHES*SAAGYKRLDD 
TAAGGPPSPGGSPRSPATAFGLHPVWGG 
QLRPRGGRGLPATGSGEAFQALTQLSRG 
RACTGPPAPGSEDAAPSGCSAARWSP*M 
AGSSSCAIPDSAASCICPQGPPWRW*SR 
APARNRCS ASSGRSRC*QLRRRR YI PGIi 
LHPVLHEALPDSWGCETQLGPPLASEAA 
PPRSRRREEG*RGSGRC* *KPSLPRSWS 
LCRNQAEAGFVS I LVWLSGLAGAPSRPA 
AAAPQIAPNQHRSPPPRLAPVPAASGSP 
WPWERDLG VLETLrTVGNi? AQDPNPSACS 
GPQSTPDTYYPVQKPKTGLRLHFHSRPN 
TIVKSVGNMCL 


7170 


21071 


A 


7229 


3 


112 


ASNLGSRGCSEPRPCHCTPAwV * E * DS I 
SKTNKQKN 


7171 


21072 


A 


7230 


103 


299 


AMS Q I RNAI P FT? I AT * R^i * YLE I QLTKE 
AKDI YNENYKIPLKE IRDVTNKWKNI PC 
S* IGAQhCK 


7172 


21073 


A 


7231 


2 


137 


SLTLSLSICFCL*RVJGLSVLPRLISNSW 
PQAIFPPwPPKVLELQV 


7173 


21074 


A 


7232 


3 


273 


LTHIVDGPNFLTTMYQCLMCVC*QELiGY 

ACRHDLGQNYIMI PNE I MQTLLDLAFPG 
YSLGR 


7174 


21075 


A 


7233 


338 


136 


RPRWEDCLSPGVQNVCGQHSESPFLQEI 
* KI SQTWWCKPWPS YQEVEGELLE I RG 
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SEQ ID 
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nucleotide 
sequence 


SEQ JD 
NO: of 
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sequence 
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26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M^Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X^Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














STCFGLPKCWD 


7175 


21076 


A 


7234 


437 


261 


. HLG FP ECWDY RHEPQHPT * FLEHFM * RR 
GLTTFPRLVWNS WPQAIFPRWS PKVLEL 
QT 


7176 


21077 


A 


7235 


41 


264 


VTVMEIKGKINLVLDI IKDI YVEGFSTY 
YF*RQGLTMLPRLDLDSWAQAILPPQPP 
E*MGLQECNHHAQLSRYF 


7177 


21078 


A 


7236 


1 


154 


GTRDFS VKTL*ARREWHDI FTGMKEKNF 
YPKKVYALKIPFKNEAETKKVEV 


7178 


21079 


A 


7237 


1 


179 


SCLSLSSS*DCR* PPPCPANFVKSFFRN 
GVLTMLPRLVSNFLPQANLLAQPLSGFI 
ILL 


7179 


21080 


A 


7238 


340 


140 


KINWVWWPVPVI PVTPEA* AQKLP * PGK 
LKFPLT* IPPLPSRLGHQSQTSFPKKKK 
KFYILSSDSV 


7180 


21081 


A 


7239 


2 


175 


QFESKPS PAWATEQDPVS KKKKKKKKKK 
KKKMKKKNSSKFKMIPFTKRQY*EGNWK 

N ! 


7181 


21082 


A 


7240 


336 


192 


QSETPSQSNQKFIF*GQGIALLPRLVLN 
SWAQAILPPWPPKMLGLQA 


7182 


21083 


A 


7241 


1 


186 


DRPGQHGEMLSLLI FCFFCRDGGLTMLP 
RLVLKS * AQVILPSRPPKALGLQDTVHC 
SKFDAV 


7183 


21084 


A 


7242 


2 


111 


VKTLNKLGIDGTWLRI IRAI F * KP IAN I 
ILNMCSFL 


7184 


21085 


A 


7243 


360 


217 


LPSRGDYKRKPPCLVNFYFL*R*GLTLL 
PRLISNFWAQMILLPQSPKC 


7185 


21086 


A 


7244 


35 


276 


VQQQRVKQGRKYYS I VLKNKKKLCNYLR 
VGE*MG*RNIGWVQ*VAHTCNPSTLGGR 
GGRII*PWEFETSLCNIG*TCISG 


7186 


21087 


A 


7245 


1 


356 


IQHTFSLL1 FGI KYS YYVLKI VS WLAAV 
AHACNLSTLGGRGGWIT*GHELETNLFF 
AVKP WVYDKNS F P PTLNLRFAKSGHLWE 
PRGKNHGPPETPPLSRVSPPQRFFPGGT 
RPLKKE 


7187 


21088 


A 


7246 


3 


293 


CKCGKAFHNFYSFQNHEISHTGSMLYDI 
RNVGK* S I FF*HHCQHKTTHRAEKLCEC 
KI CRE AFSHFGNLKVHEAGRSG S RL * SQ 
HLG WANCLTSG I 


7188 


21089 


A 


7247 


6 


387 


ASIPCLTYPSWEFL* FI*QAGHFSFSKK 
KLFI CAFFSNRDEVSLMLPRLGLNSWAQ 
WLL P W P P KVLG LQ VRATTPAKLG I FWG 
FMSVSLGQGVCLCRSLLCPLPLGPGQVH 
SGCSQSFVEKMNKEL 


7189 


21090 


A 


7248 


373 


146 


HLGKKVLI FWPQ YLALLNPPNGGVSRVY 
PRCRPLMVFFFFFFFC*GRGLAMLTRLV 
SNSWSQVI FQLQLPKVLGL 


7190 


21091 


A 


7249 


2 


184 


AILRGI * KKFISTLMDDFEGI KTSLEEV 
TAD WE IAREVELEVEVEFKDVTELLQS 
HDTT 


7191 


21092 


A 


7250 


795 


544 


EMKSCSCCPGWVQWRDLGSLQPPPPRFK 
RSP AS AS LVAG I TAACHHARL I FVFL I D 
TGFHYVG * AGLKTPEA* ATAPGP I FLYF 


7192 


21093 


A 


7251 


34 


242 


KNToLKKKr r Wv^yj^Kl\JvisLVr.Jc.r Ab WP 
GAVAHACNPSTLGDRGGRVA*GQEFQTS 
LSNIGRPRLYRKI 


7193 


21094 


A 


7252 


216 


57 


NIQVLKYVLTSYLKKKKCAFRLGPVAKT 
CNPGILGAQGRRI S* AOEFETSLS 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


IVI 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

pepnue 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D-Aspartic Acid, 
E=Glutamic Acid, F=Pbenylalanine, 
G=Glycine, H=Histidine, I=lso)eucine, 
K=Lysine, L=Leucine, M=JVIethionine, 
N=Asparagine, P=Proline, 
Q=GIutamine,R=Arginine, S=Serine, 
i— i nreonine, v— valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucieoiioe ueieiion, \— possiDie 
nucleotide insertion 1 


7194 


21095 


A 


7253 


126 


366 


INLTECQQYLDKAIFFFFLKKGLFAP*A 

PWnflRQ PA <?T .Pi PTC PPP T.Tf QttT ,TT .T . QT 

WDYGGALPCRANFGIFFFVJKKGG 


7195 


21096 


A 


7254 


2 


213 


RPTWATWQNPASTKKKKNSFFFETEFRT 
VAQAEVQGGNLG*LQSLPPGLKRFSCLK 
MPEAPKGQSCSLIG 


7196 


21097 




7255 


150 


263 


PP\rQT T DOT 7\ /""TvTO ATT 7»I4T r MT T5T DTCOn 

LCAFS * TTTRYYKYSGYCFSHPNLMLI C 
SRFTLLSSHSSLGHLTLFLLIVCFEKES 
HS C PGWRAMVLS WLTATFAS RRQATFAP 


7197 


21098 


A 


7256 


386 


93 


PPGGLPPQS I PSPRAQLKPKS PRGKKRG 
GPKPGGSQPGFWGFFFGPGPPFKRGGGL 
in wv jjvji-vivW rvvj v jvrvivjvvj i\x v iJtr V-*-* * 
RGNRS IDMKFGGS 


7198 


21099 


A 


7257 


153 


369 


KENSRLQGYGPLFSHS* FTNSCPGQVQW 
t/T/T>\7tt.at jtfPAFVPnQx/rv^ppnp^TfTPP 

LLKI QKFARCGGGRL 


7199 


21100 


A 


7258 


180 


3 


CSTLI * I RKYWLGAVAKAYN PNTLRGRG 

fZ"P T A * fcnWJ? VT Q T /2NTWVWTPT . FT .P C T>TTM 

QQ 


7200 


21101 


A 


7259 


22 


265 


QLRAIPDLKLDPLMEGKEILSRTLLGSG 
TVAPICDPSTLRGQGGWIA*VQEFKTSL 

PWMAlf PUT.VPTriflfT T/2/*"i\rrTfT.'PV"tnAl 


7201 . 


21102 


A 


7260 


326 


2 


KS AMQ I CI VD PLNAFCLE I CCHANLHCR 
PSNAFCFLKVDNVRAFVS PS I PQVSLIL 

TfWFQWPri\ri7AWTPI\TQNT\rRriPi<'iFPTZi* A 

QEFETILGNTGRPCLYFLKNNRKS 


7202 


21103 


A 


7261 


1 


68 


VNL C F I LWLG PG W AHACN P S T * GGRGG 
RIA* P * ELETCLSNTA* PGWAHACNPS 

T 


7203 


21104 


A 


7262 


282 


176 


GLKKTFFFLVRERVLLCYPGWSAW* SQ 
LTAN* TPGLKLFSCLSLPCSWNHRHVAL 

WVAFTCFT P* PttMPWPATWF AF AdFPiT ,PP 

RSSVCSEL 


7204 


21105 


A 


7263 


588 


351 


^u^^^VCLLESISSVNLCLPHLQHSPVKI»E 
I FLFFFSSDNFPSTHHITVRPLSPS * AI 
t pp VTrrvTc: p wp patp n v>nv y 


7205 


2J106 


A 


7264 


327 


2 


KIKERINNFIFICRDRVSICCPGWNAW 
TT .NQWAO ATP PPT .CT MQc; *1\TYPPAPTHPj 
QVTT I FFCRDQ I E I HMVP KYHLGNTLNY 
FFVLFCF VLFLRHP FSAHRNFRLP 


/ZUO 


L\ WJ 


7\ 


/ZOO 


i 


z /V 


A PiF fifJ FF T J .T Q RfiT .PTC a Q p. C a F T TRM Q 

HRALPSCY* FYVTLHGLTVACA 


7207 


21108 


A 


7266 


3 


227 


SSQVSLPNNWDYRREPPCLRFVFLFFVE 
TRS C YVASLELLGSGDPT VIiAS * SARVX 

HMf5 W PD W PMT . Q P T T J .HVP 
L>i v ionv>. xi yv fiN x*o x Xiun v is. 


7208 


21109 


A 


7267 


300 


1 


KMLLNTIMCNREILCKRKSPRVQQTSFV 
SYFKKLPQPPQPSATTTLISQQPSTSKQ 
DPPAAK* L* FAEGRLIiAFPSNlWLYQLR 

VTMPFT.PWTnTVTTT.VNT 
x xiiLr Xiirnx ox vnij vw 


7209 


21110 


A 


7268 


29 


344 


ASLLALKKKSGLGAVAHAKNASTSGGRG 
GRI T* AQEFATSLGNMARTCLYKKKKKK 
KKKGKWFTGVGGPQRGAP I FGPWGGGKS 
PAPGKPGGPNPFPGKGAPTLF 


7210 


21111 


A 


7269 


3 


209 


LHPLSPGPPAPVAAGIPGPQASR*GPCR 
PDPSPDSASVDASLPGPALSLWSWGSNR 
GEYGTTPASQIH 
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SEQ 10 
NO: of 
nucleotide 
sequence 


SEQ iD 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine 7 D=Aspartic Acid, 
E=Clutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine,IVI=]VIethionine, 
N=Asparagine, P=P.roline, 
Q=Glutamine, R=Arginine, S=Serine, 
J— inreonine, V— Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, Impossible 
nucleotide insertion 


III 1 


l\ l IZ 


7\ 
t\ 


IZ/V 


oc 

60 




ETSLGNTARPPL 


7212 


21113 


A 


7271 


45 


274 


GLGTVAQACDNPSTLGG*GGWIA*GQEF 
KTINNDKSNKWQYLAI IEQINNRMLI CK 


7213 


21114 


A 


7272 


1 


122 


VLVETGYLRVGQAGLKLtLTPGDPPALAS 
QSAGITGISHRA*SYMHF*LVETGYLRV 

RA 




O 1 1 1 c 

21 1 15 


71 
J\ 


1Z15 


3 


242 


LVVKGGFKTQTSGGLPG* AS KGVGI PGV 
SHCPGL IG VFNLTLKVAVS LMDL 


7215 


21116 


A 


7274 


2 


169 


LALVAQAGGQWHDLGSLQPLS PS FPQFF 

vt ct doo *nvDurnniJC7\T«7iPC t?itcatvt3 
x J^JbJb'oo^L)xKiitiri*xllDAWr br r i»NiK 


7216 


21 1 17 


A 


7275 


353 


76- 


FAHLGLPQCWD YRREPLCAT* KLFLKGK 
* Fbl\.r y V 1 r bbXiG I GH r bNALf b C VMW J 
IVLLI INFWVTGIDGRTSGSSSVKCGI^H 
SIYPMVLL 


7217 


21 1 13 


A 


7276 


r 
O 


87 


KL * AQTPGLRQSS CLGLPKC 


7218 


21119 


A 


7277 


1 


203 


FQVKMLC S G WMWI/T P VI QALGAVAHTGN 
Pb X^/GGRGGQI X *S * * FYTSIjDSGNPPA 
SASQSAGITLA 


7219 


21120 


A 


7278 


272 


51 


I FFPVN* CGGHDVQHHFNPS *NNLSSWS 
SWFMPIILPLWEA*AGK*LEPRSLRPAW 

7\ rpVT Tlt'MPVVVT -rri ttti\ 

A I KXjDbJNI b KKA X J. r. X HA 


7220 


21121 


A 


7279 


1 


131 


SRDYRHGPLHPTDFVFFVEMRFHHVAQA 
GLKLLGLSDLPALTS*SRDYRHGPLHPT 
DFVFFVEMR FHHVAQAGLKLLGLSDLPA 
LTS 


7221 


21122 


A 


7280 


411 


36 


FLECVPNF FPPKKKFFFKJ\JnTGQVFFPV 
TFL*KGPALFFFIFPFYGKKFFFLSSAK 
DFPPRVFF*KGPLFFFFFFFFFFFFFFF 
rrrrrrrrr r r r r r r r JS-tir JL x ivlvVHr 
*SSVHFQPIIFF 


7222 


21123 


A 


7282 


395 


181 


RKHES IHHDSVISVTFNMQSLRLFFFIF 
FILSL*LFRFIFFFSFFLVFFFFFFCFF 
FFFFFFFFFFFFFFFFF 


7223 


21124 


A 


7283 


392 


2 


I G YNV * FN Y X IjKNGIjF *QX»1iX * L»IN 
LFILRQGLIFWQVCNVLVFFFFFFIFF 
FFLFFFFFFFFFFLLFFFFFFFFFFFFF 
rrcr rr r r r err rr rrfriil yWrLbr X VJN 
FCKDTFKQQNIHLRTRGR 


7224 


21125 


A 


7284 


113 


291 


YVCVCVCVCVCVCVMEY* SAIKRDKI VA 
FTVTWMELETI ILSEVSQEWETTHHMI^S 

T TXT 


7225 ; 


21126 


A 


7285 


327 


44 


LIKVSPNFSPPKKKVLPKKPPCTIFPPS 
rr ^ J\lJrr / JCir'Xr J.JSJr JrnAjS.lU>JrJNoJrrFyj\ 
♦GPPQGFFKSPPPFFFFFFFFFFFFFFF 
LGFSSKLLIR 


7226 


2 11 27 


7\ 
J\ 


72 oo 


3 


215 


V X Jj X X r M 1 * riAT AW rUc V LjX V r> Jb Fb XJNiXi 
E*IiYDGGYYDGLVLTSGLGPY 


7227 


21128 


A 


7287 


367 


119 


LKKAPAPPFIiNPPFWGKNFILSPPEKMG 

c Xr i\.\jr I? xviwj it ± i. it a. i. i. i. ^ i. i r i. rr rr uo 

FFFFFFFFFFLQYWNMYFH* ILI INQ 


7228 


21129 


A 


7288 


73 


259 


INKYLQN* KKKKKKKKKKKKKKKKKGGP 
FKKTPRGAQ I S PGVKKKNFPPKGGFKKK 
QAPGPF 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I-lsoleucine, 
K=JLysine, L=Leucine, M=Me thionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
I — 1 nreonine, v=vaiine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 


7229 


21130 


A 


7289 


337 


53 


LRSFAPFHLPINNLLTDIHMVIPLPPSS 

JjL.ori I JuriMCj V IVll Lie Mil tllAJUu 1 riM 

HTCNPNTLGG * GGRS A* AQES E TSFGNM 
AKPRLYLLNF 


7230 


21131 


A 


7290 


1 


96 


KHHFGFEAAA* YWHFVDW+ LFLYVS I Y 
* *no 


7231 


21132 


A 


7291 


288 


2 


SRVEFKGPFiVlAHCSVTIPGSSYVFTSAS 

SRHWQAGL*RVS S SDPPARACQSGG I P 
GVKPHAQSVSV 


7232 


21133 


A 


7292 


121 


233 


YFPKKKKKKKKKKKKKKKKKKKKRKKK* 
KGGGRSRKSYLG 


7233 


21 134 


A 


7293 


3 


346 


NPIjNPIjVNLiTVo pkrnssldtrkkpcre 
RHREKETLRKKRNQHTEARGRQRENHPE 
TE * ETGQRDRDKQTKKQKDTD I QAYT YT 
DREKRDKKRHRQIR*AEAERIRDTERQR 

Ul 


7234 


21135 


A 


7294 


46 


346 


ETGLKKWGDLSNGTVRAVLNLAQADFCV 
SIPSYHI KHGNTKNNCGGEGKKKILWT 
AKKbbPCa v vXHACiyPNlLrGG*G>G7RVI *C* 
QWFKTGLGNIVRPCST 


7235 


21136 


A 


7295 


254 


12 


RKDAENNGSRAB* I KR VKDLVQS KLKF F 
QELEKEGILPNPFYEASITLIPKPDKDT 

C? VDnVDTlT O T MMTT\7k VTT vrvTT AVT 

b rvclu i KF JL b IjIVIIn 1 DAK± LtN is. J. I_iAN 


7236 


21137 


A 


7296 


258 . 


1 


C IGQC VLVAVGRELLCVSRS WCGLLHVD 
EVGGEALGRLLWYPWTQRFS ES FGDLS 

1 rclJ/i ViyiLrW Jr*lV VKAHtjiv* vLiLjAj* blXaJLiAJj 

LD 


inn 
12.51 


21 iJo 




/2y/ 


iyi 


Z6J 


lITClfTTDyT VTMnVVA A T VDMY T X/IC T7\T7\ 
v^oi\\~Ij.fc # r -L I J.IY1D I a JL ixxvLN .IS. J- rio r Vn 

RWVQLEWSELTQEQKTENNMFS * WFG 
FLLLC 


7238 


21139 


A 


7298 


2 


164 


S RVAG I TGAHHHAQL I FVFLVETGFHH V 

pr\A 7\ T7 l/"T T T'CMLTLTA T3 Dft + T UCUVDCV 

trUAAr jvIjIjI bNHrtAXFU -uWr r x i* t* J\ 


7239 


2)140 


A 


7299 


307 


8 


VMII TFWLPHLSG Y I EKSTP YECGFDP I 
LPLP*ALQTFNLPLIVMSSLLLIIILAL 


7240 


21141 


A 


7300 


205 


317 


KSSPACTPSTLGG*GGWFA*VQEFETSL 
DNMVKPQLY 




1 1 14Z 


A 


/iUl 


14U 


EL 


r in r XjI* bir r»irrr f ccrrrrrr i* r x* .r X-» 
FFFAMNINVFY 


7242 


21143 


A 


7302 


382 


151 


SIHRSEFRPRRSHSVSRPESRGIILVYC 
SLYFLGSGDSAPWS * VAGTTGVCPWAS 

JjJL rijC r ecu CtMKLnj v AJiaot-Jrtj 


7243 


21144 


A 


7303 


318 


17 


LVPTKRPQVENYYPVHTYIWYCTKGKRK 
WAGIGGSCLSSQHFGRPSWQDPLSLGV* 

r\AD<T/^irDT7TDCT AIT* VKP/JTaTAT? DT) 

L>y f\j\lntLi!* I f oLy^' IvLn flwyfi rl V rn 1 y 

EAEAGEWCEPGRQSLQ 


7244 


21145 


A 


7304 


310 


132 


QWCDLGATTPANFCI FFVRNGVSPCWSG 

UO*DDTM VT C7\DT UI7r\CirCT?MDTTCTrT 

Wb^lrFJJJjKJjbAKJjrlr y b v r r IN Jf X 1 b XCsi» 
FFL 


7245 


21146 


A 


7305 


338 


168 


REPPLPAANFVFFVEMR3HHVRQAGPEP 
FbbblJiiFAbAbybrivjX ll?Vnl-rl/\Wr* * 1 
YK 


7246 


21147 


A 


7306 


276 


161 


I INAAI KKNEFMSFAGTWMKLEAI ILSK 
LTQELKVK* KLRKFNTDTILLLLLI FKI 
F 


7247 


21148 


A 


7307 


310 


61 


ITLVYCNFFYFINFNFLKLFDSFVI ILS 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

etb 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=lsoleucine, 
K=Lysine, L=Leucine, M=Metbionine, 
N=Asparagine, P^ProIine, 
Q^Glutamine, R=Arginine, S=Serine, 
i— inreonine, v— vanne, 
W=Tryptophan, Y=Tyrosine, 
X^Un known, * = Stop cod on, /^possible 
nucleotide deletion, ^possible 
nucleotide insertion 














LKQIYSCTAIVYILIL*AFVQFKFLCFY 

PT.VFTCKFTPTYlfNrT FYK* T'KYV*>tfMYFVF 
C Li± r AJ.M X V X I J\xi If I rvXIv X V ±N1 V 1 if IT 


7248 


21149 


A 


7308 


257 


1 


CGRVDLH * AVFFFFKRWSLTVLPSLDLD 
S WAQVI LLP * PPE * LGQQS CTTPPS SLS 

T FT . Y A FT &f T PVMTCM Q n FVrJTCDR CM F F 3 Y 
irti j-irix tr v in xvl i Dor v vjxvi>r\.\ji i-c cox 

H 


7249 


21150 


A 


7309 


2 


181 


RDG FCHTGHTGL * LLTSGDP PASASQSA 
GITGVSHHARPCLHSLTAFFKNWMVWPG 


7250 


21151 


A 


7310 


25 


340 


FCVFNTMDVKNI * IFLIFVPDVHLG*QK 
NLLLDTI*KKKSIRIQE**QYKSISKRN 
YNTVSDKSWLGTVAHACNPSTLGGQGGR 

T2\ * TOP PHTC3 T .FiWPl/P D PVY Y 


7251 


21152 


A 


7311 


2 


193 


FVDFSMHILDMLSLYLSIVICYITLFLS 
I IDNI YVNGPHKI I IPYFYCIKYIFYI Y 
tmpppljpppt\7t t phfcvp'T'fmtyvpqvt 

HYFLLLGVFGLFHFF*FLYLYMVLFLVF 


7252 


111 CI 

21 153 


A 


7312 


1 


245 


Kx irVjj_ii\.yooiir OXjXjooWLj x rtxi V irXjttjriTXN 
AFFFFKGP*SEPLTLASSLHWYFPGGHF 
LQGPAKIDDFNFSPILGKLKKKSPF 


7253 


"111 C/f 

21 154 




7313 


167 


2 


UAr V~XtXi V v L v l^Klyo^2\A_L3lli\\j X V3XjJYK.iS.ttXj 

RMGK*GQARWLTPVIPALWELEAGGSP 


7254 


nil cc 

21 155 




/314 


1"' A 

3j>4 


i <o 

i:>2 


v.HV/\yiiVJVyW X 1/1 XI O 0 i fX VjXjK i Orl 

LSLPSGWNHSWTPPHPANFFIMGSCHLP 

1_J JLlv X\ 


7255 


21156 


A 


7315 


2 


164 


PKVGQDGLELLTSGDPPASAFQSAE ITG 
VSHHARPTFYLS * QVGLTVTS FRNPQ 


7256 


21157 


A 


7316 


155 


9 


QRIKH* FHWLGAVILTCNPSTLGGQGGW 

TTfZnOIfPPTQT HVMATPET Y 
x luuyAr a loiiri x i iv-ix xtxcxj x 


7257 


21158 


A 


7317 


1 


293 


GIPSTWDYWCTSPQPHF*FFFFFF**KR 
DLAMLPGGVxxNSWIPAILQPKGPKVLGL 

y/\VM J XXjxx\\jX XolvJtrlWnxNxSXUKLiXHi/Vij 

SKLLTTKRFKRLL 


7238 


21 159 


Tn 
J\ 


7318 




15 


olM^ijV X XjX ivlrXJO V X Xr XjXjX* VIX. 1 VootlVA 

QVGLELVASSSTPI *ASQSVEITGVSHH 
ARPKKVT 




111 £Lf\ 

21 lou 




73 1^ 


loy 


2CD 


n^RMaT.QP^pnfirQPP+firHfTPaiiOPTP 
QNPA 


7260 


21161 


A 


7320 


169 


260 


KPQKI PVACDQQFTDKRNCSHGDYECNM 
VDFFFFFFRERVSLCHPGWSAW*SGDH 
TTALQPG 


7261 


21162 


A 


7321 


580 


400 


FFFFERESRSVARQPGVQWRDLSSLKSP 

FEFW*RWGFTTLARIVLLS* PCDPPTSA 
SQSAGITGVSHCAQP* FFFFERESRSVA 

PSSWDYRRAQPCPANVFEFW 


7262 


21163 


A 


7322 


130 


26 


SPGPLFGLIEPLPSRFKGFFWLNLP*TW 
EYRLALTAPIKFFKFLIKTGFPH*GIKL 


7263 


21164 


A 


7323 


17 


303 


KSSGTIKGRAQSPFLQLVAIEQCQDCFH 
FRSQF * YLKKQLQPGMMMKAYNPS YSKG 

* CPT3 TT* anPPTfQPT fiMTTVT Pr*T IfTKTM 
oKK X X * ±v^jl,c X\oKXjvji1nX X I\XjL.\^1jj\X1>J1N 

SLIFKITENIE 


7264 


21165 


A 


7324 


194 


S 


DWVEVLKKSTYRPDTVAHACNPSTLGG* 
GGSIT*GQKFEIRLGNVVGPCLKKNLTN 
QLGVGG 


7265 


21166 


A 


7325 


413 


72 


GVPIFLKTLTGKTISSERESKVPLEKGK 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
etb 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=A!anine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=GIycine, H=Histidine, I=Isoleucine > 
K^Lysine, L=Leucine, MNMethionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
i=I nreonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














ANFRDKRGIPPHHKRVIFAEKKL*KGRS 
PFDDGFQKKLTQPRVSAFKGHPPLQPLY 
At 1 ivb r Kblj 1 HHQDCPr WHIj 1 Li-jbAKKU 

Q 


7266 


21167 


A 


7326 


394 


177 


NVNHTHTHTHTHTHTHTHGWCKSDTGYS 
oDb*CiTPRCInDP JHiVPoSbyTVNIjAMG 
LTLAKEVLANMTQTEV 


7267 


21168 


A • 


7327 


232 


22 


SRPSGSXKGAPALQQSSCLS FLS S WDYR 
HKPPYLTNF CI r L* SQSr PMLPRLLSNS 
QPHSNLPKWELQV 


7268 


21169 


A 


7328 


232 


3 


YHRSSGSLISLIQIRLRSKI*APSIPII 
KEWLSSVAHVYNPNTLGGQGGRLT*GQE 
FETSLGNIARPCLHENCLKN 


7269 


21170 


A 


7329 


364 


3 


PKARRATVRRTRVS KAPSNRQCLGVNTH 
VLGHRRPVKGNAKTRWSASESQAHTCNI 
TTLGGQGGLIT*DREFESSLATMQNPVS 
V SGWLQKMFS KQKL YSKFFPNVAYKAF R 
IWLLCARA 


7270 


21171 


A 


7330 


177 


2 


DPHLHKKFKARCGGMTL* SQLLRRLR* K 
DQL S LGVGGC FE LL WCYCTP AWVTER D P 
SC 


7271 


21172 


A 


7331 


413 


40 


RHMYLMFYTYIYVLDIYTYTVFFMYIYI 
P*CVCKYISCLHIMYRFISMYPCVYVYV 
CM CV CNV I HV I VENAG S CLAL SAL I EM E 
GMARDS KMWT FN P D E KMRLS Y KKQLEKL 
NFMILCPTWKEF 


7272 


21 173 


A 


7332 


147 


49 


KNY I Y FCRRGL PML PRDGLNS RAQ E I LL 
PQPPM*LGLNSRAQEILLPQPPM 


7273 


21174 


A 


7333 


148 


361 


S WLGTVAHAYNPSTLGGRGRQ IT* VQE F 
ETSQSNIARPLLYSVLAYAVQPPLPMVA 
MRKL WL I P TSK I TAL 


7274 


21175 


A 


7334 


305 


2 


CPGGF VTRVHR * YVRLSQPDLRVFTSGA 
MPASAAQSAG ITGMSHRARPVCS IIIFL 
NIFKV I CI Fv ELTI PNQNKQNTFL5STV 
ASCRILASRPTPIVWL 


7275 


21176 


A 


7335 


1415 


1164 


FFLRQSLLYCPGWSWAQSQLIASSITQ 
A*AI JjPPUoWD jcbbyrt 1 WJjl\.r r r C*LKjr 
SMLPRLVSGSWAQMVLLPQLPKAQAKL 


7276 


21177 


A 


7336 


1 


206 


GTRADHLMSGFQDQPGGHTETLSLLKIQ 
KoyAWWHVPVIYl.LiP IK* lUAKlSbljfc.KLjrll 

IALLKVRNRYIF 


7277 


21178 


A 


7337 


65 


302 


KSSLr YFLKIETGr Hi vAUIGL*LLbSU 
DPPASVFQSAGITSMSHHTQPKELLKST 
RNTLEEPNREWAKAREWAHRRNT 


7278 


21179 


A 


7338 


208 


38 


mdhc 7\ vMTj , u , vnDP'i , 'c , rir nfrnTi T7*\ r/^\ T iTi_r"r>T 
WPvh>AisJ>JJic*jPKtjl r UJ_^iUAc.vyi«HJJJj 

GSLQPTPPGLK* FS * LSPRV 


7279 


21180 


A 


7339 


2 


198 


ARADHPRSGVRNHPGQHGETPVSTKNTK 
N*AWWKVPLVPECWDYILFYMFTNCMDN 
KLPKWLTLC 


7280 


21181 


A 


7340 


2 


227 


ARGE I MS FAATWMQLEAN ILSKLTQE * K 
AKYCMFSLLSGAKHWVLMNI KVGTGDTR 
DYIajCSTQAVLKNIiGTRAE 


7281 


21182 


A 


7341 


3 


300 


HEGRNYTLTGRDS CTIAAS AEKACGACP 

VD VNCKEQTMPEC * AVAL * CTVHL I YVT 
INNALCLPQLTLFL 


7282 


21183 


A 


7342 


210 


351 


RGLGT VAHACNLNTFGG * GRW 1 1 * AHEF 
ETSLRNITKPSLRENPDML 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


JYI 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F~Phenyhlanine, 
G=Glycine, H=Histidine, I=lsoleucine, 
K=Lysine, L=Leucine, M=Metbionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threomne, V=Vahne, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


7283 


21184 


A 


7343 


306 


164 


I YMLYFI Y I TL I Y IHTHTHTHLYLDKY I 
YIj * CVHCFL * * AYM I CHQR 


7284 


21185 


A 


7344 


442 


8 


S QMS KLKT W FRAVAHA WN PS TliGGQGRR 
IT*VREFETSLGNTARPNLYWtI 


7285 


21186 


A 


7345 


3 


248 


HELSLSLLLSPFFFKNEKGSR*VAHAGL 
KXjIjTS S DPP T * AS Q S ARI TG I S HCAR PL 
LSHFLAYLALPCHQNRLCTGHLPST 


7286 


21187 


A 


7346 


2 


246 


HELYQSILLSPLFFKNDIGSR*IAHAGL 
QJjLjT* SDPPT*ASQSARITGISHCARPIi 
LSHCLAYLALTCHQNRLCTGHLPST 


7287 


21188 


A 


7347 


1 


130 


GTRLLKRLTQETRLNPGGRGCREPSLKY 
CT PT WARQ S QTVAHN * KRLTQETRLN PG 
GRGCREPSLHYCTPTWARQSQTVAHN 


7288 


21189 


A 


7348 


516 


288 


FFETESRFI TQTGVQWCDLG PLQPLPPG 
FKGFSCLSLLSS*DYRCLPPHPANFCIF 
SRDGVS PC* LGWSQTPDLK 


7289 


21190 


A 


7349 


417 


148 


STS WVQAGVQWHDHGIiLLPGPPRLK* S 
SHLSLPHG* DYRH I LSCLANF* NI FVEA 
GFRHTAHGLEPLGSSDLPASASQSAGIT 

/~+KA PT7T T\ 

GMSHIiA 


7290 


21191 


A 


7350 


143 


2 


TQKFKTSLSSWDYRRVSAHVANFCIFSK 
DRVSP Y* PGWSRTPDPARA 


7291 


21192 


A 


7351 


189 


37 


EAIAVRRI I *NLFFFFFF*RWGLTVLPK 
LVS NS WAQA I LS PQP S KVLRLQA 


7292 


21193 


A 


7352 


131 


2 


KSRLGTVAHACNPSTLGGRKGWIT*SRE 
FKTSRANMVKPRPRA 


7293 


21194 


A 


7353 


1 


106 


GTRGQCLTMLPRLVLNYWAQMILPPWPP 
X VLGLQV 


7294 


21195 


A 


7354 


939 


653 


KGNFI FLRQS FTLVAQAG VQWRDLGSPQ 
PPPPGSK* FSCLSLRSSWDYRHGPPHPA 
NFAFLVEMGFLHVGQAGLELPT*GDPPT 
SASQSAGITGR 


7295 


21196 


A 


7355 


270 


26 


EARFLGLLRDI AQLC YPGWSAAV * S * LT 
AASTSWAQVKASTHLSLPNN*D*ARATM 
PG * LKK YV F F * RQGLA I LPRLRIjV F 


7296 


21197 


A 


7356 


329 


206 


HHLANF * TFCRDRGLTMLiARLNSWPQAI 
LlLPUP P KTV JLiGIjQM 


7297 


21198 


A 


7357 


128 


253 


MRPDTVAHTCNPSTLGGQGGRIT*THEF 
ETSLGDMMKPYLYK 


7298 


21199 


A 


7358 


2 


108 


AREGSGGQGCSEP* WRHCTAAWVTEQDP 
VSKTIKK 


"7299 


21200 


A 


7359 




340 


GTR ERNWHLA * ALHTHTHTHTHRHTHTH 
THFSFPANS * RDTFVI F * QSLLLE IMTS 
FSPSLGLKCSFSQQPSGLLSSIAGERDS 
RI HGENTVSMCLHNRLKAELRSDV I SLR 
V 


7300 


21201 


A 


7360 


385 


1 


SVGPSKPCAGYNLLVCHLLRPLEKRS IR 
VGVT* FSRCHLSPLSLTRKANSLTPCTS 
WMRRCL I LLRLTLGVPHPMSCTHCPTLP 
S E PQLE MQKS P I F CVAHAG S PR LE 
LFLFGHLGSTPNLPSC 


7301 


21202 


A 


7361 


234 


29 


LSTEPKGSKALKFNVTLiiLNIAF* CNSG 
CNSNL*S*QQY*RVIQSHHLSQTLWLTP 


7302 


21203 


A 


7362 


68 


339 


DRD I AN KPAWQLCQSR I LYLAQLQFI DQ 
GELLPDFRIHYKVI * TCTAWH*RRNRLI 
DQWNRADI PDLNPCQAWRLTFV I LVLWE 
ADVGGS 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
{^Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=lsoJeucine, 
K=Lysine, JL=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine y 
i—i nreomne, v— valine, 
W=Tryptophan, Y=Tyrosine, 
X = Un known, * = Stop cod on, /—possible 
nucleotide deletion, V=possible 
nucleotide insertion 


7303 


21204 


A 


7363 


168 


2 


CFYS THAHI FKE LRQRFG AVAHAYNLS T 
t .raip czczv t .T*nr\ vppt'cjt „zusjwr\rK'p'P<: r* 

JjvjrtJXvVjvjrtJj i. Uy IS. C ili J. o l_Lrli>J l\i V iVlr XlO 


7304 


21205 


A 


7364 


399 


62 


QPGQYGKHPVLIKNSKIKPFWGDPPWP 
NAREG*A* KMVEPGKVRVQSAQIKALEF 

AFGKASKGGSPRSRILRPPGFLPYFAHF 


7305 


21206 


A 


7365 


289 


76 


VCDVLRESRAQLSRLKCSGI I *AHCNVS 
VDRFEAYGSRGYVCA 




2I2U/ 




nice 


^CA 


1 Qn 

1»Z 


y VMrliJriv^lM r JVL)XA?oor* Jtr±r 1 XjAJt* vHuJI/ivj 

TCHCAWLI * FLTWKIGFKTSLKI IVLLG 
YFWF 


7307 


21208 


A 


7367 


155 


2 


ELGTKREG6^EKSArVAHACNPSS2jGGRG 
RWIT*GPEFPTSMANMMKAHPRA 


TO AO 

/iUo 


21209 




7368 


J 42 


1 OA 


T ,CT DCCIJnDUVTDDDT 7Al\JPPIf PPPTf P * C 

ijoijr'ooVviJfCK x 1 tr fKi-iHiNr r ivr r r jvk o 
LTMLPRL YRPFY I TTHQTLNSAMTF 


/joy 


21210 


-ft. 


nicc\ 

15 69 


I 


21 1 


KLNEILTHATTWMNLENMLIGRS*TQKA 
TCCVIPFMDSIYV 


7310 


2121 1 


A 


7370 


190 


344 


I iSJNV 1 ViyK.XxjX Vitro 1 VAJri/iUJNJrolJuvjVjy 
GGRFG*GQEFETSLGKIARPCLY 


7311 


21212 


A 


7371 


24 


254 


KPQG IGAHASSQH YGRPRREDHLSLG V* 
DQPGQHGEIPSLQKKKKKGLPGRGGVCL 

VOTaTCT VTT /"* I? T/TvTT 7T\7T 7T TDVP 

JNJrWr JjKXJjtjCiKiN VVV VIjI.c'Iyo 


7312 


21213 


A 


7372 


43 


358 


DSNLNYS Lj FFKG EADLGTTQ VLTHPS TT 
AMYFVHYCQSP* ILYGTINT* PPWEKN 
PIHIKTPSPCLQASTAINLQLSHINCNS 

j\>\X JrJTlr'J-Ajr X yy X I Jr ir XiX vrio X 


7313 


21214 


A 


7373 


2 


288 


IHI I LAFTSSLLG I LVYRSHLI ASLLCL 
HGI ILSLTI IAPLI PLNTHSLLAPIGPI 
AILVI AA* QAAVGLALLVS I STTYGLNY 


7314 


21215 


A 


7374 


335 


1 


VQWNDFGS LQNP P PG VSP FFWLSLPNjNL 
GFKGPPPRPGPFLKF* *KPGFSALTRMV 
SLSRPRDSAPLAPPKAGVSRLNPRARAP 

PPPPPVPr'PUPPT T CPPWQT.aHK'fir'T 


7315 


21216 


A 


7375 


399 


91 


RIFFFPPSLKAPPHRQKPTPFPPTKKPL 
VFSIPPIFPPKFLKKNPFSPWEVFLFLS 
ppnimA ppripmr* vdf^wwPYdTi dot .t(~\ 

NSPQGVPPKKKKKKKKKKN 


11 l£ 

/J Jo 


2121 / 


t\ 


/3 /O 


inn 


131 


VT.VTTf VVMRTT TT.PVTT?TT71MT TT* RMWITP 
x Jux jl. j\r lshc ± ± ±xjc i xr x tin ir * r r*i w i.x r 

NLLFYIFYYILINFNILIILIINFFYFI 

liLIILNIILILYIIIYYFIIKNIIKT 


7317 


21218 


A 


7377 


2 


156 


FHHW*AGLELLGSSNPPTWAFQSAETT 
o voJriirii^v,j-*Jt nr xxjxcxvyoxjxtoxjv^ij 


7318 


21219 


A 


7378 


319 


59 


DTKMNSQD ILYSI QSNRT* W * L I KRQ * T 
KQKSAPRNGNEVP*THQHSTATTLISQQ 
PSTSKQDPPAAK* L* FAEGRLLAPPSNN 


7319 


21220 


A 


7379 


308 


12 


FSPTISNFNYKSNPSPLIYLFQ* PITQL 

T PTKTT QTWTDT >KfT.* PVT.PT.TVQTQTQT T. 
Xr 11\ J.O 11M X tr J_il\ Li r I XjKXjX lol oXoXjXj 

PISNNVKIK*QFEHTXPTPFLPTLIALT 
TLLLPISPPILIIL 




0 1071 
1\jLL\ 




/JOu 


i i 




RRDLAMVPRLFSS S PGLK* SLHLGFPEH 
TOYSHEPPSWLLFFLFLITEGYFF 


7321 


21222 


A 


7381 


300 


2 


KFFPPPGFFFKGFFSPL*SHQKKTQIFF 
GEFFWGFKKFFLFFFPPGGFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFFLFFFF 



1000 
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SEQIJ> 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 
location 
correspond 
ing to first 
amino acid 
resiaue oi 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=A(anine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=PhenylaIanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, PHProline, 
Q=Glutamine, R=Arginine, S=Serine, 
i — inreomne, v— valine, 
yv— i rypiopnan, x— jyrusine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, Impossible 
nucleotide insertion 














FF* TTJCJT.FTrVM^FT .T 


7322 


21223 


A 


7382 


163 


160 


PGTSNDEIQRRNSRNRISPIAER*GLSL 
LPRLEWMGSNEAHSNLOBLQGSSNPPASA 

K TM T TRV Cl h r a * PP T .P wm n Q WT? 2\ W Q w 
o * jl i*i J. j. v onv^rt. it rvi_iCt nt luoxvCiruiijiii 

LELQGSSNPPASAS KSTMITGVSHCA 


7323 


21224 


A 


7383 


1 


183 


VCVCVCVCVCVCIKWDSYSAIKKNELVA 
LICGS 


7324 


21225 


A 


7384 


2 


196 


KIQMWPGWAHACYPNTLGGQDGRTA*G 
LTYDGDKS 


7325 


21226 


A 


7385 


3 


192 


RNLSSLQPPPPGLK*WSCLSLPSSWDYR 

PPPPPPANPFMT iPPPPRTNfiVPPP^Pfiy 

YFEPRK 




*5 \iin 
Z IZZ 1 






ZOO 




TMT.QfSRr .P c ?anT.TMTCl*PWnPPa c 57iCinQ 

AGITGVSHRARPLVLFLRDSLYSGTQAG 
VPWRDLS SLQSRLAEQKRFCFP 


inn 


Z IZZo 




71Q7 




j 1 1 


C3 If HO ICPT . QPVZXW P PTOP Q T T .PPP PP W T M + 

GQEFETSMASMVKPCLSLLKL 


7328 


21229 


A 


7388 


3 


218 


EIMSFAATWMQLEANILSTLSHE* IAKY 
CMLSLLSGAKHWVLMNIEVGTGDTRDYL 
GCT YPS STEQLGYYS 


7329 


21230 


A 


7389 


521 


58 


SDCGLTPIAQTGVHWHHLSSLQP*LPWL 
HiVETGFCHVAQDGLBLLDSNNP PSSASQ 

CanTTnMCrr'7ir!T.DDT?*VT "D CU7T5VT \TJ T 

AMQVNEQGTSQAGEAVSEADAGQGCCCG 
LCLHRIIiSQQYALP 


7330 


21231 


A 


7390 


169 


322 


APGNFKTN*RKNGFWAGNTLGVSV^3K 
NGWAWWLTSVI PALWEAEAGGS P 


mi t 


ZlzJZ 


7\ 
J-\ 


I5y\ 


1 


1 C2 


VLGSHARKNSLPVHEVKAYLL* KYLSIL 
RATHILISPDGCWVPIFIAQSYIFLSSC 
GFFLLFFLLLLTLSFCSWHLIISFLAbF 
VIPLY 


1111 
IDOL 


Z1ZJ.5 




/3yz 


im 
5 /Z 


3 


TnBPAPPaPfTtlCVTTDfiJKrDO T'V/nV'T UC 
l^ftu/i^l MCi^ JlJr o Jxvj J. KWIN rolv l!AjJ_>riO 

DMSWSLCLFLSGAISVLCNFRLSGSSDS 
PMSA*QAARTAVIRHHTRTjIFCIFSRDR 

FTP FttWVPnFNTT .T .M T J »T jHT »F QP T /IF <^ T <? 

ATLLTLMLLV 


7333 


21234 


A 


7393 


362 


176 


ARMECSGAI *AGLRGSSDSLASAS*EAG 
YTGKCGHAQ*TFVFLVEVRFHHVGQNAI 

WTPTQT. 


7334 


21235 


A 


7394 


2 


304 


ARADCCARNE*GSVGMGPSEPGVGYNLL 
VFRFLRPLEKHS IR VGETRFSRCCLSQL 

ALHPLSCTQRPTSHSE 


7335 


21236 


A 


7395 


256 


3 


MYYAARYWNKAKITTESLRHWAGHGGSH 
PYNPSTLGG*GGQIS*AHKFETSLGNMY 
KPHLSKNKQTNKQTNKQTNI SWSWCTCL 




2 J 237 


A 


Tina 


346 


2 


GPLNFTFHLQRKNCNLGHTRSPGGVYHV 
GRTETATGPIKRRNGLGAVAHTSNPSKL 
GGQGGWIT*GQEFKTSLSLPRV7LTPVIP 
hV 


7337 


21238 


A 


7397 


2 


242 


RRYSDYPDAYTT*NILSSVGS FI SLTAV 
ILIIFMI *EA*KKKKKKKKKKKKKGGPF 
KKTLGGPKFNRRMEGKNFSLKGGR 



1001 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIalanine, 
G=Glycine, H=Histidine, I=Iso)eucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine. P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unkno\vn, *=Stop codon, /^possible 
nucleotide deletion, V=possible 
nucleotide insertion 


7338 


21239 


A j 


7398 


1 


289 


TKLLGLY YTLLQASNDFKS PFS I FDG I Y 
GSTFFVTTGFHGLHVIIGSTILTI*FIR 
QLI FHFTSKKHFGLEADA* YWHLADARR 
HSESITHQCCSL 


7339 


21240 


A 


7399 


346 


40 


III FTNKI YI KINKRLKLFI I KILDLYI 
1*1 FFFRK* FTFKQ* KF* I ININKI YYN 
NIK* * IFIYLFFFFFFFFFFFFLFFFFF 
FFFFFFFFFFFFFFFFFFFLKK 


7340 


21241 


A 


7400 


1 


334 


NSCRRQRLLDLLLLPITL*LPQLIGYIE 
KSTRYE CGFHP I S PARVPFS I KFFLVAI 
TFLLFDLQIALLLPLP*ALQTTNLPLIV 
MSSLLLI I ILALNLAYE* LPKGLD* AE 


7341 


21242 


A 


7401 


2 


302 


HTINLLGERYVLATTY* *STMTLLLTGL 
TVLHTGLYSVYLMTTTQWG*LTPHI INM 
KP*FTR*DTLVFIHLSPILLLSLNPDII 
TGWLVLLSYELYLLT 


7342 


21243 


A 


7402 


57 


338 


TTHNVIGYIEASTAYESGCDPISPARVP 
FSI KFFLVAITFLLFDLEI ALLLPLP*A 
LQTTNLPLIGMS SLLLI 1 1 LALSLAYE * 
LQKGLD*AE 


7343 


21244 


A 


7403 


407 


68 


QEELVDPLTTVREQCEQLENCVKAR*RL 
* LCDERVS SRSHSEEDCSEDLFDFLHAR 
DHCVFLILYINLKYMCELT 


7344 


21245 


A 


7404 


494 


154 


SLFI FYLYLI FLF* * FFFHYFFYFLLS I 
FSFKLFFFFLLHFFFHMFFFLFFSPPNK 
IFFPKIPPRFFLFPLF**KFFFFLPFKF 
LSPLGFFF*SPPFFFFFFFFFFFFFFFF 
F 


7345 


21246 


A 


7405 


1 


400 


NPGRFLSTSNS SLYERTRE I RPTSQSAF 
PRK* YHLKKKKKKKKKKKKKKKKKKKKK 
KKGGPLKKKPRGGQK* TGGEKKKFFPKR 
GGKKKPPGKF*KKTFFWGGKKMGKTPQK 
KINPLGKKKIFKGKRGKTPPL 


7346 


21247 


A 


7406 


2 


257 


RASLCHPGWSAW* SWLIAALTSWLKQF 
SHLiNS LKCWD YRH E P PHLA * LRFEHRHS 
GSSICTLNR* ILLSFPILSIRYKNGLIL 
Y 


7347 


21248 


A 


7407 


259 


47 


YVFFFFLFLLFFIRLQSVSPFFFFFFFF 
FFFFFFFFFFFFFFFV r K*SFIEMFSFW 
LYLLRQVS I S IAYTLFG 


7348 


21249 


A 


7408 


3 


248 


VTFL* CDVEIAVLI PIiP*ALPTTNLAL I 
DMSSLLLI IILALRLAYE*LRGGLD*AE 
EHQATHS AYERAP PL 


7349 


21250 


A 


7409 


2 


350 


RSGMPRRYSDYPDAYTT*NILGSVGAFI 
CLTAG I L 1 1 FM I * EAFAS KRKVLI VEQP 
SINLE*LYGCPPPYHTFEEPVYIKSRHI 
RKESNPPKLVSSQPHGLHDFFKKKLGTI 
TEKN 


7350 


21251 


A 


7410 


2 1 


232 


LDQTYAKIHFTIVFIGVDLTLLPQHFLG 
LSGMPRRYSDYPDAYTT*NILSSVGSFI 
ALTAAIL I I FMI * EAFAKKF 


7351 


21252 


A 


7411 


2 


321 


TSLLAVLLIIMTL*LPQIoNRYIEKSTPY 
ECGFD PIC PGRVP FS I KFFLVAI T FLLF 
DLETALLLPLP*SLQTTNLPLIAMSSLL 
L 1 1 ILDLNLAYE * LQKGLD * AE 


7352 


21253 


A 


7412 


2 


194 


S I ECFLVP I TLLLCDLEIALLLPLP* AL 
QTSNLPLI VMSSLLLI I ILALNLAYE * L 
QKGLD*AD 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=GIycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W-Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possibJe 
nucleotide insertion 


7353 


21254 


A 


7413 


1 


197 


LYTT Y FVTKTLLLTS LFL* GRTAYPRLR 
YDQLIHLL*KNFLPLTLALLI*HVSIPI 
TISSIPPQT 


7354 


21255 


A 


7414 


1 


185 


STPLP* ALQTTNLPLI VMSSLLLI IILA 
LSIAYE * LQKGLD* AEKKKKGAAVLKDP 
SGGPS 


7355 


21256 


A 


7415 


347 


228 


LE*LYGCPPPYHTFEEPVYIKSRLKRKE 
SNPP KLVS SQPKGLHDFFKHI KNYRKNK 
RKEGLASVKENDRGP 


7356 


21257 


A 


7416 


1 


296 


HAYDHASAQLNG Y I EQSTP YECGFDP I S 
RARVPFCIKVFLVAITFLLFDLEIALLL 
P LP * ALQTANLP L I GMSSLLL III LALS 
LAYE*LHKGLD*AE 


7357 


21258 


A 


7417 


3 


202 


TCS I * HVLVRMT * L I YDLE I ALL * P VP * 
ALQSTNLPLISMTSLLLI I ILALSMAYE 
*LQTGLD*AE 


7358 


21259 


A 


7418 


1 


321 


SNTLLALVLITITS*LPQLSGCVGECTP 
YECGICRILNARALCCMQI FLKAITFLL 
FDLE I ALLLPLP * ALQTTNLPLI DMSSL 
LLI 1 1 LALSLAYE * LQKGLD* AE 


7359 


21260 


A 


7419 


J 


212 


RTRG 1 1 FR PLSQ S L Y * LLAANL VI LT * I 
GGQPVSYPFTIIGQVAYVLYFTTILILI 
PTICLIENKILKWA 


7360 


21261 


A 


7420 


2 


257 


GRVGGRVG YS I AFFLQAIT VLVLDLESA 
LLLALP* SLQTTNLSLIVMSSLLLIIIL 
ALSLAYE * LPLGLD * AEYWGEMMTTTLD 
A 


7361 


21262 


A 


7421 


367 


269 


HCDLKLLGSSDPPASAS*VAWITGTCHH 
TKLS 


7362 


21263 


A 


7422 


2 


116 


ALQTTNL PL I VMS S LLL 1 1 1 LALRLAYE 
* LQKGLD *AE 


7363 


21264 


A 


7423 


2 


336 


KPECGREG ILPCCSSSAWPEGSFRPFQM 
NLGSFLSFFFLFFFFF3KGLFLCPPV*S 
PYGKSNLLEPSASRFKQFFCLGPLNNWD 
YRLFPPFPGNFGVFKKNRVFPFWPSLV 


7364 


21265 


A 


7424 


3 


154 


LFL*IRTAYPRFRYDQLIHLL*ENFLPL 
TVALLI*HVSIPITISSIPPQT 


7365 


21266 


A 


7425 


3 


154 


LFL*IRTAYPRFRYDQLIHLL*ENFLPL 
TLALLI*HVSIPITISSIPPQT 


7366 


21267 


A 


7426 


1 


314 


INTLLTLLLI I ITL * LPQLNGYIN* STP 
YECGFDP I SPARVPFS I KFFLIAITFLL 
FDLEIALLLPLP*ALQATNLPLIDMPSL 
LL 1 1 1 LALS LAYE * LHQGLY 


7367 


21268 


A 


7427 


1 


331 


I L I INTLLALLL 1 1 1 TF * LPQLNGY I EK 
STPYECGFDPISPARVPFSIKFFLVAIT 
FLLFDLEIALLLPLP*ALQTTNLPLIVM 
TSLLL III LALRLDYE * LQPGLD* AD 


7368 


21269 


A 


7428 


2 


373 


SDRNHI IHALVITI LLGLYFTLLQASEY 
FESAFTISDGIYGSTFFVATGFHGLHW 
IGSTFLTICFIRQLIFHFTSKHHFGFEA 
AA* YWHIVDVAGLFLHVS I YRGGS YNHQ 
IIPSVAQESTPA 


7369 


21270 


A 


7429 


378 


63 


PSFFPPFPP*NFFFPPR?LFSPGGFPPF 
FPPPQKGPPPKIPPGFFFSPPFWEKFFF 
PPPPLILPP?RFFFNPP?PFFFFFFFFF 
FFFFFFFFFFS FFFWDFLGSGC 


7370 


21271 


A 


7430 


1 


109 I QTTNLPLIVMSSLLLIIILALSLAYE*L 
1 OKGLD*AE 



1003 
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SEQ 10 
NO: of 
nucleotide 
sequence 


SEQ 10 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ 10 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, 0~Aspartic Acid, 
E=Glutamic Acid, F-Phenylalanine, 
G=Glycine, H=Histidine, l=lsoIeucine, 
K=Lysine, L=Leucine, MNMethionine, 
N=Asparagine, P=Proline, 
Q=G!utamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


7371 


21272 


A 


7431 


358 


290 


FLLVT PNFLFHFLI Y 1 1 IS I VPNVS FFH 
IIYILSIFFPPIIFILLLSSFNFILFFF 
FITSFPFFYFFFFFFFFFFPFFFFFFFF 
FFFFSFFFFFFF* FFFTN 


7372 


21273 


A 


7432 


1 


95 


PTRPPTRPHNSLQL*TPGLKQSYPRLPS 
SWD* RHMPSHPANFRKFF* RWHLYCPGW 
S*TPGLKQSYPRLPSSWD 


7373 


21274 


A 


7433 


1202 


870 


HNIFKIRPHCRLTHRGKPFYWGLLVTPV 
FSPSLRPSFLPSFLPSFLPS*LPSFLPS 
FLLFLSYSFCPLSFLCLRCI PHSAFFWG 
NPRQVQENCVIVTIYRYLSFTALHQL 


7374 


21275 


A 


7434 


163 


1 


S VYVKR I GLGR VAYTCNPSTVGGQDG * I 
T*GQEFETILAKRREPLFLACFLFLI 


7375 


21276 


A 


7435 


148 


362 


NDDNSVEFGIRLHKGLGTVAHTYKPSTL 
GGQGGHIA*AQEFMTSLGHMT*SSLYHK 
LTEYGWRDTASLLMC 


7376 


21277 


A 


7436 


61 


233 


GGDLRHW P S SD S F FLS LS F F F * N KNGGG 
IALLPRLDLNSWSQAILPLKPPNALGLQ 
A 


7377 


21278 


A 


7437 


44 


242 


FVCLFLFFFFEKESTFQHDLSSLQPPLP 
RFKRFSCLSLPSTWDYRIiAPQCLiANF* I 
F**KSGFTFG 


7378 


21279 


A 


7438 


119 


300 


YCVSLLIFFIFI IYI FILLI FKFLFITL 
NYIS FLFF YF*H * F YF * KNF 1 FVFPFGY 
FPRP 


7379 


21280 


A 


7439 


1 


221 


LKLQHHDPTTISLVITRNWRQPKCPNVQ 
Q IVK * SAI KRNELL I HAKTWMNLRGI ML 
SEICQPQKILYLEIPFI 


7380 


21281 


A 


7440 


35 


345 


FSFCCVCVCVCVSLPKHAI IEMLNQTKP 
YVPKGVAYLPEREP F I VPMDPELTA* YE 
DYRTHESAQEPSEPHQLSRRSLPH*PGH 
ATRDHSPPTDSWGKRRTSH 


7381 


21282 


A 


7441 


3 


384 


VIHL* AGAVI IGSRSKSTNALAHFLRQG 
TPTPVILVLGI IETMNLLIQPVALAERL 
TDNI TAGHLLMHL I VS ATLAI L*T I NLTS 
TL I M FTVL I LLT I LQ I AV ALMHA Y VFAL 
LVCLYLHDDAYDHGG 


7382 


21283 


A 


7442 


15 


277 


GLAILPRLVPNSWPKVILQWPPKMLG+ 
QA 


7383 


21284 


A 


7443 


21 


694 


NGGLNAHLASASEFDHSGVQLIEREEE I 
CIFYEKINIQEKMKLNGEIEIHLLEEKI 
QFLKMKI AEKQRQI CVTQKLLPAKRSLD 
ADLAVLQIQFSQCTDRIKDLEKQFVKPD 
GSNRARFLPGKDLTEKEMIQKLDKLELQ 
LAKKEEKLLEKDFIYEQVSRLTDRLCSK 
TSGCKQDTLLFTKKMNGYSRRIKNATEK 
MMALCA*LSMKQAPTIELQKEVQGERRP 


7384 


21285 


A 


7444 


141 


428 


DRSLLEAGDGRIAGDFFFF* TGTWFVAQ 
AKVQWGIHSSLHP* PPGLKQSSYLNLLG 
SYHHWLRPHALFFFLKNRGSLFCPGWFG 
TGFRGTTFWGAK 


7385 


21286 


A 


7445 


35 


343 


GYS I IGCPKFLDEKI FDSGHEKERKINS 
LLF* KKKKKKKKKKKKKKKKKRGGLLKK 
TSGGPKI KRGGGIKNFPI KGGVKKTSGE 
FFGKKPFFGGEKFWEILP 


7386 


21287 


A 


7446 


403 


193 


E YI KQGAE* NNQPKLHAHRKRLTWKETP 
RCIHEGIAPAIMNSDCLVFDTSIAQLFA 
ENGNLGINVTISMC 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
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od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 
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beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid,F=Phenylalanine, 
G=Glycine, H=Histidine, I=Iso!euctne, 
K=Lysine, L=Leucine, MNMethionine, 
N=Asparagine, P=Proline. 
Q=Glutamine, R=Arginine, S=Serine, 
T-Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 


7387 


21288 


A 


7447 


411 


110 


LTDS L INIVYR* KMS Y IS* TNS A* VYLY 
IFCIYIYIYIYIYRRIYSIYRC*YIYYG 
PAHIVLVYSSSYILVYYESHKYGYIMGL 
NILSYIFIYSYFSACS 


7388 


21289 


A 


7448 


1 


380 


PTRPQKLKKNCNGVSNGDLCFFGGGQFL 
IWGFFFFFCILIEGGLNSKNYELKSLIK 
TYGAKKAEIEHTEN*KNRETLLERAAQL 
AIKVTNPNAMLRWEHDYAMYMHNFWLN 
KLNNCHKVAISCKQ 


7389 


21290 


A 


7449 


200 


437 


VQSNNNKNFKRRRL*PGVVAHTCNPSTL 
GGQARRIS*AQELETGLWJLMRPCNYNK 
F INVLDDVHGVTS PVSQYYKKVC 


7390 


21291 


A 


7450 


405 


3 


PPRFFGFFFPFYPLKFFFFPKGFNFFGG 
FFPIFFPPKKKFFFKNSPGGFFLPPFLG 
KIFFFLPPFKFGPPRGFF* RAPLFFFFF 
FFFFFFFFFFFFFFFFFFFFFLQRINFT 
ARPGTPLSRAVPLRGALPPSA 


7391 


21292 


A 


7451 


2 


348 


PRVRPRVRYKLQTKRCDLTLPS * GHSHV 
. SLRNTTFDALKS * DTDSLLS PKLECHGV 
NTPNCNLRLPSLSD YPASLSREAWI AS T 
RLRAWEI FVFLAQMRQDYCMNPGGGGCN 
ELK 


7392 


21293 


A 


7452 


1 


168 


LVHDGHAGLKLLTSGDRPA* AYQSAGI T 
GVRPLGPPTYYYFRTASFMTFICEFATV 


7393 


21294 


A 


7453 


184 


260 


SSRGGSLASRSSR* STLLGLPKCWDYRC 
MPPSLANFFFFNF* I *NFCFW* RRSLGM 
LPRLWNS * MQAI LPLR IRLQG I HFAGT 
QTFSL 


7394 


21295 


A 


7454 


405 


64 


FFFFFNFF*KKKI * DVFAKI FFKI FFFF 
SGLKI F WGGGFQNAP PQKKF F F * KIKSF 
FFLNFFF*KKIFFFVAGWFFLLNFFFR 
GAPLFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFF 


7395 


21296 


A 


7455 


368 


2 


SQS FGRPRWENRLG PEQH I KISRYRPAQ 
VHACHPSTLGGRGRWIT*GQEFKGSTHL 
GLPKYWDYMPEPLLLVYIQCFHFRTFCN 
KAAIHCFKPEQLLYNKTSYFKESYKVKA 
ALVACKPTRP 


7396 


21297 


A 


7456 


2 


183 


TERDS VSKNKY 1 Y IHTHTKTHTY I CVHG 
VIYVYI *SLLLI YNIAYILI YHVFYILL 
IFKA 


7397 


21298 


A 


7457 


281 


127 


YNFCIFM* RCGFNNVGQSGLKVLLRGYS 
PASASQ IAGITGVSQCAQPVAYS 


7398 


21299 


A 


7458 


2 


194 


TSVRKYF*YPFTITDRIYGSTWWATGL 
HGLHVIIGSTFLTICLIRQLIFHFTSYH 
HFGSEAAG* YWHFVEPARLFLYVCIY* * 
GSTWWATGLHGLHVI IGSTFLTI CLIR 
QLIFHFTSYHEFGSEAAG 


7399 


21300 


A 


7459 


22 


319 


GGS VTGAYHRI LDHDRNQI FLA* GGSVT 
GAYHR ILDHDRNQI FLALLI SMLLGL YF 
TLLQASKYFECPFTICDGIYGSTFFVAT 
GFHGLHVI IGSTFLTI CWIRQLILHFTS 
RHHFGLEAAA* YWHFVDW* LFLYVSI Y 
* * GSVTGAYHRILDHDRNQ I FLALLISM 
LLGLYFTLLQASKYFECPFTICDGIYGS 
TFFVATGFHGLKVI IGSTFLTI CWIRQL 
I LHFTSRHHFGLEAAA 


7400 


21301 


A 


7460 


1 


222 


TILLGLYYTLLQA* EYFKATFTI YDGI Y 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
EXJIutamic Acid, F=Phenylalanine, 
G=GIycine, H=Histidine, I=Isoleucine, 
K~Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X-Un known, *=Stop codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














GSTFFAATGFHGLHDIIGSAFLTICFIR 
QLI FHFTSEKHFGFEAAA* YWHFVEAT * 
LCMYVSIY* *GSTFFAATGFHGLHDIIG 
SAFLT I CFIRQLI FHFTSEHHFGFEAAA 


7401 


21302 


A 


7461 


3 


246 


LNYSLFFHGEANLGTTQVLTHPSTTAMY 
F VHYCQP P * I LYGT I NT * PP WHKNP I K 
KKKKKKKKKKKKKKKKKKKRGGGF 


7402 


21303 


A 


7462 


2 


155 


SRSRAALLLPLP*ALQTTNLPLIVMSSL 
LLI 1 1 LALSLAYE * LQKGLD * AE 


7403 


21304 


A 


7463 


1 


136 


PTRPAPSHLLYCKNIKKLFPSGTANEAL 
IIADI FLRCKKKKFKVK* * VLKASEE*N 
GCKHLHGMH * NS AVCTVFWLKQ I MNS D * 
* * KVNTQ * KKKKKKKKKKKKKKKKKKKK 
KKKKKKKKKKKKKKGGENNK* KNI KKLF 
PSGTANEALI I AD I FLRCKKKKFKVK 


7404 


21305 


A 


7464 


166 


192 


NIN*MWYMHTVECYSDFKRKEILIHATT 
WMNLEDIMVSEISQSQKDKYCMILHIEY 
KLYIHM 


7405 


21306 


A 


7465 


416 


55 


LFFLRLGYFFLPNIRVFFFLSTLFFFFF 
VWGFFFFGRFFFFFPPPQEVCFFFIF*G 
I FFFFLFK* KNMFFFFFFLWGPPPFFFF 
SGAPLFFFFFFFFFFFFFFFFFFFFFLR 
KANTKKMF 


7406 


21307 


A 


7466 


295 


84 


I WGNVTLLQL VS CL FCFL FLH * G FF CC C 
CCLFLFLFVCLFVCWFLNLKPS I KCLLP 
FSLSLGCAEIPSSF 


7407 


21308 


A 


7467 


142 


258 


PGTVAHTCNPSTAGGQGGRIP*GHEFKT 
NliANMVKLHLY 


7408 


21309 


A 


7468 


3 


192 


PSEYFESPFTISDGIYGSTFFVATGFHG 
LHVIIGSTFLTICFIRQLIFHFTSKHHF 
GFE AAA* YWHFVDW* LFLYVS I Y* *GS 
TFFVATGFHGLHVI IGSTFLTICFIRQL 
I FHFTS KHHFG FE AAA 


7409 


21310 


A 


7469 


2 


110 


GRVGKHHFGFEAAA* YWHFVDW* LFLY 
VSIY**GS 


7410 


213)1 


A 


7470 


380 


8 


TPQKRKKLKKAGEGVFPPPPKKTTPPPP 
PIFIYFIFFFFFIYFFFFFFFFFQYMSF 
FYSHIMIQRHFLNF*ESFYSCISILQ*L 
FKFFS FLGGL I KL 


7411 


21312 


A 


7471 


1 


107 


PTRPHHFGFEAAA* YWHFVDW* LPLYV 
SIY**GS 


7412 


21313 


A 


7472 


2 


360 


VNTLLALLLIMITF*LPQLNGYIEKSTP 
YECGLDP I S PARVP FS I KLFL VAI TFLL 
LDLEI ALLLPLP * ALQTTNLPLRAMSSL 
LWI ILALGLAYE * LQRGLD* AEWLHSL 
TQPTNNN 


7413 


21314 


A 


7473 


423 


167 


VKVHKCFLPGVAPHAFNPSTLGGRGGRI 
A*AQEFKTSLGNIVRPPSDTCNPIVLGL 
QA 


7414 


21315 


A 


7474 


1 


73 


I KPE * YFLFAYTI LRSVPNKLGGV 


7415 


21316 


A 


7475 


131 


3 


ISWVGAEWLTRVIPAL*EAESGVSRGQE 
I E T I LANT VKPDAW 


7416 


21317 


A 


7476 


2 


103 


PRVRSTGFSSIAHITRDVNYG* IIRYLH 
ANGAK 


7417 


2131S 


A 


7477 


2 


325 


NTLLALLLI I ITF * LPQLIGYI EKSTP Y 
ECGFDPI S PARVP FS I KFFLVAITFLLF 
DLEI ALLLPLP *ALQTTNLPLIVMSSLL 
l.II ILALSIAYE * LQKGLTERDT 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A-AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Pro!ine, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


7418 


21319 


A 


7478 


423 


170 


FFSFFFFFFFFFFFFFEIDKNVYLIM* C 
TNPQEKRNLLFCFLVMVSPVCITPHTYI 
CVCVCVIYIYIYIYIYIYIYPFIFSCCD 


7419 


21320 


A 


7479 


433 


50 


RVFFFFFFTPEKKTFFPQEIYLFFSPFP 
L*NFFFSPKPLFFLGGFAPNFPPPKKVF 
FLKFPPGFFFSPPFKKKFFFFFPRFFFA 
PPRFFF *GPPS FFFFFFFFFFFFFFFFL 
TVKFTRGFRGFCGQI 


7420 


21321 


A 


7480 


401 


337 


FFFFFGKFFFPQEFTVFFPPFPFKIFFF 
SLRLLFFWGGFAQFFPPQK*VFFSKFPR 
WFFFPPLFRKKFFFFFPG* FItAPQGFFL 
KGPPPFFFFFFFFFFFFFFFRPI*VGGR 
VGNNPN PNTNYL PN 


7421 


21322 


A 


7481 




84 


KNFLPLTLALLI *HVS IPIT I SS IPPQT 


7422 


21323 


A 


7482 


562 


412 


L>FSFHVSGTMAHTCNPSTLTG*GRIT*G 
LEFKTSLGNTVRSHRYRKKKIA 


7423 


21324 


A 


7483 


12 


233 


DIFHFTSKHYFGFEAAA*YWHFVDWRL 
FLYASIY* *GSLL 


7424 


21325 


A 


7484 


385 


56 


GNF PT P KKFNF FY P F YLS I F F FF PKKLK 
FF*ElFPIFPPPKFIiFLPPNFPVLFFFF 
FLFWEFFFSFLFWFFFFPFFFFFFFFFF 
FFFFFFFF*KTTEFYFVNSNFQHTK 


7425 


21326 


A 


7485 


424 


6 


FFFFFSEGSGLIREDPIGAGALYDYGR* 
LVVVTG*TLFVGVYIVIEIARGNRLCD 


7426 


21327 


A 


7486 


16 


129 


FXXXRXXLIXKBPSINLE*LYGCPPFYH 
TFEEPVYIKS 


7427 


21328 


A 


7487 


183 


387 


IASGLDIFLLLLIiETGSHSVTQAGVQWH 
NHDS LQPRLTGLK* SSRRS PRGS WDYRV 
LLCHPGWSAEA 


7428 


21329 


A 


7488 


351 


169 


EKKIFFFSPRGYGPPPFFFLKAPPHIFF 
FFFFFFFFFFFFFFRGGGGDHKNFFFFF 
WGFLKNFPLFFFFPRV*I* IFFFFFFFF 
FFFFFFFFLGGGGGTIKIFFFFFGGF 


7429 


21330 


A 


7489 


47 


212 


THASALFL* IRTAYPRFRYDQLIHLL* K 
NFLPLTLALIiI * HVS IPITISSI PPQT 


7430 


21331 


A 


7490 


98 


273 


TSGSCCSAHILAFKP*AGMATFITKSL* 
* CHTHTHTHTHTHTHTHYMYI YTHMHTY 
IF 


7431 


21332 


A 


7491 


70 


405 


LYGCPPPYHTFEEPVYI*EAFASKRKVL 
I VEEPS INLE*LYGCPPPYHTFEEPVYI 
KSRQKRKESNPPKLVSSQPHGLHDFFKK 
KKKKKKKKKKKGGGPLKKPRGGPNLTGG 
GKKNFFFFRGGEKKPPGGFWKKTLFLGG 
GKLGTPPPRRF 


7432 


21333 


A 


7492 


22 


208 


RSSLPARPTRAVI LI I FMI * EAFASKRL 
VLIVEEPSINLiE*LYGCPPPYHTFEEPV 
YIKSYV 


7433 


21334 


A 


7493 


2 


204 


KLPETVKKKKKKKFYSFRSIVMLLLYLF 
S * SLSFFLLYSLLFFLFLLS FYI YSFLC 
FPLCKTATTEH 


7434 


21335 


A 


7494 


2 


104 


TTLLLSRT*NKLT*LT?LIPSTLLSLGG 
LPPLTG 


7435 


21336 


A 


7495 


291 


160 


PRHF* IFVEMGSHYVAQAGLKLVASSNL 
LAWASQS AGVTGVSHGSLLI TS CL 


7436 


21337 


A 


7496 


422 


180 


LGAPQGVF*KGPPFSPSSSSSYSSSSSS 
PS * QNI FNKI FLFVCFVLETES VTQVGV 
QWRDLGSLQPLPPGFKRFFTITTH 


7437 


21338 


A 


7497 


244 


296 


FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
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amino acid 
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peptide 
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ding to 
last amino 
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peptide 

sequence 


G=Glycine, H=Histidine, I=lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Profine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














FFFFFFFFFFFFFFFFFFFFFFFFFFFG 














GNF*FYK*LLGNRDGVCLPPSSYL 


7438 


21339 


A 


7498 


1 


410 


PTRPDQL I HLL * KNFLPLTLALLI * HVS 
IPITISSIPPYT 


7439 


21340 


A 


7499 


51 


478 


PQKYTLIMKICI IPPGKGFRPVEMLAGG 
NENTRCLIEERS YKYHYSLI KLGLQQQC 
I FS S LLYKYNMI YCLiMLCYY I GY I F I FY 
DMYYRQYSVLYYKLYNYVMLI * I YILYL 
FMC I * KYI YFI FM* YKY I F I YL YL Y* YK 
YIFYIYLCVYKNIYTLYLCNINIYLYTY 
IYININIY 


7440 


21341 


A 


7500 


3 


110 


FTS KHHFGFEAAA* Y WHF VDE V * LFLYV 
SIY**GS 


7441 


21342 


A 


7501 


168 


327 


AFC YKATVI KT I *Y* YKHRYTEQWSRTK 
NPDINPYICGQMI IFLQLKSLHKI 


7442 


21343 


A 


7502 


400 


159 


NNFS PLRNLQGFFGPLKFPMGFQAGPPG 
*GFSFFLRSNFFLDVFRGKY*TFGF*DF 
F YC P S PNE VLGE P PLMGGNKPG F P 


7443 


21344 


A 


7503 


401 


257 


TSQKKKKKKKKTP KKKKKNRE KP PKKKR 
GTRG EPF KTGGG KE KKKF FKKKKKKGKG 
PP * PQLVKRGGREKKKKNLWKKKGAF* K 
KKKKKKKNPKKKKKKPGKTPKKKKGDPG 
GTF 


7444 


21345 


A 


7504 


44 


344 


LLASLANLALPPTINLLGELSVLVTTFS 
* SNI TLLLTGLNI LVTALYSLYI FTTTQ 
WGS LTHH INNIKPS FTRENTLMFIHLS P 
ILLLSLNPDIITGFSS 


7445 


21346 


A 


7505 


286 


252 


I *FAFHHVGQAGLEFLTSDLPASASQSA 
RITSVSQHAWPVHNIFYSLL 


7446 


21347 


A 


7506 


3 


213 


RELPGLKRFFLLSLLSSVJD* WRMPSHPA 
TFC I FCRDEVLP CCPAGLL I LLVWNLKN 
CCCYHHHRYYYCY 


7447 


21348 


A 


7507 


400 


20 


SLPQKKVSLLKLKSAVQLVMF* DRFSLC 
HPGWRA*GQSGVTTASSLRAR*SSCLTW 
EYRPS PLPLADFLFW * RRGLP I * PRWS 
NS*GQVGLLHQPSKMLKFSSLSPCARPL 
FFFFNRSTNLRVQS 


7448 


21349 


A 


7508 


349 


227 


SDFLTSANPPASASKSFGI TGVSHHTRP 
*PRFLKVNLIYSEMEVYN*ASFRHVD*G 
GLELLTL*FTHLGLPKLWDYRREPPHPA 
LTKI FKGKPYIQ 


7449 


21350 


A 


7509 


371 


21 


FLVETGFHHVG*AGLEPLASSNLPASAS 
QTAGITGMSHCAQPNPGSLLNKTMLAVA 
TEQWVGYMWPSVFGHITARPEGKRLSDC 
LHSDTWVCWCVCSGLIFVCFETGCRFA 
LQPG 


7450 


21351 


A 


7510 


330 


14 


DKVCSVTQARVQWHDHGSLLFLSSGLRQ 
S SHLS LLS S WDHRHMPPGMAW FFYF* * R 
LRSHYVAQAGLKLLASSNPPTLTCQSAG 
ITGVSHNTWPKALFSVTDLKH 


7451 


21352 


A 


7511 


417 


33 


APPCGGGPPKRIFGPSRKRGKGPPPPKR 
FF* KNKPPKGGGKKSPPPPKNFFPKKKN 
LGPKKKKSGFWGRFAPPRAPGGGVFNFP 
FKAGGEKFFSRGEINPPRGAGKKKGFFL 
G KKNTKKKT P KKKDL 


7452 


21353 


A 


7512 


81 


296 

O 


CLREGAS H S AAQAG LH WHDHS S LHP *NP 
GLRGSSRLNLLSIWDYRCVPPSLANFES 
CLLKIILGSPFPKNL 



1008 



WO 01/64835 



PCT/US01/04927 



SEQID 
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nucleotide 
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ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D-Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Va)ine, 
W=Tryptbphan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


7453 


21354 


A 


7513 


412 


279 


KKYTQGLGWAHAYNPSTLGSQGRRIP* 
AQFETSLGNTGRPCLH 


7454 


21355 


A 


7514 


74 


126 


KI *DWVPKGAPLQGPVF*NSTFRPRVRN 
RGPPWPRPGPTVWKG*NKKN*PAPRGSN 
PPFFFPGNPGPFKLRGWPGSS*GSFQNC 
KKAQAG* APT* KPGPPPPQNKV 


7455 


21356 


A 


7515 


332 


20 


GERQVGSYFFMNTERLFGMTKKF* KWI V 
SMVAQHSS S I SGVWVFRGQELAFPLS PD 
WQVDYESYTWRKLDPGSEETQTLVREYF 
S WEGAFQHVGKAFNQGKI FK 


7456 


21357 


A 


7516 


1 


167 


RTRGQRFTVXARIVLIS *HCDLPASASQ 
SAGITGVSHCTRPGLS FLYTPTAKHST 


7457 


21358 


A 


7517 


2 


223 


GRMEEMGSHYVA*AGLEFLGASDLPA*A 
PQRAKILLLLGGHKPGLTILPRLSSNSW 
PHVKWPRWPLKALGLRG 


7458 


21359 


A 


7518 


151 


476 


SLGKFAVSFSFSFLK*RRALAVLPRLCS 
NSWP*AVLLPWLPRVGIAGMSYHTQPDY 
H FQARAVAE I LGLVSTQH S S VTGL P VEN 
SFRAGHGGPRLQSQHFGRLRWADC 


7459 


21360 


A 


7519 


3 


134 


CS FRLGAVAHI CNVTTLGGRGGRIA* AQ 
EFETCLDNIARPCLY 


7460 


21361 


A 


7520 


14 


219 


APS I HG FGFLYTY IHTKTHTHVYMY I * V 
YIHTH I ESI KCQPLTQLQSNKKANRI VF 
RTQFMLTCYFMR 


7461 


21362 


A 


7521 


134 


334 


RKHSC*IFFNFCFYLSLETRSHYVASTG 
LDLLAS SNP PAIiASQRAR I TGMSHHAP P 
VLTRFKKSLF 


7462 


21363 


A 


7522 


406 


145 


FLKTGFSPFGPFVF*TPPPGFPPPPPPP 
IFGVPGGGPFAPPLFFFFPQTKHFPPFL 
NPPSPPPQRGFFRGFFLFPPAFFLKTPP 
FPF 


7463 


21364 


A 


7523 


3 


382 


HPQGEGGRLTSLE*NTSLRLKKKKKKKK 
KRGGPFKKNPWGAQI * PGKKKKNFFLKR 
GAKKNHLGNFGKKPYFWGGKKLAKPPKK 
N * TFKGKKKFLRGKGGKKTPKPWLLKI F 
FSGFYLKKIFPPGP 


7464 


21365 


A 


7524 


319 


106 


GTRS CS VT* AG VQ WHDHS SLQP * TP WLK 
RSSWLRNRWDYKHEPPCLNNFYFFCRDG 
GLLCHPGWSQSPELK 


7465 


21366 


A 


7525 


433 


22S 


MQPYNWEVNSSSQLSLSSARQRVTHVLT 
GNFFIK* ERWDLPVLPRLVSKSWPQVIF 
PPWPSKVLKLQV 


7466 


21367 


A 


7526 


26 


351 


CRSLV FS LPKPGLR FVS I LH I QVFSHFD 
FF FLTGNTQK * RK* TPF I FPA* LFKTP A 
PKPGMVAHTCSPSYSGGLDGRIA*AQEF 
RTS LGNKTNPS LEGGRKKKKP S GS 


7467 


2I36S 


A 


7527 


438 


62 


QLSKIHRPPAGGRFFKSGGGFKPFCPVG 
LPPFGKKANQPTPKNPPKKQKPSGGGLS 
PMGKPP * FPCSMPSDSPGSVS FSRGGVA 
PAPKDYCPIV*AALIPSKKKKKSKPG*V 
KKGKPNLYSIYNK 


7468 


21369 


A 


7528 


3 


132 


FYYL * RRGFP VLPRLVLNS WLQV I LLP W 
PPKVLN*AAHHPHG 


7469 


21370 


A 


7529 


2 


628 


FFFLPGWNRVLLMGRGLGEANSREGRG 
PSQPRGMHGQLEVRGGRSQGDGWHGTLS 
PPWGR I YRGAPPTFAAPQAPKPFRQLLP 
M*LPGRPSSCLLPETPALSPLPSAEWGG 
F KLLTGT S ET PG P S G S PS VGS ANAOAAG 
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ATT I VARGHTPCTRGPLDTHAHS SLRWH 
AHTRP PPRPLAHEHGTHTHAHPHTPGAW 
VGEGPFQGWGRA 


7470 


21371 


A 


7530 


63 


205 


GR PTRPRTRGVK I L * Q FEHTKPTPGLPT 
LiIALiTLLLPISPi? ILIIL 


747J 


21372 


A 


7531 


390 


56 


KTFFFFNLIAF7EAKSVLLFLLLFFETG 
SRSSAQPGVQWHHQSSLQPQPL*LKGSS 
CLS LLSN* DYRCAP PCQ * S P CLS LLKCW 
DYKREPPHPAKESHMQQSYLIYSLYIA 


7472 


21373 


A 


7532 


543 


335 


LAAYGGTCLQS * LFSRLRWEDHLSPGVQ 
GCNELQLQHCS PAWATE* DPVFKGEKKE 
RVLVKVFS 


7473 


21374 


A 


7533 


3 


145 


WCDHGSLEPRSPGLKQSSCLSLPSNWDH 
SSEPLHTTSMWNF* IFKFLNLK 


7474 


21375 


A 


7534 


386 


106 


SWVTDEITMEI *ECVLNDNSDTPYQNLW 
DTEKWLRAKFTAIiNACMKKSERAQIRH 
TCVHTHTHTKTHTHTGSKCLLRKYLLLT 
IQQQNIRIN 


7475 


21376 


A 


7535 


398 


2 


SLQPLPPRVKCFSSPSPPRRWDYRGAPP 
RGGKFFFFGKKEFPPVGQGGF* FLPPGV 
LAPPPPQKGRISGGSPRPRPFFFFFKIR 
ISYPI LAKQLKLCFLKKKKKS I QAI KLY 
HHQKNPITKSSVFWESQPRA 


7476 


21377 


A 


7536 


2 


344 


HSKC VPA2 WTMAYYIRYLS I S S LLH I L 
EN1VHFFFHFLY*GPNNLFFFYFAGTRL 
IQRSWK*PMII**VHVQIKREGQIFNK* 
IFCMIGRGG*ITCGQEFETSLVNMVKPC 
LY 


7477 


21378 


A 


7537 


116 


224 


LHTHTHTHTHTHTHTHTQFNKLP *LFIP 
CGHIPRRY 


7478 


21379 


A 


7538 


1 


383 


WHERTHSRI I ILFQGLETLLPLI AF* * L 
LASLANLVLPPT INLLGELFVLVTTFS * 
SNITLLLTGLNILVTALYSLYIFTTTQ* 
GSLTHHINNIKPLFTRENTLMFIHLSPI 
LLLS LNPD I ITG FS S 


7479 


21380 


A 


7539 


31 


332 


DNYLS SYS SSGS WMRELI LAS QG I RWE P 
I VDRTPS HHRTHTHTHTHTHTHTHTHTV 
LFRLGPWTKAREPRGHICGN* ZRNQSP* 
KNPRTRGXNLRTPHGW 


7480 


21381 


A 


7540 


199 


15 


NNVQ I K* Q FEHTKPTP FLPTL I ALTTLLi 
LPISPFILIIL 


7481 


21382 


A 


7541 


337 


157 


AEIVPLNSSLGNRVRLH*MESYSVAQAG 
VQWHDLGSVQPLPPGFKRFSCFSLLSSW 
DSRLYFRCHD 


7482 


21383 


A 


7542 


28 


313 


RARIGSDQCLSAGTQWENKSQISGDFM 
ISTNDFRS*SVTRVGVRWRNHGSL* PWIPG 
LR* SSGFGLPNCWDYRDRNEVWGGXKEV 
TCPTVNLQKSQ 


7483 


21384 


A 


7543 


493 


180 


GGFRGSWLVPRDGGQGVGFCSLPPPPSR 
VKQFFCLRFPRRWGGREGSPRPSYFFFP 
*EKPRFFFLGRWLNFLGQGVGPPWPSQ 
RVGFPGLTPGPRASFFFNLI 


7484 


21385 


A 


7544 


403 


426 


FF+KFFIT7HTLFFFFFFFFFFFFFFFF 

ccccrccT^r add q^p ppcc n rvrivx i?t 
frtrrrrr - Lif^fi^irK^ysr cforr c L 

KFPGQPEKFALPFGGGFFF*GKFPQFFF 

CFLVFFFFFFLFLENGVSLYCPGRSRTP 

gZjKls srllaslrag r tg vsh 


7485 


21386 


A 


7545 


3 


335 


RHYSDYPDGYST*NILSSEDSLISLTAA 
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LLMVFMI *EAFAYRRTGLIA*EPSMNIjE 
*LYGCPPPYHTFEEPGYIKSRLKRKESN 
PPKLVS SQ PHGLHD FFKKKKNRGGP F 


7486 


21387 


A 


7546 


424 


140 


RRVKFVGGGAPQGPPPKRGVLPKIPREK 
VFPPPPG*NGPPGPAFKTPPKKKKNISS 
PPPGNWAPPGGLLKGPPPFFFFFFFFFF 
FFFFFLNPLK 


7487 


21388 


A 


7547 


2 


348 


FCHVAQSGLKLLAS SDPPTLAS QSAR I T 
GMSHHGGLK*GEFLGPSSNKGALLLWEA 
TWLQS FMKSASLFVKPRPTFVYFVFAFV 
GYRB IYFTASSLI KVI QEGLKYTMKCAP 
LTL 


7488 


21389 


A 


7548 


2 


90 


RKQAYTHTNTHTHTHTHTH* KKTGIHAH 
KHTHTHTHTHTLKAEKQI EGGK 


7489 


21390 


A 


7549 


256 


405 


CEL Y SG KEMELVFG L F I FTG VTRLET FY 
KKECS S FFHSQS RGHKLQPS I LNEW WH I 
RNPLGLTHFLLL * KTGVAE YKNS LNWH 
HPF FLS YAVS FLLQGWSFAMLS KLGSNS 
* VQARCLPQSPKVLGhKES PQERTVNVS 
SIRGKKWSWYLDYLFSQGLQGLKLFIRK 
S VHKS S I PRAEG I NCNHQ Y 


7490 


21391 


A 


7550 


2 


362 


ILIMNTLLALLLI I ITF*LPQLNGYVEK 
STPYECGFDPISPARVPLCIKFFLAAIT 
FbLYELE I ALLLPLP * ALQTTNLPLIGM 
SSLLLI I ILALSLAYE*LQKGLD * AELK 
EEQKTLQC 


7491 


21392 


A 


7551 


1 


317 


TTFDNSALLFFWDGGEIEFHSVSQAGVQ 
WHNLCSL* PQPPGFKQKLILPQI VKENV 
SKISSQLLFSRVNINISPSEQCITPPSL 
QLAR IG I F YAAK I HLT KGLRG 


7492 


21393 


A 


7552 


479 


183 


YLFKVHTTASHSNGACTGPTVGDI IVSS 
TL* SMTEQVTMPLASAVSDGTVPSVRTA 
SRGSEQAAESMVSSEHIEILEHAGELVI 
AS PEGQLE VQTV I V 


7493 


21394 


A 


7553 


454 


105 


KPPWGNKIRPPLFPGRKVPPPRF*NPVR 
GS PPKK* FVFPVGRVLGTGVTPLS PFLK 
TTPLLLWDPPLSQPPGGVKPENSL.YPGK 
KRFR* PKFPPCPPAWGTKRKP PLQKKKK 
RQVL 


7494 


21395 


A 


7554 


439 


1 


LLKRCVRKDSPPPQNKIFFFILKXFVFF 
FAPY * VRKFYFLTAHFGKRPPQ I Y I FG P 
PPVFLI FCCFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFNKQVFI 
ERLLCARHCFRPPPEEKTKNKI EAANPN 
SSGEER 


7495 


21396 


A 


7555 


323 


82 


FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFR* *AYL 
VRSPLLFIFVFSYF*LI*HVYHRYSVYI 
ELDVLLTGSKIILI 


7496 . 


21397 


A 


7556 


194 


412 


KMYRTSLGHLVEAKKKKKKKKKKKKKKK 
KKKKKKKKKKKKKKDSRGGW* KKFGGG 
HNTRGVKI I FFFSLGG 


7497 


21398 


A 


7557 


414 


219 


KGFKKLFPPPVF*FFLCPPPIFLKGFP* 
GRKPPPPKNPQGGPPFFFFQTPPPLFFF 
FFFFFFFF 


7498 


21399 


A 


7558 


377 


25 


THVGGVFWGKFFFSRRVFFFFYHLIQIL 
SPPPQKRPPPKGTPEG*IFPLFKEINFF 
FF*DFFFAPPPFFFFFFLFFFFFFFFFF 
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FFFFFFFFFFFFFFFFFFFFFLKVKTGV 
YCVPPIAVILYR 


7499 


21400 


A 


7559 


386 


2 


FFFFSFGEIFPPPGISGFFPPFPL*NFF 
FPLRPLFFLGGGAPFFPPPKKGFFPKIP 
RGVFFPPPLREKFFFSPPP* I WAPPGFF 
LKGPPLFFFFFFFFFFFFFFYRKARTKP 
ICLWGDVSPSKHFQCI 


7500 


21401 


A 


7560 


1 


396 


KICHRILRPIKKVK*EKKKKKKKKKKKK 
KKKKKKKKKKKKKXKKKKKKKXKGGGAS 
LKIPSGGPQLPRGGEKNPPPFLGGEKKP 
PGVFLKKHLFLGGGHFGPPPPKKNSPWG 
EKKNFWGGGGKKKQNIPGGKKIPSHKKK 


7501 


21402 


A 


7561 


1 


390 


NYVFKEFSDKLGAVAYTCNPSTLGG*GL 
WIT*GQEFETSLGNMVKPYLYQKYQKKK 
KKKKKKKKKGGGPFKKTLGGAQFNGGGE 
KKFFFFLGGVKKPPRGFLGKNPFLGGGK 
MGAPPPPKIKALGEKKNF 


7502 


21403 


A 


7562 


406 


182 


FVFFFFFXXXFXXXXFXFXXXF*TXXFF 
FFXXXXFFFFFFFFVCFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFF 


7503 


21404 


A 


7563 


2 


113 


FHFTSKHHFGFEAAA*YWHFVDW*LFL 
YVSIY+*GS 


7504 


21405 


A 


7564 


313 


97 


SPTYPCYIRNSAQSFSLIAKSFCSHPHT 
YHHDPHLEKSLTYCPQSPS * PI IPPVTC 
TYINRWPEATEEPQNK 


7505 


21406 


A 


7565 


3 


144 


DAWVFVATGFHGLHVI IGSTFLTICFIR 
QLIFHFTSKHHFGFEAAA*YWHFVDGV* 
LFLYVSI Y* *GSTFLTICFIRQLIFHFT 
SKHHFGFEAAA 


7506 


21407 


A 


7566 


1 


130 


FIRQLIFHFTSKHHFGFEAAA*YWHFVD 
W*LFLYVSIY**GS 


7507 


21408 


A 


7567 


434 


1 


PDLNKVIFFPHRKKKEPYKCHIVGKNLR 
NEMYSIQDTNSLPRRER*RKTRTCIIFP 
HK I KTS EHS FD I KEKH*NYVTLCKVLRK 
LS TNQRQS * I FLS TNKNDTLGRARWLTP 
AI PALWSAEAGGS * GQEI ET I LANKVKP 
RGRV 


7508 


21409 


A 


7568 


441 


86 


NFSFREKGGEFVPPPPLKI FFFPPPPNF 
FGGGGPPSPPPQKKVFFPKPPRGFFFPP 
PKKKKNFFPPPQKIGPPPSFF*TPPPPF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFL 
MFVCLY 


7509 


21410 


A 


7569 


1 


155 


PTRPDLRSLRFEAKS PNSRRTLHKPGVT 
I *EAFASKRKVLI VEEPSINLE * LYDYG 
FFYYKIKEPVYFYSNQKKE 


7510 


21431 


A 


7570 


434 


1 


SPTEAGGKNFFKKNPGRKTPPKKKKNI F 
FSPLTPKKFFFPPRG* ILGGGGGPNCPP 
PKKGVFPKNPQGVFNTPPKKKKKKFS PP 
GKNGGPPGVFLKGPPPFFFFFFFFFFFF 
FFLRQSHTLLPRLECSGA 


7511 


21412 


A 


7571. 


493 


196 


SSLDCS * SLQGS I * FLTLMMLAVD FVD V 
LY * V* RVLYVLVQVAITKCKI VSGLNNR 
HSFLTAVKARKSKI KVPADLVPGEVSLP 


7512 ■ 


21413 


A 


7572 


1 


239 < 


-KGLTI LDAIKNI CDS * KGVKI L I FSEVW 
KKLVPTLTNDFERFNTSVEEVTAHWET 
ARELELEWPQDVTELPQSKDQT 


7513 


21414 


A 


7573 


391 


32 


FFPQPSPPGVKGFFCLNLPNNWGFRAGP 
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PPPGYFVFLVNKGVFPVGPPGFELPSPS 
*VAPPASQKGGVSGVSPCPRWTVPFFKK 
KKKLRGVEQRCTMHLLVYS KNVPVKPQK 
KSSIGSL 


7514 


21415 


A 


7574 


2 


383 


PRVRSTAFSSIAHITREVNYG* IIRYLH 
ANGAKKKKKKKKKKKKKKRGGALKKKPG 
GGQKKGGGEKKNFFFKGGEKKKPGGI FE 
KKTFFGGGENGAHP PQKKKTPGGKKKI L 
RGKGGKKPLYPWGEK 


7515 


21416 


A 


7575 


402 


69 


KIFFKNSPLWKISHPPTYRGFFPPFPP* 
NFFFLPGPYFFLGGCSHLGPPPKKGFFQ 
KIPPGLFLSPPFKKKIYLFPPRIFLAPP 
GVFLKAPPPFFFFFFFFFFFFFFFFF 


7516 


21417 


A 


7576 


1 


119 


LI FHFTSKHHFGFEAAA* YWHFVNW* L 
FLYVSIY*+GS 


7517 


21418 


A 


7577 


219 


69 


• NDISANTAIKKFWKLPKCPTTEGWLNT* 
LY I HLVE YYATLKVCGDL YVR 


7518 


21419 


A 


757S 


3 


287 


HASAQLNGYIEKSTPYERGFDPISPARV 
PFFIKFLLVAITFLLLELEIALLLPLP* 
ALQTTNLPLIVMPSLLLS 1 1 LALRLAYE 
*LQKGLE*AE 


7519 


21420 


A 


7579 


10 


125 


ALQTTNL P LWNS S LLL 1 1 1 LT* ALQTT 
NLPLWMSSLLLI 1 1 LTLI LAYEGLQKG 
LN*AE 


7520 


21421 


A 


7580 


3 


299 


DAWVRLLDFDF*VELPTEARIRIIITSQ 
DVLHS * ADPTLGLKTDAI PGRLNQTTFT 
ATRPGVYYGQCS E I CGANES FMP I ALEL 
VPLKI FE IGP VFTL 1 


7521 


21422 


A 


7581 


3 


116 


AFASKRTGLGVEEPSLTLE* LGGGPPP* 
HTLEEPVNI 


7522 


21423 


A 


7582 


398 


2 


RVFPAPKNSLQI FPPLFFFWGSPWKFFF 
TPFPHFVFWFGNFKFNPFFAFLFRLPFP 
GEKI PLVFFFFFTD*NFVENFFFFLSLL 
FFFGFFFFFNM* S * LF*SPANFNV* VTH 
LFIFMVFFHPDFWSFGPILW 


7523 


21424 


A 


7583 


288 


9 


RGSKNLEKNSFPYKVNNHGNGSPLFPKL 
FFSPRGVGPFFFFFETRSRSVIQAGVQW 
HDLSSPPGLK*SSHLSIQSSWDY*RAPP 
HPANFVYF 


7524 


21425 


A 


7584 


1 


124 


RDGGFTMLPRPILLTGLKQSSCISLPK* 
WDYRCEPLHLASC 


7525 


21426 


A 


7585 


339 


76 


THPGHNG I S VS P KKKKI PRGGGP P PLF P 
PLPRVGGGDSL* PRKGGFN* TKPAPFPS 
SWGKKKELPFSKKKKKKKRKKGS I STPF 
SRIE 


7526 


21427 


A 


7586 


205 


87 


SISLSLSLSHTKTHTHTHTHTLSHPRCV 
PQPIGLPISKWAKVKELIT*R 


7527 


21428 


A 


7587 


353 


256 


PHPSVTTLVTSQQYQDPPPAKRL * LAEG 
SDDC 


7528 


21429 


A 


7588 


159 


195 


QICRHRWHINFLMSMGSIMITPPSFLF 
LNFL*SYVAQAGLKHLSQSDPPASASQS 
AEIRGWSHNAWPHF*YFVELVHFIYINK 
FVGIELFIDTS 


7529 


21430 


A 


7589 


76 


205 


RKKPEPEKTCFDNITGNTYPCLSTYEHD 
SD*YYVNRT*LCAER 


7530 


23431 


A 


7590 


218 


20 


LSLYLASHLWLKPPHPLEWAPSKSSLRL 
GTVASPCNPSTLGGRGGWVT*GQESETS 
LANMAKPCFY 
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7531 


21432 


A 


7591 


142 


12 


QIDYYYYLFI*RKGLTLMLRLVSNSWLH 
I I LLP WPPKVLGLQM 


7532 


21433 


A 


7592 


2 


224 


QIIEALLITILLGHYFTLLQA*EYSKAP 
FTISDGIYGSTFFVTTGFHGLHGIIGST 
FLT I C YN PQL I FHFT S EH K FG LQ AAP * Y 
WLFVDVARLFVPASIY**GSLFYPPTSL 
RVLQGSLHHFRRDLRLNIFCNHRLPRTP 
RHYWLNVPHYLLQPPTNISLYLRT 


7533 


21434 


A 


7593 


1 


323 


HAYGNTLVA*LVVTMTF*LPLVDGYIKK 
FTP YBGGFEPKS AARVLFS I KVFIiVAI T 
FLLSDLQS ALLLPLP * AVQTTNLPLI VM 
AALLLDI I LAL S LA YA* LLKGLE 


7534 


21435 


A 


7594 


1 


104 


RACVCVCGVCVCVLCLLKMCVCVCVVCV 
CVFCAC* KMCVCVCWCVCVFCAC 


7535 


21436 


A 


7595 


383 


43 


VFISETLCEHVAWSQTTIESLGLGTVAH 
TLNAS TLRG* GRM I A * GQ * S KTRRGNMB 
SISSHSQTIAQSPHLLGRHQPKWCMCVQ 
AAEAVLKGQVETDASFQIQKIPINSLIN 
L 


7536 


21437 

- 


A 


7596 


2 


331 


W P PTG I TP LNPLEG P LLNTGGLLA* G VS 
MT*AHHSLGENNRNQVIQALVITILLGL 
Y FTLLRAS E YFE S P FT ISDGVDRS TF F V 
TTGFHGLHVI IGSTLLSIWFIRQLICNF 
T S KHH FGFKAVA * YWLNFTKYLVHPPTN 
M 


7537 


21438 


A 


7597 


384 


23 


LFKTKQYRFLLPPFPLKIFFFPLKA*IF 
LGGFSHYFPPPKKGFPPKSPPWF*IPPP 
YRKKHFCSHTPKNLAPRWFF*KRPPHFF 
IFFF* *RWGLAMLPRLFSNS *AQAVLLL 
QSPEVLGL 


7538 


21439 


A 


7598 


401 


189 


RVLPCFAGWS * TPGLKCPTCLGL* KCWD 
YRREPPCLIRFYLFVYFLRQGLAMLPSL 
VLNSWP*VTLLP* SP 


7539 


21440 


A 


7599 


81 


368 


KC VI YP FLS I TLG KAK YDFF FFLEKELA 
FFPQGEMQGGNLG*LKPPPLR*RDFSCL 
ALPRGGNGRGAPPS PTNFGFLGGNGVYP 
SGPGGFETPDLK 


7540 


21441 


A 


7600 


264 


25 


AVEHPQLRLFCSHY*NNIVEERLGVLAR 
TCNPCPLEDKAAW I T * RHEFETSLANMV 
IFCLDIGSKSFLSMRFSRCLLWKL 


7541 


21442 


A 


7601 


1 


196 


WEPSLVGETNVNSFNQKYINWPGAVAHT 
YNAGTLGGQGGWI T * GQELE ITLS KKVK 
PSTYFTNTR 


7542 


21443 


A 


7602 


315 


82 


TCTQVFIALFITAKKFKLICPLTDNWIS 
RWYVHTMKHISAI KRKAVIHATMWVKL 
ENI ILSE I KQSQKATGCMIS FI 


7543 


21444 


A 


7603 


334 


118 


PKIKTEGGPPPKKNAGGMYK*KKFILVL 
AAPPHPP FGVFS VFS F Y3FW FFRFFFFF 
FFFFFFFFFFFFFFLR 


7544 


21445 


A 


7604 


3 


288 


DAYTT*WILSSVGSFISLTAVILIIFMI 
*EASSSSSSPPGGLRPRAFPVRPPGAGL 
VF*VSVPPDLPQCSLAGLQVLRFMELAR 
LAPHIRWIIQ 


7545 


21446 


A 


7605 


354 


33 


KKKFS FS KKKKKKNQLWLGAVSHANNLS 
TKGG*GRGSRFT*A* EFRTSQKLGTKGN 
MVKPISTNTFKNERGGASLSQL 


7546 


21447 


A 


7606 


401 


161 


YL VSHLLQ 1 1 L.PPALF VVT FFF * DR VLL 
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E=GIutamic Acid, F=Phenylalanine, 
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K=Lysine, LHLeucine, M=Methionine, 
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CGPGWSAWWSYCSLCLLGSNSSPCQSL 
P S S W D YRU LLWR FLM P RG I L C VYV 


7547 


21448 


A 


7607 


354 


50 


SGAQDPAGQHGQTPFSQKVPKKVYPAWW 
CMWPATQEA* LKKSLEPWRRRFLLREA 
FFVYLGVWIjSNGEPPPLGGGGSVFLEAIi 
EKSVGFSSLGFGWVFFF 


7548 


21449 


A 


7608 


370 


115 


T I KNI CNS WEVKI PRFTGVWKKL I PATIi 
MDDLEGFKTLVEEPTEDW* ITGELEVE 
PENVI EFLQSHDQTSTDEGLLLMDEQRR 
W 


7549 


21450 


A 


7609 


120 


358 


TTAVYLGHYCQSP* ILNGTVNT*PPWH 
KDP I L FLT? FPC FHAS SAMNLQLSH ISC 
NSKATPHPIiGYQQTYPPLTVHMT 


7550 


21451 


A 


7610 


3 


164 


HE* AS IP*NTHTHTHTHTHTHTHTHFYK 
EENLSHLWNISCIFLGAHKGKKKMN 


7551 


21452 


A 


7611 


85 


324 


FRFFAFFFF*KKI S FCPQAGGQGGILPS 
LDPPPPGLKKFSGLTPPRSWN*GPCPPP 
G L I G VFL KKRG F PL VGREG PNI>WT 


7552 


21453 


A 


7612 


268 


317 


F LEN C FNP GGGG CS Q P KLG P C P P AWGAK 
* ASLS KKKKNKNKN * LS YKAPS YGYKGK 


7553 


21454 


A 


7613 


101 


339 


AVPLTM V K I HALW KRVWRFHN KS KL * L P 
CDPAISLVSMYPKEMKSVCLKDVCSPRL 
ITGPLTIAGMWNPPNRSSMDYSG 


7554 


21455 


A 


7614 


366 


208 


LELLTSGDLPASASQSAGITGISHRARP 
G TLFF* A VNGGGNQV FLFLRVLNG L 


7555 


21456 


A 


7615 


30 


328 


N Y CLD FHGE TELGTTH FLTHP S TRAMVF 
E ? YCQAL * I L* GVI NT * PTVLHRNRSH I 
KAACLCLQARTWNLQISHINWKY*TTP 
HPL*YQQTYPALEYI 


7556 


21457 


A 


7616 


39 


159 


TPGLKQFFHLGLPRCWDYRHEPPRPCPN 
MYLI*ISF*CV 


7557 


21458 


A 


7617 


290 


91 


KMSFL I FFRDMAS LCC PGWT * TPGLKQS 
S C FHL PRS WDHRCT S LHLAVKTFLYY FI» 
KMFYYQNVWH 


7558 


21459 


A 


7618 


37 


246 


TQQELVY I YR YI DTHTHTHTHTHTHTHS 
HTHYWIYTWKHTLKGPKLF*EKTKMGST 
P I I SWKERLSLYRS 


7559 


21460 


A 


7619 


23 


190 


IPPQPCPFLLSLFCT*IDMSCNMCACVC 
VCVCVYVCVI CDMVLLPFCFCKLECSK 


7560, 


21461 


A 


7620 


367 


2 


FFSPKALIFLGGGGPIFPPPKKSFFSKI 
PPGGFFPLPGFLKPPPPPLF*NPPLKKK 
NFFFPPPENWAPPRVFFKGPPPFFFFFF 
FFFFFFFWSSRFLIFPQFFFLNNKIGKK 
KIFRHLGAG 


7561 


21462 


A 


7621 


6 


25 


S VI LQS I I I FI ATTNLLGLLPHSLTPTT 
QLSINLAMAIPL*AGAEIIGFRSKIKNA 
LAHFLPQGTPTPLIPILVIIETISLLIQ 
PIALAERLTANITAGHLLMHLIGSTTLA 
ISTI*LLYYNP 


7562 


21463 


A 


7622 


383 


67 


AFLNPGKKPFPPGFKRFFCPPPPKKWGK 
NRGPPPRGKI FFFFFFVKTKFFHVGPGG 
F* ILTSGDPPGQQGGLGGFFFFFQIGGT 
GDLNLSHKS PMPFT INTHNKG 


7563 


O 1 AHA 


A 


IOZ5 


51/. 




TnnPMPXfV "P PP PPTfOfS T? P RK c ?TCiR'K! F P 
X y*\j v r r " r f r J\\^\j i. t c rvo ± v?oxn uo c 

KNPCPKGPRVSPPPPPF*NRGPQGGPPP 

PGXKFPFFFFFFFIETGFPHVALAGLEL 

PGSGNLPTSVSQSAGIPGISTWPNGVFQ 


7564 


21465 


A 


7624 


287 


6 


LSGVEWVDHMINICLFFKKLPPCV*KWI. 
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£=Glutamic Acid, F=Phenylalanine, 
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N=Asparagine, P=Proline, 
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nucleotide deletion, \=possibfe 
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YQPVAVAHACNPSTLGGRDRWIPGGQEF 
KTTCLSLPKRWAHRREPLY1AFCPFYNG 
IGCLLIIGF 


7565 


21466 


A 


7625 


271 


277 


NRGKGKTGSLGILEKNPFFGGGFFWANP 
PP*GLKFWGGVGPKRPPPKKRVFFQNSQ 
GTSFPLPPVLKSGPGLVLKKAPQKBKNI 
TFQPPGKFGPPRGSLKRAPPFFFFFFFS 
LRKSSVIEK 


7566 


21467 


A 


7626 


2 


269 


LLGELS VLVC * F PPKHTTFLLTGLS I LF 
TGLYSLYILTTTQWGSLTHHI INLKPLF 
TPKNTLMFIHLYPILLLSLNPDIIKKKK 
KGGAF 


7567 


21468 


A 


7627 


397 


164 


fpppkkvffqknpkkkfspprvf*tppp 
ppfflpppkkkkfffsppp*iwpppgif 
*kppplffffffffffffffk 


7568 


21469 


A 


7628 


3 


324 


DGMNVSTL YGATGSHGLHD I MGSALLTI 

wyirqlilkftskhhfgfeaaa*ywhfv 

DVD * LFLCGS I Y * * GS PGG P KFSRGGKG 
IAVPHWGDD 


7569 


21470 


A 


7629 


2 


246 


SGLGTTLS* KN I SVLLTGLK I LVTGLY S 
VY I FTATQWGS LTHHINKI KP * LTRENT 
LMFIHLCP I LLLSLNPD IS YPGFTS 


7570 


21471 


A 


7630 


112 


290 


LISGKGQWTQMR!iPVLVTLFTDSFSK*K 
NPSTLGEQCGRIT*GQELETSLGNIVRP 
HLY 


7571 


21472 


A 


7631 


248 


2 


KVP ASG KR PEP HW Y P YHAVP GCGR I PAT 
AIWTQKWYQGFPPALSPRLEC3GAISAH 
CYFGFPGPSDS PAS PS * LAGTTGTRP 


7572 


21473 


A 


7632 


383 


106 


GWGQCFKPAI PALERFNPQECFRPGVLN 
QP*PQNKSPFFFFFFKKKKKYIYICMVV 
HAYWPSYSGG*GGRIT*AQKFKTSLNNT 
VRPHLYKK 


7573 


21474 


A 


7633 


435 


133 


YACLGLPKCWDSRHEPPGLANFCIFSKN 
RGFPISDRWINSWPRVFG* PKPPKMGG 
FQTLVQGAS LG FLLI FLKTG F P FFSQHE 
EQGWDTSSFQHRFFRGK 


7574 


21475 


A 


7634 


1 


209 


NAYRIVILCQKLFPLiSGKMNFKKTRCW 
LGMVAHACS PSTI J GG*GGRI A* AQEFGT 
SLGS ILRPCHCKK 


7575 


21476 


A 


7635 


117 


495 


STSFIDKVQLRHSFLFFSLFFETKSCSL 
TQTGVQWHDFGSL* PPPPRFK* FSHLGP 
LSS WDYRHAP PRPAS FFVFLVEMRFHHV 
GQAGLELLTSGDPPASASQSAG I TGMSH 
HALAHLYFSSGKS 


7576 


21477 


A 


7636 


1 


241 


LPGSSDSPTSASQVAGITGMCHHAQLFF 
FLAF*LIFNF* FFFGNSFFFVPQAGGHW 
GIGG*LNPPPPGLKRFSCLTLWGG 


1511 


21478 


A 


7637 


3 


93 


EAEAGES LEPGDRGCSKLRS CH YTPAWQ 
QSVTLVSKKETKNTKTSEVP FGVLNLLY 
KVS INILIFKE I FYQAPLILV* DNPDSS 
TLSSALQPGQQSKTVSNKTKQKTVS* PG 
NRV 


7578 


21479 


A 


7638 


46 


228 


GIPHIQPSKSVFNNVHGPGTVAHTRNPN 
TLEY*GGbl 1 *oHEFETSLGNIVRLPPP 
SLQSI 


7579 


21480 


A 


7639 


1212 


1023 


KNHSNEWIKKM*YICTMEY*AIRRNEIM 
AFAAT*MKLETI ILSEVTQEWKTKYCMF 
SLVSGS 

. — . i 
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Predict- 
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nucle- 
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peptide 

sequence 


Amino acid sequence <A=Alantne 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine. H=Histidine, ]=Isoieucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
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X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


7580 


21481 


A 


7640 


479 


318 


AEFLNLGDGGCSEPR*HHCTPAWATRAK 
FHLKRKKK 


7581 


21482 


A 


7641 


470 


172 


RNIRPRENQYPVFMVAFFLIAKNW*QPK 
CPSSEEAMNKLRCIHTTAVLHSSEEEQP 
TDTKNNS DLRS I APS WRSQTRAAAHRMI 
PWHSGKGKTPSTENA 


7582 


21483 


A 


7642 


2 * 


201 


AGAPPPAWLPPCRLISDC*ASNQRDSVG 
VGPSEPYATVRKYLELLLCQMHKDMCTY 
RFSIRIVLNL 


7583 


21484 


A 


7643 


521 


385 


GGSPLETH*YTSQGGGIPYTDLTGHHPS 
QGRIQEAPKLTHL 


7584 


21485 


A 


7644 


331 


3 


DLVPKKGGGGKKTTPRIKKFPPKGEDFF 
FFTPHYFWFWVFFWGGAFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFT*DYYIS 
F*EERFNKFYRCQSTLVQIGHrSNL 


7585 


21486 


A 


7645 


88 


367 


HCS FLLLGKKKKKKKKKKKKKKKKKKKK 
KKKKTKKNEYKNLLGGAPP * KPPSGALY 
KYRRENFIPFSFERVSKYPPRDLCKKNL 
FWGGLNIPS 


7586 


21487 


A 


7646 


378 


37 


FFFPXFFF FFS P P PX FFFLFXFFXX PFF 
XXSFFSLPPLPLXXPPPPPPLFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFFNI*TI 
VL 


7587 


21488 


A 


7647 


34 


293 


TGI PGVI S AI FM I * EAFVSKRKVLIGKK 
PSINLE*LYGCPPPYHTFEEPVYIKSTQ 
KRKESNPPKLGSSQPHGLHDFFKEKKKI 
IK 


7588 


21489 


A 


764S 


22 


106 


KNFLPLTLTLL* KNFLPLTLTLLI * YGS 
IPITISSIPPQT 


7589 


21490 


A 


7649 


1 


231 


FDRTI SKDISK* MVLVSKKKKKKQKKKK 
KKKKKKKKKKKKGGG P FKKT PGGAP KNP 
GEEKI IFFFLGGGKKNTWGFF 


7590 


21491 


A 


7650 


79 


286 


WSRCLVSLRNPCI IKTCS *AKKKKKKK 
KKKKKKKKKKKWGGAFLKI P WGGPI FPG 
GGKFFFFFFGGGY 


7591 


21492 


A 


7651 


57 • 


375 


SGKSEFHRVPQWPGTGADACNPSTLGGR 
GRW I T *GQEFETS PQGDPI STKKKKKKR 
GALL KDS LGG PNL PG FGNL KVFS FRGG I 
LKPTWEFWEGTFILGGEKIGPN 


7592 


21493 


A 


7652 


3 


357 


LAFLLALSKITHASIPIPVSSPSKSPRS 
KGTKKKKKKKKKK K KRGGPLKKTLGGP K 
INGGKKKKIFFFKGGEKKTPGGILEKKL 
FLGGGKMGPNPPKKIKPLGEKKNF*GEK 
GEKKP 


7593 


21494 


A 


7653 


113 


378 


MGAFNLGP FLLMGGGNPSG PNGHWVLGA 
GSYLGPGKTPFKKSQKGGPPPDL*GRGG 
QCSPGTKPGGGGEKTGLAPPFTKSPLGF 
LQKK 


7594 


21495 


A 


7654 


341 


147 


NFFFSLKPFIFFGGFCPIFPPPKKSFFS 
KIPPWFFSPPF*EKIFFFPPPLNFAPP 
RVFFKGPP 


7595 


21496 


A 


7655 


395 


3 


FFFFFFFFFFFFRGKSE*FVLFPVSPAP 
SLRG 


7596 


21497 


A 


7656 


2 


162 


ES KGYTHTHTHTH I YTHI YTHTYTPREH 
+ PKKADVAMLILNKLFFKPRCVALL 


7597 


21498 


A 


7657 


333 


28 


LEFFFNFFFSYFLFFS*RVWGPERPPPP 
LKKAV+KFFLFDIFLFFFFFIFGLGFFF 
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FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFLEHIRLLMFVTFSSHR 


7598 


21499 


A 


7658 


1 


137 


HS S LGDRVR PCLKKKKKKKE KKKKKKKK 
KKKKKKKKKKKKGGGLF*EKKKKKKKKK 
KKKKKKKKKKKGGGLF 


7599 


21500 


A 


7659 


236 


32 


1 1 FL IKFLYF* FI FI * F* FFLTFFYFKR 
LKILFFFFFFFFFFFFFFFFFF* KNFI K 
NIIYFFI FYLFYYLKFFIYIKTFLFLKI 


7600 


21501 


A 


7660 


301 


65 


KAPPKIFWFFFFVFFFFFVSFYIFFFFF 
FLFFFFFFFFFFFFFFFFFFFFFFFWKS 
LAAT * FLRG FETF * LAY I LKTT 


7601 


21502 


A 


7661 


70 


273 


KHSPPHKPASD*NTELTIKKKKKKKKKK 
KKKKKKKKKKKKKKKKKGGGALI KNPWG 
GPIHRGGEKFFFSFLRRD 


7602 


21503 


A 


7662 


1 


353 


ILIINTLLALLLI IITFGLPQLSGCLQK 
STPYEGGSDPISPARGRFSLKGPLQAMT 
FLL* ELE I ALL* PLP * ALQTTNLPLI DM 
GCLLLNIILCLSLAYE*LP*GLDCSRYH 
SQRIQ 


7603 


21504 


A 


7663 




131 


FIRQLIFHFTSKHHFGFEAAA*YWHFVD 
VA*LFLYVSIY**GS 


7604 


21505 


A 


7664 


2 


192 


PLSQSLY* LLAADLLILT* IGGQPVSYP 
FTI IGQVASVLYFTTI LILI PTI SLIEN 
KILKWAY 


7605 


21506 


A 


7665 


356 


2 


TFFFLSSPGGSCGSPTPARPKKNPPPWG 
GGLPLFSRRGALFPKNFFWGGYPFFFLF 
*KKPPFFP?PGPQSPVTSPKDWPPLRM 
P PPR PHVR PLGL PKKS FS S P R WE KQ VKK 
TKKRAA 


7606 


21507 


A 


7666 


312 


2 


GFFFFFFFFFFFL*DRVLLLLPRLECNG 
VISAH 


7607 


21508 


A 


7667 


1 


233 


FWLSSNAE FDSANSCE CLBVQRM IQDQD 
SFPTYHYFDMYVCI YV* *RSLAVSPGLV 
SNSWPQAILQPQPPQSLGLQE 


7608 


21509 


A 


7668 


174 


295 


I FFLPFCGEHSLAVLPRLVLNSWTQAIL 
PPLPP*ILGLQA 


7609 


21510 


A 


7669 


183 


330 


NKFKLYWSPVAMAHACNPRTHRGGWIT* 
CEEFETTLANMVNMVKPCLY 


7610 


21511 


A 


7670 


3 


319 


TSNTLLALLLI 1 1 TF * LPQLNGY I EKST 
PYESGFDPISPARVPFYIKFFLVAITFL 
LFDLE IALLLPLP*AVQTTNLPLIVMSS 
LLLI I ILPLALAYE * LQKGLD 


7611 


21512 


A 


7671 


1 


313 


ARGERERERERERERERERBRESGGGGP 
TQTDCKGGRNT*RGREI YRESE* DDRPP 
FLPTYRVNLQRPVGLRRLKGAGDKTFCL 
I LTLARDYVWPD YRMKRADHM 


7612 


21513 


A 


7672 


3 


91 


TRRERERDR*REREREREREIFREKNSQ 
S 


7613 


21514 


A 


7673 


1 


115 


DELANLFII *KAGFPVLPRLVLNSWLQV 
ILLPWPPRLN 


7614 


21515 


A 


7674 


3 


232 


TRRERERERERERERERERERERERERE 
RYRE* DRERERERERDCVWGGGV I SLS R 
AVALSGAPAVGRQTRERISR 


7615 


21516 


A 


7675 


1 


234 


ARGERERERERERERERERERERERERE 
REIERERERLTQRERERERGRGVWHAPL 
AIERG*KPQGGFWWKEERVKKEPLGKLL 


7616 


21517 


A 


7676 


I 


262 


ARGERERERERERERERERERERERERE 
RERERERERERERERARAPQNL* EEWAH 
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nucleotide deletion, \=possib!e 
nucleotide insertion 














TPPAPGVSLSWREHTRGGPPWGKKEGGW 
GRLGHPPPSILRAKGG 


7617 


21518 


A 


7677 


214 


2 


QQASVAQAGVKWPDLCSLQSMPPRFK* F 
SCLSLLSSWDYGPPPPCAGLHIWRRNVT 
LALQGLPTTSLALV 


7618 


21519 


A 


7678 


3 


341 


HERHELEELI KTLFFFFFFRKKNHLCKP 
GGKKRGPIWFLKKFFSPPQKKPPPPPPF 
LREIGAMAPQPGGGLLI*NKTPFRPRGQ 
GGPKPPTPGKPNPPPPPKGNFPL*GPNP 
QPFGSPTPPPPQKVI FPFEGHQA 


7619 


21520 


A 


7679 




357 


GTR I NTLLALLL III TF * L PQLTG Y I KK 
STP YECGFDPI S PARVPFS I KFFLVAIT 
FLLLDLEIALLLPLP*ALQTTNLPLIDM 
SSLLLI I I LTLSLAYE*LQKGLD*TDFY 
SVGGEAN 


7620 


21521 


A 


7680 


365 


1 


PDAS Q * HCTNGQSNRLLNLL I KALLTTA 
KIWIQPKCPPVDARIKKMQ* IPRMEHPS 
AI KKE I LPS VATWMS LEDT VP SRI SQ AQ 
KDKYHTVS CMWNLRMLNAE P EGRP WPR 
GVQGWGAARA 


7621 


21522 


A 


7681 


387 


3 


F FFKKE FCPRKKAKK* NWG PGNLGPRG * 
KNFP PPPPQKRGKKGGPPQGGE I LGFL * 
KKGFPPMGRGGSKLWPLGFSPLNPPKRG 
GKKGEPPPPPQRGKPGGPPPKGIiFFFFF 
FFLRQSLTIiSFRLLV 


7622 


21523 


A 


7682 


107 


290 


ELNKRWGLGAVAHACNPST * * ELNKRWG 
LGAVAHACNPSTLGG * GRW I T * ALEFKI 
NLDNKARSHLKI KKQHQQQKNSGLDA 


7623 


21524 


A 


7683 


1 


348 


GTRERKPTWLWYHRERGMERDCSCVPGS 
SGISESRVWVQVGMEVYPAI I YLCLEPK 
YLVISEFSPAFRIWFLGLSWAHTCNPS 
NLGGQGGRIT*SQEFKTNLANIVRPHLF 
FFLF 


7624 


21525 


A 


7684 


364 


77 


GGPS I FPRLALLGGGGPGGPPLPRGNFS 
FFNFFVKKGVFFPPGVFPFLPPGVLPPP 
PPFWAPG* PRPPNFFLGAPGFFPFFFLG 
GGFFPVAPGGV 


7625 


21526 


A 


7685 


3 


24 


HEET 1 1 QVKREPVE * KKI FANPTFNKGL 
TSEI YKKQLNRKKPNNPI *KQSSK 


7626 


21527 


A 


7686 


1 


125 


GTRPGMPGTYSNYPDTYTA* KI I SS IGS 
FISKTAFQPGPWE 


7627 


21528 


A 


7687 


459 


298 


LSLPSSWNHMCALPHPASFFL*RQGLAL 
LPRLVSKSWAQVMLLPWPSKVLGLQV 


7628 


21529 


A 


7688 


374 


248 


FPHHNVHEVIVRPLSPRYPIISHVTCTY 
RSRWP*ASEGSQKK 


7629 


21530 


A 


7689 


284 


163 


GTVAHTCNLSTLRGRGGWIT*GQEIDTS 
P AWPP KVLGLQA 


7630 


21531 


A 


7690 


83 


255 


KVDYVSIKSEFF*DRVSLYHPS*GTVAR 
S*LTTTPNPGIKQSTCPTVPSSWDYRHV 
LPLPTRTI FS VPTQSLATIL I * I TLNLY 
INLGKIHFL 


7631 


21532 


A 


7691 


3 


193 


HERLDPADFS FNFFS VDVGLAMFPRL.VL 
NS * HQVI LLDWPLTVLD YRHELLR PAWG 
FYYLNFT 


7632 


21533 


A 


7692 


1 


364 


NFKSFFP* LFYLYHLLFFGFGLFLFFLI 
KLGSSLFINFSKKSVL*FTNWIYFSVLY 
FTDFCFIFI ISFFGFSLI YFI IAVFPLK 
FIGFILSFI * ITVFYGOAOWLTPVI P.TX, 
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NO: of 
nucleotide 
sequence 
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beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Pbenylalanine, 
G=Glycine, H=Histidine, I=Iso leucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y-Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 














WVAEVRGSL 


7633 


21534 


A 


7693 


370 


151 


GKKYSVASPGKLWPPQGFFKTAPPFFFF 
FFLK*RCGFTMLPRLVLNPWPQGILPPQ 
PPKVLGLQVLRHHI WLR 


7634 


21535 


A 


7694 


3 


373 


W PG YTLNQAYAK I H FT 1 1 F I GANLTF F P 
QHFLGLSGMPRRYSDYPDAYTT*NILSS 
AGS FISITAVILI I FMI *EAFASKRKVL 
IVEEPSQSAGITGVSHCARAEYLFIDRR 
DGLSLCWPGWT 


7635 


21536 


A 


7695 


406 


394 


KFF*KKFLFPPTPTFLLEACLFFFLPPI 
KGFLFKYIAPGIKTPPPKKEKFFSSLKV 
LFSPPYFFFIPPPPIFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFFFRFEG 
ECWRL*W*FYFYFFKESPHCIP 


7636 


21537 


A 


7696 


2 


334 


ILINWTFLALLLLIITFWVPQLKGYIKK 
STPYECGFDPISPARVPFSIKFFLAAIT 
FLL FDLE I ALLLPLP * ALQTTNLPL I VM 
ASLLLI IILALSLAYE*LQKGLD*AB 


7637 


21538 


A 


7697 


2 


95 


LNLTI Y 1 1 LTTT AF LLLNLN S S TTTLLL 
SRT * NKLT* LTI Y 1 1 LTTTAFLLLNLNS 
STTTLLLSRT 


7638 


21539 


A 


7698 


423 


326 


RQCLTMLPRLVSNS * SQVILLPWPLKVL 
GSQA 


7639 


21540 


A 


7699 


400 


32 


FHKEYESYFFSPNQPQFFFFFPHR*NFY 
VGVIiKKQPPKKKFFFLLFTPERFFFPFL 
*KKNIFFFP*YFFFPLVIFFL*PPPLFF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFSSWSFPFY 


7640 


21541 


A 


7700 


125 


407 


FINFSSTFWKPTTCNMQKHTPIARTKD 
LCITIFFFFLEKNFLFVPQVGGQGGILG 
* LKI LLRG* KQFSCFTLPGRWTNGGI PP 
PPKNFLKKFF 


7641 


21542 


A 


7701 


50 


.225 


P WNM VKMS CWLGKVAH VCNP S TLGGRGR 
RIA*AQEFETS*VTQ*DPVSVIKKRKKN 
FK 


7642 


21543 


A 


7702 


2 


146 


EIALLLPLP*ALQTTNLPLIVMSSLLLI 
I ILALSLAYE* LQKGLD*AE 


7643 


21544 


A 


7703 


1 


93 


ILI INTLLALLLL IMTMGLPQPKGYIKK 
STP * ECGFDP I FPARVPFS I KI FLIAI T 
FLLFDLETALLLPLP * ALQTTNLPLIVM 
ASLLLI I ILSLSLAYE+LQKGLD*ALLL 
LIMTMGLPQPKGYIKKSTP 


7644 


21545 


A 


7704 


2 


112 


GRVGKHHFGFEAAA* Y WHFVDVG* LFLY 
VSIY**GS 


7645 


21546 


A 


7705 


1 


325 


TAGQFLPKLSILLSYNPAITFLGIYPKI 
LKTYVYMKTCTWMFIAALFIWQTWKQQ 
S KLW Y IQT I KY YS VLK * NELS S YENTSK 
KLRCILLRERSQYKKPPYFLIPTM 


7646 


2 J 547 


A 


7706 


2 


169 


SRSRAGTLAI *TINLPCTLMIFTILIVL 
TI LE I AGALIQAYVFTLVVSLYLHDST 


7647 


21548 


A 


7707 


263 


346 


MLINVPLGLLFVGVILSKESPSVDQGGV 
QWINLFSMQPPPTGFK* FSCLKA* ASDN 
LSPHEQYRLALSFLKLTL 


7648 


21549 


A 


7708 


343 


163 


PKEF* *RQGFFHVGQAGFKLLRSGNPPP 
SASQNGKITGVNPLAWQTNNS I PPMAP 


7649 


21550 


A 


7709 


459 


J 44 


FWPGASSHAFDPTTLGGRGGR IA * AHEF 
KTSLGNIVRPPSDTCNPKVLGLLA 


7650 


21551 


A 


7710 


2 


83 1 


NFLPLTLALLI * YVS IPITISS IPPQT 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 
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NO: in 
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26 
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beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutaroic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I-Isoleucine, 
K=Lysine, LHLeucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X^unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


7651 


21552 


A 


7711 


19 


181 


KYVHTKACT * MF I S SLF I I T KN WKQP I F 
SSIGE* VNKT* Y IHTMBYLLFRNKRK 


7652 


21553 


A 


7712 


392 


49 


SDI FNS DI FSHY * CSVF * S YLAyNR YLS 
FL*LFLEFIYSYP*IDKAFHCFLHPVSS 
FWIjDFFLjFI fsfflfs yffsyfyf* fyf 
*FIFNFIFNLFFILFYF*FSFFSKCLKF 
PC 


7653 


21554 


A 


7713 


334 


208 


FFF*EWVLTTLPRLVLKPQAPAILLPWP 
PKVLG FTGVSH I KI 


7654 


21555 


A 


7714 


2 


408 


WNGME WNGL KWNRLER YGMQ WNGLEWNR 
TE * TRMVWNAME WTR I ELNGLE WNGME C 
NGMHLNGLEWNGMERNGTDPEWNGTDPE 
WNGTDPEWNGTEWNGMEWNEVDSNGMEW 
NGI DSNEMGWTRMSWNGMESTRV 


7655 


21556 


A 


7715 


362 


223 


VWWFMPIIPTLWEAEVGG*LEPRSSKPA 
W ATWQDAI FKKKFKHQr. P 


7656 


21557 


A 


7716 


2 


335 


LIVI INTLLALLLI I ITF* LPQLNGY IE 
KSTP YECGFDP I S P ARVP FS I KI FLVAI 
TFLLFDIiEIALLLPLP*ALQTTNLPIiIA 
MSSLLLI IILAIiSLAYE*LQKGIiD*AE 


7657 


21558 


A 


7717 


2 


142 


SITL*LPQLNGYIEKTTPYECGFEPICP 
ARVPLS I KFFLEAITFLLFDLE I ALLLP 
LP* ALQTDDLPLIVMTSLLLI I ILTLSL 
AYE * LQ KGSD * A YM PR PR P F VHKI LLRS 
YYLLII 


7658 


21559 


A 


7718 


6 


92 


FFEAAA* YWHFVNVV*LFLYVSI Y* *GS 


7659 


21560 


A 


7719 


417 


164 


PLEPL I S GRGL PQI AP PPKKGS PPKS PR 
WFFFPPP*KKKNYFPPPPKILPPPGFF* 
NPPPPFFFFFFFFFFFIFFYFFKTALPL 


7660 


21561 


A 


7720 


25 


401 


TENS * D PNWD * I PHYA* P * TSTVKKKKK 
KKKKKKKKKKKKKXGGGLI KKFRGGPKY 
TGGGKI I FFFFMGGKKKPLGDFLKKNFF 
LGGGNLGKHPPKKLSLQKKKKNFKGRGG 
KKTPLCRRGKKFS 


7661 


21562 


A 


7721 


388 


2 


APPFFFYFFFFI FFFYFFLGVGFSFK *N 
SKVFWISNFLKKILRVFICSLEKGINPF 
LEKCFYMFFFFVAFIIRLNFSLQAKFFL 
LNFALLPFPEIFFFFPFFFFFPQIEDFW 
HPYIEQVYQLSLFSLS 


7662 


21563 


A 


7722 


389 


71 


FFLPHQKQVFFPPPPFKIFFFFSRVFFF 
WGGWAQKPPPPKKFFFLKTPPGFFLPPP 
♦KKKFFFFSPRFFLPPPGFFFKPPPPNF 
FFFFFFFFFFFFFFFFFFFFLN 


7663 


21564 


A 


7723 


77 


220 


T P RRG WAY WLM PAY PA I L EA * AGGLLE P 
KSLRSTWPTWRNS ISPKRK 


7664 


21565 


A 


7724 


16 


258 


ISSFVFKRRSFAMFPRLVLNSWPRWLP 
PRP PE * LGLQACTTTFSFNFFDG I FLRP 
LKED VTARDQRE I LYVFAL FYYGGA 


7665 


21566 


A 


7725 


353 


111 


IFGAGPRFVFKAGGQGANSRSLKPFPPG 
NKwSFPFSPSKKVJEGRG*PPRPPFFFFF 
FFIREGVSLCCQGWSQIPGLKQSS 


7666 


21567 


A 


7726 


2 


185 


MSMGHTRLSSAWTGKPPLS VEDD FEKL I 
WE I SGG KLEAE I DLDPGKDEDDLLLELS 

*MTnT 


7667 


21568 


A 


7727 


346 


143 


SQAFLYLESFLLLLLLY* FLFFEMESHS 
VTQDRVQWRHLSSLQPLPPRFQQPSRLT 
LPS S WEAEVGG 


7668 


21569 


A 


7728 


411 


221 


NLRPPGSSYSPASAS* IAGIPGARHHTO 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
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sequence 


M 
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09/515,1 
26 


Predicted 

beginning 

nuc\eotide 

location 

correspond 

ing to ftrst 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, EHHistidine, 1-Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Prolme, 
Q=Glutaraine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
YV=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possib!e 
nucleotide deletion, \=possible 
nucleotide insertion 














LIFAFLAEILKNLAKLYLIPRKPNIYYL 
PNCYINI 


7669 


21570 


A 


7729 


2 


303 


AAAS TNM LL F FLG RH L WE WLGHM VGL I 
FTFKKLPNQAGRRGSHL* FQHFGRQRWE 
DHLRSRVQDHPEQHSKTPSSLQKNKMS I 
KSNLLNIHKLFSTGRM 


7670 


21571 


A 


7730 


67 


65 


SICEYCFVSSVFVLKRLFSVMQAFSLEG 
CVSCQS * AFLQH YLFKKLY FLY * WGLTM 
LSRLVLNTWPQVIHIiPWPPKVLGL+AFC 
GNEA 


7671 


21572 


A 


7731 


354 


238 


QSILDDVAMVLD*QAEVFIVKMWRLLIY 
VHEAKKIGLVK 


7672 


21573 


A 


7732 


33 


186 


RGALAVXiSRL I LTPGLKES S CLDLPKGW 
DYRWEPPRPGCFF* LMVLVLS F 


7673 


21574 


A 


7733 


1 


181 


VNAGADCS S IGGVP FLQHKKCHGKDYEP 
RGITTLERS YVEETTEHLVSKSK* PLRA 
QINIi 


7674 


21575 


A 


7734 


336 


80 


HKLKEPPGVFPVFPFKNLEFGPGPPKFF 
FKKAFSPILSFFFGKIFKI PRFRGENFA 
P* NFGKNS * KPRFS PPPPKKKGFFFFFF 
F 


7675 


21576 


A 


7735 


2 


163 


TPVSTGTPVPTLT* VPSPI IFPVSEKWA 
GCLKLCLNFTCTELRLLTSLLTIRS 


7676 


21577 


A 


7736 


87 


258 


KAPSVCLFSALLMLLRMSARTSVCTVRP 
LSPS*AVISPVTCTYTSRWPEATEDSQK 


7677 


21578 


A 


7737 


1 


103 


LDSKGIYMTLQPSKLSPKLEANVEIREH 
MLENSSRF*RDLYDSAAFKARTKARSKC 
RDKRAHVGEFF 


7678 


21579 


A 


7738 


193 


300 


GGLPPPPFFFFFFLKKRGFPWVTRGGPN 
PPPLEMGGPTPPKGWNYGGGPPP*FHPF 
GGVGPP I S KGGGLG PPRVTQGKPLFFKK 
KKKKNGGGGKP P YS PFLGGVKQKKGVNP 
KGGGSKKPKFPPPPPPGGKKKKPFLKKK 
KKKK 


7679 


21580 


A 


7739 


150 


17 


GRVAQVWWLMHAI PAL* KAEAGRSLEPR 
SWTPDWGTWHLPISTK 


7680 


21581 


A 


7740 


1 


358 


SPPRPPPPPPPPPKEKFKKLENPPPPPK 
NPQKQNPPPKTKKKSPTTTPKKKPKIGT 
PPQTPKK INCS * KKKNNPPENKKGLNYP 
NKPNHPHAPPKEPKQREKPLVPTKKKKG 
PSPKKKK 


7681 


21582 


A 


7741 


235 


147 


F*FFLGDEISLCCPGWSLTPGLEQASCP 
GLPRRWNYRCEPPHSVKTIF 


7682 


21583 


A 


7742 


131 


12 


AASTYGQ*FKICGAILRLMPIVIEFIPL 
KILQIGPVFTL 


7683 


21584 


A 


7743 


1 


184 


NQ YPWCFCNHMSMGKES KTLNRSGMVAH 
TCNPS TLGGQGGR I T * A* GS KS SLGDKV 
RPHLY 


7684 


21585 


A 


7744 • 


347 


202 


S LFVETG YHYVD * AYLELLASGS P PAS A 
SQSFGISSVSLCTQPMFQFE 


7685 


21586 


A 


7745 


138 


263 


KKRKTYIHTKTCM*MFIAALFIRVRKWK 
QS KCHSADE W IN KF * Y I KI CLS I HQLND 
IWIASTF 


7686 


2)587 


A 


7746 


358 


169 


AAAVAERTQNTEKTEDLVLGLWI KKVT Y 
WPGWAHVCNPITLGGRGGWIT*GQEIE 
TSLPTRR 


7687 


21588 


A 


7747 


78 


346 


AIQMLRWQVKIKLPKSPVEI IPFKI VPQ 
FOVOKI KSONNFI.I.LFQ3SQI FFATSTG 
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Predict- 
ed end 
nucle- 
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Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E^GIutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I^Jsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tbreonine, V=Valine, 
W~Tryptophan, Y=Tyrosine, 
X=Unkno>vn, *=Stop codon, /=possible 
nucleotide deletion, \=possibie 
nucleotide insertion 














SVRPLSPS*AIISPVTCTYTSRWPEVTE 
KSQKK 


7688 


21589 


A 


7748 


222 


98 


KKKARHGLGAVAHACNPSTLGGQGGR I T 
*VQEANIVRTCLY 


7689 


21590 


A 


7749 


314 


52 


LKOTDIiQFFLGPRKKKRjSTLNDSSLFVSA 
EEVRLMYCFNKLAI LHYF* FSNKFI VFV 
* KFGHLLDENMGS KFDNI GMNAMANKDN 
ASK 


7690 


21591 


A 


7750 


371 


216 


YHTHTHTHTHTHTQI PSLPLNLEGPVGY 
LLSCGSLHLKNLW* KCMRKERKK 


7691 


21592 


A 


7751 


3 


238 


FL I QTG FGHVS QAGWE LLR * GN LPALAF 
QSAGITGMSYLCVAESLYLLPPFLKILC 
SSLLTLS PYDTWKVALLCS YTV 


7692 


21593 


A 


7752 


2 


247 


NSDIFDIYIIQYIFLIYLIYIIQYIYLI 
* VFLI I FFKFTLK* EFI FISLSISFRLA 
FI VYRDVILLFLFFFS *NFVGDS IS 


7693 


21594 


A 


7753 


359 


2 


TLKIRPIiLYLPLFFFFFF* KKKIFFFPP 
GGGKGV* SGFYI.MGRGTFSLFFKRFFLF 
YPPKEGGLRGSPPPPSFFFFFFKKRGFP 
FLARGV* KKGPQGTPLPWLPKSLGLRG 


7694 


21595 


A 


7754 


3 


174 


CSVIQAGGQWCGHSSLQP*IPGINQSSY 
LSFPSSWDYRHATMPS*FLFFIEIKMLQ 


7695 


21596 


A 


7755 


292 


219 


IWPLSHVHSNPLTHSPSQVDQGLFTTLL 
PCLSG YMYVQFPLAEMP F I S S LLLENAC 
LSLCEGSEAQKPLLWEGLSYLPPLNQLP 
LCL*LCVCISGGVCRCLMCILGVCVCMH 
VCVSVCPCTHTLAQETKGC 


7696 


21597 


A 


7756 


2 


195 


VALMADGAIDTESNDYGAFMPLGIERGL 
DRIWEMPELWLRPNEFDCMTD* PCIQHA 
PSVSCGLA 


7697 


21598 


A 


7757 


4 


280 


DHIVDMITPSFTRRTIAVF*DLNLYIVI 
RGK I TS I IiKPNKNLKLWY I YT I E YYS AL 
KGNEILIHATIWINLENMQDEINQTQKD 
IYCTISLI 


7698 


21599 


A 


7758 


352 


107 


FLTRAS RGHNG ET I VTS WPGAVAHVCNP 
SSLGGRRRWIT*GQEFKTSLANMVITFF 
I SKNSNR FLSQFP I S LRPTH YKVLS 


7699 


21600 


A 


7759 


92 


4 


RPSPRRGGWTA*GQ2FKTSLGNMVKPCL 
Y 


7700 


21601 


A 


7760 


92 


4 


RPRRRRGGWTA*GQEFKTSLGNMVKPCL 
Y 


7701 


21602 


A 


7761 


328 


124 


PLFSFLFFPFFFFFFFFFSRRDGVLPCC 
PGWS*TPGLKLSSSLCLPKCWNYKHEPL 
HTAHHNF 


7702 j 


21603 


A 


7762 


1 


167 


VDSTDKR PGAVAHN YNPSTVGSQGGQ I T 
*GKFNPSIiANTVKLFKKKKKRKRGALF 


7703 


21604 


A 


7763 


309 


226 


R PRRRLTLS PR L* CSGAISAHCKLRLPG 


7704 


21605 


A 


7764 


349 


7 


QVFCFLFFVCFS S I FMTFNE * QGLKVTS 
GISSNVY*FIiLIWII»GKITYQDIGRCFS 
EYGSPEQHNLGIVRl^ASSRLGWAHTCN 
? I TLGG * GGW IT* GEGFETSLANMGKPL 
LY 


7705 


21606 


A 


7765 


43 


339 


FFFFLFFFFFFIIFLLLSFFF*DHFFVS 
LLFISNFVFLFPI FIFTVFQAHLFICLL 
FFPICMFLNLFFPNFNYYFYPFPLFFYL 
LFFFFG* FFNHLFYL 


7706 


21607 


A 


7766 


352 


102 


HTSCLIIKSSLSKMNISCVSSSISFFF* 
FILIEIGSR* WQAGLOLLGSTDPPASA 
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nucleotide 
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ing to first 
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residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
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location 
correspon 
ding to 
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acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=A)anine 
C=Cysteine, D=Aspartic Acid, 
E=G\utamic Acid, F=Pheny)alanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine t P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
YY=Tryptophan, V=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














SQSARITGLSHHAQLSSTSLVVENVEW 


7707 


21608 


A 


7767 


346 


90 


TPPSSFFSFPPFPFFSFIPFRFFFFSNR 
DRVLVCYLGWS *TPELK*SSQFGLFKCW 
DFRCGPPCLTSTAFYISITVNFSPSNCY 
S 


7708 


21609 


A 


7768 


278 


87 


AASTGVGPSEPSAGYYLLVCRLLRSLEK 
CS I RVGVT* FS RCHLS PLS LTRKGNS LT 
PCASRVR 


7709 


21610 


A 


7769 


5 


272 


AVYHP INESCVLN I GKDS CLLY * LFEKK 
KKKKKKKKNKKKKKKKNKHYE* FNTFPG 
VARKINLFLCIQKVDAGGGTLWGGGPPI 
FFFFL 


7710 


21611 


A 


7770 


198 


33 


QRPQSRQCRAWLGMVAHAYNPGTLGG*D 
RRTT * GQEFKTS LGNI VRP YL YEKM F 


7711 


21612 


A 


7771 


97 


410 


A*WLMPVIPIIWEVXWRLLEPGSLRPS 
WATWRKP 


7712 


21613 


A 


7772 


5 


415 


XLCVYLHFVGTHPFVHL*YMHFLPLTLA 
LLI*YVSIPITISSIPPYT 


7713 


21614 


A 


7773 


1 


342 


WRVTSGHSG*AAAYAT*YILSAEGSFF 
PLTEELLI * FMKREAFT*KR*VLiI IKE P 
S INLD*LYGCPTP YHTFPDPVYLNSKRR 
RRDSHPPKLASRLPHVLHDFFKKKKGGR 
RF 


7714 


21615 


A 


7774 


398 


122 


SPPPPFPSSPSFFLPPPSSSFFLFFLFF 
FFFFFFFFFFFFFFFFFFFFCRD*VSLC 
CPA * S QT PGL K * Y S YHGLP KFWNY WHE P 
PCLAYFL 


7715 


21616 


A 


7775 


1 


336 


FNFLLI I I IEMESRSVSAHCKLSLPGSS 
NS SAS AS RVAG S RGAHHHTWL F F F FFF F 
F* KKKGAP PGGQGG FLTLKKKNP P PRA P 
QRGG I NGGTPR PQNGGGKFF P WGE KKKF 


7716 


21617 


A 


7776 


1456 


1232 


FAI E SHCVTQAGVQWCNLASLQPS PTE F 
K* FSCLGLPSSWDYRCVPPKPANFYIFS 
RDRVS PCWPGWS KTPDLK 


7717 


21618 


A 


7777 


400 


2 


NTEFPYG?PISLLGM*P*EME7YIPRKP 
CTQMFLEVLFTIAKK* KKPKRS * LI NG * 
ILVYPLMEYYSATKRFDVMIHTKNVDAI 
GNIILSERRQM*KTHIVYSIHRASCTAE 
VRIKVSYKRAATWI KS ILIA 


7718 


21619 


A 


7778 


360 


218 


P YLANF* NLL* RWHLNMLPRPDSNS WPQ 
VILSLWP?RV1,G*QRGGRVEQRGARGNW 
MS KCPEGGDCLCWVAGA 


7719 


21620 


A 


7779 


1 


372 


FEVRSCSVAHAGVQWHDHRGFEFLGSSD 
DPSVSASCISRITGASHCTQG*L*LLTK 
VL*VSAFCLSVKTLRLTSLLKSDTFLjGS 
RSKLNSLGSSLQTYISPSRHFIQLLCFR 
SLFPRFliVKHKK 


7720 


21621 


A 


7780 


372 


219 


ATSLLSYFKKLPQPPQSSATTTIISQ*S 
STSRQDLPPAKRLELTEGSDDP 


7721 


21622 


A 


7781 


358 


145 


I KNIHDS WEDVKI STLTRV* KEL I PVLT 
DNSEGLKTSM*EVTAEWEIARELELEV 
EPKDVLVQFHDTINR 


7722 


21623 


A 


7782 


317 


146 


GRVDCKCP FYRFQMLLVRLGMVAHVCNP 
STLGGRGGWIP*GQEFKASVSRMPRPHL 
Y 


7723 


21624 


A 


7783 


13 


186 


DRVSITQTSWAHPPTSAS *VAWTTGMHH 
HTWLNFVFFVEMGFHHVGQVGIRLPLLS 
WK 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
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26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=PhenyIaIanine, 
G=Glycine, H=Histidine, Wsoleucine, 
KHLysine, L=Leucine, M^Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 


7724 


21625 


A 


7784 


339 


221 


KWDLAMLARLVSNSWLQGSSYLSL*NCW 
DYGSEPLCPAL 


7725 


21626 


A 


7785 


355 


3 


FGNSGRVDLFFFFFFLDRVSLCCPGCSA 
VTNLEVHHSALQPQTPGLKPS SHhShhS 
SWDYRRVPPHPS * GLF* *GK* YSITQLR 
FLNLHLS SGLRSLLPKPVAST VRI RTQS 
GLGLK 


7726 


21627 


A 


7786 


1 


201 


QQFSHVFRLLFQFSFSCQEPPSSHIQPG 
VCTRNTKVSQAW * CVPV I PANREAETGE 
SLEPGRRSTRP 


7727 


21628 


A 


7787 


270 


1 


RPRRHLVPCVLMLGPKCPGLPHTLTTLP 
YTLTTPSPTRSLGPPECK*SFHLSFPSS 
WDYRRMPQRLANFCRGRILLCYPDWSSS 
PGLKQS 


7728 


21629 


A 


7788 


229 


2 


DGWVR I NWAQ PGTVAHAYN I SNLGGRGR 
R IT * G QE FKTS LTNM EKPC L Y S DHE VRS 
LRPARPTWRNP I STKNTKM 


7729 


21630 


A 


7789 


118 


263 


CSKS YYWPDAWHTCNPSTLGGQDGW IA 
*AQEFKTSLGNIARPPISVF 


7730 


21631 


A 


7790 


246 


359 


TFI FSETRSHS VTQAGV* WHDHGSLQP * 
PPWAQVILSC 


7731 


21632 


A 


7791 


70 


. 102 


AA*SRLTATSLSRVQAIPLPQP?EQLGG 
QGGRIS 


7732 


21633 


A 


7792 


7 


312 


FLDFQLRLHSNSYCEDKGVFDSEIYEIC 
I VL I CKK * RKFLNQKKKKKKKEKPGL I * 
KKKKNSQKKWFDFPWLPPNPETKNSLFL 
PKEFLWLKTKPPFPLTLT 


7733 


21634 


A 


7793 


1 


341 


I KPE * YLLFAYT I LRS VFNKLGGVLALL 
LSILILAIIPILHISKRRSIIFRPLSRS 
LY*LIiAADLLILT* IGGQPARYPFTIIG 
QVASVLYFTTILILIPTISLSENKVLLW 
A 


7734 


21635 


A 


7794 


1 


143 


YGSTFFVATGFHGLHVIIGSTFLTICFI 
RQLI FHFTSKHHFGFEAAA* YWHFVDW 
* LFLYVS I Y* *GS ?FFVATGFHGL»HVI I 
GSTFLTICFIRQLIFHFTSKHHFGFEAA 
A 


7735 


21636 


A 


7795 


2 


257 


KWAIIEEFTKNNSLIIPTIIATITLLNL 
YFYLRLIYSTSITLLPISNNVKIK*QFE 
HTKPTPFIjPTLIALTTLLLPISPFILI I 
L 


7736 


21637 


A 


7796 


22 


326 


RDASDCSFQNLPVPLWVEK*MVFLLTKK 
KKKKKKKKKKKKKKKKKGGGPFKKNSWG 
AKI I PGKKKI I FFFLEGQKKKLWGFFKK 
KPLFWGGKKRPNPPKXN 


7737 


21638 


A 


7797 


506 


281 


RGAN*NRSGCGKRHEERERERERERERE 
RERERERERERESPRPPKRQRERETEIQ 
TLSLTVSLAPPPTCVF 


7738 


21639 


A 


7798 


435 


336 


MRSNIHFFSHTKTHTHTHTHTF* ILKQH 
TFSK 


7739 


21640 


A 


7799 


411 


106 , 


RNPPPLFFSPPLQRGGFPPIYWGPPRFF 
PPPPL*KT?PKLKIGAPPKKKKPPPPPG 
EKMVSF* PPPPFFFFFFFLRRPFPLFSP 
GGGP WGHFRGPP P P P PGV 


7740 


21641 


A 


7800 


2 


296 


FVPSTANWCFFLWSVF*TGGSNYFFVHR 
S YSQAGVDLI FI RLANT I SNLS F I * * RG 
GLAMLPRLVLETRAQTI LHSWPPKVLGL 
QVWATVPS FQFLKN 
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SEQ ID 


M 


SEQ ID 


Predicted 


Predict- 


Amino acid sequence (A=Alanine 


NO: of 


NO: of 


etb 


NO: in 


beginning 


ed end 


C=Cysteine, D=Aspartic Acid, 


nucleotide 


peptide 


od 


VSSN 


nucleotide 


nucle- 


E=Glutamic Acid, F-Phenylalanine, 


sequence 


sequence 




09/515,1 
26 


location 
correspond 
ing to first 
amino acid 
residue of 
peptide 
sequence 


otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


G=GIycine, H=Histidine, I=Isoleucine, 
K=Lysine, l>=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=G!utamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


7741 


21642 


A 


7801 


344 


105 


PGFFFFPPPGKRGFFPKPFFLGTPGFFP 
PPLF*NPAP*FFFWAPKKKIFSPPPPGK 
KIFLLKGPPLFFFFFFFFFFFFF 


7742 


21643 


A 


7802 


68 


216 


VHTIAKIWKQPQCPSMDD* IKNMRYLYT 
VE * YS VLKERE I LL YWGH YAM 


7743 


21644 


A 


7803 


407 


3 


GFIH*FPLFSGYTLDQTYAKIHFTIIFI 
GVNLAFFPQHFLGLSGMPRRYSDYPDAY 
PT*NILSSVGSFISLTAVILIIFMI*EA 
FASKRKVIiIVEEPS INLEWLYGCPPPYH 
TFEEAVYIKSRQKRKESNPPMY 


7744 


21645 


A 


7804 


2 


156 


THRR III LSQGL * T LLPL I AF * * LLASL 
ANKKKKKKKRGGPFKKFPWGAK I 


7745 


21646 


A 


7805 


2 


309 


THSRIIILSQGLQTLLPLIAF* *LLASL 
ANKKKKKKKGGAPLKKIPGGAKI KRGME 
I KNFS QKGGEKKTORGI FGKXP YLGGGQ 
NWEKLPKKFKGLKGKKNF 


7746 


21647 


A 


7806 


110 


362 


TLSDLERLLMKAVSHFLM I DLLE KC * CV 
L KNQSS KKKKKKKKKKKKKKKKKKKKKK 
KKKKKGGPLLKNSWGGPNFPGGEKIFFF 
FFRGGF 


7747 


21648 


A 


7807 


295 


108 


KQRGFVFFFFEMESRSWQARWQWCNL 
SSL*PPPLGFKQFSCLRIVYRKDSLSFN 
KPRLLP 


7748 


21649 


A 


7808 


2 


142 


GSTFFVATGFHGLHVIIGSTFLTICFIR 
QLIFHFTSKHHFGFEAAA*YWHFVDVV* 
LFLYVSIY**GSTFFVATGFHGLHVI IG 
STFLTI CFIRQLI FHFTSKHHFGFEAAA 


7749 


21650 


A 


7809 


390 


2 


PASLLHCG*ISDCCASNE*GSVGLGPSE 
PGAGYNLLVCHLLRPLEKCS IRVGVTRF 
SRCCLS PLPLDRKGNS LTPCTSQVRQ CL 
ALLQLTLGALHPVS CTHCPTI SGEMNPV 
SQLEMQKSPIFCVTHAG 


7750 


21651 


A 


7810 


1 


157 


FLHFGQAALELLTSGDPPASAS * SAG IT 
GVSHRAQLVCTF I T I YVFLKNS S Y 


7751 


21652 


A 


7811 


368 


44 


QNFPPPKKRLPPQPPLFFFLVPS FKGEG 
FFFFLRGFFFFSSAQRALL* FI YFIFFF 
FFFSFFFFFLYFSISPDSKGDTHHDLLL 
GVSWWTRSLPLWIARHNMHKWGW 


7752 


21653 


A 


7812 


400 


80 


PQNSFSPPGIGGFFPPFPL*NFFFSPKA 
FFFLGGFSPFFPPPKKSFFFKI PPGFFF 
SPPLKKKI FFFPPPFFIAPPRFFFKAPP 
PFFFFFFFFFFFFFFFKYFRHI 


7753 


21654 


A 


7813 


119 


230 


SEEFETSLRCI VI PSL* KKKKKKKKKKK 
KKKKKKKKKKKKK 


7754 


21655 


A 


7814 


3 . 


412 


HEQEHL * LLLP* * PLAI I * VI STTAETN 
RTPFDLAEGETELDSGFNIEYAAGPFAL 
LFIGEYTDI I IRNTLTTTI FLGTTYDAL 
SPELYTTYFVTKTLLLTSLFL* MRTGYP 
RFRYDQLIHLL* KNFLPLSLALLI 


7755 


21656 


A 


7815 


1 


183 


LRLERLSEFSTRRERERERERERERERD 
TRIDIYIVSQKRKKYIV* IRRNIFVYHA 
EFSKR 


7756 


21657 


A 


7816 


484 


97 


QPRTPDLK* FACLGLPKCWDYRHKPPCL 
ASDGNHS * SSPGLJj VLS&ASVSPSGHMS 
PSQQTS PWVSEESLLLGRVPGFIiY FPPV 
DAPGPGAWLAHWQAVLKLLGSSDPPVL 
PSQSAGFAGLSNCAWPW 


7757 


21658 


A 


7817 


2 


219 


ADRLRNS*EA+EREREREREREREREPY 
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peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H==Histidine, I=lsoieucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine r P=Proline, 
Q=Glutamine,R=Arginine, S=Serine, 
I — I nreomne, v— valine, 
W=Tryptophan, Y=Tyrosine, 
X = Unknown, * = Stop codon J / = possib , le 
nucleotide deletion, \=possible 
nucleotide insertion 














PPSHI YTDRNRH * MRHEERTL.FLWGSGE 
VRDAAPWVSHI PNTAL 


7758 


21659 


A 


7818 


190 


292 


NSHLrl FPWTKVFFMS*CTYNLVLPGSEK 
KFYSHA 


7759 


21660 




no ~\ rv 


498 


139 


DT5aaTt»aDaaai?'DDDDT nPP-frPPQPPPn 

TQRDARGRRRGED PGNS PFHPNPDRQPS 
rvMXi^o i i^rviMiirvijW x oir/ir r i_>i"i x djjo v 
AVI PRNGGPGGVSSGPCLLQLLCGQAGS 
STIRKIPS 


I /oU 


zlool 


A 


/oZU 


A OQ 


33U 


PT ,riPT .UtfPDZinTVPT J SI * qnppzvga gt.dt 
SWDYRHDHRTGHHPGIYDSKMCIF 


7761 


ziooz 




7821 


a orv 


ZOO 


PPT .YYMYPYRPPVYYT7YYPY PYVTr PYPP 

FFXXXFFFXFFFFFFXFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFLY 


7762 


21663 


A 


7822 


493 


482 


VGQAGLKRLTSGDLPASASQSAVITGVS 

l-TO Zi P P T M VPT? WO * Zl TfP CWV * P KTV 


7763 


21664 


A 


7823 


456 


121 


ASFFPIQYRGLGGATPPQEGGCG*GIPI 

VTAir'VVDDDPr' DUPP QVD DD^* l^YCVC T 

tVVVo x JSXKPo^irn^vjolVllFJr Xy^JViw I r X 
NVIiVLFYLRDKGLAIPTLVSNSWAQVIL 
PPWPPKVLGLQTRVTVPGQNILKENMFF 


1 /64 


ilocO 




/oz4 


Z94 


4Z> / 


T HTfPVTT TCVFWTIvrP7tfWVr , PPriTUZYT4T 

CNPSTLGG*GRQITRSRDRDHPGQHG 


7765 


21666 


A 


7825 


26 


235 


SVWWNSQTGNKVKFTISW* IDKQMRCIH 
IjyiEYYSAIKI^EVLINATTTWINLNTI K 

T OVDOATT VCMTT 

j_>C)is-Koy x LiJxori 1 xj 


7766 


21667 


A 


7826 


30 


303 


SYVSWEFPGSSGPTLFSPWHQTPRTYL 
NSVASAINLTQCPH * PEKKKKKKKKKKK 
KKKKKKKKKKKKKKGGAFKKKGP*KTPK 
KTPGGF 


7767 


21668 


A 


7827 


464 


21 


REPPHPAPADILILHFWPLEL*EIiISW 
xjN 1 KC V v i CL/D i FKiVJU X IVFrUrK ULiliQ X 
EPL*VLFSNLGSFGSLSKS *MGLVNFFC 

RLVLNCWPQVILPPPPPKLLGLQVEATT 
nCtV no x £/ 


7768 


21669 


A 


7828 


372 


205 


FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFIKKFFI 


/ /oy 


ZIO/U 


J\ 




4o3 


313 


LKNRKPNIVCTHL 


7770 


21671 


A 


7830 


449 


266 


WQMRFLHVGQAGL* LLPSGDPRASQVQ 
SAWI TG VSHHAQRS I FFKKRNNDCDRPD 

IM X Hi v» 1 


7771 


21672 


A 


7831 


480 


148 


I ELRLSHGAAEFHRPREWYGI FKVLKER 
DFYPRI I YPAKI ILRHEGEMKTFPDKQT 

ATNNHLKVKKKKNSLGW 


7772 


21673 


A 


7832 


23 


131 


QRGNSKGYHLKMIQQEEIRKLEEEKNQL 
EGEI IHFYKMK7VASEALQTQLSTDTKKD 


7773 


21674 


A 


7833 


16 


339 


NTDTLGSLMAFCRDGLAMLPRLFLNTGL 
KRSSCPDLPNCWDYRHGPPYLASFVLLK 
* TLS ILLLPYYHKMHTCVFMVLCTNFCG 
p\7rDPivn?T t nT.inrvnvT QT ttwtpu 

.Lj V \_ i. VjL w ]11«XjXi\_xj A. V i vj i ijoXiXVrl 1 r n 


f / /4 


Z JO/ J 


A 


7834 


jjj 


70 


KKTPRGFSGVFLGPFYKKGPLFFFFFFF 
FFFFFFFFFFFFFFFLFLQFFILCQQFL 
S*STEKTVHSKYILSFPFHCLIFCPFIIi 
LCSTFCNGAHY 


7775 


21676 


A 


7835 


494 


282 


PENGMIQGGGACSEPR**HCTPAWAKER 
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ing to first 
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residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, 1-lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














DSVSNAKTIQNKKNLGN*HSISTAKAKN 
IIIS* YKSELRKKM 


7776 


21677 


A 


7836 


305 


185 


LVIETCLTVRPLS PS *AI ISPVTCTYTF 
RWPEVT3ESQKK 


7777 


21678 


A 


7837 


179 


191 


LGCRKYLFYLNGGLKCLYIKCVCVCVCI 
Y IG VY I GMCVY I C I YI YLMAS * NQVTS F 
LSFYVFLLNYR* I P VKV I TVL FNLRMTG 
LP 


7778 


2J679 


A 


7838 


466 


343 


LGRLRQEiNHLPPEGRGCSEPRSRPCTPA 
WETEEDAVSKINK* INK 


7779 


21680 


A 


7839 


451 


89 


LRDTR * SGVI CLPKQAWAMVGAPPPASL 
PPCSLI SDCCASNQRDSVGVGP SQPGGG 
YNLL VRR FLS LS E KRS I R VGVNRFSRCH 
LSPLSLTRKGNSLTPCASRVRQCLALLR 
LAKGARTH 


7780 


21681 


A 


7840 


442 


164 


AHTNQFSQCI KKSVTPDLMEEMYKKAHA 
AI * EN P VYE KR P KKE VKKNRWNCPKMS L 
AQKKHQVA* KKASS LRAQEQAAES 


7781 


21682 


A 


7841 


494 


359 


ICMLPRLVLNS* PQVILLPWPPKVLGLQ 
A 


7782 


21683 


A 


7842 


577 


224 


IFFFFNKIFLFRPGLNLMGDIWVPSTPP 
LQG*TNFPPQPSQEVGLQKAPPPPGLIIj 
VFLSRQGFPQLGRVGFHFPPPGDPPPRP 
FKKIVJSHPGGGKSPPKKKKKKEIAEVLG 
AFRLR 


7783 


21684 


A 


7843 


404 


56 


HTLSTNVCSSYLNLDFFSLRGRGLIMLP 
RLALNS WAQE I LLPQPPE * LQLLRRLRQ 
ENLLSPGVQGQPGQHNETPTS 


7784 


21685 


A 


7844 


1 


323 


INTLLALLLI I ITF* LFQGIGYINNSTP 
YEGGFDPISTARVPFCIKFFLEAITFLL 
FDLE I ALLLPLP * ALQTTNLPLI AMAS L 
LLI I ILALSLAYE * LQKGLD*AE 


7785 


21686 


A 


7845 


419 


159 


FLFFFFXFXXYVLS CGI Y* YWCNLFSS * 
QYFFFYVMAFRIFLFTFLFLHFYYNMF* 
CTYFYLSCL* FEYICSHIIHYFIYFLHL 
ENF 


7786 


21687 


A 


7846 


475 


124 


FFFFFQPPQKEGGPPPPPKNFFFFFTRI 
FFFGVFFFFFPPQKNFFFF*KTKGVFFF 
PFF* * KI FFFFPGVFFFFPVFFFLSPPP 
SSSFFFFFFFFFFFFFFFFFFFFFFFFF 
FFLKRGQAS 


7787 


21688 


A 


7847 


6 


189 


LIDTVSPSVALAGVQWCNHSSPQP*PPS 
LKRSSRLALPRFWDYRCAPLCKAHSFNS 
NHEK 


7788 


21689 


A 


7848 


20 


249 


KINFIWELTCSNTVHTFYVYGFDKCIL 
PTTQLFFFL*ETGFHSVARLECSGVIMA 
HCSLD I LG S S WRPANFLNFL 


7789 


21690 


A 


7849 


101 


398 


LFQKKKKKKKKKKKKKGGPLKKKPGGGK 
NKGGEKKKIFFLKGGEKKNPRGNFKKKT 
FFGGGKKGEKPPKKKKSPEEKKKF*RGK 
GGKKSLI CWVEKNLG 


7790 


21691 


A 


7850 


87 


226 


GSLS P I ML VLP FL I HLL* KNFLPLTLAL 
LI*YVSI?ITISSIPPQT 


7791 


21692 


A 


7851 


2 


440 


GAIIRRLLH*FPLFSGYTLHQTYAKRHF 
TIIFIGVNLTFLPQHFLGLSGMPRRYSD 
YPDAYTT * N I LS S VG S F I S LTAAVLTTL 
MS *EAFAS KRKVLLVEEPS I DLERLYGC 
PPPYHTFEEPVYI ISRPRRKESNPAKLV 



1028 



WO 01/64835 



PCT/US01/04927 



SEQ 11> 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C-Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phcnylalanine, 
G=Glycine, H^Histidine, I=Isoleucine, 
Kf Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucJeotide deletion, Impossible 
nucleotide insertion 














SSQPHG 


7792 


21693 


A 


7852 


1 


80 


SSLLLIIILALSLAYS*LQKGLD*AE 


7793 


21694 


A 


7853 


5 


135 


ATFLYFS * K* SLSMLPRLVLNS WTQAI L 
PPWTLKVGLCFFDPL 


7794 


21695 


A 


7854 


38 


419 


FVMMPLHSSLGEGVRPYLKKKKKKPPWE 
GGS PPKNFKNQ I FA+NQKP PFFFF I PPN 
XKWGSPPGGGASPL*SHPSGGPGGPNNK 
VKISKPPCPPGGTPVFTKTPTFT* PWGP 
I PEKEGRKTVCPRTH 


7795 


21696 


A 


7855 


398 


61 


FFSPRPRGGGFPPPPPKNFFFSPPTIFF 
GGGVWNFFPPP KMVFFLKI S PWFFFPL 
IRKKIFNLSPGSFWSPQGFFF*GPPPKF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFF 


7796 


21697 


A 


7856 


1 


121 


LDASGRLP?SPKPIKIKNYNRP*EPK*T 
KICSLHSLPPQS 


7797 


21698 


A 


7857 


1 


332 


TPPIARPAENSARGAEGRRR* I IALQVC 
NKKWPPTKLHCI KKKKKKKKKKKKKKKK 
KKKKKKKKKKKLWGGGGGKNFFKKGGKK 
KLGGG FKKRGRKKKK 


7798 


21699 


A 


7858 


3 


191 


SPRHLTRAAGIRHEGQTGLKLLTSGNLP 
AS AS* SAGITGMSHRIGPRMLDLLTLVS 
WKQHGL 


7799 


21700 


A 


7859 


503 


62 


FFFFLNCCLSGLRHLIKLFEPQFLHL*N 
GGiWLSIILSIKSPLRSPSLSYSWL**P 
REKIKWWGGQVRWFMPINPRTLGGGGGQ 
IA*SQEFKSSLDNMVRPYLSEPP 


7800 


21701 


A 


7860 


355 


86 


Y I KG * QM VGNRGAS Q AGMTG YGMPRL I L 


7801 


21702 


A 


7861 


336 


225 


PQLAGPVKTVRKVYKKEKARVI PEEKKN 
FKAFASFRKGRANGRVFGIRAKKAKEAA 
KQDV* KKPPI PLAGGS PTRE PPSQAHLF 
PQNPPRGNPP 


7802 


21703 


A 


7862 


465 


170 


GGPPPKTHPFGGERGGKYHRGKKKPPGH 
QRVNPPS PKKKKKNWGRGKNPFTPL IGG 
GQAKKPPQPGRGECP* KKVPPNNSVLGK 
KKNFFPKKKKKKKK 


7803 


21704 


A 


7863 


495 


275 


RSFFFFFFFFFFFFFFFFLVARK* YYFL 
AT KKKKKKKKKKKKKKKK 


7804 


21705 


A 


7864 


493 


70 


PLTASISLPVNWRYIAPHRWKIK*KSV 
Y * KYLA* CLAQSKCPKSVCslKLSS P * LH 
FVPPPNPAEAMWTARQC*CIRGLGGKGT 
LCVAHTYNPSCSGG*GRRIV*AQEFKAS 
LGNSEPLI LKKERKKESQAWWLI PI I PA 
F 


7805 


21706 


A 


7865 


67 


385 


VARTTGMQHHAWLI FFI FI I FVETGSHY 
VAQTSLEFLASSDPFTSVPQSTWITGVS 
YSWS*PGIAHTCNPSALGDRGERIT*GQ 
EFKRSLGNIVRPCLYKNNKNKKN 


7806 


21707 


A 


7866 


454 


0 


LSFFFFPLGDRVTLCCPG* SAVARSQLT 
VALNSWAQAILLPQIiRKHVPPCIiANF*N 
FF*RQRYTMLHRLVLNSWAQVILQPWPT 
CPTAAQA 


7807 


21708 


A 


7867 


1619 


698 


PATSSS SS S SSSSSAAAAVAAAATAASL 
PPCRPPLRPPQPLAPHEVATLAPWRG+K 
PSFTSSGKRWLTSEPQTSGSCASCL*SM 
RASSPSSG* SKKKPPLPAEAAASVAACA 
VC WRVRAP P YVAATTAYTMAVMG WMAFP 
WEVIWTRWRMMSOAIRPLOTWTNSVTAP 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E>=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, Wsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q-Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop cod on, /= possible 
nucleotide deletion, Y=possible 
nucleotide insertion 














s 


7808 


21709 


A 


7S6S 


52 


187 


LDFN * RRCLTKLPRLVLNS WAQ VI LPP W 
PPKLLGLQV 


7809 


21710 


A 


7869 


33 


373 


AGVQWRDLGLLQPPPPGFKRFSCLSLPS 
GWDYRHAP PHLANF WLVEMGFHHVGQA 
GLKLPT*GDPPTLAYQSAGITGMSHRAR 
PGISVFHLKLHS FDPFKI I FRPGAETMP 
S 


7810 


21711 


A 


7870 


100 


119 


VFVE I VFPEGLLFS PCSFHLCG FRIHLK 
AEF* FFVETGSMLPRLVSNS WAQVILPP 
WPPWL*LQA 


7811 


21712 


A 


7871 


471 


446 


LLMLPRLVLNSWAQVILPPQPPKVLGL* 
AYI 


7812 


21713 


A 


7872 


449 


33 


FSFFFFFFLGKKDLFFSFRGGGGGANSA 
FFPPPNFGKKGSAPPPPPLKSV*NPAAP 
TGVFFFFCGGGA 


7813 


21714 


A 


7873 


334 


41 


PCPSLFFFFFLRQSLTLWPRLECSGVIS 
AHCNLCLPSSWDYK*HCPYLATGAPGLL 
SQSELS YVTL* LQESRPLCI PVDFMTFL 
PSLCFQS PNSTTL 


7814 


21715 


A 


7874 


342 


716 


STGIFLFVCF*DGVSTLAQAGVQWPDLG 
SLQPSPPRLK*FSCLSLASSWDCRHVPP 
LLANYLYF* *REGFTVLAMMVS FS * PCD 
PPTSASQSAGITGVSHCAHPGMFFLNWP 
LKSCPSESGAAV 


7815 


21716 


A 


7875 


454 


215 


PPPARCH * PP* * TLVHAVEFKSHHYSLI 
SSTQGHKHCGRPQGPLPRKTRDLCSLVY 
LLTFPSLLSYDPAKSLSARNTQE 


7816 


21717 


A 


7876 


492 


176 


P CGGRGAGGNWGWP P WHLRASAS SRWAW 
ALQFHTRSSQPAGPTGPRQ*GAQHLGQQ 
LLCS I SHQALAAPPQGRAQDLQPAMPE P 
SPRPPLPWAAQ 


7817 


21718 


A 


7877 


281 


439 - 


TWSIDL*HMAYFLFFSFLFFAFLSFSLL 
SFSLLFFAFLFL*VPSFSLFQRQSL 


7818 


21719 


A 


7878 


507 


338 


LLRRLK* EGHLS PGGRGCSKPRLHHCI P 
HWATE*EPFSKTINK*N*S*MKMKMKIN 


7819 


21720 


A 


7879 


55 


298 


PPCLANFLKFFL*RQSLPMLSRL?SNSW 
PQAVPLPRPPKRNQLYLYFGCCSQLCNP 
QHLCS KQHTFKI LFD I EKH FRLREH 


7820 


21721 


A 


7880 


145 


292 


YFGGKQIYFIFCRDRISLC*PVLLTSLV 
SNSWPEAILPSQPPKVLGLQA 


7821 


21722 


A 


7881 


3 


203 


LSLLPRLECSGTFTAH * SFKLLGSRDS P 
TSAS * I AKTTGMRHHAQLRFFFFVPGGV 
CVCVCGCVYV 


7822 


21723 


A 


7882 


324 


211 


AW WLN P VI P TL * EAKMGGS LEARS LRF V 
WAHNETPFLF 


7823 


21724 


A 


7883 


2 


124 


QHFGRLRQVDHLRLGVHYQRGQHGETPS 
LLK* INVYI YMT 


7824 


21725 


A 


7884 


367 


141 


KCWDYRREPPRLAQGLEFLFNFCFEIVT 
EPPCPALAYF*NFL*RWSLATLPRLVLN 
SWPQAILPRWPPKALGLQE 


7825 


21726 


A 


7885 


471 


215 


RGMCVSHIFLCVGARYNISSARESLLKE 
FTECTERKHTHTRAHTHARARTHTHTHT 
HTHTHTRV* NSRS QP YCRVHACS P AYLG 
G 


7826 


21727 


A 


7886 


462 


204 


RHSS*LGLPKCWDYRHEPPHLACISSF* 
RKSDISLSDELGTKCHKWWPQWKSRAG 
SVGRDPCSGHFILAGFGOLGA 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=G)utamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=lsoIeucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine,R=Arginine, S=Serine, 
Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 


7827 


21728 


A 


7887 


409 


152 


PVFLALERPNQQDFLRPGVLDQP*QNNK 
ISFFFFFFKKKKKYIYICMWHAYNPSY 
SGG*GGRIT*AQKFKT£LNNTVRPHLYK 
K 


7828 


21729 


A 


7888 


1 


152 


GVHVCCDCKCICVCLCWGVHECLCVCV 
SK* VHMCHRBGT I FGN I S WEGD 


7829 


21730 


A 


7889 


101 


399 


EI IN I VFFF Y FFLPLLFFFF WGGR FKKQ 
LNPEFLTNSWKKNHKTIGGKNPMGG* KK 
AITNPQKKGRKRSFSF*NKAQPTQGKNG 
QNSQGS KAVGALTAH 


7830 


21731 


A 


7890 


478 


142 


KF S KKKKRGVLNF Y KKKAF F S KP P KP VF 
KKKKLLKNPFFKAKKQFFFKTPTRPLFR 
EKFFPGPPKKKKKRPPGLFKNN*KGPPW 
PREKKRVKKKKKKVKSH YYNNHMP I S VH 


7831 


21732 


A 


7891 


497 


320 


PPWKFEFEFWFSP*LHKYYPPYVCVCVC 
VCVCVCVFDFRFFKPAPSPQSHFGINLS 
FLL 


7832 


21733 


A 


7892 


217 


23 


IPPFKKFF*EMGSHAVT*ARWQWHNHA 
SLQPETCGLKGFSHLTLPNSWDCRSVPS 
SSARRQSH 


7833 


21734 


A 


7893 


652 


470 


SSSSSSSSSSSSSSSSSNSSSSSQFERK 
TRSAPQVEGKGPNFCSLEAPPPGLSLFS 
CLNF * RS WDPGPPP * PHATPKTRAGGDL 
LCATHHRARHCTPGIiRFPLFLAL 


7834 


21735 


A 


7894 


25 


178 


GILFFWRDGGLVI liHRLVSNFPVSSDSP 
ISASQSWDYRCDPPCPACPLIFW*LLKH 
SF * FFWRDGGLVILHRLVSNFPVSSDS P 
ISASQSWDYRCDPPCPACPLIFW 


7835 


21736 


A 


7895 


104 


451 


ILSFMTTWMEAEDIMLSEI SQTQTDRYC 
MILLVESKNVNLRPGMMAEAYNPSTLGG 
* GER IS* GQEFETS LVNI ARH YFYKRN F 
KIKSKFKKIVRKSQKKNVRRWLLEARNK 
GRYG 


7836 


21737 


A 


7896 


510 


335 


SCIVSLHRSWDLWHVPPFLANF* IFRKG 
GVLPCCPGWQIYTFQLQIPFLSITPVAF 
SG 


7837 


21738 


A 


7897 


37 


260 


AGNSQNQGGGACSE PRWRHCTPA* ATER 
ESVS KWKQTKFLCS VPYFESTHFHCTNE 
ATGAWGVEKVALGHAASW 


7838 


21739 


A 


7898 


35 


466 


THTCGGIRKKLTNKRENQK* SQQKPSAP 
KEPEVKNKQKKTKNKPKKKKKTPQIQPT 
KKPFRNPNLRVFTFFVTTQNKTPTRGRK 
TFQKVKLFPPNPCLGNKPQTLLPRGGLQ 
KKKKKIPFFYSHPPPLAQKKKATPWQNG 
LGFT 


7839 


21740 


A 


7899 


494 


149 


PARLLSLHSYEVKSVLPKEEGLLNFFVQ 
S VTAHPTS CIGLEE I *LLD 


7840 


21741 


A 


7900 


494 


114 


GAEGGPLSKKKKKNLLPDYHTGPMSLSL 
YLWPFLSNGHTLTLMCAHTKTHTHTHTN 
H* LSRVGLP VLKPGQYQANGNRWS P* YN 
IKE* PRKTRVMPVTPQLRNDGPMLTFWC 
S SRNS FAMHVLGKL 


7841 


21742 


A 


7901 


19 


454 


VWCNSSVS IHTHTHTHTHTHTHTHTRSW 
VGWHMP I VLSTQEAEAGRLLGSGSLRIQ 
RAMIVPLHSSLGNAARPCV*HTGSCCVS 
QAGVQWHNHSSLYPQTPGPKQSSSLSLL 
SR*DYRHVPPHPASCVCVRVCVCVCVCV 
CVYGDRGITPH 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIalanine, 
G=Glycine, ENHistidine, I=lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V= Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possibte 
nucleotide insertion 


7842 


21743 


A 


7902 


3 


393 


DAWDPLNPLVNLFVSPKRNSSLDTRKKP 
CRESKKFNTHSRPKSSHQLRKRSSSTPT 
T*KIPNI*LNSSTPSRAKKKRKKKPKKK 
KK* KNSQKKKKPMRKTKKKKGGGLLKKI 
LGGAKPFGGRKEKI FFF 


7843 


21744 


A 


7903 


424 


139 


LFFFFFFFF * G * GFVI FCYVGQVGL * LL 
TSGDLHISASQSAEIRGVSHHAPPLSLI 
WFISQCVGILNHTVHLQYIQCSFVHHF 
LINQEKIDIIK 


7844 


21745 


A 


7904 


1 


181 


SQ*LY* LLAANLFILT* IGGQPGSYPFT 
IIGQGASGLYLTTILILIPTICLIESKI 
LKWA 


7845 


21746 


A 


7905 


3 


281 


HAYAHASATFFVSTCFKGLHDIMGSAFL 
TMCFI RQLIFHFTS KHHFGFQAAA * YWH 
FVDW* LFLYVS I Y* *GAVLKEPWGGQS 


7846 


21747 


A 


7906 


1 


84 


PTRPLTMLPRLVSTPGLKQSSQLSLPKC 
* DYRCBP ACTAYS F * LTMLPRLVSTPGL 
KQSSQLSLPKC 


7847 


21748 


A 


7907 


3 


392 


DPDNYTLANPLITLPHIKPE+YFLFAYT 
I LRS VPNKLGGRLALLLS I L I LAI I P I L 
HISKQQSIIFRPLSQSLY*LLAADLLFL 
T * I GGG P RS Y P F I L I GQVAFV L» YFTT I L 
ILIPTISLIGDRILKWA 


7848 


21749 


A 


7908 


2 


231 


GCVEKGTLSHC*WECKLVQPLWKTVWGF 
LKELKIDLPFDPAI PLLGI YPEENKFTL 
GPLHSVGRSSRKLPGYSPRV 


7849 


21750 


A 


7909 


529 


297 


KIFIGAPVFCPPPHFFFLIPPF*GVEKI 
FFSFFVLLPFGFFFLFLGLFFFFFKLFF 
FFFFFFFKI PFFFS KKVGVGK 


7850 


21751 


A 


7910 


410 


61 


FNNQAMQWFPPFPLKIFFFP *RV* FLE 
GGWSNLSPPPK*GPSPKFPTRVL*GPP* 
GKN*NFGFPG*NWAPHRFF*RAAPFFFF 
ODRVLLPRLVSNS * AQVIHPPWPPKVL 
GLQA 


7851 


21752 


A 


7911 


512 


314 


ANFC* LF I ETGFPHIGQAGLELLTSNNP 
PALASQIAGTTGVNLRPGQDFSSYHSEN 
PLVYGSLYIA 


7852 


21753 


A 


7912 


1 


264 


SSTTTLLLSRT*NKLT*LTPLI PSTLLS 
LRGLPPLTGFLPKWAKKKKKKKKKKKKK 
KRGGGGWKKKPKKGGGGETSSKKDTFFK 
KGGP 


7853 


21754 


A 


7913 


436 


129 


FFFFFFFAPPPKI KTPPFFFKRE VYI Y * 
ALS VFQS VCKALY I HFRI I S SQ YPC * LG 
FFFFFFFFFFFFFLKGSLDLLSAYCVPG 
SVQGALHTLSH 


7854 


21755 


A 


7914 


397 


34 


IFSNLFFPPQKKMFFPPPPLKYFFFSFT 
ALFFFLVFFPFFPPPKKIFFFNNPPKFF 
FYPS * KKKFFFFI PVYFLAPPEFFF* AP 
PPFFFFFFFFFFFFFFFFFFFFFFFFFF 
FSVSKQPLFCLKI 


7855 


21756 


A 


7915 


396 


86 


IFLSPRK*GFFSPPPP*KFFFSPKPLIF 
FGGFFPKFPPPKKNFFFQNPPGVFFYPP 
FKKKKFFFPPPLNFGPPRVFF*RPPPFF 
FFFFFFFFFFFFFFYLWVC 


7856 


21757 


A 


7916 


351 


112 


SFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFPI*KYVFIYISLSLIPFCKFLK 
GRECDIHFCDIQHSYSKYLGHFKCSTND 
RQMNK 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Aianine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Giycine, H=Histidine, I=lsoleucine, 
K=Lysine, D=Leucine, MNMethionine, 
N-Asparagine, P=ProIine, 
Q=GJutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop cod on, /= possible 
nucleotide deletion; \=possible 
nucleotide insertion 


7857 


21758 


A 


7917 


445 - 


2 


FFFFFFFFFFFFFFLSYF*LIHYAEGIG 
ASPCYIMLGYNFSSFPCGTISIAPGFNF 
YRLYFISHASADAWVDPRFLDRYRRGIN 


7858 


21759 


A 


7918 


2 


407 


PPPPQVYFNIWRTIFFGEGLLHIFPPQK 
RFCFYKTYTNY I KTLFIKKKN I FSLAHI 
KMVPPRS I Y*TPPPLFFFFFFFFLFFFF 
FFFFFFFFFFFFFFFFFL 


7859 


21760 


A 


7919 


107 


11 


KMWY I YTLECYS ALKKKKE I LLFDTTL I 
NLKDIMLSKISQHRKHTHT+ FKT 


7860 


21761 


A 


7920 


I 


70 


FFFF*AAESCSRIKCSGTIIAQCNPELP 
GLSDPPTSAT* VAGTKGVCHHTQLRFI F 
IV*LSGRILLQDQVQWHNHSSV 


7861 


21762 


A 


7921 


123 


260 


G IQARKEPGAVAHAYKLSTLGG * GGW I T 
*GQELETSLVNMVKPCL 


7862 


21763 


A 


7922 


184 


399 


LHLLMRVIiKKKKKMGGPLYRS PRGAKVN 
PALQRLDSLLIGSRILSNLGIFGKAPHS 
WGG * AFFAATGLHGLHVI IGSTFLTIGF 
IRQLIFH FTS KHH FGFKADD * Y WH FVD V 
A*LFLYDS I Y* * GS R I LSNLG I FGKAPH 
SWGGTKWDNPPPDMKSS 


7863 


21764 


A 


7923 


2 


407 


GRVGM I * EAFAS KR KVL I VE EPS I NLE * 
LYGCPPHPLHTSNKKPPHTKKKQKKKKK 
KKKKKKKKKKKKKKKGGGLLKKLWGGPK 
NTGGKKKNFFFFWGGKKKNLGDFIiKKNL 
FWGGGKIGPTPPQKKNPLGEKKF 


7864 


21765 


A 


7924 


1 


398 


PTRPPTRPTSSRSRAPPFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFVSAALFFGKKKNFFFFF* FFRG 
* GFYFFPPLYFF I CGGGGVFFFFFFAGW 
NI FF I PPKXNGI FKTLRGKGG 


7865 


21766 


A 


7925 


2 


335 


NLTFFPQHFLGLSGMPRRYSDYPDAYTT 
*NILSSVGSFISLTAGILIIFMI*EAFA 
SKRKVLI VEEPS INLE * LYGCPPPYHTF 
EE PS I KKKKKKKKKKKKKKKKKKKKKG 


7866 


21767 


A 


7926 


1 


328 


RTRGRTRGESNPSLRIQNSPCHLSHPIL 
KKKKKKKKKKKKKKKKKKKGGPLKKNPG 
GAKI * PGPKKKNLFFKGGLKKTPLGNFE 
KKPFFGGGKKRKKPPQKKKTLKEKKKF 


7867 


21768 


A 


7927 


27 


226 


LSR*KM+NKSHIYSFEKXKKKKKKKKKK 
KKKKKKKKKKKKKKQKKENRPKNGDSEN 
GHPKSKIFWL 


7868 


21769 


A 


7928 


398 


2 


GGGGGPGGRGRP * *NPHKKEGRDPPHPK 
KKFFPRRRKNRGGGGREKSPPQKKKAPQ 
KKPGRGFKRAPQKKKKTPPPPRKKGPPP 
KNFKKGAPPFFFFFFFFFFFFFFFFFFF 
FFIiGILARTTTTTNDMKNHR 


7869 


21770 


A 


7929 


3 


156 


HASAHASAPVFV* S VLMTAVLLLLSLPV 
IiAAGVT I LIADR I LNTTFFDP A 


7870 


21771 


A 


7930 


405 


249 


AS AHAS APL FV * S VL I TAVLLVLS LPVL 
AAGITILLTDRNLNTTFFDPA 


7871 


21772 


A 


7931 


921 . . 


699 


DHPGQHSE ILSLQKMKTMSQCGGAHL* S 
QVLRRLRREDCLS PGSQS CSEL * SHHCT 
LACETE * DLVFQKKKKF 


7872 


21773 


A 


7932 


423 


259 


HDLGSLQPLPPGFR*SSYLCLPSSWDYR 
CEPLRPAQRRGILMAATSRILTRSKGW 


7873 


21774 


A 


7933 


395 


35 


PRVKRFSCFWLPSKWGPRPRAPWPGYFC 
FFC*NGGSPFLPGGSWTLAPNYSPPPAP 
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QNGLVQALRPPGPPTFPFL I FPTKAPWS 
WPFFGGKKKKKLSPKGHSBRHLHHKYIR 
LLSQHGGM 


7874 


21775 


A 


7934 


1 


118 


NF* IFCRDWGLALSPRQLSNSWPQVILP 
PQPPKVLGLQA 


7875 


21776 


A 


7935 


322 


383 


KKKTGLIKISTLCFVKNTFKKTKWQAPE 
RKKKVPIFLSYKRPVFKIYKKLFPFCRK 
KIKKKIPPPGFTSFSCLSLPSSWDYRHP 
PPCPVNFFCIYNKNGVSPG* PG 


7876 


21777 


A 


7936 


3 


131 


GFHRVGQVGL* LLTLGDPPASASQSAGI 
AGLSHRSQPLTWSL 


7877 


21778 


A 


7937 


334 


146 


HPQGLNI *NKMVSSSGAKHSSRLSLPKW 
WDCRQEPPCPVIMFLKRGINHALYSPSR 
XLLHFL 


7878 


21779 


A 


7938 


2 


371 


SEPGAGYNL»LVGRF*SPSEKCS IGVGVT 
RFSRCCLSPLSLTGKGNSLTPCASRVRQ 
CLALLWLVQGVIiHPLSCTECLALPSEMN 
P VPQMEMQKS P I FC I ADAGS CRPELFLF 
GHLGSSPQSKL 


7879 


21780 


A 


7939 


2 


95 


RLN LGGGGCCE P * S CHCTPAWVTKRNS I 
SKK 


7880 


21781 


A 


7940 


1 


191 


PTRPSQTPGLK* FSHLSLPKSWDYRCEP 
PCIiAVILSIPQNSSMPSNIFAASYSS*F 
GLYSSRW 


7881 


21782 


A 


7941 


127 


383 


S KDC VR I VLLRAQAKAGSYRTVNWCRPG 
AVAHT CNHSTLGS RGGR IA * GHE F KTS L 
GNIVI LPLYKKNKKKKKRGARLKEPNLT 
P 


7882 


21783 


A 


7942 


235 


3 


KN I L PGYFCFFLRE KPPPPGLESLFS KK 
KKKKKLARHGVAPLLGRMR* EDRLRPGV 
QGCNELWWCHCT PAWATEQNP 


7883 


21784 


A 


7943 


3 


225 


EHGSLYPPTPGLKQSSYLSLLSR*DYR* 
AAPCPANF?IFNFL*REDLTMLLRLVSN 
SWTQAVLLPPKVLGLLV 


7884 


21785 


A 


7944 


2 


232 


TLLPDNLSQTIYPNKKKKKKKKKKKKKK 
KRGG P FKKKKF * TRGGGKKN FFFRAPKI 
FFGGRVLKKGGGKKPGEPTN 


7885 


21786 


A 


7945 


394 


41 


WCRWLETWGAGGSGAVPPPPFFPPHSSS 
WDLALQGSGASS P FS PGAVAHACNPSTL 
GGQGG W IT* VQE FETSLANMMKPHLFS K 
KGPVTAVAHPAKAHRAMWCTPTHMHTQH 
VLHVL 


7886 


21787 


A 


7946 


405 


100 


FFFSPRFSSSLFLFLSSFFFFFFFFFFF 
FFFFFSTGLF VY WS PTH * SLPVPR I FF F 
SYLWQTEVKTMLGTE 


7887 


21788 


A 


7947 


391 


212 


KEMQIMPAT/[R*HLTPVRMVITKKSKAKC 
WQGCEEKGTLAHSWWKCKGTAETGKLAG 
F 


7888 


21789 


A 


7948 


106 


301 


LQLPITQLWHSIYFMKKIRYRLGWAH 
TYNPSTLGGQGGRIA*GQVFKTSLSNTA 
RQKKKKKKG 


7889 


21790 


A 


7949 


342 


208 


GLGAGTHTYNPSTLDN*GGWITSGQEFD 
TSLANMVKHHWQTGQF 


7890 


21791 


A 


7950 


391 


21 


RQGFPKLPRL ISNS * PKGSAHLSLPKCW 
EYRSEPPSLATWYVTLSRHFCSTLLLGQ 
GRSKFKMKWESRLEGLLAKVSSLRSSYC 
SSLRILGLEILLSPLLFFLETGSGSVSQ 
AGVQPLCPGLK 
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7891 


21792 


A 


7951 


146 


355 


ENIRQTQNEDWIKSSKETVFCKNI I IS 
LGWAHACNPSPL*GQGRRIT*AMEF*T 
SLDNIARPCLYQK 


7892 


21793 


A 


7952 


289 


327 


KGI PPLPPRLKDKGPFFFFFFLKRESLF 
LP* FWKNGCRGPPLFFFFFLRDSVSLLP 
RLVSNSWP*VILLPQPPKVLRLQA 


7893 


2 J 794 


A 


7953 


861 


586 


DR VR S VAQARLQ WHDLGS LQP P PPGF * Q 
FSCLSLLSSWDYRRMHTWPFVF*WRWGF 
TCVAQAGLDLLSSDNPPALASQSARI IG 
ESCCVLLK 


7894 


21795 


A 


7954 


402 


52 


PRKVS IS* PHDLPAS AS QNAG I TGVSHR 
TRPSFNFSYIETAITETRKSLQPSPGMN 
* SL P S GM VG FKPH S F I KS AN F CE ALAVQ 
QTLGEVPGHAPLSPPRQEPRVCPKKVCL 
KTALK 


7895 


21796 


A 


7955 


259 


3 


YNTKVLS FPILL* TFLKHLWLGAVARAF 
NPSTLGGGGGRI A * GQEFKTSLGNI ERC 
RLYKNLKLFLESGLFLKAWRGAKRVRTR 
G 


7896 


21797 


A 


7956 


171 


386 


lp i rlfslvftt i eksrpgmvahtcnls 
tleg* sgwit*aqefetslgnmaklslp 
kiqesaslitrsqnk 


7897 


21798 


A 


7957 


397 


283 


PQPLPPGLKQFSCLSLFKC*DYRRDSAR 
PAKKNPTFNK 


7898 


21799 


A 


7958 


169 


324 


GTHRVEMCSQWLGWAHACDPS PLGGQV 
RKIT*AQEFGTSLGNIGRPCLYIK 


7899 


21800 


A 


7959 


290 


397 


DCS S CCS PLKGR * RG * AQWLTP VI PALW 
EAEAGGS 


7900 


21801 


A 


7960 


496 


298 


LS KTGSL Y VAQS WYWLFTGT I MVHDSL 
KLLCSSS PPVSAS * VAGTTGAHRCCEVL 
TTFQPDVYRL 


7901 


21802 


A 


7961 


404 


174 


PGPQKGSFF*KKXKKKKEIDHILSHKAN 
LNKC*RIKII*TMLSDQNEIKLEIIIKR 
QLLKETLFLEMKKYTIRKLS 


7902 


21803 


A 


7962 


25 


99 


PGLFLSS *HQVI LLPWPPKVLGLQL 


7903 


21804 


A 


7963 


436 


106 


GGGKSPPPGNWAPPGNPGEPLFF*KKKK 
XNPGGGGGRFFP P PPGGRKKKGVYPGGG 
GFYLTNFPPPPPPWGKKKNFFFKKKKKK 
KKKKGRIAICYI ILFITSINFKXAFI 


7904 


21805 


A 


7964 


212 


395 


ERERKERDRDGEREREREEREEGRKKHG 
GR*GGREGERHRPRGAEDPDRAGRATDK 
RTGAli 


7905 


21806 


A 


7965 


344 


345 


FSDCYKRSSCASWTWYTRLAHHQPPARP 
GAAPS PRKRWVDFAVNRNFPSRRHPCAT 
PRPAPGATGRPSRVPKSTAFTQMLTQPR 
DPHLPLPIPPLSQPPPSP*PAPLVMR 


7906 


21807 


A 


7966 


397 


298 


RGFTMLPRLVLNSRTQVI*LPWPPTVLG 
SLSIF 


7907 


21808 


A 


7967 


92 


268 


IKMSLCPNFFFFFKKQNVGPRAGAYPCN 
PTILGGQGRKIT * GQDLKTTLVNMGKPH 
FY 


7908 


21809 


A 


7968 


257 


107 


RTHTHTHTHTHTHTHTHTHTHTCENSKT 
TKNIANNCS * LESQRNNGKCTYAKF 


7909 


21810 


A 


7969 


413 


12 


PPPPPLLTPWPCFFPQIL*PVGHSAPPL 
FK*LKIPRPFFFPPPPKGIGAPP?KIFF 
PPGFSLPPFVKLPPGEKILFPPPKKKKY 
PPPPPQS*SF*PPPPPPFFFFFFSTSAK 
KSKKNFLFIFSPFFNPKYYNT 
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X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
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7910 


21811 


A 


7970 


399 


212 


TGVQVJSNHSSLQSQTPGLKRLFLLILLS 
I WD YGCMFP YLS FKN * FCTKGGSC YVAE 
ADVNIL 


7911 


21812 


A 


7971 


584 


424 


RPRRENRLNPGGGGCSEPRSRYCTPAWV 
TE*NSVSNRNKTNQNPI FFSCKQFD 


7912 


21813 


A 


7972 


105 


1 


PSMPQLSHL*NGILGRVRWLTPVIPALW 
EAETGRS 


7913 


21814 


A 


7973 


363 


28 


S S S LGGHPQ LW FHTL KLR P VTVS YA* NK 
LCDKI EKKLFFTIALHNKFLRKKLTRDV 
KILFNNK*IKKEGPNEF*NILCLWIERI 
NINKVP YPS KVMYRSNTLP I EVAVFKSQ 


7914 


21815 


A 


7974 


404 


250 


FFFFW*R*GFTMLPRLVLNSWTQAIHPP 
ALASQS AGI TGMSHHAQPLS I IF 


7915 


21816 


A 


7975 


2 


60 


FS CLG L PFC WDYRHAP PHLANFCRDGVL 
NY* PQVIHLPQLPKVLGL* ACTTTPG 


7916 


21817 


A 


7976 


325 


81 


KTIHS LLFGQ FFLLQP LL P S ? I PHAAP P 
TPHLI FLLLLFCF* FFK*RQGKQGLTTL 
PKLISNSYSQVTLLPQPPKVLGFQA ' 


7917 


21818 


A 


7977 


406 


282 


QWRDLGSLQPPTPWFK* FSSLSLPNSWE 
TKAGRSREFGGGD 


7918 


21819 


A 


7978 


382 


221 


DCIISASYLQKNFFFFLGDKVLLCHPGW 
S AVA * S * LT VTS KR VHF YS S E I S L Y 


7919 


21820 


A 


7979 


423 


29 


FLW * RRG FTS LPRLVSN FVIAQG I CLPVIL 
PKVLGLQA 


7920 


21821 


A 


7980 


3 


238 


SLAFFVETGSHFVA* AGLELLS SSNS PA 
LASQSAEITGVSHHTQPE*GYSHM*PQY 
PYQLRFNSDTI ILSNVLFVPNF 


7921 


21822 


A 


7981 


324 


128 


WLSVSPYTFELSLLVGYEFVIHDVI*TIj 
IFWWWSLFCFLWFFLVaWFWFFC 
CCFLSVFCI 


7922 


21823 


A 


7982 


1 


86 


NLTMLPRLVSNSWAQAICLSWPPKVLGL 
*AQAICIiSWPPKVLGL 


7923 


21824 


A 


7983 


1 


137 


RPANFCIFFVETEFCHVAQAGLELLGSG 
DPPTSASQGAGTTGMSH* D * PANFCI FF 
VETEFCHVAQAGLELLGSGDPPTSASQG 
AGTTGMSH 


7924 


21825 


A 


7984 


2 


277 


PRVRSVQRVIYQYV*NLQIHVPFDSTNL 
GIYLTD I LPHWNDNMYKVFAAVSFVIA 
KY*NPCLTRRQCNKLWPIHAMEYYVTIK 
KNEDGRA 


7925 


21826 


A 


7985 


211 


42 


GPQKWPRGPGENLGGFFFFFFFCRDDGL 
TMLPRLALNS*VHAILPPQPRKVPGLWV 


7926 


21827 - 


A 


7986 


409 


183 


LLRLK* SSHLSLPKWWDYRCEPPCPAI F 
FKRKQILKESFPGENTCCHLFNSLEVLK 
NKRRTCYNFFNTLAVLKNM 


7927 


21828 


A 


7987 


2 


194 


RFFETQSRLLQPPPPQLK* SSHLSLQSS 
WDYRHVPPCLANFFLFWYFLQGQGFAFL 
LKLKFLFK 


7928 


21829 


A 


7988 


483 


209 


SSPSLGNFCIPSSGGVSPCWPGWFQTPD 
LVIWPPGPPSLVFFFFFFRDEGLSMLLR 
LTLNF * AQ V I LL PQLP E * LGLQADRQ VP 
LYPAEVF 


7929 


21830 


A 


7989 


164 


29 


KFW F WLGT VAHACN P S I LGSQGRR J A* A 
QELE PSLGNEGR P CL YK 


7930 


21831 


A 


7990 


3 


474 


PTRAPLRAPFETIVQSEEFSKHHFGFEA 
AA* YWHFVDW* LFLYVS I Y * * GS PPPP 
PKILIKKNGAETKKFFF 


7931 


21832 


A 


7991 


469 


323 


VETGLHHVAQDGLELPTSGDS PASASQN 
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IGITGVSHHA* PDFTFLEQF 


7932 


21833 


A 


7992 


469 


13 


KGDTVSSSDSSPSCGGLFPVGAS*LLCL 
FTQALAMVGARRPFSMLPCSLISDCCAS 
NERGSVGMGPSKPGVGYSLLVCRWLRPL. 
EKRTIKVGVTRFSRCRLSQLPLARKGNS 
LTPCVFWVRKCFGSHSVGCTHCPAPAVR 
QAPVK 


7933 


23834 


A 


7993 


15 


439 


IAS GR P FS IKFFLVAI TFLLFDLEIALL 
LPLP*ALQTTNLPHVMSSLLLIIILAIi 
SIiAYE * LQKGLD * TEPNKKKKNKKKKQE 
KKKKKKKTKKTKGGGLLKKILGGAPILW 
GG KNMF F F FWGGE KKTPGG FLGENLFLG 
GGK 


7934 


21835 


A 


7994 


392 


186 


YF Y I INNFFFFKKI FI * * 1 1 FFFLXKFL 
* YS PKKVFFLFF* I FFFFFPSSSLFLFF 
SSSSPFFFFFFFFFFFFFFFFFLRG 


7935 


21836 


A 


7995 


11 


386 


TWEVEVAVS * DHAI ALQ PGQQE * MAI S K 
KKMTGRMFIEVSFI I ASSWKQLKCS* TG 
E*MKSLWYIHI I E Y YLTI KENK VLEHE V 
SRLTLIDTGRERSKTKEYTLYNPIYTNL 
RIAKLSGCGGSHL 


7936 


21837 


A 


7996 


3 


186 


DSFCF*RQSCS I TQAGVQWGDRGSL* PQ 
SPGLKRSSCFSLPKHWDHRDEPLHPAGV 
AVLN 


7937 


21838 


A 


7997 


316 


649 


RDKSFFFEMESRSVAQAGVQWHDLGSIiQ 
ALSSGFMPFSCLSLPSSWDYRHPPPCPA 
NFLYFLVEMGFQHVGQNGLDLIiTS * STR 
LGLPKWLGITGVSHWHPAQDGGFHHVG 


7938 


21839 


A 


7998 


468 


336 


RRGFTMIjARMVS is* psdmpasasqsag 
TTGVNHHIRIKRSLL 


7939 


21840 


A 


7999 


464 


283 


TLCGGIRPTNIMKGNLLYSKSIGLNVSL 
LPKNPRRNIQNNV* PNTWALQ P SHVD I 


7940 


21841 


A 


8000 


46 


487 


RQGRLSLQKFLLPFVQLCPAHRGGVYRG 
RQASLS CGGLHPVRVSQPLCLPT* VSAM 
AAAPHPASLLPCSLISDCCASSEQGSVG 
VGPSKPGAGYNLLVCHLLRLLEKRSIRV 
GVSQFSRCHLSWLPLARKGNSRTPCTFR 
VRRCLAL 


7941 


21842 


A 


8001 


295 


1 


TQPWGTKRKLPLKKKKKNSGDREWRWLC 
NS VT I LNATTM Y TS E WLR PGTVTHTCN P 
KTSGG * GG E I A * TQE FKAS LGN I A* S W L 
YKKILKISPIiWWQ? 


7942 


21843 


A 


8002 


144 


317 


ELFYLKEMV*RPGPVAYACFPSTLGDRG 
GR IT* AQE FRTDLGN I ARP CVYLR KKKK 
RG 


7943 


21844 


A 


8003 


69 


291 


HCVW TMI FI VIKYFYLNPQNUSMRYYYS 
HFMDKETDVQSD * MTC* RLGAVAHACN P 
STLGGRIA*GQEFKTSL 


7944 


21845 


A 


8004 


3 . 


176 


I VAHSSLTPGLKQFSCFSLPSS * VYRNM 
LPRIANY* FFGTORVSLSCQGWS * TLAS 
LLGSSSSPASACQVAXSrGTCYHAWLII 
DFLVQTGSRCLAKAGHELCS CL 


7945 


21846 


A 


8005 


419 


178 


N P PP KKLFLSS S S S S PLPHLNVJGTPGF F 
PPPPF*NPPPEFNFGAPKKKKKLSPPRA 
EKLVPFKGPPPFFFFFFF 


7946 


21847 


A 


8006 


429 


225 


QPLPPRFK* FFCLSLPRR*GHRHRPPHP 
FNFFFF * YFLRWS FVLVAQAGVQCMDLS 
IYLNEFWLLDF 



1037 



WO 01/64835 



PCT/USOI/04927 



SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning . 
nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 
peptide 
sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=A)anine 
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7947 


21848 


A 


8007 


756 


224 


PPRHQPQHVPPPLLHFSTPSPAPPKSHA 
AAPNSNPPQAHAPPAPPAPRAHSPSPPP 
RGATPPFPPPPFSFLAAVGRAPIFPREP 
LLPSSSSPPPPP*IGEPPGFSPPPLLKT 
RPPK*N*GPPKKKNFFPPPRPKNWFL*K 
GPPPFFFFPFFFLRQSFTIiVAQAGVHKT 
RLHLKKKSN 


7948 


2J849 


A 


8008 


170 


559 


SSHLSPQSSWDYRSGMVAHTCNPSMLGG 
QDGRLT*AQKFQTSLGNIVRCHFYFKKY 
FKNKIFFYSPFINTSTPKKKKKKGGAVLK 
NQSLRPRAGKVLVFLWGHLIQFRGPRFK 
NGGTGKPGGLPNLLAPR 


7949 


21850 


A 


8009 


1 


662 


RTRRTRSWWYMPVLPLSHYGEAGESLGP 
* KWMVD * APIAPLHSTLGDKMTLPD I KK 
KNGETVANTSPLSTTAKPAKLFLKAKKD 
QLMRDLFLPKTPKKPQMSTGLDARSKRW 
LKI I WRRHGIWPLKNIGPTEDVQASAHG 
GVEBNMTSDIEIPEAKHDHRPTEDVQVS 
AHGGVEENITSDIEISEAKHDHHI>VEDL 
SESliSVCLEDFMTrGSQWKPICLS 


7950 


21851 


A 


8010 


377 


2 


NFFFFFPPFWGFFFFLGPGFFKVFF* IF 
PFFFPRNPPFF* FFLGVPLFFKiC* IFFF 
PFSIRFCFFPKKFFFFLILFFFF*FPFF 
FFFVYVFDFFFSFILYIFFFVYFSLPFT 
VLiI I SL I YHFAPA 


7951 


21852 


A 


SOU 


1 


363 


• PTRPIiF**RGCFTMLPRLILNSWPQVIC 
PPRPPKVLGLQT 


7952 


2] 853 


A 


8012 


278 


39 


FXXXFFLSPPXFLI FFFFFFFFFFNFFF 
FFFFPPFFFFFFFFFFFFFFFFFFFFFF 
FH*DQ*FSNFDFTPFTFTLISNII 


7953 


21854 


A 


8013 


79 


412 


MNECILGKGGKDGGFFEQCENLGLC* FL 
KVP * DSD I WLGAVAHAYNPSTLEGQGER 
ST*AQEFKTRLGNMTRPCLYQKHTHKKK 
TKKKPH P GLG WG AWG PTYLGG VGT K I T 


7954 


21855 


A 


8014 


277 


19 


FGNIHNFFLIGSRSVTQAGGQWCNHSSL 
* PQTPVLKQSSHLSLPSSWEIEAAVSQV 
RITALQPGRQSKTLPGNSGPVLRADARR 
SI 


7955 


21856 


A 


8015 


1 


130 


ARIVSIS *PCDLPASASQSAGITGISHH 
TQLAHNTLKASPTLF 


7956 


21857 


A 


8016 


4 


417 


RLMPPHLANFLFF* RHGFTYVAQVGLKli 
LGSNDPSASTSQSTGITDVSHCTQPHLL 
KSSHSTFPLKTLDSMATNCSWDKVHYLP 
TACRALLPCPLPSACFPSLSQVPSVFTC 
CVPATLTITLLPLLRAFEHVTLCPLF 


7957 


21858 


A 


8017 


388 


158 


CTTWASNCINWICFPLS* INSHL.CNEI I 
VLI *GEI YNVCIS I YLCI YMCVCI YI S V 
CVCVCIYVH*TPYLIHKCLI 


7958 


21859 


A 


8018 


154 


382 


GIKRPGAVAHNCNPSTI J GG*GKWIG*AQ 
EFKTSPGNMVKPQI YLKKKKKKGGAVLK 
DPSLRPRAGKVIVFFWGPLI 


7959 


21860 


A 


8019 


396 


237 


FFFFFFW*G*GFTMLPRLVSNSWAQVIH 
LPWPPKALGLQVRTMVPGLRLANF 


7960 


21861 


A 


8020 


412 


168 


GNLLG SPL PGNVCKWKP FI LF * KKK* SL 
AVLPKLVSNFWP * MILPPWPPKI *GVQA 
RAPPPSQILKKKKSSHFYNSHVNSL 


7961 


21862 


A 


8021 


3 


124 


TRFLHVSQAGL* LLTSGDLPASAS QS AD 
YIFIISPAKIIFL 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
VSSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A- Ala nine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenylaIanine, 
G=Glycine, H=Histidine, I=lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, tt=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *-Stop cod on, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 


7962 


21863 


A 


8022 


3 


200 


HLSP YFYFFKDRVALCHPGWS AVVQ * P I 
VTSIFWSSC*KLTSWRRWRPSWLTR*N 
PVSTKNTKN 


7963 


21864 


A 


8023 


3 


132 


FLFFFQF*FMRLGLAMLPMLGMNSWA*V 
ILLPQPPEWLGLQA 


7964 


21865 


A 


8024 


3 


187 


VGQAGLE P PTSGDLPAS ASQS AE I TRM S 
HRAQTK*DFIGSFSEEFS 


7965 


21866 


A 


8025 


405 


231 


SDKW I KKMRYIHTTEY Y* ALKRKE I LTH 
ATTWLlNLENILLSEISQSPKHRYYVIPL 
T 


7966 


21867 


A 


8026 


175 


382 


GKRIFPPYPPQEGGKTGPPQTPGLFFFF 
LKKK * PGGLGGARFSPLLGGVGRKNSFT 
LKGGGAINLKGPPSLPPGGKRWAPPQKK 
KKKGQAWWLMPVI PALWEAEAGRS 


7967 


21868 


A 


8027 


83 


374 


GDQVWWIjMKACWPS tlggqcgritwdre 
FETS LANI FFP I S TKKKKKKKKKKKKKK 
KKKKKKKKKKKKKKKKKKASSGGGVS * K 
KNFS PRAGKEI FF 


7968 


21869 


A 


8028 


382 


148 


SPHCSLDLVGLKQSSHLSLPST*DHRCV 
PPRLAI FSFAFLFFWEATTTTTKATMLP 
RLVLNS * AQAI LLPQPLKVLRL 


7969 


21870 


A 


8029 


88 


208 


SGSV*BNNPLSWAWWLQSI IPATQEAEV 
GRLLE PRSS RS A 


7970 


21871 


A 


8030 


1 


283 


NKSR I KKAI KMTE KMKE K I EMMKLMNQL 
PNGEHHAMI QMMMKIEKKGRMKIKE * DK 
ESDKDDRKDEREDRDDETDEPTPERRTS 
RDDS DDDEDRDEGKNEDKRKDDS KDDDE 
ADEDTDQDDYDPMDAHKAADEDDDEDED 
EVAEQMTKKKKKKKKKKKKKK* * KDDRK 
DERE DRDDE TDE P TPERRTS RDDSDDD E 
DRDEGKNEDKRKDDSKDDDEADEDTDQD 
DYDPMDAHKAADEDDDEDEDEVAE 


7971 


21872 


A 


8031 


2 


139 


LQTLLPLIAF* * LLASLANLALPPT I NL 
LGELSAQFAVIGMYILY 


7972 


21873 


A 


8032 


80 


324 


YLHFYFFVR * KLMGLDTVAHT YNPS TLG 
DPERRIA*A*EFKTSLGNMVKPCIYKKK 
KKKKKKKKKKKKKKKKKKKKKKKKK 


7973 


21874 


A 


8033 


299 


158 


GNQPGASAHTCNPSTLVGQGGRI I *AQE 
FKASLGNMVKHYLYQKHKN 


7974 


21875 


A 


8034 


416 


60 


AQWIiTS I IPPL* EAEVGGFFEPRSLSPA 
WATE 


7975 


21876 


A 


8035 


398 


271 


FYFL*DEVSLLPRLVSNSWTPAILPPWP 
PKVPGLRHEPPRPA 


7976 


21877 


A 


8036 


385 


151 


FLYFL*RLGLTLLPQLVSNISSSDPPT* 
ASESAGITGVSNHAHLRICILTRLPGNS 
SH*NVRRAGLGDTQSSALIPPG 


7977 


21878 


A 


8037 


3 


406 


PAS AS* VAGITGTHHHAQLI FVFLVETG 
FLFFFFWAVFLFFKTKNGPPGGQLFFFA 
APPALGGGDFGGQGATFSPKGLGVLGET 
RGGAPAPKKLGAKKE PSHLLGGGAQNL P 
KPRGQKGFGFYFYFLARDFFLG 


7978 


21879 


A 


8038 


428 


20 


LANQ*WKPPSNPWETKISPKKKKGPPGQ 
GPPPLTPPLWEAPKARKLGSPWLPR*NP 
LFPQKPKKFFGFGGGPPYSPLPQKLSPK 
NGVTPEVGPSHKPKFPPSPPPLGQK*TP 
FPQKKKPKRLKRGMFLHGSTFQSL 


7979 


21880 


A 


8039 


3 


194 


QNKISQVWWYTPVIIATWEAEGGESVEP 
GRQRLW*AKTAPLHYSODEKYLKASVDS 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od- 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, Wsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glu famine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan 3 Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, V=possibIe 
nucleotide insertion 














YCLISCH 


7980 


21881 


A 


8040 


435 


96 


LEPL*KYFPILDAIiCMICDSWEEVKIST 
LTGV* KKLI PMLMDNFEAFKTSMEKVTA 
DWEIGELELEVEPADVTELLQSCAKLE 
GMRSCFLWMSKESGFLRWIYSW 


7981 


2)882 


A 


8041 


413 


254 


PGFKQFSCLGFPKR * DYRKKPPCPASKY 
FKINYLRNIYRSLILKN 


7982 


21883 


A 


8042 


409 


102 


GFFPFGPFFF*PPPPGVFPPPPPPNFWF 
*GFGPPPPPLF*FFFRGGPKKIWFFPFF 
LWPPLFFFFFPFWPKKTPPKKARSLKIF 
FFLGPKKI PNPFFFFFFF 


7983 


21884 


A 


8043 


414 


153 


GRGWATNFYFFLI ILK*FFFIFKKFFFF 
HLFFFFFFFFFFFFFFFFFFFFFFYKI F 
LFTLVLTCPQTSCREAEPFDHKVCSVLK 
ML 


7984 


21885 


A 


8044 


401 


292 


NYLPPQ * KKKYYPQGPFRKP I KKSPPPC 
FFFFFFFFPSFPPFLNIPPPQKGKFPPK 
KYFFNPP!»FPPLFFliKPPPLFFF*GPKK 
KNLIFHPPSKKICPFKRGPP*FFFFFFF 
F 


7985 


21886 


A 


8045 


370 


14 


TPPPTKTPFFFLYPPPWREKREGFF*KA 
NMSPPVWKKKIPPLPPYIYL+KGGSRYL 
AKLSLNLKGSIYLSFCFPPLPPSLPLSL 
FERWGLT I LPKLVLNS WAQM I LLPWLPK 
VLGLKV 


7986 


21887 


A 


8046 


395 


268 


PFFFF*ETGSCSIVQVGVQWLFTGTILV 
LISVGVLTCSISDL 


7987 


21888 


A 


8047 


379 


98 


GPAHNSPPLGGESETPLGYPGKPRFF*K 
NKKINPARGPGPWPPPPQG*AGKRPLP 
PRRGFQLTQNGPPPPPPGGKKKPPFQKK 
KKKREPAIC 


7988 


21889 


A 


8048 


369 


177 


RNTRGGPTFFFFFFFFK*NLARLSRLEC 
HGT I SAHCNFRLLGSSDSLPLTSNDIGQ 
INKSLFA 


7989 


21890 


A 


8049 


1 


128 


ALGLVAHAYNPSTLGSQRGKIA*GREFG 
TSLGHRARPCLYKK 


7990 


21891 


A 


8050 


166 


163 


P VHQGEQTTQDKCLR * STHLGLP KC * DY 
RREPPCQASKLVPTGILADSKHSKCQTM 
DSCFSNQLYKQR 


7991 


21892 


A 


8051 


402 


245 


YFKSVCQAQWLNTYNPSTLGG* DGWITC 
GQEFRLPWPSGITGASHHAQPHFF 


7992 


21893 


A 


8052 


404 


268 


QQLWRLRQRDCLNLGCGGCSEPRSCYCT 
PA*ATEPDPVSNKQINK 


7993 


21894 


A 


8053 


1 


154 


GFLRVGQAGLELLTLRDLPTSASHSAG I 
AG VI YRAWPGEAYLFSG * WLRFT 


7994 


21895 


A 


8054 


406 


150 


PKKKKNPCPPPEVKFDSLKRAPLFFFFL 
EESCFVTQAGLECSGAILAHCNLCLVGS 
SDSPLSL * RWVGNSHF I FTSLLSVS PRS 
N 


7995 


21896 


A 


8055 


1 


125 


TAVILI I FMI *EAFASKRKVLI VEEPSI 
NLE*LYGCPPPTH 


7996 


21897 


A 


8056 


400 


260 


HMSPHLAFFFSFLFL*RLDMLPRLVSNS 
WPQAFLQLLLPQVLGLQV 


7997 


21898 


A 


8057 


396 


263 


FLQPGTVAHACNPSTFRGGLIT*AQEFK 
TSLGNKVRPCLYKKQY 


7998 


21899 


A 


8058 


402 


211 


FS CWWCTKPKAAG * FPA? WGPPFLNNPP 
GAGTNFPGPPRSGGFGSPPPPWPGPRRP 
CPNGIFFC 



1040 



WO 01/64835 



PCT/US01/04927 



r 



SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenyla)anine, 
G=Glycine, H~Histidine, l=lsoleucine, 
K^Lysine, LHLeucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X-Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 


7999 


21900 


A 


8059 


I 


279 


VLL YHPGWSAWQS WLMQ* TPSNADI WR 
IARGFKRSSHFSLLSSWDHRHVTPCLAN 
FFLIL* IWGLAMLSRLLPLRLRKPVLLD 
*APSPCGLI 


8000 


21901 


A 


8060 


408 


315 


SLTMLPRIVFNS * AQAILLPWPPKVLGL 
QV 


8001 


21902 


A 


8061 


198 


2 


RIQHVESNCKLEVEGGWAQWLTPSTLGG 
RGRMIS * AQEFETRLGNTVRPCFYKNLK 
SQPGMWRI 


8002 


21903 


A 


8062 


3 


294 


GLSPTAHPITAFTSQPLPSGLSVCRLFL 
WAMCEQCPHLPIRPCCPRI IGPECALYT 
T S CLLHQLKKREG F * KKKKKKKKKKKKK 
KKKI F * WGGGKK 


8003 


21904 


A 


8063 


197 


184 


IKDFLFFF*GRVSLCCPGWSAAAQSRLT 
AL* PGELKETFYL* IRRSWDHKHAPPLP 
APF*LFWVTLIITPSTHEVSSAVCSLLY 
VKSEQKCQ 


8004 


21905 


A 


8064 


3 


373 


TSGSIiAKRPADCLSAQFTTPSSLCGCSP 
VRVCI TVPRCTHARAHTHTHTHTHTHTH 
SNHG * APGALY I QELQRGKVSHVPEKEK 
SDPETA*TKGG*HVTNPHGRQRSGEQEE 
HRICVGRCCSG 


8005 


21906 


A 


8065 


532 


106 


ENKP IHNLS INKYIGKI LRDKALLYCPG 
WRAVMQS*LPVASNSWAPAILLCPGFLS 
TWDYKRVPPQLVKI FSKKKKNKPTNI LR 
PKCHTRNQDIYKRGKVGFLCSYCPPALL 
LLLSLLLLLLLLFSFSTRDSASGAGCSS 
LP 


8006 


21907 


A 


8066 


411 


130 


PKAQL I FLRR * S LTVFPR WSNS WPQAI 
LPPWPPKVLGPQARAMVPSPKLVLESIL 
RI PTSVI FFPLNLCSNIKLAEI KEGCSA 
AFLFNQRFAN 


8007 


21908 


A 


8067 


3 


32 


DAWADAWGSLTDEWVEK I WYIHAMQ Y YS 
SFEKREIL*YVRTCMNLEDTMVSEIRQS 
QRDDYCMI PHI * GRTRGVH 


8008 


21909 • 


A 


8068 


410 


116 


STFNIQTWPGTVAHTYNPNTLEGQAGRI 
T*GQELETSLGNI TRPNFLLFCQACPGQ 
GLSTVRMCGSLHYFVLFLRPLLKCLDNR 
KSPFYHPVILPCYE 


8009 


21910 


A 


8069 


158 


289 


DLCS S LRN I VRPQL YKK* KI SQVRKKKK 
KKKKKKKKKKKKKKKKKKKK 


8010 


21911 


A 


8070 


388 


1 


APTPPCFLFKPFVLGGPPFGKAFFQKFP 
GGGFFPSISGPKPPPPRNPSLISKGKRV 
PSPQSPIMVFPGAF*RPPPFPLMVG*KI 
KIFLPPPPPFKTPPALiSPFFFFEAEFRS 
CWPGWNAVMRSRLSATS 


8011 


21912 


A 


8071 


15 


336 


KLDKRYDRTGSPGTTHASGYLIEPLVAE 
ASYELILSIAFFFFERRGLALLPRLEGS 
GVI VAHCTLKLLGSGGPPASAS * VAGTA 
GMNHHEGHEPGL FEKQNSGFI F 


. 8012 


21913 


A 


8072 


396 


274 


PHLHTWESSL*RQGLAMLSRLVSNSWPQ 
VILSPWPPKVWL 


8013 


21914 


A 


8073 


189 


54 


NKPLGPGMWQACNPTPLGGQGQWIP*A 
PEFKISLGNLAKPCFY 


8014 


21915 


A 


8074 


3 


257 


HAFVLLFETRSCLSLRLESSGAMIAYCR 
LELLGSGDPLTSASQVAGTKGM*DDAWL 
RQVSNSWPQVILRLSKPHMFLFQFSLET 


8015 


21916 


A 


8075 


1 


119 


T LP AT WEAGAGGLLE P KS LR V * CVFTAT 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ IB 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQJD 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ingto first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AJanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutaniic Acid, F=Phenylalanine> 
G=Glycine, H=Histidine, I=!soleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V- Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon,/=possible 
nucleotide deletion, \~possible 
nucleotide insertion 














VNSSRTPAWAA 


8016 


21917 


A 


8076 


232 


266 


LPYCSGQWCLGLPIIQGVQWHDHGSLQP 
RLPGLNQSSCLSLPKCWDYGREP*YLA* 
YQARYHGSLARQNVFNIRPGIMVHARSP 
STLGG 


8017 


21918 


A 


8077 


390 


143 


LGGFLVFCFPAKKGFFPNPINLGTPRFF 
PPPPF*KPAPEINFGGPKKKXIPSPPPG 
VKFDPLKRPPPFFFFFFFFPFIRLLV 


8018 


21919 


A 


8078 


424 


144 


LKNYI FFTFL *NRDRVSLCCPRLVLNFW 
AQE I LLP WP P KVLGLQA 


8019 


21920 


A 


8079 


182 


45 


VH KAGMV7U4A YN PS TLRGRGG W I T * GQ E 
FETSLASLDNMVKPRLC 


8020 


21921 


A 


8080 


405 


163 


YFLEEMGFHHADQAGLELLT I GDP PGGM 
SHCTWLNLNVNLIQKHPHRNIQNNV*PH 
ISGPVKLTHEGNHHNWEMSTQKHSP 


8021 


21922 


A 


80SI 


39 


240 


QSKTVSKKKRKI IFCRYRVS* SPGFKQS 
SHLGLPKGVWYRREPPHPASTFSQSFPV 
KDHFCSGVSS 


8022 


21923 


A 


8082 


2 


263 


DCCVSSE*GSVGVGLSEPGAGYNLLVFR 
LLRPLEKRS I RVGGSRFSRY YLSWLPLA 
RKGNSPTPCTSQVRQCPTLLLGLHPLCD 
KPQ 


8023 


21924 


A 


8083 


1 


248 


S VG VGPS EFGA* YNLLVCCLLRPLEQHS 
IRGGVSQFSKYRQSWLPLARKGNSPTP* 
ASCGMRGPALLWLTLLGLHPLSNKPQ 


8024 


21925 


A 


8084 


3 


255 


HLSLPSSWDYKRMPPHLANFFFFFLGKK 
GVLP *GQTGLEL *N *KNPPA* PPKGAGV 
KGVTHRP KP * NGGKGRFE KDTPGGPFL 


8025 


21926 


A 


8085 


409 


145 


LRGPYKKNFSFQAPGRKIGSFKKAPPFF 
FFFFF*GQSLCCPVGLSTGVI IAHCSLE 
LLTSSDPASSASRVAGTTGACHCS*VLQ 
TFTF 


8026 


21927 


A 


8086 


199 


26 


GFPPPPNKTEIF*RPPDFLNFFFLVEMN 
LTSLSRLVSKPWAQAVLLPWPPKALGPQ 
A 


8027 


21928 


A 


8087 


222 


3 


S VG I TGVSHHVLW Y FTL KGFLGRGAHS C 
NNSALGGRGGWIT*AQEFETSLGSVARH 
HLYKKYKNYPVAWRL 


8028 


21929 


A 


8088 


415 


197 


TILCFYHQNNSVFSSNPFLSL* IFIF*D 
RVSFCHPGWSAWQSWLTVFPILCATVK 
YFTCQQY I IKI FALNS 


8029 


21930 


A 


8089 


297 


71 


FIFPENKNRPEWAHACNTLRGRGGRIT 
*GQEFEKSLSNIARPHLYKTKTQNILSK 
YIQYLGFQLTLQIDLANIY 


8030 


21931 


A 


8090 


185 


415 


PFGLKQSPHLTLLSS*DYKHTPPCRVNF 
FFFLGGFGFFPKAGFNLGN*RNFLASPS 
KRAGMAGINNLAGP I FRFFKK 


8031 


21932 


A 


8091 


2 


125 


RKQAS KKERKE * KK*RKKERKKERKKER 
KKERKEKERKEEP 


8032 


21933 


A 


8092 


8 


412 


KGPQGTTLTCIFDLSYSSSLSHWSQSLC 
VAFPLPECIFPHKPLCPACHFFFFQSWG 
LALLPKRHYRGKIGVNGNLELLG*NNPP 
SGTSKIAEPAELFHHTHARPFNWIIKTD 
FPYIAQTGF*LRPSRNPSTLEV 


8033 


21934 


A 


8093 


249 


143 


NI FAATWMRLET 1 1 LSEVTQG * KTKKQM 
FSLICGS 


8034 


21935 


A 


8094 


3 


280 


PERWDYRHEP PGPAS FT* LLLFVFNL * R 
KGL YVGQRGRQLLAS S SL PAPAS LS AG I 
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NO: of 
nucleotide 
sequence 


SEQ ID 
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09/515,1 
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beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, FHPhenylalanine, 
G=Glycine, H=Histidine, l=Jsoleucine, 
K=Lysine, L=Leucine, MNVIethionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptopban, Y=Tyrosine, 
X=Unkno\vn, *=Stop codon, /-possible 
nucleotide deletion, \=possible 
nucleotide insertion 














IRMSHCTPAWVTVRPCY* INK* INGKDT 
IRQNLKKM 


8035 


21936 


A 


8095 


404 


16 


QMGFCLFFLPSSKEVFFPPIFFFYRESF 
FLFHFFLDPRQY * L YF 1 1 KKN I FLPS P F 
IFFLFIRVSPYI FFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
" FFFFFFFFFFFFFFFFFFFFFFWGSMEL 
LSKAHQCISSHGSG 


8036 


21937 


A 


8096 


358 


200 


KSYFIFSLI I FSHYFFLLFLKHLLATY Y 
CFLIDCTLFFSFYCPFCCLFVLLSGRFP 
RSCL* TFH*RLYFSFPI FNVQKLF* LSD 
ICFCCIYLFIYLFIYFETVFCSCCLGWS 
MMA* FNGMFKDR I 


8037 


21938 


A 


8097 


2 


132 


KEEKKERKE* KKRKKERKKEKREKKRKR 
EGGRKEGKEGKNSST 


8038 


21939 


A 


8098 


48 


331 


PGTWHACLGGEDILKWQTKGRGRRAGSH 
RTSWIEPLETVFLELEAGGRLESPGKNY 
SRLGAVAHICIASTLGGRGRQIT*GQEF 
DTSLADFCST 


8039 


21940 


A 


8099 


7 


373 


NGRLRRGFAMLARLVLNF* PRDPPT*AS 
QSAGITGMSHHARPTIESFIHYCWECKM 
E*PLCKLILYFHLFIFKRQKLGVHWCHH 
SLLQPPTPGLQRSFRLTPPSGWDYRRGP 
PCPANI*LII 


8040 


21941 


A 


8100 


116 


432 


YLRNLFYQRQAFSPAILNLMCSAL*ESH 
TVAQAGVQWHNLGSLQPLPPRFKRFSCL 
SLPS SNI FAPLS YFFRNS S FTNI SPSEI 
GPQITDPLFIKKKIKVSVMVE 


8041 


21942 


A 


8101 


361 


138 


FNLLGRKKKNFGGLKEGESRRPPNPGPQ 
FFFFFLFFFFFSG* RWGDTMLPRLVLNS 
WAQVILLPWPPKSLGLEV 


8042 


21943 


A 


8102 


1 


199 


PTRP FVFLMEME FYHIGQTALELLTSGN 
LPASASQSAGITGVSDR*LA*YLFFFHE 
LFCYFALLLI 


8043 


21944 


A 


8103 


412 


46 


KGGSPPPPPPPKIGRKGPPNPGDPFFFF 
SPFFFRGF*GSPFPPK*KVLLVGFLKPK 
PPFPLGWGNPPPPPVGGFSPPQSLQFFF 
FKKVFGWVFFFPLFFFFFFFSPFFLFFS 
ETESHHIAQA 


8044 


21945 


A 


8104 


192 


2 


EETELSFFIIKYYLPRNQKVCKRL+RRL 
GPVAHACTPSPLGGQREWITRSGVQDQP 
GQDGETP 


8045 


21946 


A 


8105 


373 


183 


GGPF* FQYILILFDNFFFFSLSFPFSQD 
FFFLSS FFFFFFFFFFI INYFYFF 


8046 


21947 


A 


8106 


373 


281 


SFKTAPPFFFFFFW*RLSLTMLPRMVSN 
SWAQVVLLPQPPKVLRL*ATQRKKHFRQ 
EQI 


8047 


21948 


A 


8107 


3 


114 


FLIFVHIGPNYVAQAGLELLSSDDPPAL 
ASQCAG I TGiVENHHAR P YYLL * TLN I PLP 
YDPEIFLLGIYPR*MKTYVHTKACT*FF 
ITPLFVI AKTWKQPK* PSVLGLQA ' 


8048 


21949 


A 


8108 


416 


243 


PGVFPPPPF*NPPPEIIFWAPKKKKYFP 
PPPPLNFVFFKGPPPFFFFFFFFFFFFF 
L 


8049 


21950 


A 


8109 


2 


211 


SVTQAGVQWHVSLLQPRLPGLEQSSCLG 
L* *CWDCGHEPWHLALVTTSKHTESCSL 
LCI PTATFLGLVT 


8050 


21951 


A 


8110 


172 


28 


HWCF I TS FFGFYFL * RQGLTKS PRLVP 



1043 



WO 01/64835 



PCT/US01/04927 



SJEQ ID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=Phenylalanine, 
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NSWV*VILLPWPPKVLGLQA 


8051 


21952 


A 


8111 


2 


225 


QKSAKNFYHLSIYLSIYLSIYPSIYHLS 
I YLS 1 1 YIiSSTYE * VI ILEQKWNSIKKG 
CL*CTIQTNCL*QLKE*VIRLIPKL*YK 
I KLPTCWS VHLNWFTLCNI SWYLKYLAM 
* *RKLVCFA* FWYNSNKLLITIKRIGH 


8052 


21953 


A 


8112 


392 


221 


PFPKRES PPDKVNPPPPDI FGGGKTKKK 
TLANN* FPPKEKEFFVFFFKKEDFF*NI 
VGNFKTPS KI * AVCF *NPVFKKKKGAPK 
KKSSFKNF 


8053 


21954 


A 


8133 


369 


112 


I FFSTI YYFXHPFFFPPPLFFTPPPLFF 
FFLKKKKFFFFFFFXXXFFFFFXXXXFF 
FFFFFFFFFFFFFFFFFFFFFLAF*HVY 
SF 


8054 


21955 


A 


8114 


418 


248 


KYYLPTPGY*NLILLNGPPFFFFFFF*D 
RVSPCCPRLP*TPRLTQSSCLSI J PRSWD 
HRCMPLRLVTF * FTFKLSKQSLRTLLTY 
TLAGTTLG 


8055 


21956 


A 


8115 


229 


78 


SFKGAAPFFFFF* * RQNLTMS PKLVSNS 
RPQAVLP P WPP KVLGLHRGG W I T 


8056 


21957 


A 


8116 


356 


66 


PLFPFFYIKNFKKKFFFFFQYFFFFPPF 
PKHYSHKDIFFFFIIKFFPSSSFSFPPF 
FALSSFFFFFFFFFFFFFFFFHA*FPVF 
FLYKI PMLYPQL 


8057 


21958 


A 


8117 


212 


81 


ERLSHCSLNLPDS SNP PAS AS * VAGTSG 
THQLVMWHAGS PRF 


8058 


21959 


A 


8118 


367 


2 


GVFFFFFSPAKKGVFFHPFFFLGPRFFP 
PPPFFFPPLWFYFWGP*KKY* FPHPRGY 
EFFFF*GGPPHFFFFFFFFFFFFFFFFF 
FFFYAEGIGVSPCYIMLG YNFSS FPCGT 
ISPRVRPRV 


8059 


21960 


A 


8119 


38 


215 


FFKSAFHHTRLIFNFFFFLEKTGSHYVA 
QAGVELLG * SSPTTS AFE IGG I TGLFLS 
FKN 


8060 


21961 


A 


8120 


266 


2 


PQIFLFIFETRSHSITQVGVQWCDHSSL 
QPQNPGLKQSSLSS*DYRHTLTAPG*FC 
FL*RCGLAMFPMLVLSSWPQAVLKIHPC 
CGMC 


8061 


21962 


A 


8121 


3 


332 


DAWADAWGHVPP C PAN F VFLV * TGFLH I 
GQAGLELPTLSDPPASA3QSVFI YFLET 
E SHS PAWETERDS VS QKKKKRGGRFKGS 
NFTSAGLQRFIFFMGPPKLISRAGV 


8062 


21963 


A 


8122 


378 


66 


LFIPLPIREEVFPCIPG*NWAPPGVLYN 
GRPFFFFLLETVFLHAGQAGLELLGSSD 
LPVSASQSVGITGMSHRVQPKLRIFVNI 
SFPFEIFCEIMLLVPLAQP 


8063 


21964 


A 


8123 


2 


316 


I SMLLALLLRI ITFRLPQLNG YVEEFTP 
YECGFNPISPGRAPLFIEHFLVDITFLL 
FDVQIALLLPLP * ALQTGNLPLI VMPSL 
LLIIILALRLDYERLQNGSD 


8064 


21965 


A 


8124 


331 


14 


ITSILLI I ITF*LPQLNGYI *KSTPYEC 
GFDPISPARVPFSIKFFLVAITFLLFDL 
EIALLLPLP*ALQTTNLPLIGMSSLLLI 
I ILALSLAYE*LQKGLD*AE 


8065 


21966 


A 


8125 


2 


228 


IIITF*LPQLNGYIEKSTPYECGFDPIS 
PARVPFS IKFFLVAITFLLFDLEIALLL 
PLP * ALQTTNLPL I VMS S L 


8066 | 21967 


A 


8126 


451 


200 


KA*WNRFAILNVTKYTCSSQEEVKIPTF 
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AGVWKKLI PAHTDDFVHFKTSVQEVATD 
VAK I ASLLELEVESEDVTE * LQS QDKT 


8067 


21968 


A 


8127 


2 


396 


LIVIINTLLALLLI I ITF*LPQLSGYIE 
KSTP YECGFDP I S PAR VPFS I KFFL VAI 
TFLLSDLEIALLLPLP*ALQTTNLPLIG 
MSSLLLI 1 1 IjALSLiAYE*LQKGLD*TES 
GWEKLVKKE KADKLKGGRP 


8068 


21969 


A 


8128 


1 


350 


GTRGLQTLLPLIAF* *LLASLANIiALPP 
TINLLGELSVLVTTFS*SNITLLLTGLN 
ILVTALYSLYIFTTTQ*GSLTHHINNIK 
PSFTRENTLMFIHLSPILLLSLNPDIIT 
GFSS 


8069 


21970 


A 


8129 


375 


1 


LTCFS EHRP PPRANFFFSRKTRAP FFLP 
GGFQI PGPNFFAPP I FPNGGP I PGLDPQ 
PGPLNFF*KSFFPDPXFFFLFFPPPI *N 
SGVFLPRNPVFPKTPNFLFFFSPFFPGD 
GVLLCHPGRLPRA 


8070 


21971 


A 


8130 


32 


211 


GIRHGFAMLPRLVLNSWPQAILLPWPPK 
LLG * QAGVELLAS SNP PALAS QTAGMTG 
VSTRP P VLLPTLGTHLCLDACLGS S L 


8071 


21972 


A 


8131 


396 


5 


KYRDGPPPPGKFFFFFF*DGFSLCCPGW 
SQTSCPK*SSCLPEG*DYRHEPPHMTLS 
SSY*CSQMQKCINKNCLGSCWAHACNL 
SPLGGRRlTHGKKFETSliDNIVKNRLKK 
KKKKIYQGGGDEPCIFTN 


8072 


21973 


A 


8132 


347 


34 


KAGFVIPRGPQEFFFFFFPQDGILLCFP 
GWRAVAKS*VTATSNSWAQAITGMLPPC 
PANF*KIS*RQSISMFPRLVPNSWPQVI 
LPPQPPK*LGLHVCLVPSLA 


8073 


21974 


A 


8133 


102 


270 


DYRRTPPHLANFFYFL*RQILTVLPMLV 
SNSWVQAI LLLQDFFSVQLKMGFLS VP 


8074 


21975 


A 


8134 


338 


47 


FLSTQKESSPLHTHTHTHTHTHSHLLTH 
HHGRGQERGKRGR *GG * WRTGTHS PHGP 
HCVRLRKSRPSGELGAAQGGCSLIQPKI 
EDPGQSGGPREVP 


8075 


21976 


A 


8135 


68 


467 


SLLNRI PFCLCLNSFDCVFLLGLLFEAG 
F CS VAQAGWWYEHG SLQP * PRGLK* S F 
HLSLPKFWDYRNEPPHPPFSCFKALNFK 
DYCDSGTVPCPFMKELSMLFIHALTESF 
VLFIVMSKLITTKHLTI ISNV 


8076 


21977 


A 


8136 


500 


355 


FHHVGQAGIQLLTSGDPPA*ASQSAKIT 
GVSHRARPQTSFLNKLPSLR 


8077 


21978 


A 


8137 


58 


222 


I YG YEGRGS VAGSLS SL * SATTDS DLDY 
DYLQNWGPRFKKLADLYGSKDTFDDDS 


8078 


21979 


A 


8138 


345 


196 


KGDQAQWLRPGMVAHI CNLSTLEGQGGR 
IT*AQKFETSLSNTLRP?ISFK 


8079 


21980 


A 


8139 


500 


355 


FHHVGQAGIQLLTSGDPPA*ASQSAKIT 
GVSHRARPQTSFLNKLPSLR 


8080 


21981 


.A 


8140 


3 


231 


HASADAWGRNEWPCAVAHAHNPTTSGG* 
GGWIA*AQEFEFETSLCKIVKPHLSKKK 
KKKKNFSPGWGSPPCTPLF 


8081 


21982 


A ! 


8141 


416 


26 


ARIFPNNSFWPGQGGFPFPPRPLEGPGG 
VIP*CQGVQARPAPPGEPRLFPKPQXIV 
GGGGEPPLFPPPERAGPQKCRPFGEGGF 
N * PKPRPCPPGWGAQGGFVSKKKKKRNE 
DS KI PL I GKY P VG I RNMA 


8082 


21983 


A 


8142 


413 


35 


RGAGGGGSPPGPRYPSQQHLLRARTPTP 
FARSRGSFPGVRGSGWRGGLGG* SRERP 
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LPSLPPRALPRSGASSGPLDGGAQGPGR 
RERWAFLGGIRADARGCRAVTSPEPYRL 
PGAGAARARR 


8083 


21984 


A 


8143 


166 


368 


ENI LWAAMWLDCHPSTLGG* GGRIT* G 
QEFKTNLANIERPHLFFFFFFKRDSLFA 
PRAEGGGPILF 


8084 


21985 


A 


8144 


1700 


621 


DLRDHS VLQQGPACLTR CGEPI hSUSEG 
SKIGSERESGEHSRHPEAPPPSVEVGSR 
RS*GQDS*GFGPGNGQDTRGPLSHAAPN 
WTPDPAAAVAAGGTSLPPRGGGGI * RGR 
SHTARSGARVGSANAHARGGSGFSRHSG 
SHGCLRNQSPRLGKAGALCGSAWVSGL* 
GSGhTS PRGRRERLRMPG VAKGRPR *GN 
PRRGRGKQHG I PASGLPGRCWLGIKMST 
LQPRTRAPEELFVPQESSGSTPEKMLVS 
FHGSSLRNEATPRYSQQEEAGNGRWQQS 
LSLBRWPPWTSHPLGTPPLMPVAVARCC 
ILPGLWPLLSPPSGSASPVSQGPGCLSL 
WPNAFKKDDFPVRQGDTSWRSVS 


$085 


21986 


A 


8145 


390 


38 


NTTTFNIYFFYFFFFSLFIYFFFFFFFF 
FFFFFFFFF I F ILFI YSLLVFGVWVLGL 
GLALALLAKLFLVNSIjCRRYRG* SLLYY 
AWL*FFIFPLRYYIYCARFQFLSPILYL 
GKWFG 


8086 


21987 


A 


8146 


20 


187 


KLTTSTGRNPGRPTRPNF*VF*RDGDLI 
MLPRIiVSNSWPLVNLLPWPPKILGLQM 


80S7 


21988 


A 


8147 


417 


64 


VRPRVRPRVRPRIRKKKKNKKKKKKKKK 
KKKKKWGGGF * D FWGGEKGP P PLQKNDF 
LLF WGFFLKS FFFFCGGLFFWGGPFFFP 
PPPNFLGVGGKKKTFRLLCGPFSAFFGG 
EKN 


8088 


21989 


A 


8148 


351 


187 


RLGTF YLSNI LRFYLS ITHGP I PYIGRL 
TSQNLPLLLLDLT* LEKLLPKKSQT YN 


8089 


21990 


A 


8149 


465 


60 


PPHFVJGNPKKKNSQFFWAKKKKKIFLGP 
* KMVAKKG WFVJGRKKGFQRGGKKG * KG 
GQLFYPVLSQVLGGGQNHGPPGKAGFGE 
NFFFFGQGLGWAI SRWLRTPLFLAGGGE 
DPKKNFKPPPKI KTLGAGVKKK 


8090 


21991 


A 


8150 


1 


175 


WDYRHVP P YLANVFFVIFRGDKVSLCCP 
V*GPTTELKRSSCLGFP*GWDYRREPPH 
LA 


8091 


21992 


A 


8151 


59 


35 


RWNSRPRRPGLKFLTSGDLPASASGSAV 
ITDVSHHAWPETFFR*RA 


8092 


21993 


A 


8152 


269 


159 


ARVEVSKKYIYIHTHTHTHTHTHIHTHT 
HTYTTKKIjPS*LFQVCKMEKFGRIjWQ 


8093 


21994 


A 


8153 


1 


256 


PGPPPRFFFFFFFFFFFFFFFFF*TLKI 
RFIWRKQKTHNPKDG ILHQD 1SQVISLQ 
ITNHFHRRYFCTNFTCIQERDINQSLFL 
F 


S094 


21995 


A 


8154 


252 


41 


LIIPVLICIVFIVLQFKIFSVISLNKGL 
FRNKFFI F* *RWGLTMLPRLVSNSKAQV 
ILPPWPPKVLGLQV 


8095 


21996 


A 


8155 


21 


443 


HELSLI * F I STLAETNRTPPDLVDGESE 
LVSGFS IEYAAGPFALFFIGEYTNIII I 
NTLTTTI FLGTTYDALSPELYTTYFVTK 
TLLLTCLVL* IRTAYPRFRYDQLIHLL* 
KNFLPLTLALLI * YVS IPITISS I PPQT 


8096 


21997 


A 


8156 


2 


205 


LLHLMKHFNLEIQKAQ*rRINLKRSISR 
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HVI INLSKAKHKG* ILKAVREKMLIMYK 
KPLVKLQVISN 


8097 


21998 


A 


8157 


2 


205 


LLHLMKHFNLE I QKAQ*TR INLKRS I SR 
HVI INLSKAKHKG* ILKAVREKMLIMYK 
KPLVKLQVISN 


8098 


21999 


A 


8158 


2 


222 


VYTWGTGKARVFPGEF*LPVWIPTRHLK 
FYNEPIRDANESASAETENPQSSIIHPQ 
DEQNGD I RRTDKVTTHQ 


8099 


22000 


A 


8159 


1 


297 


GFHHVGQSGLELLTSGDPLASTS *SAGI 
TGMSHHAWPLNCSYGDSFSWLKCPELTV 
ACWWEEGWRKLLMENEQGREAHQKLBSV 
INFYIKDSMTKRKYK 


8100 


22001 


A 


8160 


138 


437 


NEYDHFSIIKERRDFIVCVFFFFFSKKS 
LTVGTQGGPSYLEPPFLGNKEFSCLTFL 
GS WDNGGATTRLG * FFVFFRKKGVS P CG 
PGWS * TPDLRKTP PSA 


8101 


22002 


A 


8161 


330 


148 


QQSKTPFFFFFFKKKKKYIYICMWHAY 
NPSYSGG*GGRIT*AQKFKTSLNNTVRP 
HLYKK 


8102 


22003 


A 


8162 


391 


255 


RPAPADF* FFFFYRLGLAVLPRLVWNSW 
PQATLLPQPLRVLGLQI 


8103 


22004 


A 


8163 


131 


3 


HKVSLCYPG+SALMQS*LTVASNSWAQA 
ILQP*PPE*LGLQA 


8104 


22005 


A 


8164 


97 


440 


NKGSLCPPGGRPGALSHFTARSRGSPGP 
TPQKTGNNGPGPPTPVNLEFFEKRGFTP 
VGQGGLKLRT * RS P P PSAPQKAG ITGGS 
RRPGPIQKKFLKTLLGETKWMAKLIKTY 
LK 


8105 


22006 


A 


8165 


541 


295 


FSQQKKKIFISTNLSHQESKIYICKEME 
SHSVIRLLEGSGAIMNHCSLKLPSSSDP 
PASVS * SAGI IGVSQHLANYTFKKIF 


8106 


22007 


A 


8166 


440 


259 


PPQKFFFFFFFVKTGFRHW*AGFRLLD 
S SDP PPLAFQSVE I TGMS P S ARPMFGVF 
HSLR 


8107 


220Q8 


A 


8167 


23 


405 


FR I CHLLR VHLLV Y F I P S FTDNHL LSTS 
AMFLYSNDWIAQNVSGPFLGI IFRTFF 
FFFLKGEFFFAPQVGGQ3GNLG*LNPPP 
PGLKGFSGLTPPGGGNSGGGPPNKPNFW 
I FKKGGGSTLWPR I I 


8108 


22009 


A 


8168 


23 


388 


PYFSCALPRHHPPTTNLGPKLGMGAPWG 
QPSVAHHTLLFFFFFLKRDFCFVPRVEK 
LWLEESSLNPPLLGLKEFSCLTLWRTGN 
NGGPPPPPVIFCF*KKGGVYPWGPGGGE 
TPALKETPGL 


8109 


22010 


A 


8169 


612 


498 


NIHFSLSFSHTHRHMHTHTKTHAHPTYI 
RTP*DNVPQ 


8110 


22011 

r 


A 


8170 


2 


288 


VNLTFFPQHFLGLSGMPRRYSDYPDAYT 
T*NILSSVGSFISLTAVILIIFMI*EAF 
AS KRKVL I VEEPS I NLE * LY I KKKKKKK 
KKKKKKKKKKN 


8111 


22012 


A 


8171 


381 


101 


QKSLQQPSNIATILEEVQVIFLGSPYHL 
QRWEMRLTKWLRGLGAVAHACNPSTLGG 
MGGWIA*AQELETSGDPPPSSNRQLFTG 
EHNNDNKLI 


8112 


22013 


A 


8172 


136 


360 


TKKHFGLLK* *TKKHFGLLKK KKKKKKK 
KKKKKKKKKKKGGGAFKKKKFFPRGGGK 
I FFFFGAPKNKFGGGVLKTGGGKKPGVT 
KK 
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8113 


22014 


A 


8173 


358 


58 


EKVSVFFLKMKCSGIILLPCRLTLPGST 
DSPASVCRMPAIAGPRRHA*LVLVETGF 
RCVGRAGLQPLTASDPFTSAS PGAAIAD 
GVSFTQCSMVPRIRVQ 


8114 


22015 


A 


8174 


254 


370 


GLTVLPGLVSNSWAQVILLSWPPE*LRP 
QAYATGAGFLL 


8115 


22016 


A 


8175 


48 


269 


FYVI LCLRDRVLLCCLGWS * TLGLKLSS 
YLS LLS S WDLQHVP PHSAS CLT F* VKAC 
FFLLKYLKRLKI YAKIM 


8116 


22017 


A 


8176 


1 


338 


PTR P INTLLALLL IIITF* LPQLNGY I E 
KSTPYECGFDPISPARVPFSIKFFLVAI 
TFLLFDLEIALLLPLP*ALQTTNLPLIA 
MSSLLLI I ILALGLAYE*LQKGID*AEF 


8117 


22018 


A 


8177 


426 


130 


QS LD P PGNMVTLS LQN* KNPP P * GGAVF 
PPPGRLGGENPPGGSKEQKRRPAPQGG* 
PRQPPSPKKKNADSEFSIFNLQPALLCK 
QYCIVLHALPNKILF 


8118 


22019 


A 


8178 


396 


244 


KIKLL Y I CMC I Y I YVI SRHY I LR * HTHA 
HTY THTHTHTKTHTP KNT KYT L W 


8119 


22020 


A 


8179 


2 


119 


TRPGNWPGWAHFYDPSTLGA+GELIT* 
GQEFETSSSLY 


8120 


22021 


A 


8180 


2 


80 


FLPLTLALLI * HVS IPITISSIP PQT 


8121 


22022 


A 


8181 


438 


333 


SPLGRLRHYNCLNWGGKGCN*PKSCPCP 
PAWGTK 


8122 


22023 


A 


8182 


393 


19 


IFFFFFPPPSPAKFFFIFKKKRFPPLWG 
GVF*SPPPGFQKTPPPQKVGFSRVFPPP 
PPKKFFFGGGKYFFFKEPPPPPPPIL*K 
GGAKI S P? PPPGKKKNPFFLKGG KKKKT 
PFIFFFPPKKIKN 


8123 


22024 


A 


8183 


24 


269 


GSQQVLGFCFRDR VLLCHPGWSAVM*S * 
LTVASTS WA*R I LHCSLLGS WEYRHTPP 
CL*KRNPQLTVKWRVKDHNNKKSHR 


8124 


22025 


A 


8184 


154 


361 


WHNEDPLQPQTHGLKRSSHLGLPS I WHY 
RHVP PRLANPLNLS FRQGLAMLPTPS VC 
HLLVFLS *HIYV?TFGYLS*SVCSHLLNF 
L**WHNHDPLQPQTHGLKRSSHLGLPSI 
WHYRHVP PRLANF LNL S FRQGLAMLPT P 
SVCHLLVFLSETLGFK 


8125 


22026 


A 


8185 


228 


337 


RPGMVAHACI PG I LGGRGGWI T * GEKFE 
TSLANMVK 


8126 


22027 


A 


8186 


314 


127 


DIQPKNEGEGEEEQELWLGMTAHAYTPS 
TLGD*GGRIA*VQEFETCLGSLARPPSL 
QKNFKN 


8127 


22028 


A 


8187 


10 


420 


TQEAEVAVNWDHATALQ PGRQSETP S PE 
KKKKKRGPPRWGENFFPTPGGF* KDGGN 
KKTGFPPFKRDGPPPPKNWTPWVFQKRG 
APPE P PPLGGALGRE PLGGFFGKGGVET 
PPPPIFKKNNPPEKKPHRGFKFFAF 


8128 


22029 


A 


8188 


398 


1 


LKKKRPQTPFSPKIQF*KFFPPFPFKKS 
PFPKKLLKKKRAPGGPIFFFFFYPGLFK 
NPSPL*MGSPPPPLFNPKN*KKPPLLKK 
GPGF*WPFFQKKKPPFFFFFFWRQGFTV 
LPRHTPS CLVNFC I FCRDGI 


8129 


22030 


A 


8189 


30 


153 


AWYVRREDHVSLGG*GCGKPWLCHGTPA 
WVTEENPKGKIK 


8130 


22031 


A 


8190 


206 


45 


HPSQNSLSKN* ERVSVCCPGWSAVAGSQ 
LTAASTSQAQVILLPQP?E*LGLQA 


8131 


22032 


A 


8191 


3 


228 


TCVCLCVCVY I CDVCVGCVCVY2 DRY I Q 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N^Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, V=possible 
nucleotide insertion 














I YR Y I HT Y I D I YTH I * I Y I HT Y I DTHMW 
VCVYIYIYIYRCMGVRET 


8132 


22033 


A 


8192 


41 


416 


ICRVKEIASRPLLETHSLSLVELS*PCL 
TAQ * RND FLAL KHLTAVLYNL I LHSSGF 
WSSLKSHFI FLAL * YYL* * FI * SN* RRD 
LAVLPRLVSNSWPQVILPPQPPKVLEIR 
YEPPSLALILFFN 


8133 


22034 


A 


8193 


2 


183 


KRhS I WQKKVEGEGFQGMDVGPVS VAHA 
CVSSSLEGQGGKIA*GQEFETILSNIVR 
PHLY 


8134 


22035 


A 


8194 


2 


465 


AGVQWHDLGSLQS P P PGFK* FSSLSLPS 
SWDYRSAPPCPADFCI FRRDWAGWGVGC 
G FHR VARAGLELLTS GD P PAS AS PGAG I 
AGMSHRTQRGQDNFASWRRGGPQMSPSP 
PSCLFLQNTFSLKPSWVAPYPHASALGG 
GE YGGPPS PQPHLE 


8135 


22036 


A 


8195 


397 


149 


L I CLTDLP I RFLSLLALVFTVFNTKVKV 
KDL I KLG WAHTCNLNTLGG * GGR I A * A 
HEFETSLGNI VRAVI PVCCFSVTLSR 


8136 


22037 


A 


8196 


355 


95 


KKIMPTLCLKI PS * YRCEEI IQLSLQSS 
CDHRHMPPHPTNPF*RQHLSMLRSLVLN 
SRPQAVLPHGPPKVLGLQAPATAPGILS 
LF 


8137 


22038 


A 


8197 


382 


112 


KTDFGKKGSPSSPPPFFFFKKSFAPSPR 
GEKKAPLKSQRGLCPPKPKDPP PPGSG* 
TQDKGPPPPGQP I FFFFLKKGVCLFSRG 
GSKTFI 


8138 


22039 


A 


8198 


32 


361 


ASRIGFILQEFLPVRVWTTNPSAGADGY 
N FLEKQKGTE KTS CS VAQAGAQ WCHRNL 
L* P * TSELKQSS CLSLLSSWD YRMYFVL 
LGKQVTLLI GKRALS VY PRD WSLC 


8139 


22040 


A 


8199 


1 


314 


I NTLLALLL.I I ITF * LPQLSGY I EKSTP 
Y3CGFDP I S PAR VP FS I KFFLAAI TFLL 
FDLKNALLLPLP * ALQTTNLPLIGMPSL 
LL 1 1 1 LALSLAYE * LQKGLY 


8140 


22041 


A 


8200 


3 


109 


FTSKHHFGFEAAA*YWHFVDW*LFLYV 
SIY**GS 


8141 


22042 


A 


8201 


3 


216 


DAWAAVGHHHTRL IF* I FVEIGSLCVSQ 
AGLEILDSSDPPSSASQSAGIRGMSHGA 
GQEHLFHRPIGKIK 


8142 


22043 


A 


8202 


1 


267 


R IHTGEKP YKCEECGTGFKGPS TLTTHK 
FFVYCREVAVLLKNCYSHLYPK* I I VNG 
SEADKREMCLLCIFFHPLiAEEQENVENK 
RKYNP 


8143 


22044 


A 


8203 


422 


92 


PLVFALPPKIGFAPTTPSSSPPPPPLFF 
FGPPVFPPPPFFNPPPFFFFSPPPKKKF 
PSPPPRGKIFPF*TPPP?FFFFFFFFFF 
FFFFFFFLFLYYLFILC 


8144 


22045 


A 


8204 


445 


260 


VCSPPLFIWVAPGFSPPPLFKNPPPNFF 
FGAPKKKNFFPPPPA*NFFFLKGPPLFF 
FFFFF 


8145 


22046 


A 


8205 


103 


248 


GPGWAHACNPSSLGG*GQWIA*AHSFK 
TS LSNAVRP I S AKKKKGGP F 


8146 


22047 


A 


8206 


407 


197 


QLIFKFL *EQASHYVAQAGLEL»L»GSGNP 
PASTSQSVKVTGMSHGAWLLAPTLRSTS 
LNYNSLSVLLLPRP 


8147 


22048 


A 


8207 


1 


142 


S CS VAQARV * FRDLGSLQP PPPRFT? FS 
CTPAWETVRLHLRKKKKKN 
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nucleotide 
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ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIalanine, 
G=Glycine, H-Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X= Unknown, *=Sfop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 


8148 


22049 


A 


8208 


2 


370 


PEVFDYPN*FKNL*LKKTCTQ*L*AALF 
V I T KT WNQG C PLTGE W I NKLE Y I HTMD Y 
LAI KINELANHKMM*RTLKCI LLSERS * 
SRKATYYMI S I IRHPRKGKTIEIVQNSS 
CQGLGERQKGL 


8149 


22050 


A 


8209 


363 


108 


KKPRVE ACLS PGVGRERAKRER* KRERE 
RERERERERENEL*KLSLADLCIKNG*R 
EFFKEKRYVKRYAKILRVSGRKQKYGNV 


8150 


22051 


A 


8210 


386 


254 , 


LNPGGGVCSEPRLCPCTPAWVIR*DSVS 
EKGKKKRKGPENSLTATNF*ALSCNLDY 
LRCSMLKMQRQRKRKERTEVRFWQLHHN 
LTLGFV*LIVYQRE*RRTGWFVAKYMLY 
TFS * PGSFPRLRKCVEHVFGYEPACSPL 
FPLIHNQLHKAKC 


8151 


22052 


A 


8211 


3 


131 


AHLFIAWFTEYFKPTVITYCSKKKIPFK 
MLLL I DSAPS YPRALMEMYKGMNWFMH 
DNLTF I L WPMDQ * VI FTS * S YYLRN * FC 
TIAAIDSDSSDGSGQSNLKI F*KGFTTL 
DAI KNI CDS * EL 


8152 


22053 


A 


8212 


86 


286 


WSIWECTVISVQLFHKPKTSFKNTSYWS 
GVMAYTNNLSTLGSQDPQ IT* GQE FETS 
LANMVKSCLY 


8353 


22054 


A 


8213 


393 


344 


SFPFSLPSKWGFKNLVPSPGYFLFFLNK 
GFRYFGQGGFKILNSNYPPPLASQKGGI 
SGISPWPRALVFF*WRNLGPN*RTIVSP 
NYGLK 


8154 


22055 


A 


8214 


3 


313 


QGLVLWHSHGLLQP*FPGLRRSSCFSLP 
CSWEHRCTPPHLANFL1LCVCVCVCVET 
GSHFVAQAGLELLGSSDPPLS ISQKSWD 
YKV* AI I PRAKQNYLMLFY 


8155 


22056 


A 


8215 


165 


348 


RPCPLFPEDLLTFSREGPGTMAHACNLS 
TLGGQGGGIA* GQ2FETTLGNI VRPHLY 
FYLKK 


8156 


22057 


A 


8216 


379 


218 


VFLVETGFLKVGQAGLKLPTSGDPPASA 
SVSAGITGVSQKCRF*YNVFIFQKSSLL 
NALP I CS LTMLLGNLFSMFMTPKFFTCS 


8157 


22058 


A 


8217 


32 


276 


RYLP I IAALFTIAKI *KQPKSSLLDKWI 
KKLW C I YTME Y YS ALKV I LTWTTWMNL 
VGFIPGDINHIQKAKYCMISLICGI 


8158 


22059 


A 


8218 


171 


2 


KGVFGLGRLFFFLDRVWLCLPEWSTWQ 
SRLSATS AS * IQAI LPPQP PEELGPLCP 


8159 


22060 


A 


8219 


530 


73 


TKPTVGSNSRSYINQTQYFPSTLTFGGL 
MRLTI YHFFFLLVY YNPRVMVTPAYM* L 
LILFYSSSIRDLLXAKQFCLPWEGFSPH 
AC* TNATLLFP YS I * YLGLPS FHLIFFV 
CFSFYFL*R*GLTMPPMLVLSSWPQVIL 
PPQPPKVLGLQA 


8160 


22061 


A 


8220 


350 


135 


FFFFSQTGVAQNRVQ*RDHNSLQP*PPG 
LRQSSCLTLPKCWDYRCKPLCLASWIYF 
KDKVYTSTPVHTQEY 


8161 


22062 


A 


8221 


185 


331 


PGMVAHAYNPNTLGG*GGKLT*AQEFET 
SLGNINSVPKKRRKRRKEMM 


8162 


22063 


A 


8222 


73 


232 


YCS FKF I TLTCLS WLGTVAHTCNPSTVE 
GGGGR IT* G WE FTTSLGNIVRSCLY 


8163 


22064 


A 


8223 


418 


211 


AMPSLLTYLLTFI FCRKKKI FFYWGEFT 
MLS *LVLNS WTQVILPPWPAKVLVLPRL 
ALFYSLLVLFPSF 


8164 


22065 


A 


8224 


227 


33 


ESHSTTQAGVQWHDHGSLQPQPPGMSLY 
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SEQ ID 
NO: of 
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SEQ ID 
NO: of 
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nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=Pbenylalanine, 
G=GIycine, H=Histidine, ]=lso leu cine, 
KHLysine, L=Leucine, MNVIethionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T^Threonine, V=VaIine, 
W=Tryptophan, Y-Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














CH* CliRFWSWRPAEKPGWCSSLRFMELE 
SWSGASDAF 


8165 


22066 


A 


8225 


244 


287 


TDS PTS AFR S AG I TS I S HHTLP KS TF * I 
FCRAGGLAMLPRLVLNSWPQVILPSQPP 
KALGSQA 


8166 


22067 


A 


8226 


53 


235 


NKKWQGVVAHTCNPTTLGG*GEWIT*GQ 
EFETSPANMAKI PPLVMFAFCFEETDNX 
SHTK 


8167 


22068 


A 


8227 


275 


25 


GVLMFNFKFC I *QNSPFMLPGWLNSWI 
PAILPPQRPKVLGLQAWTILYKWYYGKE 
MWHKE I CEGTHDVLSKLLTTRRFKRHL 


8168 


22069 


A 


8228 


386 


211 


QLKLPPLKGNKLRPP* *KFFFFFFFQRQ 
GLTILPRLVSSSWHQMIFPPQPPEVLGL 
QA 


8169 


22070 


A 


8229 


I 


117 


VFILFYF* DGLAML PRLATN S WAQA ILL 
PSAPKVQGLQA 


8170 


22071 


A 


8230 


356 


26 


WHVHGSQQP * PPGLKQS FCFSLPSS WDE 
RCAPPHPINYFQICRDRVYKRWANMVI S 
PNPLVLISQSAGITSKIPCPGSEISTJF 
I FSHQLLLVRGVYPDMLVLQLGKTCI 


8171 


22072 


A 


8231 


3 


188 


FPSLRPA* IFFYI YFFFLKNKVFLCHPG 
WTQLLDPNWTQAAGLKQSSHLSLLSSLN 
FRGSA 


8172 


22073 


A 


8232 


2 


253 


KLDY INCGTNLAQKCLFCLFGS LFFQR * 
GLTLLLRLFMNSWPQAILLP* PLTVLGL 
QATMLGSQVPVFDLGITLCRCRFGKLE 


8173 


22074 


A 


8233 


373 


65 


PEKAGSCLELAFGLQMCHPPWASTLPAA 
LTD FGLATVPNHVSQFLKSHTHTHTKTH 
THTHTCTRTLLVLFL*EALTTTPLLRS I 
CKHRESACSLSLTWPLCC 


8174 


22075 


A 


8234 


390 


46 


RVFKPG*NPPLVKAPAPPLFFGGV*GRS 
PKKKNPQGPGGGGPPPGFHPFFLTPFFP 
IFLRREPPPFFFRGGLGPPPPPFKTPFF 
FFFWVKGGKFLPQKKKKKFFQS I STRS V 
IHK 


8175 


22076 


A 


8235 


387 


216 


QKLGVGLVPPGRVIFFFFFFFCC*GGGL 
TMLPRLVSNSWPLAILLPQPPKVLGLQV 


8176 


22077 


A 


8236 


1 


201 


RFFFTIPPQCFPQFILSF*RRGLAVLPR 
LVLNPWPQAILPPQ P PELLGLQAHPTTL 
GPVFYSYCVRA 


8177 


22078 


A 


8237 


399 


229 


SVGGGGLTMLPSWSQTPGLK* SSHLSLI 
KCWDYRPEPPHLAHFSIiLNLQLYCHTKL 
S 


8178 


22079 


A 


8238 


192 


335 


SHTLPHSSNFS YFW *RQGFAMLPRLVLN 
S WARAVCLPQS PKVLGLHA 


8179 


22080 


A 


8239 


442 


254 


GRDRVLLFCPG WS QTLGLKRS S CLGLPK 
HWDYRHEP PHPAEN I KFCVW PTLYFYWP 
ALVQINLP * QLQGTECVACMC 


8180 


22081 


A 


8240 


381 


180 


VLKLFFCSHRGPTMLPRLVSNSGLK*SS 
HLGLPNCWD*RCEPWHPASLCLLTSTFW 
PPTLICNIHK 


8181 


22082 


A 


8241 


388 


260 


NAIPFYITKCCNPSTLGG*GG* IT*GQE 
FENCLANMVKPCLY 


8182 


22083 


A 


8242 ! 


1 


145 


GFTI LVRLVLNS * PLD PPALASQGAG I T 
GVSHCAQLIYVFQRSLQLLW 


8183 


22084 


A 


8243 


1 


173 


GLTVSQAGVQWCDHSSL* SQTLGLKQSS 
SLSLPSSWDYKCKPQHQADRFFLMEFFQ 
M 
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peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D^Aspartic Acid, 
E=GJutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K-Lysine, l^=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T-Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


8184 


22085 


A 


8244 


202 


327 


TLFITFIFF*R*GLAVLPRLVLNSWPQV 
ILLLWPPRVLGLHV 


8185 


22086 


A 


8245 


42 


289 


KEKRS IILFI FFCRDGVLLWLPRLDLNS 
WPKTIFPPWPPK*LALQAHTTIPRGGGS 
IILICI RDVKRERKQYAKS KNPEGET 


8186 


22087 


A 


8246 


592 


485 


NIVSQAGVQWHNLGSLQPLPLGLKWFSC 
LSLLSS*DYRHVPPCPANFCISVETRFC 
HVGQAGLELLASNNPPAPASQSTGITGV 
SPCTRPRVYVFLLWILANGPS*RAETLV 
NLATNQEDADSSLAPGLGSGHCF 


8187 


22088 


A 


8247 


307 


12 


MCSPVFSPKQKNPLFLNFFKKEFLFFFP 
GGRQGGHFGFLAPPLPGLNKFLVLPFPG 
NGDTGKGPPARGNFFFFFFFGIFSRDFV 
SPC*PGWSLIPDPR 


8188 


22089 


A 


8248 


392 


237 


HYGQASLAPLTSGDLPASAPQSAEVTGV 
SHRAWPTVLI * S FS * KFNWV 


8189 


22090 


A 


8249 


153 


386 


FFPPGWPKPSLLEPPPPWLKGFPPPTLL 
GGGD * KRPPPPPGfTFCFFKKNGVSPLWE 
GWF*TSNLGNPLPSPPKRVGIT 


8190 


22091 


A 


8250 


387 


166 


I ETILDAI KNIHDS WEEVKI STLTRVWK 
N* I PTPMDNFEGFKTSLEEVTADVMGI A 
RELE VE P EHV I ELLHD IS 


8191 


22092 


A 


8251 


405 


266 


LFAIVKR*NHPKCLLIYARINKMWHISI 
GWNIISLI SKGMKFS YMFLTWMNLEVIM 
LSEISHSQKVT*YMVPIA*GHLEKLNCL 
PIKQQTPIPSSSSP 


8192 


22093 


A 


8252 


379 


201 


PGANYACNPSTLGGRSGWIA*AQELETS 
LSNMTKPQPCSTPQCFbVNQLLLQNLYP 
YFP 


8193 


22094 


A 


8253 


78 


311 


IjTLY FHTRKKKGLK* I FYFI * S I LSQKD 
LKVGTGPGAGAHTYNPRTWGGWKGLQAG 
KRIT*AQEFETSLDNTVKTCL 


8194 


22095 


A 


8254 


303 


81 


TPCAENFKLRKRFHHI ISLQRKMNYLI S 
FLYF* *SHSLTII»PRLVSNSWPQAILLP 
LPPKVPGLQVSATLPGL 


8195 


22096 


A 


8255 


2 


376. 


MGYNLVCHLLRPLEKRSSIRVGVT*FSG 
CPPSPLPLARKGYSLTPCASQVRQCLAL 
LWLTLGGLHPLSCPHCPTSPSVMNPVPQ 
LEMQKSPVFCVSHAGSCRLELFLFGHLG 
TQSSLDSFFKKVI 


8196 


22097 


A 


8256 


70 


356 


Lrl I AYNLTYDLSWKMFYVYLRKI YTLLK 
WVECS I YTCWV* FVYSMVSWNRI DSPGK 
KKTNIYDQLIFNKGAKSTKLRKNSLFNK 
WGQDKI STYKK 


8197 


22098 


A 


8257 


403 


85 


PFCPWFPPVFIPPPLRKSGTPPGPGPEF 
PGPGFSNGGNPRPLKKS PKFSPGWGRGP 
LFPPSRGV*AGNFLGPGRWRGPWAKFVP 
LPS CIiGHKAR FFFP KKKRRK I K 


8198 


22099 


A 


8258 


249 


2 


YCEGDLGSYRPGVSKCMEFSPRRDIRIM 
PWPGAVC YAYNLSTLGGRDGWI A* A* EF 
GTSLAYIVKPHLYTHTHTHTHTHTMY 


8199 


22100 


A 


8259 


2 


189 


IQCVCVTKSHFCHPGWSAVAQSWLTAAS 
TS*A*VDFSHIiSLLWSWD*RRVTPRLVN 
LILFPS 


8200 


22101 


A 


8260 


453 


128 


FFFFFFLI ETVSHDVP * AGLELLS S SHP 
LASAFQSAEITGVSHHAQPSSVSWLIHS 
S 


8201 


22102 


A 


8261 


420 


79 


KNPTPFFFFFFLIETVSHDVP*AGLELL 
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Amino acid sequence (A-A)anine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=Phenylalanine, 
G=G1ycine, H=Histidine, I=Iso!eucine, 
K=Lysine T L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamme, R=Arginine, S=Serine, 
T=Tbreonine, V=Valine, 
W-Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop cod on, /=possib!e 
nucleotide deletion, \=possible 
nucleotide insertion 














SSSHPLASAFQSAEITGVSHHAQPSSVS 
WLIHSS 


8202 


22103 


A 


8262 


1 


262 


NSRTERLGSWQSHFCHKEMEAHFWQSG 
VQWCDLGSLRPLPPGFK*FSCPILLSGW 
DYSQCRPCRTCSLI FMHSRSPSQPSEKK 
WTC 


8203 


22104 


A 


8263 


321 


100 


SATKNYLLSRQSFIGK*NNGLGAVAHAY 
NS S TS RGLGG Q I A * A * E FETS *GNMMRP 
HCYRKYKNLARHSTYSPR 


8204 


22105 


A 


8264 


145 


275 


KPRDYS FFF * WKRDLAMLPRLASNSWTH 
AILPPWPPKVLRLQG 


8205 


22106 


A 


8265 


73 


351 


SLCYRSFARCDRPKKGQHFSKSYLQDAQ 
WARTLLSTIAQNNSAEHHLKTTDW*GAV 
AHACNPSTPQGRGGRIA*VQEIETSQGN 
TGRPHLYKK 


8206 


22107 


A 


8266 


382 


232 


LVETGFCHVGQAGLKLLTSGDPPASASQ 
SAGITGVSHCAWli*T*YS*DFQ 


8207 


22108 


A 


8267 


397 


130 


IGQAGLXLLTSGDLPTSASQTAGITGMS 
HRAWPTFSLIAR*NASFQVGHFSPKYQV 
GTLAKSLKCHQVRSDTLEFILCCRI I YE 
TLGKF 


8208 


22109 


A 


826S 


309 


97 


DTWYRTRRKDKCTITETRIQSVRPLSPR 
*AITSRVTCTYTSR* PEANEEPQKITFR 
HCDLFLPTLTDQLTL 


8209 


22110 


A 


8269 


2 


224 


DSSGIPGSPTRPPTRPKKQKLPCDPTVS 
LLGICI YDQRQ* SVFQRDICHSHMFITA 
LFT I VN I WKQT* HPLMDE 


8210 


22111 


A 


8270 


374 


2 


TSWESARKPPPIHSHTL.KGFFFFLFMFF 
FF*NKI CKDGSLTMLPRWIiNS WAEGI L 
LPRSPKWGIQELATQRRASILLC^TY 
ACGI FQHWVNVKLLMKQCPFCPidPrK 
TQNFFPRV 


8211 


22112 


A 


8271 


1 


379 


PTSASQVAGTTGTHHHIVLFFFFGIFEK 
KGVFPFSPGGF*TPKPKPPPGFGPKRGG 
KKGGGPPPPARGMGSGVFYPPPQKPFFV 
LIGGQILFKGRTLDFKKRAPPFPPFSLF 
LPKHTPSFWIKWDP 


8212 


22113 


A 


8272 


80 


401 


AWGMRFCLLLYS CQ YLISD IAHFS KAFF 
FWKREPPPAPQVKGQGPNLS * WNPWLQG 
♦RGFSGLTPKGGGNYGPPPPPPLIFFFL 
RKNGFFPGWPGWSKTPDLRKPPC 


8213 


22114 


A 


8273 


405 


95 


PPKRFWGSSKGFPQREKKLGSPTFPKKK 
KDFIYSYLSKSLKKE* INRP SAAAHTCN 
PSTLGGQGGRIT*GQRFKISLSNIGRPH 
LYEK I KNVKKLK I KKPNKL 


8214 


22115 


A 


8274 


167 


52 


DREQRPGAMAHTCNPSTLGG*GGWITRF 
LANKVKLRLY 


8215 


22116 


A 


8275 


276 


23 


KYVSSRPGTVAHSCKPSPLDGQGEWIA* 
VRKFKTI LGNMMKPHLLFYKKLTFKNVK 
C I FKK YCATESR YVTKE KQKDRKKGTKE 


8216 


22117 


A 


8276 


1 


114 


FTMLARMVWIS* PRDPPASASQSAGTTG 
VSHHARPGLS 


8217 


22118 


A 


8277 


2 


87 


TMLLRLFLNS * AQAVI IiPWPPAVLGLQG 


8218 


22119 


A 


8278 


357 


347 


SEGLALYQLEAFTVCDLRERGFCRGRSL 
WPLITWAWRGGVFLLVQFQEVCCRLALG 
SLSPDPVLLPESGGPTSASQSAGI IGVR 
DRAQPNCVFEIGSYS VTQAGVQ *HNHGS 
L*P*P 
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Predict- 
ed end 
nucle- 
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ding to 
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residue of 

peptide 

sequence 


Amino acid sequence (A-Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G^GIycine, H=Histidine, I=]soleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T—Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


8219 


22120 


A 


8279 


371 


125 


FFPKKNEPPG*PPPKK*RKPGKFILAPS 
PPFFGGGEFYKLKRGVLKQKPGPPGVPK 
ILKTPPRLGKNKPISPPPGQKKQNFF 


8220 


22121 


A 


8280 


377 


63 


PPVFPPPWGGQGGGFP * ARGLRTPWPTR 
ENPVFF*NPKN*PGGGGPF* IPPPWPG* 
ALKFPLPPRGGVPLTKNIPPPPRFSHKR 
RVFFPKNLKKKKNPCLIESL 


8221 


22122 


A 


8281 


2 


284 


LPK*VKNFVHAKICT*MFVAALFVITSS 
WKQPRYYSSIEKWINKLWHISAMEYYSS 
LKR*ELSSHEKT* KNFKCGSLSEKSQSE 
KDTWCLIPVI 


8222 


22123 


A 


8282 


318 


40 


APRLSLWFMWFIKRLDSSPGSRITWT 
NCFENHLCV CVCDRFLLCQAREQWRVLG 
SPQSPSPKFKRFSCLSLPSSWDSRCKTP 
* PAMFCGLC 


8223 


22124 


A 


8283 


30 


343 


CYQKASHATEKSFMR*RVNPCSKLCCCY 
KILPQPPQPSAVITLISQQNKKKKKKKK 
KKKKKKKKKKKKKDGGGALKKKKI FSPR 
GGRI FFFFFWAKKNNLGGGL 


8224 


22125 


A 


8284 


163 


392 


FWLYNFS VI YS I SDPCWYKEQLCKS VF 
SFLSS+SLLDEPNPNSPANSQAAQLYQE 
NKREYEKRVSAI VEQSWNDS 


8225 


22126 


A 


8285 


406 


63 


KKKLILAKPGWNLDSLKRAPPFFFFFF 
RHGVPLCCPGWSKTPEPK* SSCLSLWSS 
WD YRREPLCLAYS PVLNI THLSS PNVKL 
CRDSQESYDPI IMMMVI I ITEGGMICGV 
IM 


8226 


22127 


A 


8286 


1 


322 


TNTLIiALLLFI ITF*LPQLNGYVEKSTP 
YECGLDP ISPARVPFS I KFFLGAITFLL 
FDLE I ALLLPLP * ALQTTNLPL IGMSAL 
LLIIILALRLAYE*LLKGLD*AE 


8227 


22128 


A 


8287 


395 


181 


HGSLHPGTPGLQQFSRLSLPSSWGYRHV 
PPCLANF* IFCRGGVSLCCLGSVEGLEC 
QARLPVLVRAELGLD 


8228 


22129 


A 


8288 


3 


246 


SSTLSFPSG*DYRHVPPCPANFFFFWIL 
EKTPFCFVPQGGFNLWA*TSPLNPPPWA 
SKGAKITGGTPGMGLINLANGRPP 


8229 


22130 


A 


8289 


138 


421 


LLSSVFFFFERDFCFVPQAGGQWGNLG* 
WNPWFRGLKNFPGLTPPRSWNNGGGPPP 
PLIFCFFLKKKRVSPRGRGRANFLDLGT 
PPPGPPKERE 


8230 


22131 


A 


8290 


141 


375 


RTFFFFFKTEGYFVAQAKGQGGDHGSLQ 
PQSPGLKGSSLLRRWDYGSMPPCPSIFF 
F* IF* KNG AS LCG PNNS KTGG 


8231 


22132 


A 


8291 


407 


209 


IGGPPGFSPPPVFKNRPPNLIFGPPKKK 
KNFPPPGGKNWFF*KGPPPFFFFFFFFF 
FFFFFYYTL 


8232 


22133 


A 


8292 


86 


433 


PPXPNKLIEXGGAF*KRHFFPPPRGRVR 
FFFFGPQXKTPAPGQNPGGGGKPWGSPL 
FSPGEKNXPPTHPXGGPVLGAPPPPPGI 
VLI*KKTGLPLFVGRAGTNPNLGGPPAP 
PPKG 


8233 


22134 


A 


8293 


147 


342 


liVKFEYTVLPKNRYRLGWAHAYNSSSL 
GGQGGRIT*A*EFKTSLGNILRSHLYYK 
KRKEKKLN 


8234 


22135 


A 


8294 


6 


152 


CCGHAPPHPANFYSL*RRSLAMLPRLVS 
NSWTQAILLP*PLKVIiALQV 


8235 


22136 


A 


8295 


18 


205 


IPTRPPTRPIPGRLNHbTFTATRPGVYY 
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C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, Wsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
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T=Threonine, V=Valine, 
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nucleotide deletion, \=possib!e 
nucleotide insertion 














GQCSEI *GANHSFMPIVLELIPLKIFEI 
GPVFTL 


8236 


22137 


A 


8296 


1 


199 


YL Y I YLS ILLS I YLS I YLS I YLASYLAT 
*LAI *KEIYYEEYVLVMTEAEIFHILPS 
PSWRPRKVMV 


8237 


22)38 


A 


8297 


1 


318 


PTR P PTRLLS R P I K I QI ALH KFS LCPE Y 
LQ* ADYLF* SLW IRPDAWPHACNLS IL 
GG*GGRIA*AQEFEISLGNIVRPCLYCI 
I KNKI KKKKPLQ I I AFFLCFKD 


8238 


22139 


A 


8298 


416 


304 


GFLHVGQAGLKLLI SGHLPA* ASQNVGI 
TGMSHHARL 


8239 


22140 


A 


8299 


352 


31 


EMILKAAREKK* IPYNEPLICLTVDFLV 
ETLQARREWYDI FKVLKKKNKTKNPFYT 
KIEYLVKLSFKYEKVTKTLLDP*Q*LRD 
FINTRHANRNTSNRRKRPI IKHK 


8240 


22141 


A 


8300 


382 


266 


RWDFVMLPKLVSNS*AQAILHVGLPKCW 
DYRREPPHLA 


8241 


22142 


A 


8301 


2 


273 


PLTRGPSSASDALSSGGPYHPSECCFTY 
TTYKIPRQRIMDYYETNSQCSKPGIV*V 
VHTHHTGGRGSQQGLLEGSREWWWNGDP 
QRTSQV 


8242 


22143 


A 


8302 


77 


430 


GMGLSMHRTHFQ YLAHIHP CVHVHTHTH 
THTHTHTHTHTLCM * HTE I HKS VRYGTE 
HAQDPFS I FSPYTPMCACTHTHTHTHTH 
THTHTH S V YVTH KVCEETCLS FFYPESG 
TISHAQE I LKRCLENKTHLGWCFTSLL 
SVANIVFFSYKRQVAAARGGLFFSLSQA 
E 


8243 


22144 


A 


8303 


137 


396 


SS VKTMNYLKPWHMVADRC* LFQCLPS 
KKKKKKKKKKKKKKGGPLKKKK I LTRGG 
GGIFFFFGAPKKISPGRV*KKKKKKKKK 
KKKKGGPLKKKKNFNPGGGGNFFFFWGP 
KKNIAGARLKKPGGGKNPRSP 


8244 


22145 


A 


8304 


329 


144 


NGPPFFFFFF*DGVSLCCPGWTAVARSR 
LTFAVFFTLNCS WG IHSELQHWFQDYL 
KLKTF 


8245 


22146 


A 


8305 


404 


157 


RTYN PSTLRGRSGQ IT* GQDFDTS LGDR 
VGPHLRKRKRNYYSLVECFYRIKEKYPE 
SHKKAMKMLFQLKPDFLHIFQHIATN 


8246 


22147 


A 


8306 


394 


294 


SLCFLTPPRDFNLGALIKNFYSPTPGWV 
SCVLSKGPPFFFFFFFKDGVLLSCPH*S 
QNPFVFALEWSGTPELK* SSHPGLPKAW 
D YTYE PPHQAQVELLR Y* RS FPVLLLDG 
ICIFHSFFSPRGSLIP 


8247 


22148 


A 


8307 


280 


2 


WDSHVTFFTSVITVMLFSIQYVKQLEN 
IKSWPGIVARACNPSALGGRDG*AQEFD 
SSPGNTAKPCLPQKNKTKQKTKNKVGGT 
LETRSWRL 


8248 


22149 


A 


8308 


136 


330 


MSYQRKSVNKYKSNNACEEVTTWARHSG 
SCLRLYNPSTLVGSGGKIP*TQEFETSL 
SNIARPYLY 


8249 


22150 


A 


8309 


3 


118 


I FWLVQMGFHHVGQASLEL * PQ VIHPP 
QPPKVLGLPA 


8250 


22151 


A 


8310 


2 


366 


TRVDPRVRVRSTHRNLCLPGSSHASTSA 
S * VAGTTS ACHHARL I FS FS FFFLG FSE 
KTGFNFFAQGGFEPLGEREPPPFGPPGF 
WGGGGGGTFKRPRRWKFQKDI IPPLVSN 
RGDQPEPFF 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenyla!anine, 
G=Glycine, H=Histidine, I=lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 


8251 


22152 


A 


8311 


253 


446 


S KTFQARGGVTEMGFRHVGQTGLE I LTS 
GDPPASACQAAG I TGVSHKARLR I * YFI 
RIFILMTC 


8252 


22153 


A 


8312 


430 


88 


GVGRIQISWSPWLPKPKNSPGSPPQ*VG 
APGGPPPPGPYFFFFFQKKGFPLFSGRV 
*GSGSKNFFVFPPPKNWGTSPPPPPEGR 
GFFFFFFFLSRDGASPCCPPRPPKVLGL 
QV 


8253 


22154 


A 


8313 


400 


323 


HRLMPPHPIDFFLFFVETGSCYVAQGAC 
ELLGSGNLPGSAS*NARITDVNHHAQP* 
KSSSGPGTQQC 


8254 


22155 


A 


8314 


4 


421 


GGRIARTQEAKVAVSQQCTTALQPGRHS 
KTPSQKTKKKKSPGGPPGPPPGRFFFFF 
XT*KKGPFLKKPPQRGGDPRGKKKRGGG 
LWKKPGGKLGGPTKKKTRGGGANPPPLW 
GKGGGPKGAPRAFP FTKKKP PPSGRNQ 


8255 


22156 


A 


8315 


213 


32 


RCPGTVGHGCHPSPIjGGQRGRIP*GQEF 
ETSLANMVKPYFFLNSKKKTKNHPTKKN 
SQAW 


8256 


22157 


A 


8316 


422 


153 


AATLYTHKTPHCIYFFLDRNLLCCPGWS 
QTPGLK*AFCLSLPSSWDYRHKPPRPVT 
SPIFRIKSYDVIPPPYLE*NPMMLYLPS 
YCHHI 


8257 


22158 


A 


8317 


1 


284 


ERQDWESRLEAMECAFHLEKSVNQSLLE 
LHQLAMEKGDPQLCDFLESHFLNQQVKA 
IKKLGDYLSNLCKT*APEAGLAEYLFDK 
LTLGGSEEDT 


8258 


22159 


A 


8318 


187 


37 


TRDFF WPG WAHA * NCSTLGGQGRR IA* 
AQKFESNLGNRVSHCLYKKFKN 


8259 


22160 


A 


8319 


2 


250 


KYEQSLQEVWDCVKRPNLRI IGVPEEEE 
KSKSLENIFGGI IEENFPGLARDLDIQI 
REAQRTPEKSIAKRSSSRHIVIRLSK*E 
VWDCVKRPNLRI I GVPEEEE KS KS LEN I 
FGGI IEENFPGLARDLDIQIREAQRTPE 
KSIAKRSSSRHIVIRLSK 


8260 


22161 


A 


8320 


371 


196 


NNFSILFFFFETRSH*VAQAGLVLIiGSS 
NPPCSASRVAGVTGVHQRYQQDDFKNFR 
HN 


8261 


22162 


A 


8321 


3 


235 


FISVSFKYSKHIPNNMHLKFLVLFLNFF 
FRDGFWLRSG*SWAILRCGHSSLQPLT 
GLKQSSHLS FQGS WD YRRAPH 


8262 


22163 


A 


8322 


1 


395 


LEVPLLNTSLiLLASGVSIT*AHHRLIES 
NRNQI IQALLITILLGLYFTLLQASEYF 
ESPFTISDGIYGSTFFVATGFHGLHGII 
GSTFLTICFIRQLIFHFTSKHHFGFEAA 
A*YWHFVDED*LFLYVSIY 


8263 


22164 


A 


8323 


322 


83 


MHQSFLVAKVKKGKHGRWFYSCLMIKSA 
PVLSVTHNTIiLCWPGAWHACNPSTLGG 
QGRWMA*AQEFETSIiDNVGKPHP 


8264 


22165 


A 


8324 


12 


249 


GFFLPPPPGRGEKRGPPPPPLFFWFFFK 
KGEFPLGGGNFLTLEPPRLAPPKGGK*G 
GDPPPPGFFFFLKKKPKPLLKG 


8265 


22166 


A 


8325 


139 


335 


ENILWAATWLDCHPSTLGG*GGRIT*G 
QEFKTNLANIERPHLFFFFFLKRNFFFA 
PRVEGRGPV 


8266 


22167 


A 


8326 


1 


246 . 


FRRGLAVAPRVLLCYPGWRAMVQS * LTA 
ASNSWAQAIHLSLPKCWNYKHTPPYPTC 
*NSACWDESSFPLLVSAMEVCGQPHS 
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K=Lysine, L=Leucine, M=Methionine, 
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T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


8267 


22168 


A 


8327 


422 


107 


IYYVRQQRKKRGLFFPFFRFPLPPGGGQ 
WGHFWFLQPPPRGV+RFFFLNLPRNRPW 
DFKGLPPPPAYFAFFLKKGVFPPGQGDF 
* IIiAPKGPPTSTSQKGWGFKD 


8268 


22169 


A 


8328 


I 


161 


FRG I RWVCHVLFFYLS FFLRQGLAVL PR 
LILNSWAQATLLPQLPB*LGLQTSS 


8269 


22170 


A 


8329 


422 


103 


PPYSPPFGGPGGGVPKVRESGPPGPPQG 
NPFFFKKPKNNPGGGGGPLIPPSSGG*A 
KGMGLSPRPRLPLS*NWPPPPPPGGQNK 
LPFPKKKKKERKKBKFNFEHRG 


8270 


22171 


A 


8330 


46 


422 


STPFPLASQSAGITGVSHRARRSSSFQC 
TQIPLFPETLPNLGFGSRHGRGFSSQVL 
PAQSSSFDLDLISVLGY*MLSHISLGPN 
RALGFIPPHLSKWENGSTLRSIVLSQIE 
GLSSTLSFPHPVS 


8271 


22172 


A 


8331 


429 


112 


FSHliGSSPRQGFTMLARLWNS * PCDP P 
TSASQSAGITGVSHHAQPHAVFYIRLPD 
SQVLCHQMP YWPTAHYPHRW I FSLVLQK 
NLS VPPPLWPIQSQPKAS ILLN 


8272 


22173 


A 


8332 


291 


19 


KTPFPPKI *KFPPNFLKFTPGCGGPPPP 
FLKKFVSPGNPPPKKIFFFFFPPPPPFF 
FFFFFLRRGSL I S I AQDG I Q WHYHGSLQ 
PQLPGLR 


8273 


22174 


A 


8333 


307 


123 


K YF RGGGGGGR * FPPLGGVKQKKRGNPG 
GGG S KKP KLGP S P PTRG KKKN F FS P KICK 
KKKKK 


8274 


22175 


A 


8334 


3 


169 


HASADMCHQAQLLLLLLVETGLCHDGRA 
GFGS * SQEIHPPRPPKVLGLQHFGL I Y 


8275 


22176 


A 


8335 


1 


208 


LC VCVCVCVCVCVCVLFC * WRKGLALS P 
RGECSGTNMAHCSLNFSGSSNPSVSPSH 
VAETTGVRP I LG 


8276 


22177 


A 


8336 


1 


352 


I LF YF I FYYFS FCFEMGS RS VSQAGVQC 
PDLSSLQPLPPRFKRFFCLTFPSSRDYR 
CVPPHLAKFCI IRRR*AAIGQAGIELLT 
LSDPPASAS * SARNTGVSHCSGPSF INF 
LiNWQI 


8277 


22 178 


A 


8337 


420 


272 


KAPCQFSGEKTVFPKHCWNSWVPFRKKQ 
QQKKNLCPCLVLYTS INSQ*TMGLNI KA 
RTVNI S * KYKEDP* TGKK* INKSCPWTG 
KKKKFLARTQKT I AI WDTL INWTSPKLK 
MSTKQRHHLKMNR* PKWLLFFVS 


8278 


22179 


A 


8338 


427 


202 


LIIYFCHQSVQKKGVLPLLQEREGWAEA 
SLKISKI SLSHTHTHTHACAHTHTHTHT 
HTSS*WGIKTWITIKSFA 


8279 


22180 S 


A 


8339 


2 


164 


TPGHG ISLCRPGWSAVAPSRLTATSAS W 
VQTILLPWPPE *LGLQESRLLRDHLC 


8280 


22181 


A 


8340 


422 


1 


PPGPPGGKTKRAPPPSFFKGPNPLLKGK 
GPGAKPPPKNGVFFFPPKKVKGPQKNFG 
GTQI SQPNKRGA* KFTPFFL * KPPQKKK 
GPPPGEKIFKGPKF*KFPSPGFFFFFLK 
QGLTLSPRLERSGANTAHCSLKLLSSSD 


8281 


22182 


A 


8341 


1 


456 


RTRGAVSQGHAIALQPGQQSETPSQKKK 
KKNPAPGGKGKIGGTAKKKGRNFLDANI 
LGPNSRPGGAFFGG*TAPPGKKRGSPGP 
IFGGGKGGPKKKILSF*KAGGGAKFFPS 
PVGTPFFRGF*KGKILLKKKKFSKPGGG 
NPENLSPTNFFF 


8282 


22183 


A 


8342 


407 


2 


RQFILGAPGVFPPPGF+KPPPKI IFWGP 
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beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid Sequence (A=AIanine 
C=Cysteine, D==Aspartic Acid, 
E^GIutaroic Acid, F=Pbenylalanine, 
G=Glycine, H-Histidine, I=lso leucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














KKKNFFPPPRGEKFFSFKGAPPLFFFFF 
F FF FF FS PFFFG ALF FF FFT I LFKTKFG 
VFSCKGIPPKFFKRSRPPNGAGRLPLLG 
VGPLPGVPQGGEAERKISQGSFQ 


8283 


22184 


A 


8343 


441 


1 


PPPYILGGPRVFSPPPFFKTPPQIFFLG 
PQKXKNSPPPPGKKIFFF*RAPPPFFFF 
FFFFFFFPPFFFGALFFFFFTILFKTKF 
WISWKGIQPKLFKRSRPPNGGGRLPLL 
GGGPPPGVPRGGEAERKFSRGSFQRTGG 
SAPEFRA 


8284 


22185 


A 


8344 


466 


323 


FGVAGTQNPGRNPRVPSFRGGPPPGVPQ 
GRNPPPPWFFPLNPPKHFLTP*VFPFLE 
PPIEWGPGPP*PSYGFSSPVARK 


8285 


22186 


A 


8345 


657 


347 


LFFWFFLRQSFTLVAQAGVQ* CDLGSLQ 
PPPPGFK*FSRLNLPSSLDDRCPAPCPA 
NFVLLVETGFHHVGQAGLELLTSGDPPA 
SASQSAGITGVSISAQPAQ 


8286 


22187 


A 


8346 


3 


212 


DCSRGPPPHLANFGLLEKTGSNHVDQAG 
LKPLS LRDP PAS AS * S PG I TGGSHRAQF 
*FIFKRGMAPKGF 


8287 


22188 


A 


8347 


9 


255 


IFGPTHPSDFFLYEIDLPFKN*KGFPPP 
PRGGGKNPPFPPPPFFFS* PPWKKKITF 
QTPLTGVFGAPKNFNFKGGFKKIGF 


82S8 


22189 


A 


8348 


79 


312 


NAAGFE VGI SKA* ALETLQ 1 1 TREGHSN 
KPRYAVTFESHEMGTVLHLVEQEHTVGY 
ITASCSALKC* IESGGFLNDYI ! 


8289 


22190 


A 


8349 


1 


369 


FFFFLKRDRVS PLSSRVKYKWHD YSSLQ 
LQTPGLQ*SSCLRFLSSQDYRHIPPCPT 
NF* KCFVEMGS CCVTQGWSQTPGAQAVL 
PPWASQRHWGLQGVRPLCS LLLSNCPHS 
RliVWSWLFSGC 


8290 


22191 


A 


8350 


434 


324 


LPILSPWWKMPWPGTREAEA*EQ*SL . 
EPGRRRLQ 


8291, 


22192 


A 


8351 


3 


253 


YMCSKFFFLECSGAISAKCHLELPGSSD 
PPPPT*NF*VIGTTGTCHHEWLLFELFG 
KMKPHY IAQVGFKLLERNFLKTKKVFP 


8292 


22193 


A 


8352 


3 


1431 


SPRGGI PSLKQVYSSLTTNSRPSQHLF* 
EEALQRAREEEEKRKEITSHFQSTLTDI 
QGQ I EQQS ERNMKLCQENTELAEKLKS I 
IDQYELREEHLDKI FKHRELQQKLVDAK 
LEQAQEMMKEAEERHKREKEYLLNQAAE 
WKLQAKVLKEQETVLQAQLTLYSGRFEE 
FQS TLTKSNE VFATFKQEMDKTTKKMKK 
LEKDTATWKARFENCNKALLDM I EEKAL 
RAKEYECFVMKIGRLENLCRALQEERNE 
LHKKIRDAEISEKDDQSQHNSDEEPESN 
VSVDQE I DAEE VNS VQTAVKNLATAFM I 
IHHPESTPHQSKETQPEIGSSQESADAA 
LKEPEQPPLIPSRDSESPLPPLTPQAEA 
EGGSDAEPPSKASNS PAGLGAETQCEGL 
PVGAQADQASWKPEAEASGQAPQAPTEA 
SLQKMEADVPAPACAAEEHVAAMVPACE 
PSRQPPRAAAEELPVGASAGPQPRNVA 


8293 


22194 


A 


8353 


376 


65 


I HCGDHVQE VPKLKVCALHMSS WAQSQI 
PKAGDKILTFDQLTLDTLKGCGTILLSG 
PHKGQEVYWHFSKALGTQHSHTKPCVHS 
RDQKFKH I RG* WASQGYKN 


8294 


22195 


A 


8354 


459 


85 


' LEEVESLERVLEKXMIESEELKSKQTRF 
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nucleotide 

location 
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ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=A)anine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S^Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop cod on, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














LEEIKNQDKLNKSLKEEAMLQKQSCEEL 
KSDLNTKNELLKQKTI ELTRACQKQYEL 
E*ELAFYKIDAKFEPLNYYPSEYAEIDK 
APDES P YIGKSR Y 


8295 


22196 


A 


8355 


231 


358 


PFFIFYLYQSNDIARGLERGLEPEKVIG 
ATDSCGDLMFLMKW*ESVAPMTFSGSRP 
LSKPRAI SLLWYK 


8296 


22197 


A 


8356 


2 


164 


I HRG VENQMAGEQGWDKDVS VYWVGAVA 
HACNPNTLGGQGRWIT*GQBFETNLP 


8297 


22198 


A 


8357 


23 


291 


CTHTRAQAHAHTHHXjFYYFLYI FNFFEE 
TGSH* I VQACLELPAS SDP PAS FSQSAG 
ITGMSHHAWPTSCILKVKMALCSINCSA 
ACFLY 


8298 


22199 


A 


8358 


1 


307 


FFFFEIESCSFAQDGVQWCDLGSLQPPP 
PGFKQFSCLSLLSSWDYRHVPPRLANFC 
I FSRDGFHHVGQAGLQLLTSNDPPT*AS 
QSAGI TGVSPRAQPCPAF 


8299 


22200 


A 


8359 


75 


266 


FVKTRNLFLTVLEAGKSNI KARAG* VSG 
EEHGLCLQDGALLPLEGRNAVSSHPRGQ 
NMMRAKC 


8300 


22201 


A 


8360 


1 


295 


NTWVFCFVLLFRDRVSLCCSVRSTVASS 
* LI VNMNLS FPSS * AYRHVS SHLDFFFF 
FKIQRILILPRLISNFWPQEILLWLPQG 
LGLQPRPQVLVGNS 


8301 


22202 


A 


8361 


376 


293 


MGFHHVS OAGLKLLTS GDLPAS AS QSAR 
I TGAGHHAQ PD F * HSGDSWQAPPE PRDQ 
G 


8302 


22203 


A 


8362 


375 


142 


PQEEFLRFATDVGEYRAVTELERPVSES 
WNTQKDLL * QKRGRVSTYCRHTYGVGES 
FSVQRRGEHVGGRGLGPCELEI 


8303 


22204 


A 


8363 


36 


363 


QLSFNPLIGKKVKLRTVFIGKQFLFLLG 
GGVREAGAHSGAQATVRWHNLS SHAKRP 
S CLSLPS CDHRHLLPHPANFC I FWTD* I 
PPCRSGWVQTPELKQFTHLGLPKR 


8304 


22205 


A 


8364 


455 


235 


GAQARAPPRGIIFFFFW*RKGFCMLPRP 
VSKFWG*WPLPWPPKRVGLDSWAPPPR 
DFIFFSFFVKKRVCILW 


8305 


22206 


A 


8365 


432 


175 


STDQVSVAMGPS * PGAGYNLLVCRWLRP 
LKKCSIRMGVS* FSRYCLSWLPLDKKGN 
SLTPCASRVRRCTVMLRLMLRGLHPLSD 
KP 


8306 


22207 


A 


8366 


383 


83 


RRLIVFI CYFSKK*GLPLLPRLVSNFWP 
QVILVPWPPKVLGL * ALKKL WLDMVTHA 
YNPSTLGGQGTR I T 


8307 


22208 


A 


8367 


27 


304 


FFGF ILTLKKSSN* CTLRFCFLS IGKVR 
IHTLGNI LSRRGYGERCSLPHCLWES IC 
LPVQSN*ALHKKVQMHI?FDQAIPSRGI 
LPTDTGHK 


8308 


22209 


A 


8368 


165 


204 


QNIP I AVS F F FFFLKRVS FCP PG * RAGG 
QSLFIGALTFQV*GILIPYPPGNWELRG 
PPPNLTNFF* FLNLRFLLGPINLHYVHD 
KTSPLLYLFFFFF 


8309 


22210 


A 


8369 


400 . 


201 


KERPLP FRL I GSP FQGTW PRFFS KGE VK 
NSNFSKY*PPPFINPAPRVCFEGIEVIT 
KFFQVRPFFP 


8310 


22211 


A 


8370 


21 


408 


LIPLSTYVCVRVCECVCVCVCVKI *GYA 
CNTLL I ACN I LQDVP FTS PTKDKI FSGD 
FLTLVGKK 
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nucleotide 
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Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=Iso!eucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
^Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \= possible 
nucleotide insertion 


8311 


22212 


A 


8371 


1 


94 


NTLGHEN*LNSGFGGCSEPRSCHCTPTW 
ATE 


8312 


22213 


A 


8372 


546 


972 


PIHSLYGR*PSFCLGLALFKTLiQVMHFL 
KDSVRNLKSSNLKVGLNFRSPQIQATIS 
. PPLQPKTKTYEEDPKSKPKKEEKNMEVN 
MKEWILRYAEQQNEEEKNENSKSLEEEE 
KFDPNERYLHLAAKLLDAKEQAATFKLE 
KK 


8313 


22214 


A 


8373 


502 


178 


QHFWRLRQKDSFALGG*KQPGQKGENPF 
FQKKAKKILG*WDQPWPTTQKGGGGES 
L* PGRTKS PPCTSGWGKKQNPVLKKKKN 
PQKTNC I PLC I YH I FL INRKTNTV 


8314 


22215 


A 


8374 


362 


112 


YRMSVNPQVRAKRVSQRTNTPPNFFF*D 
RVLLCRPG* STWQSQLTVAS I S WAKAI 
LLPPQPRQAKYFLRLVKCSSEKGERIE 


8315 


22216 


A 


8375 


411 


21 


VLGIYPTLVSPSPGLKMTRVKLPSGSKK 
VI S S SNRP VFGWAAGG * FDKP I LKAGR 
AS S K YKAKRNC W P RVRGVAMN P VEH P FG 
GGILQRIGKPSAIRIYAPAGRKVGLIAV 
RRS GRLRG ANTV QE KEN 


8316 


22217 


A 


8376 


450 


108 


AEGGRGPRK* *LRSCRPVDGSAGQSPGP 
LRTESLETPGCSPQGWQELEPPPRRPQ 
WWPGPPGHRDRRQHRRPGARSRQQGPES 
SH*QRSGESRPPAVPRPPCPLPWfCKGVL 
E 


8317 


22218 


A 


8377 


25 


306 


EQQPPPVAVHQSGPPGAPPEASPSHPGS 
SEKPQPVP PPRAPLVPQPKP * LS PPHHP 
AAGLLAVHALAGGPGTVQRGLPGRAARA 
VSVRVPAARC 


8318 


22219 


A 


8378 


376 


149 


PAHCSLLI PGFRGSSCLSLPCNWYPGRA 
PPRVGNFFFFF* EMESCYVAMLPSLVLT 
SWALAILPVLPPKLPGLQA 


8319 


22220 


A 


8379 


98 


374 


SVPPPVGTYEGDAAPTGGGGLPALGR*G 
AAGGHVQQGHPGPHRGHAAPKPSAQGGP 
GHSRECL.EPSGGAAGGEREPPRAGGGQS 
WMRAPSMT 


8320 


22221 


A 


8380 


2 


807 


RPQCCVRAPRAGAVSGVPKLWSVP*SGR 
SAPHPTRS * VRS WSRWLRTQS WQTSRTR 
RRSES KVRVS YQAWTAS PP PAAS ASARP 
A*RTSSRSYSSSSTCCSWLWPSSWSTGP 
SQTFVRNSSTLSCLCLTRKWIAMMPQVLi 
?CTP VRPSCSAVS TI TRSFLL * QALASR 
VT*IAPPRGSTTRTPSPIRL*NLP*LSR 
GPGK*KSGSWSSSSSA*TRWRTSAPLI 
TSSSLLSRSSCKAQTG*ASCRPVRVPIP 
AGSSLGASAPGSRCHW 


8321 


22222 


A 


8381 


104 


327 


CIAFPEMLGIKYEVGKCPSGLKSLKILK 
NTHTHTHTHTHTHTHTHTHTHALYLIRE 
GRKK* CVCMRKTDSTCVS 


8322 


22223 


A 


8382 


400 


270 


KPKLEVRASGQVQ WLS P 1 1 PALEEAKAG 
G*LEPRSLRHFMVCW 


8323 


22224 


A 


8383 


178 


588 


ADLLSGAPGPPGGSQDNANSSEHSRACA 
PAAPGP * WLQPWLCP * WTQSPLEARGS * 
NPGVSHSSRIPGI PGAAGTSRPSGALPP 
QPAGKGGSSGELHVARGCACGRPLWSHR 
PGCX3QLLRVTAADSHRCASRGAAPM 


8324 


22225 


A 


8384 


423 


78 


QGQELAELTGVTVESPEEPQRLLVS*GG 
TVPCLACSLHRTL* TVGHGSASSFLAQG 
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ing to first 

amino acid 

residue of 
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Predict- 
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nucle- 
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ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=GIycine, H=Histidine, l=Isoleucine, 
KHLysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
\V=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=pos5ible 
nucleotide insertion 














DMEPLGDRGLQGTEQLH*GGDRASLQDI 
CFLLPVLSLLCPRSLDLHRHRVGVTGFT 
ED 


8325 


22226 


A 


8385 


434 


176 


FADDSRFD FLLLV* CAMLML I REQLLEG 
HFTVNMRLMQDYPI TDVCQ I LQKAKELQ 
DSK* PGGKRPMFGREASRPCAVAPGTHR 
NL 


8326 


22227 


A 


8386 


2 


305 


RLWEDPPGRNTPPGAR* PGPCGEGGHI I 
REEQACGWPRGHSGPWRHPQAGPLQLGQ 
WRGKGAGQGARRSPQGSLESRPPTLSAA 
PPAIHICSVGGVRLHCY 


8327 


22228 


A 


8387 


331 


91 


EMRSHSVTQAGVQWRDYGSLQTRPPGLS 
FNVNWDIWGL*GDPSSPPMLLPSS*ARP 
GGGWGPGGEGHVSKEAAGRLQGPP 


8328 


22229 


A 


8388 


2 


250 


IHVRLWSQLFRRLRWEDLLSTGGGSCSE 
P * S CHCTVTWAFFLES CLR CAHWYVTL 
LTALLG SLP CAN PRQTAV S LPQAQLV 


8329 


22230 


A 


8389 


388 


176 


YWDSQ* CMPS FAVP VESAC I CAIGHNFS 
KS VNS VI AVCVDGS SHKYVFTHDGRCHR 
EAFDVYRD I CDDDA 


8330 


22231 


A 


8390 


445 


330 


PPRFTPFSCLSLRSSWDYR* PPPRPDN? 
LSLYSVLHET 


8331 


22232 


A 


8391 


559 


142 


LRTLPVTLGKSLALSVPSFPYVGGNWIT 
NRE P S AKGRGS G PNRG PGHAAS CPARTH 
SLRHS ? AGG VM LTQ PR P I L I L PETQH P S 
ES AVARTDI S KARRWLHSRS PWPH * DET 
SAGAERPGIRCYIGHPAT 


8332 


22233 


A 


8392 


104 


400 


YYEIFKKEMYSYYHILHFFFFFSFFFFE 
TKPNSVNGPEGKGPILG*LKPWLLG*KH 
FCLTLPSGGNYRHVLP PPVNGFFFFFFF 
KKKKVFPFVAPRGV 


8333 


22234 


A 


8393 


408 


169 


DQVWI KDWNIGSLRPR* KGPQTI ILTTP 
TAVKIEGI PAWI QHSQVKPAAPETWEVR 
PSLDNPCKVTLKKMTSPAPVTLRS 


8334 


22235. 


A 


8394 


1 


269 


YCCPLFSSKALTQENSPYSSFRIiVNPPG 
LSLHPEGEGGKWINERGRELGPSAGPLL 
LFLHFA3AGRRQPPDWADS EADLQQVRH 
KLGQP 


8335 


22236 


A 


8395 


2 


204 


KDCKVNKEVERVLREFHQAGKPIGCDAS 
TSLPAPAPWWSWNAALCCPGLSCVWCWQ 
GESRTCVGEHQ 


8336 


22237 


A 


8396 


3 


285 


KVTVKYDRKELRKRLNLEEW I LEQLTRL 
YDCQEEE I PELE IDVDELLDMESDDARA 
ARVKELLVDCYKPTEAFISGLLDKIRGM 
QKLSTPQKK 


8337 


22238 


A 


8397 


1 


192 


ETGFHHAGQDGLELLTSGDCPSLASQSA 
RITGVSHCTRAQLLTEQEYFKNQNHLPS 
APFYKRAF 


8338 


22239 


A 


8398 


8 


241 


LNPFLYAFLDPRFRQACTSMLCCGQSRS 
SGTSHS IRGEKSAS YSSGHSQGPGPNMG 
KGGEQMLEKSI PYSQETLWD 


8339 


22240 


A 


8399 


2 


240 


SCLN P FLYAFLE PR FRQACTSMLCCGQS 
RS AGTLHS SRGE KS AS YS SGHSQGPGPN 
MGKGGEQMHEKS I PYSQETLWD 


8340 


22241 


A 


8400 


10 


272 


CNWQNAYLTWRCAQDCEDYFAERLYRS 
MKGAGTDSETLIRI I VTRAE VDLLG I KP 
KFHQKYQKSLSDMVRSDTCGDFRKLLVA 
LLH 
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Predict- 
ed end 
nucle- 
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location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartrc Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l^lsoleucine, 
K=Lysine, JU=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Ty rosine, 
X=Unkno\vn, *=Stop cod on, /^possible 
nucleotide deletion, V-possible 
nucleotide insertion 


8341 


22242 


A 


8401 


47 


248 


TRG I FFFFLNTRSHS VAQAKGLGHDHSS 
LKPQP PRLKQS FNLS LLSTWN YGGLLPH 
SALWKAEAAKS 


8342 


22243 


A 


8403 


497 


303 


TVEAS VS YLL YVAMVMQLP WGKS PASGA 
ESHRQGCGGSWPGGGMEEGRGAERGGGC 
LLPQALLY 


8343 


22244 


A 


8404 


3 


235 


LTLLAHCRMMAAAHMGGPPMMPMMGPNT 
PGMMPVGPAPGMRPPLGGHMPMI PGYPM 
MRS P ARLMMVP SQPRMTRPDR 


8344 


22245 


A 


8405 


3 


246 


LVFSVDVGLLAHTRQRGLKRSSGMSSLL 
GKIGAKKQKMSTLEKSKLDWESFKEEEG 
IGEELAIHNRGKEGWIKGAQPLGQ 


8345 


22246 


A 


8406 


1739 


804 


WEPDVSGSAAWLASAAFGMATFSGPAGP 
I LSLN PQEDVE FQ KEVAQVRKR I TQRKK 
QEQLTPG WYVRHLPNLLDETQI FSYFS 
Q FGTVTRFRLS RS KRTGNS KGYAFVE FE 
SEDVAKIVAETMNNYLFGERLLECHFMP 
PEKVHKELFKDWNIPFKQPSYPSVKRYN 
RNRTLTQKLRMEERFKKKERLLRKKLAK 
KGIDYDFPSLILQKTESISKTNRQTSTK 
GQVLRKKKKKVSGTLDTPEKTVDSQGPT 
PVCTPTFLERRKSQVAELNDDDKDDEIV 
FKQP I SCVKEE IQETQTPTHSRKKRRRS 
SNQ . 


8346 


22247 


A 


8407 


2 


241 


CLRFPSSPAMGLLRSGTKLI FRRRPKQK 
EAGLSQSHDDLSNATATPSVRMKAGSFS 
RRLIKRFYFKSKPKANGNPSPQIi 


8347 


22248 


A 


8408 


285 


186 


KQGLTSLPRLVSNS WAQAFLLLWP PKVL 
RLQA 


8348 


22249 


A 


8409 


1 


246 


RHENREELQVIADLCIKYDTLCISDEEF 
MNGVEY IWKGP CFCLTTFTLYLKGDI FP 
RPPSGCIiNLWIVLNPMYPVIHLITSC 


8349 


22250 


A 


8410 


1 


102 


AGS YTLHI IKGDDGTRGVTGRFTFTLHR 
IQTSEE 


8350 


22251 


A 


8411 


877 


647 


FFFETESHSVTQAGVKWGDLGSLQSLAP 
GPKKFSCLNLPSSRDYRSPPPPLANFCT 
FSKNGVSPCWPGRSSTPDRR 


8351 


22252 


A 


8412 


1 


130 


RMLNDKTLRTDIGGNFPKNGWAAIATHS 
FEFAQFDNFLEEATR 


8352 


22253 


A 


8413 


97 


401 


CNPENGLTASALGRRMCLATCKAPRTLP 
DSGDTAS CRFPAVPRPPHSRRSSGSGHL 
PGRPRCPALPGLEWSNPPGPPTSGYLFP 
TFSTPAAHSHQKTLLG I 


8353 


22254 


A 


8414 


348 


516 


VSAYGF I TEGHER FS DH YYDTS WKRLI F 
YINHDFKLEREVWKRLHDEGI IRLYQRL 


8354 


22255 


A 


8415 


714 


410 


LGVFR S ALHGS LW LLLR S FPQKS PN PLA 
LliLFLQCNTAYQCLL IADQHCRTRKYFL 
CLASG I PCVSHVWVHDSCHANQLQNYRN 
YLLPAG YS LEEQRI LDW 


8355 


22256 


A 


8416 


2 


102 


GHDPQDRLLAQDSEHNHSDRNWQRNRPW 
PXDSY 


8356 


22257 


A 


8417 


1 


309 


FYFENALSKSNKPIHTIILNPHVHLVGD 
DAACI AY I RLTQYMDGSGMPKTMQSSET 
R VWHRRDGKWQNVHFHRSGSPTVP I KPP 
CIPNGKENFSGGTSLWQNI 


8357 


22258 


A 


8418 


3 


242 


ARAL.TNAASHVDDMPNALSALNDLHAHK 
LRLDPVNLKLLSHCbLVTLVDHLPDEFT 
P VEHA CLDNVLAS VS TVLTFKYR 



1062 



WO 01/64835 



PCT/US01/04927 



SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end. 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A-Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, HNHistidine, 1=1 soleu cine, 
K=Lysine, L=JLeucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V= Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


8358 


22259 


A 


8419 


711 


296 


NVANSDGLIASLWKEYGKADARWVYPDP 
TIVSVEILTVALDGSLALFLIYAIVKEK 
YYRHFLQI TLCV CELYGCWMTFLPEWLT 
RSPNLNTSNWLYCWLYLFFFNGVWVLI P 
GLLLWQSWLELKKMHQKETSSVKKFO 


8359 


22260 


A 


8420 


3 


97 


FWS S LCNGL I AAQLLFYWNAKPPHKQK 
KAQ 


8360 


22261 


A 


8421 


1 


279 


GCG IADLAMSAI FNFQSLLTDILLLICT 
CAY IPS LA PNLLDRNKTGLLG I FWKCAT 
IVERKSPYVAVCCILMAFSILLIQKLVK 
MPQCICHNI 


8361 


22262 


A 


8422 


1 


228 


RHEVFIELNHIKKCNTVRGVFVLEEFVP 
E I KE WSHK YKT P MAHE I C Y S VL CLF S Y 
VAAVHSSEEDLRTPPRPVSS 


8362 


22263 


A 


8423 


3 


182 


SEDTGEEQWTAEFINRGEYEIDIAGYR 
FQAKAKLYPVASLFTQKRRKDDMSLSDL 
HGK 


8363 


22264 


A 


8424 


1 


70 


VQFVFDAVTDV I I KNNLKDCGLF 


8364 


22265 


A 


8425 


2 


151 


ETTASSCTPASLESRRCCAPCRMPRTGF 
FGSSPLWRPSGSRSLKPGFQQ 


8365 


22266 


A 


8426 


1 


187 


RGRVGPGGERLVPGVPGAEAQQPAGDGV 
RAGPLQARP PAP VG VSQGRCQAAGAAAG 
PPRPDG 


8366 


22267 


A 


8427 


520 


423 


GKYQLQSQENFEAFMKAIGECWTERQSC 
GSHQ 


8367 


22268 


A 


8428 


1 


627 


GTSGTRGVTGYFTFTLYLETPKPSISSS 
NLNPREAMETVILTCDPETPDTSYQWWM 
NGQSLPMTHRFQLSETtfRTLFLFGVTKY 
TAGPYECE I RNSGSASRSDP VTLNLLHG 
PDLPRIHPSYTNYRSGDNLYLSCFANSN 
PPAQYSWTINGKFQQSGQNLFI PQITTK 
HSGLYVCSVRNSATGEESSTSLTVKVSA 
STRIGLLPLLNPT 


8368 


22269 


A 


8429 


3 


390 


ILGCNILRVEYSLVICVSVPGSKKVILD 
LPLVIGSRSGLSSRTSSMASRTSSSMSW 
VDLNIPDTPEAPPCYMDVIPEDHRLESP 
TTPLLDDMDGSQDSPI FMYAPEFKFMPP 
PTYTEVDPCILNNNVQ 


8369 


22270 


A 


8430 


3 


208 


NGTHVI I LLCLKTCGT WNVANDM I VAS 
NLGTGVPNQTPVSSGDLLIRSNGLLIPG 
TCE ITRLYT I SE 


8370 


22271 


A 


8431 


3 


563 


LPTSRVDPRVRLDRMKKDQEEEEDQGPP 
CPRLSRELPEWEPEDLQDSLDRWYSTP 
FSYPELPDSCQPYGSCFYSLEEEHVGFS 
LDVDEIEKYQEGEEDQKPPCPRLNEVLM 
EAEEPEVLQDSLDRCYSTTSTYFQLHAS 
FQQYRSAFYSFEEQDVSLALDVDNRFFT 
LTVTRHHLAFQMGVI FPH 


8371 


22272 


A 


8432 


2 


119 


DGDNILVTTFIYIKSVTELNGDI ITNAS 
WRCSERVAPSA 


8372 


22273 


A 


8433 


192 


1285 


AGVLSIIEETDSEGLQTKWENQTYDER 
LEINDSEEVASIYTPTPRHQGLPRSAHIi 
PNKAMADNSSDECEEENNKEKKKTSQLT 
PQRGFSENEDDDDDDDDSSETDSDSDDD 
DEEHGAPLEGAYDPADYEHLPVSAE IKE 
LFQVNQLRCVPTDVLDHKLKPFI PDFI P 
AVGDIDAFLKVPRPDGKPDNLGLLVLDE 
PSTKQSDPTVLSLWLTENSKQHNITQHM 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 1 
C=Cysteine, D=Aspartic Acid, 
E^Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Iso)eucine, 
K=Lysine, L=Leucine, MNMethionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














KVKSLEDAEKNPKAIDTWIESISELHRS 
KPPATVHYTRPMPDIDTLMQEWSPEFEE 
LLGKVSLPTAEIDCSLAEYIDMICAILD 
I PVYKSR I QSLHLLFSL YS E FKNSQHFK 
ALAEGKKAFTPSSNSTSQAGDMETLTFS 


8373 


22274 


A 


8434 


3 


251 


TLQADHFNTRLS CGDAAQTLWARTG YLG 
FVRRTELTAATGERHDALYWGALDETL 
ELRGLRYHPIDIETSVSRIHRSIAEW 


8374 


22275 


A 


8435 


1 


225 


QTFNLEGSQI YEDS I VLQSVFKSARQKI 
AIEEESEDESNEEEDEDDEEYHEWKRYD 
RLGENMCLNCIWNGGGYIV 


8375 


22276 


A 


8436 


95 


340 


CGCG I AG LAM S A I FN FQS LLT V I LLL I C 
TCAYIRSLAPSLLDRNKTGLLGI FWKCA 
RIGERKSPYVAVCCIVMAFSILFIQ 


8376 


22277 


A 


8437 


1 


317 


GPKPLVRTSREPGKDVTTSGYSSVSTAC 
PTSSVDGGLGALPQPTSVLSLDSDSHTQ 
PCHKQARKSCLQWRPPSPPESTVSQQQV 
KRINLCIHSEEEDMNLGLVRL 


8377 


22278 


A 


8438 


80 


213 


PHLS FNAG I TT I KVN I RNANS LGGGFHC 
WTCDARRRGTLQSYLD 


8378 


22279 


A 


8439 


3 


767 


HEDNIKQLKEMKFTYLINYIQDEINTIF 
NDY I P YVFKLLKENLCLNLHKFNEFIQN 
ELQEASQELQQIHQ YIMALREEYFDFS I 
VGWT VKY YELE E KI VS LI KNLL VALKD F 
HSEYIVSASNFTSQLSSQVEQFliHRNIQ 
EYLS I LTDPDGKGNEKIAELSATAQEI I 
KSQAI ATKKI I SD YHQQFRYKLQDFSDQ 
LSDYYEKFIAESKRLIDLSIQNYHTFLI 
Y ITELLKKLQS TTVMNPYMKLAPGELT I 
IL 


8379 


22280 


A 


8440 


103 


354 


NGCECDFLFLFLFYFFETESRSVAQAGV 
QWHCLGS PQPS S PR FKRFSCLS TLGGRG 
GWIMMSGVQDQPGQHGETPFLMQAGLKT 


8380 


22281 


A 


8441 


3 


160 


KLYPLKIVFGMNGRVWVKGKTVQQTLIL 
ANVLEACELMTLDQRI LMFI LAES 


8381 


22282 


A 


8442 


459 


3 


CGGLHPVRAS WLLCLPKQAWAMAGAP P P 
AWLPPCSLISDCCASNQQDSVGVGPSEP 
GAGYNLLVRRFLSLSEKRSIRVRVTRFS 
RCHLS PLSLTRKGNSLTPCTSRVRQCLA 
LLRLAHGALHPLSCAHCLALPSEMTPVP 
QMENAEITRLL 


8382 


22283 


A 


8443 


2 


318 


RQGNMTAALQATLLNPPINTKSQAVNDR 
ARG I VLNVLISFKAND I EKAVQSLDKNG 
VDLLMNYM YKGFES PSDNSNAMLLQWHE 
KALAAEG VTS I VRVS TALIPA 


8383 


22284 


A 


8444 


2 


109 


VTPFPMS CDLQGDCACRNPQAQEHSRKD 
LRGYSHG 


8384 


22285 


A 


8445 


1 


69 


LVLNSWPQVIRRPWPPKWGLQV 


8385 


22286 


A 


8446 


1 


163 


PSEKHNIWVGVTQFSRCRLSPLSLTRKG 
NSLI PCAS RVRQCLTLLRLVHGACTH 


8386 


22287 


A 


8447 


1 


147 


GQAGLELLMSGDRPTSASQSAG ITGVSH 
RARPSS I S F I LELRG S VRKKF 


8387 


22288 


A 


8448 


914 


729 


GLTMDTQKDVQPPKQQPMIYICGECHTE 
NEIKSRDPIRCRECGYRIMYKKRTKRLV 
VFDAR 


8388 


22289 


A 


8449 


134 


392 


TCPPPPEPPS PLTCAVL VPQTRR WRGLG 
SLFRRACCVALPLQLLLLLFLLLLFLLP 
I REEDRS CTLANNFAR S FTLMLR YNG PP 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenyla)anine, 
G=Glycine, H=Histidine, Wsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine > R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 














PT 


8389 


22290 


A 


8450 ■ 


37 


248 


AKI TPLLQPGRQEQDS ISNKK I LRCCLT 
MLPRLASDSSGFKRSSHFSLPTSWDYRH 
SPPPLASVPFLLMC 


8390 


22291 


A 


8451 


367 


165 


NFFVFIiVETGFCHAGQARLQLLASCDLP 
VLALQSAGI TGVSHCARP I S TLKDT I KT 
SHFVPLKISGK 


8391 


22292 


A 


8452 


2 


282 


GKRMAGGPEMMQLRLDGNRLYITTSMFS 
AWDKHLYPDL I REGS AMLHVDTOTALGG 
LKLN P Y FL VDFWKE P LGPS LAHE LR YPG 
GDCSSDIWI 


8392 


22293 


A 


8453 


4 


163 


HHDFHAQSLLANIERIAILVFAACEAAA 
VLSLLVSISNTYGLDYLHNLKLLQC 


8393 


22294 


A 


8454 


374 


197 


TPEPKQSSHLGLSKCWDYRHEPPHQARN 
STFDSNHSSLAAEDKLLGYFGLSQWQPG 
YQP 


8394 


22295 


A 


8455 


343 


161 


MGMGPS VPGTGYNLLVCRLLRPLEKCS I 
RVGVSQFSKYHLSRLPLPRKGNSPTPCA 
FWVR 


8395 


22296 


A 


8456 


1 


193 


MGFLHVGQAGLKLPTSGDPPTSASQSAG 
ITGMSHRVQPYYVLSICQTLHSVCICLL 
IILLTTVL 


8396 


22297 


A 


8457 


342 


263 


WLRP WPALWEAEAGGSLELRS SRPA 


8397 


22298 


A 


8458 


101 


303 


KAAK Y IjFQRLH FCFLNLKKDNMLLFNHR 
MEN F KEL FGFGT VAHTCNPS TLGGRGGW 
ITRSADRDHPG 


8398 


22299 


A 


8459 


3 


140 


YLKHLCMYVCKYLRQGLLMLPRLVLNSW 
AQR I LLPWLPKVLGLQV 


8399 


22300 


A 


8460 


327 


248 


FPRLVLNSWPKAILMPWLCKVLGLQA 


8400 


22301 


A 


8461 


327 


79 


SIPFILAFFIYLSFFFKETGSQSVTHAG 
VPGI I IAQCGLQLLGSGDPPDSASLWG 
TTCTYPTPILSVFPNMPSFFILLCMLF 


8401 


22302 


A 


8462 


365 


273 


MGFQHVGQVGLRWPQVICLPWPLKVLGL 
QA 


8402 


22303 


A 


8463 


375 


217 


Q AGLKLLTSDDLPAS ASQS AG I TG VSHH 
AQPRHHFFKTRFIYIFFSSPSSVLS 


8403 


22304 


A 


8464 


238 


354 


RGKCFVQAFL YKECSGGQARWLTPVI PE 
LWEAEVGRWIT 


8404 


22305 


A 


8465 


367 


248 


FQMRIHKWLIDLHSPSEIKQITSISIEP 
GVEVEVTIADI 


8405 


22306 


A 


8466 


341 i 


153 


NMILGWAQW VTPVI PAIGEAHWGGLLYP 
RSSKPAWATQQNLLFTKKKKNIYIYIYI 
YIYERY 


8406 


22307 


A 


8467 


368 


101 


VEMGFRHVGQAGLELLTSSDPPASVSQS 
AGSSTSLPTADLLTPGWVGVYGLVDKNP 
VTLIAMQRLPAVLEKKSVARKNSEiVKIV 
LKRGQ 


8407 


22308 


A 


8468 


362 


248 


HVGQAGLALLASGDPPTSASQSARIAGV 
SHRARPEAPL 


8408 


22309 


A 


8469 


384 


196 


SARLGLPKFWDYRREPPCPADDMLILMV 
GILETFSLIFTATMESRNYCVYFKMRNK 
HIISKTS 


8409 


22310 


A 


8470 


386 


300 


LAMLPRLVPNSWSQWLSPLPPNMLGLQ 
V 


8410 


22311 


A 


8471 


380 


178 


PKKNKFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFMYIYIF 
AIVYQTLACNNKFLQMERVTRKLRQC 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspoo 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A-Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Gfutamic Acid, F=Phenylalanine, 
G=GIycine, H^Histidine, I=lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine 7 R=Arginine, S=Serine, 
T=Threonine, V= Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unkno\vn, *=Stop codon,/=possible 
nucleotide deletion, V=possible 
nucleotide insertion 


8411 


22312 


A 


8472 


3 


154 


FLVDMGIHKVSQAGLKLLTSGDLPASAS 
QS P I FFFLFFFF2KQ I LAQAGG 


8412 


22313 


A 


8473 


3 


162 


GLYHVGQAGLKPLTSGDPLASASQSVRI 
TGVSHRTRPKI I FKL I FFLFFLVA 


8413 


22314 


A 


8474 


266 


107 


LCLFLIEMRFPYWQAGVKLLSSSHLPS 
SASQSDRITGVSHCFWPEVFKRGIF 


8414 


22315 


A 


8475 


361 


169 


RLPG P SNP P I S ASLAAG I TGARHKAQL I 
FVFLVEMGLCQAGKHNFEKYSRVLIYSP 
IVSFTHFI 


8415 


22316 


A 


8476 


I 


154 


KKLI PTLMDHGAGFKSS VKE ITTAWEM 
AREVELEVE PEDGNELLQSRGKT 


8416 


22317 


A 


8477 


3 


193 


SHYQSLISSNHGHKHCGRPQGPLPRKTI 
DLCSIiDYQLTFPPLLTHDPVKSPSVRKT 
QELSLIP 


8417 

- 


22318 


A 


8478 


408 


103 


TGSRLTLSSRVECSGVITAHCNLELPGS 
SDPLASASQVAETTGMKHHVWLIFLGIF 
LVKMASHYVAQDDLKLLGSTDPPVSASQ 
S VG I TGI SHHAWATI C YK 


8418 


22319 


• A 


8479 


59 


166 


NVNTCLCLHTHTHTHTHTHTHTHNYFKE 
TPIHQYI 


8419 


22320 


A 


8480 


394 


253 


FLLLLLESGFCHVAQAGLKLLSSCDLPA 
LASQSAEISGVSHCAQPLA 


8420 


22321 


A 


84S1 


379 


184 


CFS LPKC WDFRG VP I S PGDFWN FI I KLG 
VCP FCQGGFELLAPNDVAASGFQNAG IS 
GINFWAWAL 


8421 


22322 


A 


8482 


367 


170 


CGLSSGLGGRSGRESERKKERERERDRE 
KKKEGEGERKRERERESVSSYKDTNSMG 
SGPHPYSLI 


8422 


22323 


A 


8483 


324 


136 


QRSSCLSLPSS WDYRRMPPHLANFCI FC 
RDGVLPCCPSWSQMTTSFLSLNFLICLV 
GIILGRG 


8423 


22324 


A 


8484 


1 


57 


GERRYGTCIYQGRLWAFCC 


8424 


22325 


A 


8485 


35 


339 


GWLLRFPWWQWKNGNNLNVSLIDGWLSY 
DTLGYNGILKCHMYILTEGDSQKKKKKK 
KKKKKKKKKKKTDPAGNSKI YNRKI FKT 
PRG FFKWALGPLL I LFF 


8425 


22326 


A 


8486 


330 


66 


RPGVSRFNPWPLI FFLRRVFLCRPGWNA 
WQSLLTKISASQVQAIFLGGGGCSEPR 
SCHCTPAWATTAKLCFKKKNGKNFTDIM 
FSK 


8426 


22327 


A 


8487 


3 


209 


CLS LLR S WDAQAP P PHPAN F FLM FN FCR 
VGVLLCCPGLASSDPPPLVSQSARITGM 
SHCTWTIFLSFK 


8427 


22328 


A 


8488 


3 


216 


GQS ITVSCTATNSDVGSSNLVSWYQSSN 
LVSWYQQHPGQAPKLIIYEDNKRLSSEK 
KKKKKKKKKKKKKK 


8428 


22329 


A 


8489 


326 


62 


PAYFS IGS FFLFLFFFKTECHS VTRLEC 
SGAISAHCKLRFPGSSNSPASGGGGRSE 
PKSHHFTLAWTTRAKLHLNKNKI KTVKE 
FLSS 


8429 


22330 


A 


8490 


2 


156 


SSAGIGRTGCFIATRIGCQQLKARGEVD 
ILGIVCQLRLDRWVCGCKQGQQR 


8430 


22331 


A 


8491 


399 


254 


KNNKRNKTINNKI TNKLI I KLKN I SFLY 
FFFFFFYFFFFFFFFFFFFF 


8431 


22332 


A 


8492 


2 


130 


FRRVGQAGLELLTSGDLPASAS QGAGI T 
GVSHRAQPTFLKSC 


8432 


22333 


A 


8493 


3 


94 


HL PAE FTPAVHAS LDKFLAS VS TVXiTS K 
YR 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=A!anine 
C=Cysteine, D=Aspartic Acid, 
E=GJutamic Acid, F=Pheny la Jan ine, 
G=Glycine, H=Hi$tidine, l=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V^Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 


8433 


22334 


A 


8494 


327 


70 


KSGFSNVGQAGLKI LTSSDLP VSASQS A 
GITGMSHCTWPLYPYFGLGSRSFHSIEE 
HCWYQLAPFFSCRNPFVIVPFFVHRFYL 
TF 


8434 


22335 


A 


8495 


97 


219 


PGTVAHACNPGVLGGQGGWMTSGQEFQT 
SLASAAKPRSTRP 


8435 


22336 


A 


8496 


3 


191 


GIRRGEILEVIEFTSNEEMLCRDPKGKY 
GYVPRTALLPLETEVYDDVDFCDPLENQ 
PLPLGR 


8436 


22337 


A 


8497 


144 


304 


S YFYEE I LFSTVI KKNAI VS FTATWVEL 
EAI ILSELTQEQKTKYHMFSLVSGS 


8437 


22338 


A 


8498 


2 


90 


LTILVLAMWKVGFFKRNRPPLEEDDEEG 
E 


8438 


22339 


A 


8499 


1 


156 


PSSQDYMHAPPHLANFLNKFFVKWCLAM 
LPRLMSNWPQTILLSPPKALGLHV 


8439 


22340 


A 


8500 


67 


319 


KLMSFGPTLLSARCVSTGQICMNELNLN 
IDAMSVAATINNEIMSFARTWMKLKAIT 
LI ILTLEHKTKYCMCSLISGSLRMTTQG 


8440 


22341 


A 


8501 


3 


299 


ETGSHCVDQGGVQWHISAHCSLDLLGSS 
DPPASASRWGI TSAHHFLLWS I SSLPP 
LLLHPHNLLNLLLLLLPSLS IVLNLHIL 
PPLLPPLRLLCSPC 


8441 


22342 


A 


8502 


56 


142 


NNSAKKKKKKKKKKKKKKKKKKKKKKGG 
AAF 


8442 


22343 


A 


8503 


486 


348 


TGFHRVAQAGLELLSSGNPPALASQNGG 
ITGVSHHAQPAEYNFKK 


8443 


22344 


A 


8504 


158 


403 


RIFFFFETVGFVAQDGVQGRGAILQPRL 
FELKRFSCLSLPSSWDYKRAQPNPNDFF 
KRQMAFDPKIQR I SLGSKKKKNHPIE 


8444 


22345 


A 


8505 


3 


78 


DWHIAYVLLYGPRRVEIMEEESEQ 


8445 


22346 


A 


8506 


164 


291 


TRFCCCWLFWWFCVLCVCVAVFPPWCI 
LFFIWLFFFFFFF 


8446 


22347 


A 


8507 


372 


69 


CAPLPPLKFFFPPRGFIFRGGGSPIFPP 
PKKGVLPKNPPKGFFPPPPFLKPAPGGA 
FNYPPFGGKI FFS PPRGKWAPPGGFLKG 
APPFFFFFFFFFFLFF 


8447 


22348 


A 


8508 


282 


1 


S FQHVAQAGLKLLGSRDP YTLAAQS AG I 
TGVSKHARPVL YFS S VLFSMEFLS TMSA 
VILPCICSFCVSIiAVPSSCILQQPLPPT 
CTYIKTETCI 


8448 


22349 


A 


8509 


400 


45 


TPPVS SLFFLPLKEI I FFCPPFRQSLPP 
PFFFFPPPLFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFKYLSLPFTL 
SKEKNK 


8449 


22350 


A 


8510 


241 


419 


TPLGGWEKKKKKKKKKKKKKKKKKKKKK 
KKKKNKKKKKKGGGFLKKPLGGANFIGA 
GNN 


8450 


22351 


A 


8511 


317 


28 


QSLSPSPQTVYFI YLI I YAVFFTTLTI I 
LIFSPS FRMELKFLLRCYFFAP FYAFL F 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFSQPGAQTTAGRAGV 


8451 


22352 


A 


8512 


124 


292 


MCCYIMSLLKVICTIIKEFSTKKKKKKK 
KKKKKKKKKKKKKKKKKKKKKKKKKIKK 
KGGVPL 


8452 


22353 


A 


8513 


1247 


910 


QAQLjGDIGTSCYTKSGMILCRNDYIRLF 
GNSGAAGACGQS I PASELVMRAQGNVYH 

L?KCFT CS TCPJS! HLV PG DR FHY I NG S LFC 
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Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E^GIutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, l=lso!eucine, 
K^Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamj»e, R=Arginine, S=Serine, 
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X=Unknown, *=Stop codon, /=possib!e 
nucleotide deletion, Y=possible 
nucleotide insertion 














EHDRPTALINGHLNSLQSNPLLPDQKVC 


8453 


22354 


A 


8514 


99 


262 


MNKRFLLNKKKKKKKKKKKKKKKKKKGG 
PFKKNPGGAQ I FPGGEKKI FFLKGGL 


8454 


22355 


A 


8515 


85 


2 


GWGWAQWLTPVIPVLWEAEAGGSRGQE 


8455 


22356 


A 


8516 


244 


423 


VFAVFNFLGWVMDMLINLAWI ILQCAC 
I SNHHWHLEYIQFLFANYTS I KLGKKP 
SKSV 


8456 


22357 


A 


8517 


3 


114 


KINGBDKQK I LDKCNEI INWLDKNQVCV 
FFFFFFPI 


8457 


22358 


A 


8518 


413 


91 


GGVRGK I S FRGGG FIiK KKKKKLLKKKKK 
WGGGPKEKKKRGGFFFFFLLNFFFWGGV 
KKKKKKKKKKKKKKKKKKKKKKKKKRRK 
GRKEGREGRRKEGEGRQEINMTA 


8458 


22359 


A 


8519 


1 


163 


S RLPGRSRRRS RGRRRRRRRRRRRRRRR 
SRVL I FP WDQNLDLNIVYI I EDVDAA 


8459 


22360 


A 


8520 


191 


21 


RFTTHVDAWATVTVFLIESVFLHVGQAC 
LELLPSDDLPASASENARVTGVSHRAWL 


8460 


22361 


A 


8521 


2 


144 


AAATSQAWWCMPWPATWEAEVGASLKP 
GRLQLPLTMIALVHLSYAI 


8461 


22362 


A 


8522 


90 


8 


KPTPFLPTLIARTTLLLPISPLILIIL 


8462 


22363 


A 


8523 


305 


146 


VFRIHSCGIRGSVDVKITDGLLVIRRIE 
NVPPGPNNKNKNPYAIFQSSS I ESQ 


8463 


22364 


A 


8524 


176 


2 


LAWPFLFLPKCWDYRHEPLQPASLTTFV 
REWDKIGWAWWLTPVI PALWEAEVGRSQ 
GQ 


8464 


22365 


A 


8525 


2 


200 


CLIPSSWDYRKMPPRPANFCIFFFFLGD 
HVSQVGLKLLTSKDPLAWTSQRAGITGF 
SHHTKPQGFF 


8465 


22366 


A 


8526 


2 


217 


SFSLLFPPSFSLLLPPLASLLLPPLPSF 
SLLFPPSFSLLSPPSFSLLLPPSFSILL 
HPTFSHLYPPSPSFI 


8466 


22367 


A 


8527 


2 


186 


IRGLRPLEKRGIRVGVSRFSRYHLSRLS 
FARKGNSPTPCASRVRRCPSLLHGLHPL 
SDKPQ 


8467 


22368 


A 


8528 


3 


153 


CLGFLSGWDCRCIPPCLANLKKLFVEMG 
SLFWPQWILPPWPLKVLGLQV 


8468 


22369 


A 


8529 


2 


388 


RTAVHTF I LVLQVCVCRWKLS 1AALT PA 
HLVCSSHR I LS YLS I RGS WDHRCMTTTP 
G LVI S I FFVEMK Y RYAVHAGH I I»LGS S Y 
PASAYQSAE ITGVSHWAWPTDSLWQKPH 
MVIWCISCFLNVLIGTM 


8469 


22370 


A 


8530 


293 


64 


YRIS I KVSKWFKGWVSSFWI FFFFVFFF 
SFFFVFFCFFMFFLKQYLKFIINIIIIL 
TQALY I CTTS KI CYS FLYFR 


8470 


22371 


A 


8531 


2 


250 


SCLGLPSSWDYRGTLPCLANFCIFNRDG 
VSLCCQGWSQTPGSQVIDEAVYS I YAYY 
IIALLIVYVAQWEVWLKHFCCILXYT 


8471 


22372 


A 


8532 


2 


201 


FHHVGQTGLELLSSGDLPASASQCAGIT 
GVSHCTWPSILYRMNFKNLSAVFIAWVI 
EGNYTNKTCI 


8472 


22373 


A 


8533 


319 


200 


TFTFFLQRWGLTMLPRLVSNS WAQAI CL 
LQPPKVLELWA 


8473 


22374 


A 


8534 


160 


297 


KTETLALVAHACN PSTLGG IGRQVS SAH 
EFKTILGNMTKPILYTKL 


8474 


22375 


A 


8535 


333 


228 


PNRGH YI AI VKSHDF WLLFDDD I VEVS S 
FLISYF 


8475 


22376 


A 


8536- 


3 


325 


PPSSLLPCSWILDCCASNERDSVGVGPS 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoteucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














EPGAGYNLLVCCFLSLLEKRS IRVGVTR 
FSRCRLSPLSLTRKGNSLTPCASQVRRC 
LALLRLAHSALHPLSCSYCLALP 


8476 


22377 


A 


8537 


216 


5 


KLGFLVNLVEFQVKFFFLFFFLEIILIE 
MRSHYVAQAGLELLGSGDLPASASQSAE 
ITGMSHHALPNYSF 


8477 


22378 


A 


8538 


136 


22 


RPRRRLVLNPWPQAILLPWPPKVLGSQV 
WATCGRRPAS 


8478 


22379 


A 


8539 


136 


22 


RPRRRLVLNPWPQAILLPWPPKVLGSQV 
WATCGRREAS 


8479 


22380 


A 


8540 


8 


319 


YIYMSMYMLIYVCIYVSIYICVCIYINS 
ISFNTYMIIYMLIYVYVYMCVYICVCVY 
IYTNSISFNTHITPIKHSDVHTVSTTRC 
NLGGRGFCHTWPLPHLAGW 


8480 


22381 


A 


8541 


331 


156 


SVNGLLSLPACLPVCLPACLPSFLPSFL 
PSFLSFFLLLLLLLQKPVNSLIFSQSLS 
IS 


8481 


22382 


A 


8542 


72 


1 


KIGQARWLAPVIPALWEAEAGGSP 


8482 


22383 


A 


8543 


2 


83 


QAGVQWRDLGSLQALPPGFMPFSYLGG 


8483 


22384 


A 


8544 


293 


2 


KTVWHYLVNAPEVEIQAIYSQETCKSMY 
QETCTRIFIAILFI IAPNWKQRQDTMNP 
GNKENG KVATLDS DGFL VRFQS LMR S NC 
PLYPFNKFPILLK 


8484 


22385 


A 


8545 


314 


122 


LVETGFLHVGRAGLELPTWSDPPTSASQ 
GAGIAAVSHRGQPVDFKNNISTQIQGRP 
IICNYKTF 


8485 


22386 


A 


8546 


316 


123 


LVKTGFLNVGRAGLE IPTWS DPPTWASQ 
GAG I AWSHRGQP VDFKNS I STQIQGRP 
IICNYKTF 


8486 


22387 


A 


8547 


3 


247 


GIMATERLANYTGGIYAEYQDTTYINHV 
VS VAG WG I SDGTE Y W I VRNS WG 5 P WVRC 
FHFLKVIPRKKTSWYSSRLNGSVYV 


8487 


22388 


A 


8548 


289 


25 


QRREKFFKNGPPIFFRGPFGFPQIFVSF 
PFFFPKIGVFFVFFFRFSLVLKGDFFSN 
FLKINCFFFFFFFFFFFFFFFFFFFIKN 
RPCL 


8488 


22389 


A 


8549 


289 


2 


KKKIGETLCKLKLCKKFLDVTPKAQFIK 
QKKEKLINWQFIKIKNFRSPKALLRRGK 
GKPGTGRKRNANHI SDKGPLCRIQKKNS 
RGRAQWLTLW 


8489 


22390 


A 


8550 


290 


187 


RQENPLNLGGGGCSEPRWCHCTAAWVTE 
QDSINK 


8490 


22391 


A 


8551 


293 


183 


RDEGRGISYMLPRLVLNSWPQAILLFQP 
PKVLELQA 


8491 


22392 


A 


8552 


364 


87 


MLPLDFFLGPSLDFCPPFCESGSGGIKV 
PESTTPRASAFLPPGAANLHHI LQLVGA 
PRVPPGFHHVGQDGLDLVDLVIRLPRPP 
KVUGhQGG 


8492 


22393 


A 


8553 


57 


225 


KWQKWTPWAGGVAHACNLSTLGGRGGKI 
ACAEEFKTILENITOPCLSKNKWISFS 


8493 


22394 


A 


8554 


326 


2 


KSKRGYSEPLVFSICSARAPRSHSVSQA 
TVYGHAHASTQPFPPRLNRSSYLSLLSS 
GDYRHDPPPLAYICISVIGNDVEHLFLC 
LLSTCISSVNRSHVYSTLVPIFGP 


8494 


22395 


A 


8555 


372 


148 


Y I CFLRSS VRQ I LFTPDCSDCSMWVPCN 
IRLPGSTHWCASASQGAGTTGARHHARL 
I FFWVFFCI FS I FSRENH 


8495 


22396 


A 


8556 


422 


269 


AADQERLHTYQLNYYKFCKRWGLTTFPR 
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nucle- 
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location 
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Amino acid sequence (A=Alanine 
C=Cy stein e, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, 1-Isoleucine, 
K=Lysine, Lr=Leucine,M=Metbionine, 
N=Asparagine, P^Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threontne, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *-Stop codon,A=possibfe 
nucleotide deletion, V=possible 
nucleotide insertion 














LVSSSWVQAILPTWPHKVLRLQA 


8496 


22397 


A 


8557 


669 


407 


AGVKWRNLSTLQPPPRRFKRFSHLSLPS 
SWDYRCPLPCPANYCIFLVEKEFRHVGK 
AGLELLTSGDPPAWVSLI AG I TGVSHRA 
HPD 


8497 


22398 


A 


8558 


195 


352 


LGLQAGTTASGYSTVILGGMLKPRSWAW 
WPWPATREAEVGGSLEPGRSKLQ 


8498 


22399 


A 


8559 


345 


219 


MLARMIS ISLPCDPPASASQSAGITGMS 
HHGWLNL PKS I HL 


8499 


22400 


A 


8560 


137 


2 


VKY CLPTRSLI KTVRPLS PSQATASPVT 
CTYTPRWPEVTEESQKN 


8500 


22401 


A 


8561 


324 


193 


RS SGNL PASAAQ S ARITG VS HRTRP KC I 
ISKE YVAENNHLKS TS 


8501 


22402 


A 


8562 


326 


121 


HNHTHTHTKNHKKHQQPQHTHTYHTQTD 
IYVLQAASQKSLILISSLAV 


8502 


22403 


A 


8563 


3 


85 


HEHTHTHTHGHTHTHRHTHTHTIiCARM 


8503 


22404 


A 


8564 


2 


153 


ARDRHI PPCLTNFSVFWRDGGLAMLPRL 
VSKLWAQAVHP PWLLKVLRLQA 


8504 


22405 


A 


8565 


2 


177 


ARVGFHYVGQAGLGRLTSGDP pasasqs 
AGITGMSHRARPMS SLTLHILQDTWRAY 
AG 


8505 


22406 


A 


8566 


353 


212 


AGFRHVGRAGLELVTQDGPRTSACQSSG 
ITGMSHCAHPEKSTAFILF 


8506 


22407 


A 


8567 


76 


191 


ELI F CRDGGLTV FP KL I S NTW P YAVLPL 
WPPKVLGLQV 


8507 


22408 


A 


8568 


1 


142 


GTRGFLHVGQAGLELPTSGDPPAIjASQG 
AQMTGVSHCASQKKEFLEW 


8508 


22409 


A 


S569 


250 


95 


SASRAPLPIiPPSHLPIiRAAGLSPLCPPR 
LVSS YRPHVILLPLPPKVLGLQV 


8509 


22410 


A 


8570 


3 


226 


HEHTHTHTHTHTHTHTHGHTHTHTHTLV . 

LrVYSLCEHIWRFIVISRARVCCIYVWF 

FECGYADRRVADKWLWIH 


8510 


22411 


A 


8571 


1 


216 


GTRG LGVG VSHL VS KNTHTHTHTHTHTH 
THTHTATASQLS VF K I WAFQRS LD KGLT 
YISNSYCSSKFYLYNH 


8511 


22412 


A 


8572 


356 


268 


LTMLARLVLNFRPQVIPQPWPPKMLGLQ 
A 


8512 


22413 


A 


8573 


12 


175 


RWGFHPVGQAGLELLTSSDLPTLASQKA 
EITGMSHCARPSFFSFFLSFFFWKHL 


8513 


22414 


A 


8574 


1 


163 


GTRTG FHHVGQADLELLTSGDP PVS ASQ 
SVGITGVSHRAWPVGLCI IALISNEY 


8514 


22415 


A 


8575 


344 


176 


EFLVETGFRHIGQAGFELLTLGDQSTSA 
S QNTE ITGVSHPAPGLAI I FKSGNQAFF 


8515 


22416 


A 


8576 


2 


179 


ARGILCHSGTFKLDSRPSMVAHTCNPST 
LGSQGGRI ACGQE IDTSLLKVARPRLYG 
QIF 


8516 


22417 


A 


8577 


1 


100 


GTRGFTMLPRWSDSWAQVILLPWPPKV 
LELQM 


8517 


22418 


A 


8578 


3 


114 


HERHEILSIYLTIYLSIYLSIYLSIYLI 
YLPTITRI 


8518 


22419 


A 


8579 


1 


264 


GTRPCRIELYRWESIiAKAQETSGEEIS 
KFYLPNCNKNGFYHSRQCETSMDGEAGL 
C WRVYP WNGKR I PGS PE ITGDPNCHI YF 
NVQN 


8519 


22420 


A . 


8580' 




245 


GTSGTSGILGFYLEREDRLLQIRTDTIT 
LSHGYSTFSLNRCDSRHHMNRMFQMLYQ 
SPFLGYIQLSHEVLAFGISYIYWTE 
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C-Cysteine, D=Aspartic Acid, 
E-Glutamic Acid, F=Phenylalanine, 
G^GIycine, H-Histidine, Msoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, PHProline, 
Q=G!utamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


$520 


22421 


A 


85SI 


1531 


1457 


QARNETIARRLWDVSCDLLGLPID 


8521 


22422 


A 


8582 


191 


2 


TITSYFSPTPAPGSHYATFCFYKFDYSR 
YFI KVELWEAETGGSRGQE IETILANTV 
KPRLSRA 


8522 


22423 


A 


8584 


2 


212 


ARAG I YPKE YKS F YYKDTCMRMFIAGLF 
TIPKTWTQSKCPSWIDWIKKMYRQGPIL 
SPMLECSGEVSGFK 


8523 


22424 


A 


8585 


2 


192 


ARGNS PASS SRVAGITGAHHHARL I FVF 
FLFVRFFFSFLKTGFKLVAHSNFQPLTL 
FAPLHFK 


8524 


22425 


A 


8586 


277 


66 


PLDFRLGHKRRLPFPKKKKKESKFVCVC 
VCVCVCVCFKRQCLVTLPRLVSNSWAQA 
ILLSWPPKILALQV 


8525 


22426 


A 


8587 


395 


286 


PHLGL P KCWD YRRE PPCIiAE WLKK YVW 
VTYFSFRR 


8526 


22427 


A 


8588 


307 


177 


RYPSTINYS ILNRDKIMPKLDRMVYKAR 
PRVMCLPWPPKVLGL 


8527 


22428 


A 


8589 


167 


349 


RNLRELGEFFFFFFWFFFFFFFFVGKFS 
VFLLVGNLFLFFGGGIFGLSRVMCFKEF 
RCFS 


8528 


22429 


A 


8590 


375 


240 


LLSRFSWFCFVFRQGLTMLPRLVLNCWA 
RGILLPWSPKVLGLQA 


8529 


22430 


A 


8591 


347 


215 


ETGFPHVGQAGLKLLASSDPPASASQSA 
QITGMSRRAQPSAKRF 


8530 


22431 


A 


8592 


340 


114 


HRTAHCS LYLSGS S DPATS S SQVTGTTG 
KDQHTWLLFKYFCRARVSKINKYKS INK 
FYFLNTQTTPSEYGKASPG 


8531 


22432 


A 


8593 


86 


253 


LRLCCGLSCSTAKKKKKKKKKKPKKKKr 
KKKNLHPKRGGFEEILFFFPKGGKNLF 


8532 


22433 


A 


8594 


3 


135 


GFCHVGQVGLELLTSGDPPALASQNAEI 
TGVSHRAWPPVLFFF 


8533 


22434 


A 


8595 


2 


274 


PRVRTISLLGILVYRSHLISSLLCLEGI 
I LS LF 1 1 ATL ITLNTHSLLANI VP IAI L 
VFAACE AAVGLALLVS ISNTYGLDY VHN 
LNLLQC 


8534 


22435 


A 


8596 


3 


316 


S I P YSWGEKEGI PCMAPPQ I QGTSRLND 
FTALSLHLNTHTHTHTHTHTHE I QHLPP 
QGITALILNSLLRHYCPSLAFPSLSPKQ 
NLTVRHCILSRDTWLATASK 


8535 


22436 


A 


8597 


385 


291 


KAPPLFFFFFFFFFFFFFFFFFFFFKPV 
KKILGV 


8536 


22437 


A 


8598 


275 


369 


GVQVLKLLTSGDLPALASQSAE ITGVSH 
RAR 


8537 


22438 


A 


8599 


98 


344 


VHYEEFMCVCVCVCVCVCVCWPRVRGVD 
MNPVEHPFGGGTHQHIGKPCTIRRNASA 
GRKVRLMAALWTVRIGGTKTVTLRKT 


8538 


22439 


A 


8600 


328 


195 


THVPPSLAKFVFLVKMGFLHIGQAGLDL 
STSGDPPALASHVFKR 


8539 


22440 


A 


8601 


2 


214 


HVDWMGTFAILSELHWDMLHVDPENLR 
LLD S VLNCELANHFGTDFTP PVQAAYQK 
WADVANALAHMYH 


8540 


22441 


A 


8602 


89 


181 


GLRHENRLNSGGGGCSEPKLCHCTPAWA 
TE 


8541 


22442 


A 


8603 


340 


57 


KEGHQEMEQAEDGGHGRKAPGTS P WTLG 
ELPKSARGHTGLFVSYRGCRTQFRFKKV 
LFGGDRNRGLAMLPRLVTjNSWPQEILLP 
QPSKVQGLQK 
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residue of 
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Predict- 
ed end 
nucle- - 
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correspon 
ding to 
last amino 
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residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=G 1 utamic Acid, F=Pbenylalanine, 
G=Glycine, H=Histidine, 1-Isofeucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V= Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possibfe 
nucleotide insertion 


8542 


22443 


A 


8604 


330 


132 


TGFLHVGQSGLKLLTSGDLPTSAPQNAG 
ITGGSHCTRPFSLFHS YLVLFYGCS SSQ 
VSLSILARM 


8543 


22444 


A 


8605 


546 


279 


RPGPTVSPRLECICMIMAHCSLDFPGSG 
VS PTSAS RVARTTS THKHAWL I FYS S VE 
TGSHHVAQAGLELLGSSNPPI SASQSAG 
ITAL 


8544 


22445 


A 


8606 


3 


249 


NRFNLGSGGCSEPRSHHCTPSWATEQET 
CITYLEYCFPPVSHTNLVYFKPLNVTIR 
SDYC I LGLHAS S IGGSDSL 1 1 DCWD 


8545 


22446 


A 


8607 


159 


324 


VIQYRTNILHLHLCEVLVFWFLPYFFLK 
KT Y TGI GQMRWLTPVI PALWEAEAGGS 


8546 


22447 


A 


8608 


334 


179 


TGFPHVGQSGLKLLTSGDLPTSGSQSAG 
ITGVSHCTWLEVIYFLNISIYPS 


8547 


22448 


A 


8609 


278 


202 


QQPSTLRQDPPPAKRLHLSEGSDGH 


8548 


22449 


A 


8610 


236 


66 


AASTFFLQKIKRHLLSGTVAHACNPSTL 
GGRGIRI SRGREFKTSLGNMVRPRVYKK 


8549 


22450 


A 


8611 


29 


177 


GKQVMALHCFI FFHFFFRRSLAMLPRLV 
SNSWVQEILPPWPLKVLELQA 


8550 


22451 


A 


8612 


1 


215 


HTSRTLFVHLRLKNFNKYLIEWGLSMLP 
RL VLNS WAQAVXiP P WPS KVLG I TGMS HC 
TWLYLHLSTDLLKLC 


8551 


22452 


A 


8613 


3 


91 


LRKGNCLDPGEGGCSEPRSCHCTPAWVT 
E 


8552 


22453 


A 


8614 


2 


167 


FT I S LLG ILVDRS LTHH I I KPS FTRE 
NTLMFIHLS P I LS TRKQLAI QHLSRLE 


8553 


22454 


A 


8635 


2 


281 


FCI LVETGFHCVAHAGLQLLSSGNPPAS 
ASQSVGITGMSHCTSPNLSLLTQSKSFC 
LLIQNHNPYAFINSQLTFS ILFLLSYFI 
FYDSLFLLF 


8554 


22455 


A 


8616 


282 


182 


GQAVTWLTPVI PALWEAEVGGLLEPRSSR 
PAWST 


8555 


22456 


A 


8617 


379 


207 


SNOTTKKYINFFFVEMRSHYVAQAGLKF 
LGSSNPPTSASQSASITGISHYVWPISP 
S 


8556 


> 22457 


A 


8618 


705 


483 


DKSFALVARAGVQWHHLGSLQPLPPGFK 
QFSYLSLPSSWDYRHAPPRPANFCILSR 
DGVSPCWPGWSRTPDLR 


8557 


22458 


A 


8619 


359 


220 


RQS LTMLPRLVSNS WAQAI LP P WLSRLL 
GLQALAPI PGKS YNEKTP 


8558 


22459 


A 


8620 


3 


186 


HEVSWVKRKQDEWIEFDDDTVS IVAPED 
ILRLSAGGDCHIAYVLLYGPRRVQILEE 
ESEQ 


8559 


22460 


A 


8621 


1 


116 


GTSFCRDGGLTVLAQLGSSFWPQRILPP 
WPPKWELQA 


8560 


22461 


A 


8622 


369 


188 


MYSLEMGFIGNGHAGLELLASGDLSALA 
SESAGITCMSLRYRTRFSFKRATCGLDL 
SLQS 


8561 


22462 ! 


A 


8623 


202 


22 


YILGFYFMPVICYQERKKDKKNWEVTTI 
MS F VG I CMELGAI I LS KLWQEQKTKYH I 
FSQV 


8562 


22463 


A 


8624 


387 


315 


SNLGGGGCSELRLCHCTPAWATE 


8563 


22464 


A 


8625 


2 


135 


TRDLCSLVYLLTFPPLLSHDPAKSPSVR 
NTQELS I KKKKKGRP F 


8564 


22465 


A 


8626 


3 


232 


HASAFEEPVYIKSRQKRKESNPPKLVSS 
QPHGLKKKKKKKKKKKKKRGRGLKKKKK 
KHNKKFKKGQKKYWFGRGAF 
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SJEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
VSSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nuc)e- 
ottde 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=A la nine 
OCysteine, D=Aspartic Acid, 
E=GJutamic Acid, F=PhenylaJanine, 
G=Glycine, H-Histidine, I=Isoleucrne, 
KFLysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glu tannine, R=Arginine, S=Serine, 
T^Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, V=possibIe 
nucleotide insertion 


8565 


22466 


A 


8627 


97 


373 


QHGFLKKKKKKKKKKKKKGGPLLKKPLG 
GPNFTGDGKKKFFPKKGGIKKPPGEFLK 
KNLFLGGKKNGKNPPKKLNPLGEKKNFK 
GKKGKNPP 


8566 


22467 


A 


8628 


2 


188 


RYGCPPPYHTFEEPVYIKSRQKRKEWNP 
PKLVSSQPHGLHDFFKKKKKKGAAVLKD 
PSGGPS 


8567 


22468 


A 


8629 


287 


68 


GEEGRTQKGERNGGGGSERKEGEEHEAR 
RTGRGEGGGGEAKGRPRSARRRGRGSRG 
SKSQTPSRHLPAHI ITN 


8568 


22469 


A 


8630 


364 


136 


VSLLLGLEYSGTITAHYSLNFLLFIVET 
GSHYVAQTGLKLLGSSNPPTSDSQTFEI 
TGVSHGAQPKVQFWSKNLDT 


8569 


22470 


A 


8631 


355 


70 


LFPPRFFPFFSPLSPLKFFFSPKGFNFF 
REFFPIFSPPKKRVLSKNSPGGFYKPPL 
RGKI FTFPPPVKFGPPRGLFKGPPPFFF 
FFLFFFFFLVI 


8570 


22471 


A 


8632 


2 


266 


AFTISLLGILVYRSHLISSLLCLEGIIL 
SLVI IATVITLNTHSLLANIVPIAILVF 
AACBAAVGLALLGS I SNT YGLDYGHNLN 
LLQC 


8571 


22472 


A 


8633 


1 


186 


LTHH INN I KPS FTRENTLMF IHLSP I hh 
LKKKKKKKKKKKKKKKKKKKKGGALKKK 
PGGGQN 


8572 


22473 


A 


8634 


2 


80 


VLE I AVAL I QAYVFTLLVSLYLRDNT 


8573 


22474 


A 


8635 


713 


333 


EGPPPPRSKKKGTQGGGKDPPS PPEPKT 
PGNPQGRKTGPPPPGRPYTGPRPPGSGP 
TRGGGSRS SSSNTNAPGEKI FFS KNPGR 
KIFPPRAILVFFSPFPLKNFFFSLRLLI 
FLGGCAPFFPPPK 


8574 


22475 


A 


8636 


389 


1 


FIFSFLFLIIFFFLNINFFFYIFSSLSF 
FFFLNFNLLLKHTPPIILFFFSLSPIFF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFGTKKELY 
LLRARGCKQYKNQKLIWYLTF 


8575 


22476 


A 


8637 


2 


192 


VRTLGLRTDAIPGRLYQTTFTATRPGVY 
YGQWYE I CGANHS FMPI VLELLALTI FE 
IGPEFTL 


8576 


22477 


A 


8638 


1 


87 


PTRPLE I AVAL I QAYVFIXLVS V YLHDN 
T 


8577 


22478 


A 


8639 


2 


132 


INLASTLIIFTILILLTILEIAVALIQA 
YGVTLLVSLYLHDNT 


8578 


22479 


A 


8640 


2 


113 


LFFIAEYTNIIIINTLTTTIFLGTTYDA 
LSPELYTTY 


8579 


22480 


A 


8641 


375 


22 


FIFISPLAYLFFLPFPFFFIPPLCFFFF 
IPLFFLFLSFLFSLIFFLFSLILIFPPS 
LFFFFPPPFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFC WLPHR VMIAKVG I S WSKKI 


8580 


22481 


A 


8642 


180 


34 


WFISDSLWLGMVAHTCNPSTLVGQGGWI 
TCGRSRPSVPKRTPSOTCAPA 


8581 


22482 


A 


8643 


338 


133 


NLPLGLFYSLQKKKKKPFLRPGTLGAPK 
GNFQGAPPLFGKFAFFFFFFFFPFFFFF 
FFFYG FFFFLFK 


8582 


22483 


A 


8644 


156 


287 


LTYPI KPPLLCSQKKKKKKKKKKKKKKK 
KKKKKGGAFKKNPWGAQI 


8583 


22484 


A 


8645 




325 


LHHCH THN F VNKW C F I R KKKKKG KKKK 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
corrcspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, ! 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Iso leu cine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Pro)ine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *~Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














KXREKKKKKKKKKKKGGGPLKKPPGGPK 
FSGGGKKKIFPQRGGYKKPPGGFLGKTL 
FLGGEKMGENPPKKIKPLGEKKIF 


8584 


22485 


A 


8646 


370 


3 


PVRAS RLLCL PKQAWAMAGAPP PASLP P 
CSLISDCCASNQRDSVGAGPSEPGAGYN 
LVMHCFLSPSEKHSIWVGVTRFSRCCPS 
PLSIiTRKGNSLTPCASQVRQCLALLRLV 
QGAGTHRTRG 


8585 


22486 


A 


8647 


341 


88 


KKKKIFSPPIKRGPPSVYFICPPPPFFF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFRMGCDRWHGELYNKI 
PIWHKLCGDL 


8586 


22487 


A 


8648 


1 


108 


PTRPRTRGVASVLYFTTILILIPTISLI 
ENKILKWA 


8587 


22488 


A 


8649 


3 


96 


EICGANHSFMPIVLELIPLKIFEIGPVF 
TL 


8588 


22489 


A 


8650 


457 


297 


SPSFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFLFYFALF 
FIYLIIKDNYMFQC 


8589 


22490 


A 


8651 


84 


352 


YEKTDVKI I S I LFNTD YMLE YNVLH I LG 
QIKMLLISKKKKKKKKKKKKKKKKKINK 
FFWGGLILTLLGIVLYSPLEALKISPSF 
GFLEK 


8590 


22491 


A 


8652 


2 


111 


NNIKPSFTRENTLMFIHLSPILLLSLNP 
DIITGFSS 


8591 


22492 


A 


8653 


1 


305 


RQQQQGQQLRNLRDFLLVYNRMTELCFQ 
RCV P S LHHRALDAE E EACLKS CAGKL I H 
SNHRLMAA YVQLM PALVQRR IADYEAAS 
AVPGVAAEQPGVSPSGS 


8592 


22493 


A 


8654 


129 


268 


LSEVLYLFPPKKKRLIILCLFFRNPGSS 
NLQKITKEPI IDYFDVQD 


8593 


22494 


A 


8655 


281 


117 


CI ITLSFINTLQAY I IWLWFSEVCL YV 
SLS I YLS IYL*S I YVFEYRY I KISFHL 


8594 


22495 


A 


8656 


3 


177 


DLHAHKLGVDPGNLNLLSHCLLETLAGH 
LPG E FT PAGHAFLD KFLDF VST VLT YK Y 
R 


8595 


22496 


A 


8657 


3 


270 


RRRGRAHCSLDLLGSRNPPASASRVAVT 
TGTRHLAW L I WGG FLF F VLRD VL VR FH 
ATDKD I PKTGQFTKERGLMDLQFHMAGK 
ASQS 


. 8596 


22497 


A 


8658 


107 


11 


QFEHTKPTPFLPTLIALTTLLLPISPFI 
LIIL 


8597 


22498 


A 


8659 


21 


136 


VPIEHDVPVPNSRAELLKMFGIDRDAIA 
QAVRGLITKA 


8598 


22499 


A 


8660 


1 


395 


LVTALYSLYIFTTTQWGSLTHHISNLEK 
XKKGKGKKKKRGGALFXGI FGGSHFFGV 
WSLLFFFLKGGIKKTILGFLGKTLFFGG 
GLLGAPLPRKIKGLREKKNFKGELGVKN 
RVFFFFGNFSSLGVYLKKY 


8599 


22500 


A 


8661 


1 


255 


RTRGRTRGRTRGLTRGKKKKKKIMKKKK 
KKKKKKKKGGAFLKNPWGGP I FWGLPKF 
YFLPNKGSFFNLIGDFLKRPFFWGGAYF 
G 


8600 


22501 


A 


8662 


136 


358 


FHI VKGVS I I KFKC V ILK KK KKKKKKKK 
KKKKKKKKKKGKKLKKKKGG KKKKGGGK 
NKI FFWVGFF YKRVWG YF 


8601 


22502 


A 


8663 


198 


368 


LINLVQFTCILGTLLGILFIYLFLRRAI 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

etb 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, ]=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R^ArginineyS^Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possibJe 
nucleotide deletion, \=possible 
nucleotide insertion 














TMLARLVLNS W PQAVLLPQP PKVLGLQA 


8602 


22503 


A 


8664 


407 


82 


KFLPHLGFPPFKRFFIKKEPFQWFFFLF 
GFFFFRRSFFLLPRVQCNGAISGPCNLC 
LNVKVRLS FKKKKKKCWD YTFE PLCPAL 
HFSFINKTEDCLFVCFTVYWPHGF 


8603 


22504 


A 


8665 


166 


293 


KI IAKHNIARTNPYTFCIMKKKKKKKKK 
KKKKKKKKKKKKKKGGAL 


8604 


22505 


A 


8666 


1 


164 


RAKAKRRNTTVS CRMRHLKI VYRRFRYR 
LYVGI IIGPGGLNETLVFTCKKITVP 


8605 


22506 


A 


8667 


295 


85 


YI YMSHI CTLSLSLSI YKRVKCFCFFKT 
WSHS VAQAGVQW RDLGSLQPPPPKWI LC 
ISVFFTVLFVMKKN 


8606 


22507 


A 


8668 


2 


131 


INLPSTLIIFTILILLTILEIAVALIQA 
YVFTLLVS LYLHDNT 


8607 


22508 


A 


8669 


423 


107 


HCALPKKKKKKKKKKKKKKKKKKKKKKK 
KKKKKKKKKKKKK 


8608 


22509 


A 


8670 


432 


301 


SQILGRFSQKQHLSLGGGGCSHPRSCHC 
TPAWARGRLRLKKF I K 


8609 


22510 


A 


8671 


1 


106 


LKLLTS S DLPAS ASQVAE I TGMSHCAR P 
LSRIYMC 


8610 


22511 


A 


8672 


383 


252 


CPANFVFSVEKGFLHVGQAGLELPTSGD 
PPGCWFCFLQQQLHW 


8611 


22512 


A 


8673 


560 


135 


SSWGMDS I PTSSNMEETQQKSNLELLHI 
SLLLIESRLEPVRFLRSTFTNNLVYDTS 
DSDEYHLLKDLEEGIQMLMGRVEDGSHL 
TGQTLKQTYSKFDTNSHKHDALLKNYGL 
LPCFRKDMDKVETFLRMVQFRSVEGSCG 
F 


8612 


22513 


A 


8674 


2 


326 


SLLMCVHRCECVCMRACLCAGVCMCIAS 
CLG L PMWWE C Y TWRVLV FHQ FQDE ELH 
DTVDLET I PLERQPRDVQHPVSTRILYL 
HVY FVAVTLTL I R I LQLWTE AFS P 


8613 


22514 


A 


8675 


171 


272 


TQCCLLKAEKKKPKKKPKKKKKKKKKKK 
KGGPF 


8614 


22515 


A 


8676 


60 


409 


LVWLESHSDVGPLIKDSKQEKKKKKKKK 
KKKKKKKKKKKRGGAFKKKPGGGQNLRG 
GEKKNFFFLRGEKKNPRGNFGKKTFFWG 
GENWAKPPPKKKSPEGKKKILKGKGGKK 
YLNFLGE 


8615 


22516 


A 


8677 


106 


286 


LFWNKEGNNWKKKKKKKKKKKKKKKKKK 
KKKKKKKWGGALLKI SGGGPI FPGGGKN 
FFFFFGGGF 


8616 


22517 


A 


8678 


1 


97 


I I GQVAS VLYFTTI LILI PTI SLI ENKI 
LKWA 


8617 


22518 


A 


8679 


1 


141 


NEHQPGQHGEMPSLLKIQKISWAWWSMP 
VIPATQEAEMEKESQISIM ! 


8618 


22519 


A 


8680 


499 


140 


GVFFFSRRKRCGVSPPPPPKPVFFPLPG 
FFFWWGGCPSAPPPPGGGFPPNPPRGFF 
YPPPLKGNFFFPPPAGGGPPRGFFWAPP 
PPFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFS 


8619 


22520 


A 


8681 


60 


224 


KQTKY IKEI TICK I CT WQGR YHDHEGGPP 
RVKLIHCTPDlXiLTPVISPDVGNSTTV 


8620 


22521 


A 


8682 


252 


383 


SQXKKKKKKKKKKKKKKKKKKKXXGGGL 
LKNFWGGPNLTGGGKI 


8621 


22522 


A 


8683 


405 


145 


KFFFSGGCFFFWGPPGKNVPPPKKMFFF 
LIPPPKKKXPPPPKKIFFPPPGVFFPPP 
PPFFFAPPPFFFFFFFFFFFFFFFFFFF 
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Predict- 
ed end 
nucle- 
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residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I^Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptopban, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














FFF 


8622 


22523 


A 


8684 


1 


406 


LYYFLTKSTTTYLAVPQPFPPTPSTPSR 
AKKKKKKKKKKKKKKKKKKKKKKKKKKK 
KKKKKKRGGGPKKKTR KGGGGQKKKKGR 
GPEKNRGGGETTPGEKKKKKKKGGGGKK 
KKKKSGGGKKNGGKPGRGGGGGR 


8623 


22524 


A 


8685 


391 


265 


GRP F FF FLDGVS LLLPRLE C S G V I LAH C 
NLCLPGPSDSPSSA 


8624 


22525 


A 


8686 


2 


177 


DTALYSLYIFTTTQWGSLTHHINNIKPS 
FTRENTIaMClHLSPILLLSLNPDIITGF 
SS 


8625 


22526 


A 


8687 


479 


366 


GLPKCWDYRREPPHPAANFCI FSRDGVS 
PCWPGWSCL 


8626 


22527 


A 


8688 


35 


161 


SNPPSSAFQVAGTTGVRHHDQLTFFFFF 
FLGGTGQILCRGGD 


8627 


22528 


A 


8689 


1 


122 


TKHINNI KP S FTRENTLMF I HLSP I LLL 
SLNPDIITGFSS 


8628 


22529 


A 


8690 


2 


152 


NIILAFTISLLGILVYRSHLISSLLCLE 
GIILSLFIIATLIYPTPSFSCY 


8629 


22530 


A 


8691 


541 


85 


IFLRQFLSPRLEYSGVI KAHCSLNLLGS 
SDPPASAFKVAGTTGIRHHAWLIFVFFF 
AETGFHYVAQGGLELLNLRNLPATAFQS 
AG I IGISHHDLLGIQITTWLLQKLS YLQ 
KKICKDQETGLLHAFHLHPYPYPMPSSP 
QLLQGPDILTNG 


8630 


2253] 


A 


8692 


122 


38 


GRVDHS FMP I VLELI PLKI FEIGPVFTL 


8631 


22532 


A 


8693 


276 


77 


GVSPCEPVFFFILGRCKPPAWGVPKGRD 
FFVLS FFFWF Y I LLFFFFF? FFYFFLF 
SFYCFNGSNK 


8632 


22533 


A 


8694 


3 


70 


FMP IVLELI PLKI FEIGPVFTL 


8633 


22534 


A 


8695 


230 


3 


GSGIPSHPGHPSLPYHLEREWLAKTGIR 
DTSNSVHLLELCLRSQRHGRARWLTPVI 
PALWEAEAGGSRRQEIEPI 


8634 


22535 


A 


8696 


401 


240 


PPPPPPPCGSEPRPPHPSLS PPGSREMF 
VLALSQESQMTFFFFQLRKKLHLYL 


8635 


22536 


A 


8697 


103 


313 


EKKKKKKKKKKKKKKKKKKKKKKKKKKK 
KKKKKKKKKKGGGGPKKKKFEAIFGKTP 
GGVSQGGGPPPLNFFFLNPGPPFF 


8636 


22537 


A 


8698 


354 


110 


KKGTLFEVSESF YTTEGVSAKINRI RS I 
VS VNKVARCIGEI FTDAVQS AFQKE VGG 
VSDSMVHKYEYAGCVDVKAVCSHGL 


8637 


22538 


A 


8699 


2 


164 


FLVETGFLHDGQAGLKLLI SGDPPA.SAS 
QSAEITGMSPHACNSSTLGGRGGRIT 


8638 


22539 


A 


8700 


191 


371 


PPDSRWAIQGGHLELTPTTRQDLAQAPR 
RKLALLEGSFQGQAQWLTPVI PALWEAE 
ASGS 


8639 


22540 


A 


8701 


2 


170 


ARAGLYHVGQAGLKPLTSGDPLASASQS 
VRITGVSHRTRPKI I FKLI FFLFFFFFF 


8640 


22541 


A 


8702 


31 


229 


MEKYNVHPHSGI LRSHEKEQAALFT I AK 
RWRQPSYPS IDE W INIMWSSHTVEYYTA 
MKRNKQLYSQ 


8641 


22542 


A 


8703 


3 


126 


LQELRDPTLTFRLLGSPRPWVETRPVD 
DPTAPSNLYIQE 


8642 


22543 


A 


8704 


438 


31 


SEPCAGYNLLVCRFLSPLEKHW IRVGVT 
RFSRCHLSPLSLTRKGNSLTPCTSRVRQ 
CLALLWIAHSALHPLSCMKCLALPSEMN 
PVPQMEMQKSPI FCIAHAGSCTPELFLF 
GHLGSTPLTOLLSLRINVFLRTSE 



T 
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NO: of 
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SEQ ID 
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peptide 
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NO: in 
USSN 


Predicted 
beginning 
nucleotide 


Predict- 
ed end 
nucle- 


Amino acid sequence (A=A)anine 
OCysteine, D=Aspartic Acid, 
E>=Glutamtc Acid, F=Phenylalanine, 


sequence 


sequence 




09/515,1 
26 


location 
correspond 
ing to first 
amino acid 
residue of 
peptide 
sequence 


otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


G=Glycine, H=Histidine, I=Jsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unkno>vn, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


8643 


22544 


A 


8705 


429 


319 


PGAI IAHRSLNLPGSSDPPTSASQVAGN 
TGVSYCARP 


8644 


22545 


A 


8706 


426 


245 


RAAT P S P I KLT KKKKKKKKKKKKKKKKK 
KKKKKKKKKKRGAPLKKTLGGPQ INRGK 
KKIFFFF 


8645 


22546 


A 


8707 


1 


108 


KHPRGGSFGYCFALAWVAFPLALVSGI I 
YIHLRKR2 


8646 


22547 


A 


8708 


3 


146 


QI EGHT I SALGDGAAWPVQGLI RHFRPE 
LEERMQRFAQQHQARQAAS 


8647 


22548 


A 


8709 


2 


83 


LVLE I AVALIQAYVFTLLVS LYLHDNT 


8648 


22549 


A 


8710 


1 


81 


RPLSKTVRFNVLKVTKAAGTKKQFQKF 


8649 


22550 


A 


8711 


156 


4 


RPLS FHPGGKKRLFLKKKKKKLCAVAHA 
CNP S TLGGRGGR I MRSGDRDHPG 


8650 


22551 


A 


8712 


440 


233 


TPSYFVLLVERGFSMLPRLVSLTSGDPP 
ASAFRSAGITGMSHHAQPRTGRSKAYLP 
FTiEEPDLKHIFPF 


8651 


22552 


A 


8713 


92 


302 


AMHPTMGPQDQGHTGGLYAHTPLHPHSH 
THANTLIHSHIYAHAHSHTIPATCPKGP 
TQPYLCLLSPKIEL 


8652 


22553 


A 


8714 


2 


166 


KKINSFFSQGLTQAKVHGGRPSSLGPQT 
PVLKRSSCLSFPSSWDYRHEPLYPAN 


8653 


22554 


A 


8715 


1 


129 


GFRKVGQADLELLTSGDLPASASQIVRI 
TGVSPCAQHEFFKSN 


8654 


22555 


A 


8716 


381 


105 


RRSLLHS VLNGAQAGVQWRDLGS LQPPP 
PSSLPWPPKVPRLQPLPGRHPVWEVRSV 
SARPPIVWDVRSPSAWLPSLESEERLCL 
AAIPSGK 


8655 


22556 


A 


8717 


385 


125 


FFFFLPPAFLLSGXFWFFFFYIFSFFFF 
FFFLGSPPPPIFFFFFFFLNKIFFFFFF 
FFFFFNNLFFFFFFFFFFFYFFFFFFFF 
FFF 


8656 


22557 


A 


8718 


409 


264 


KTDQRWSSTSSSKIMSQSQVSKGVDFES 
SEDDDDDP FMNPS S LRRNRR 


8657 


22558 


A 


8719 


3 


269 


KKHVKRVLSHLKAS CP EELLHFS S WPQL 
LELQI^IGGGSHTHMDQHTHTKMHPHIYT 
HAHTHTWAKLNELP WVEMNLLS LGAFQ 
QRLP 


8658 


22559 


A 


8720 


430 


141 


LFFFLSFFEDGVLLCHPGWSWAQSQLT 
ATSTSKKKKKKKATPPGIPKFLIGKSGK 
PPRVLLIGHWAPFKFFFFNPRLAFF 


8659 


22560 


A 


8721 


386 


290 


RWGLTMLPRLVLNYWLQAILLLWPPKVS 
GLQA 


8660 


22561 


A 


8722 


3 


157 


HEVFLVETVFLHVGQAGLELPTSGDLPT 
SALWEAEWGGLLEDRS SGPAWET 


8661 


22562 


A 


8723 


493 


186 


FSPQGGEKRAILGLRNLCPPGVKDFSAR 
PPKEVGNEGRVPKTKLFFVFLKKKGFPL 
IGRGGFKSRPWESPPRPPQKVGVQRLNP 
PPGPFFFFDLIWPNRNG3 


8662 


22563 


A 


8724 


3 


186 


GGGTPPFPGVWGKGNGGGTPPAKTLPPF 
PSPLFFFSPCPSPGEGGSREPLLSTPLA 
CICI 


8663 


22564 


A 


8725 ; 


370 


67 


FAPPKNFYKGAPPIFFIFLFFILFI YFF 
FFFFIFFKFLVFLLETGFHHVSQEDLDL 
VIHLPQPPKWGLQAWATTPGQFFVFLI 
ETGLHPVNQDGHKLLTL 


8664 


22565 


A 


8726 


966 


652 


LGSLKPP PTGFKR FS CLS LPS S Y RHA 
PPCPANFVFLEETGFLHVGQVSLELLTS 
GDPPTLAS QS AG I TGMSKRDWLGQIILI V 
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NO: of 
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sequence 
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M 
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Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
diog to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M==Methionine, 
N=Asparagine, P=Proline, 
Q=G)utamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
\Y=Tryptophan, Y=Tyrosine, 
X=l)nknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














NHCYLFYQGHIKTTRSLLPPP 


8665 


22566 


A 


8727 


85 


425 


HMHS FAHELHTFLHTHTNSHTNSQNQS P 
TYTS KSHTHS YMR FTDSHDS QMQQTHRH 
S LLHMTQLYTCTHTHTHTRTVI PLHLHS 
VKGWLNTKVALGGRTSHGRESHIAGRLL 
A 


8666 


22567 


A 


8728 


393 


124 


LGSGE PQLFQS P PS GGPPGPKSH WNS WG 
NANPPVPPYIIESAPPPPHFPFGAQSKY 
S GGQAARLCPCP P FFL»KKKKKKA.PGSGT 
GTFSFQ 


8667 


22568 


A 


8729 


25 


217 


AVQP I RVQWRRS LQS PQEQ I I LAPSLAK 
VDMEMTQLTQENADFATRDRYHHSSLVN 
REQLMPHY 


8668 


22569 


A 


8730 


1 


611 


PGIFYSALLSLDTS ILNQLCFIMHRYRK 
NLTAAKKNELVQKTKS EFN FS S KTYQE F 
NHYLTSMVGCLWTS KPFGKGI YI DPE I L 
EKTGVAEYKNSLNWHHPSFLSYAVSFL 
hQTSSSSSSSSSSSSSSSSSSSSSSSSS 
SSSSSPSPEERTVNVSSIRGKKWSWYLD 
YLFSQGLQGLKLFIRSSVHHSSIPRAEG 
INCNNQY 


8669 


22570 


A 


8731 


120 


378 


VEKTQAS ILGCKCSTPRAALGFR2RTLP 
LRRCPCCGPPEAQMELEKRALGSQAQWL 
MPVFPAL^^EVGESLEARSFETSLGNI 
A 


8670 


22571 


A 


8732 


2 


68 


VYVCVCVCVCVCMCVCRAKVGM 


8671 


22572 


A 


8733 


382 


258 


NYHSSLHLETPGLKQSSCLSLPKSWDYR 
HSSPCPALIFNSL 


8672 


22573 


A 


8734 


479 


326 


PPLYGFFFFFFFWGGGFFFFFFFYFFFF 
FFFFFFFFFFFFFFFFFFFQTTF 


8673 


22574 


A 


8735 


422 


2 


GRRPPGLYFFFFPPGKKNFQGRGGFLFF 
FPPKGFFFFFFPIGFFFFSSELGKDWPP 
PKKKVFSQRFPFFFFPPPLFLFLFFFFF 
FFFFFFFFFFFFFFFSFFFFFFFFFFFF 
FFFFFFFNKQVFIERLLCARHCFRPPAS 


8674 


22575 


A 


8736 


7 


75 


I AVAL I QAYVFTLLVS LYLHDNT 


8675 


22576 


A 


8737 


2 


176 


NFGLLAETGFLYVGHAGLEVRSSGDLPA 
SASQRAGI TGVSTAPGFNRYFYKQTI YK 
YG 


8676 


22577 


A 


8738 


408 


111 


GSSPPPRAGGENFLKKNAGGKNFPGGEG 
GGGFYPLYPKKFFFSPKGFIFWGGGGGK 
WPPPKKGGFSKKPQKVFFTPPQKKXKFF 
WPPGGNWGPPKNF 


8677 


22578 


A 


8739 


5 


249 


YEGLKKLHMQASPFQRGHPVNHKKKKKK 
KKKKKKKKKKKKKKKKKKKKKKKKKKKK 
EGALGGKFLAQTS FKGEXQKVFI XY 


8678 


22579 


A 


8740 


1 


226 


FFLRQSLSVTQAGVQWCDLSSLQLLAFQ 
GSSNSPTSASQVAGITGVHHHIQLIFIS 
IiVETGFHHVAQAGLELNXjQ 


8679 


22580 


A 


8741 


4 


96 


DGLIMLPRLVSNSWPQGILPPWPSKMLG 
LQV 


8680 


22581 


A 


8742 


19 


112 


FGRILLVREKKKKKKKKKKKKKKKKKKK 
GGGPL 


8681 


22582 


A 


8743 


362 


234 


KMDQKCLSEVLQRWFPCCCCCCCCCCCC 
CCFSLPFIFEPSRS 


8682 


22583 


A 


8744 


3 


419 


LTLRRFQLNLTALTKKKKKKKKNLKKKK 
KKKKKKKKKKKKGGPLFKNSLGGP I FPG 
GGRFIFFFFWGGFLKPPGDFLKKTFFLG 
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correspond 
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peptide 

sequence 


Predict- 
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nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C— Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F^Phenylalanine, 
G-Glycine, H=Histidine, I=lso leucine, 
K=Lysine, L=Leucine, M=JVIethionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














GEIFGPPPPKNLTPLGKKKIFKGVGGKN 
PPLFLRRKKFFSGGFFKKI FS PGLGF 


8683 


22584 


A 


8745 


444 


1 


KKPSKEILYPENSRFFSPLSPLKFFFSP 
KGFNFWGGGGPLCPPPKERFFSKNPPGG 
FFSPPLKEKIFFFPPPVNLGPPRDLFKG 
APPFFFFFFFFFLWEMFYWLSLSSGSQR 
LPQQVPTVEPSELGGMGAYFVSKRSTYL 
QRGWRPER 


8684 


22585 


A 


8746 


104 


222 


NKTFCLLKKKKKKKKKKKKRKKKKKKNK 
KKKKKNWGGAL 


8685 


22586 


A 


8747 


403 


201 


MEKYNWQPHSGILYSHEKEQAALFPIA 
KRWRQPSCLSIDEWRWIMWSSLTVEYYT 
AMKRNKQLYSQ 


8686 


22587 


A 


8748 


1 


90 


RTRGAVYAALERMGLDGCVEDLRSRLQR 
GP 


8687 


22588 


A 


8749 


444 


85 


SPPPPGLFFFFFFPKKKTSPPPTKKGFF 
SPPPPQKFFFFLKPFFFLGGLGPNFPPP 
KKNFFSKNPPRFFFFPPFKKKNFFFPPP 
FFFAPPKFFFLTPPPLFFFFFFFFFFFS 
FFFFFFFL 


8688 


22589 


A 


8750 


2 


309 


EFFPPYWEFLKINACMFSPEKKKKKKKK 
KKKKKKKKKRGPPLKKTPGGPQI PRGGK 
KKIPPLKGGQKKPPRGFLEKNPLFGGGP 
FGPPPPPKINPPEKXKNF 


8689 


22590 


A 


8751 


3 


76 


TASVSEGGGLQGITMKDSDEEEEG 


8690 


22591 


A 


8752 


433 


145 


SFFFFFFFFFSFFFFGKKSSFFTPb 


8691 


22592 


A 


8753 


2 


124 


GHLLMHXilGSATLAISTINLPSTLIIFT 
ILIKKKKKGRPF 


8692 


22593 


A 


8754 


80 


201 


F INANSKKKKKKKKKKKKKKKKKKKKKK 
KKKKGGGLLKKTPGGAQF 


8693 


22594 


A 


8755 


415 


1 


IYFPTPEKFGPPKETLKKGAPFFFFFKQ 
KFPFFGPGGKQKGGFGSLQTLPPGKKKI 
SRPNP PRKRGFKDAPPPPGKFVFLKKKG 
VFPGGQGGFKTPTPRDLAPPAPLNPGAL 
FFFFLRQSLSLSLRLECSGT3 LAHAS 


8694 


22595 


A 


8756 


3 


136 


FTATRPGVYYGQCSEICXSANHSFMPIVL 
ELIPLKIFEIGPVFTL 


8695 


22596 


A 


8757 


80 


326 


KKFSLGSQGGRAGVFPRPPLGKLQTTAL 
NSGAKGTPPFLFPEPRKRGGPPPAPGWE 
GIFCPHFFPMGQKGEGGLWHTGEGPH 


8696 


22597 


A 


8758 


1 


164 


SLISSTQGHKQCRRPQGPLPRKTRDPCS 
HVYLLTFPPLLS YDPAKS PSLRNTQE 


8697 


22598 


A 


8759 


431 


79 


FFFFFIFLKKVSTQSPPPQFYFFFFLKT 
FFLFQPLLKEGGSWGGRGDKPPPLAPFK 
KWGVFFFFFSFFCCFVFFFFPLFFFFFL 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFPRCVLTRL 


8698 


22599 


A 


8760 


60 


194 


ARPTCPATAVTQKKKKKKKKKKKKKKKK 
KKKKKKGGGVLKKKQGGRKY 


8699 


22600 


A 


8761 


383 


38 


CAS MTFP KKKKKKKKKKKKKKKKKKKKK 
KKKK 


8700 


22601 


A 


8762 


399 


164 


H I TVKS LLVPMDDPPKKKKKKKKKKKKK 
KKKKGGGFKKNLWGGQKLTGEKKKI FFF 
LKGGKKKPLGIF 


8701 


22602 


A 


8763 


353 


48 


NFFFKVFFFLKDFSHKGAPPKKGGPPKK 
TPPRKKFFFLKIKPLFFIAFFFFWSLFG 
FFLIFSLFFFFFFFFFFFFFFFFFFFFF 
FF I VFTI TLMHFK I YLH 
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nucleotide 
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correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A^Alanine 
C=Cysteine, D=Aspartic Acid, 
E=G!utamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Iso leu cine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


8702 


22603 


A 


8764 


2 


93 


FAMLPGLVSNSWPSAICLPWPPKLLGLQ 
MM 


8703 


22604 


A 


8765 


398 


63 


PPPPVFFFFSGGFFFWGGGGQKSPPRER 
CFFFKMHPGFF Y I PPFVGKKVFFVLEGV 
VCPLGFFLSGGAFFFFFFFFFFFFFFFF 
FFFFFFGFFFFFFFFFFFFGFPSVSHL 


8704 


22605 


A 


8766 


3 


83 


I LE I AGALI QAYVFTLLVS L YLHDNT 


8705 


22606 


A 


8767 


450 


306 


QKKKKKRRNKPEGLGTVAHACNPSTLGG 
RGRWITQPGQQEGNFISKY 


8706 


22607 


A 


8768 


1 


111 


WSLALVAQAGAPGFKQFS CLGLPKSWDY 
RHEPPCPGL 


8707 


22608 


A 


8769 


385 


261 


FLAEMG FCRVGQAGFKLLNSGD P PAS AS 
QSAGITGVIAPVL 


8708 


22609 


A 


8770 


3 


293 


SLGS KP LGLLS I»S P VKWFR I FG KERNKV 
WGKDGGTDRNQSSSAFKHLQRGDSDPKQ 
NKIKACSSKFYLRRCVKRSFLliIKKKKK 
KKKKKKKKKKKKKKK 


8709 


22610 


A 


8771 


2 


281 


CVCVCVCARVYIYIVYVYICIYTYMCVY 
MYICIYIHVYIHVYTCVYVYIRIHMYIC 
ICVCIYTCIYTYIYTHTYKHSVYYLHNF 
YINPKLLQS 


8710 


22611 


A 


8772 


295 


161 


THTHTHTHTHTHTHTHTHTHTTCSLPPS 
LAQLLGNCC I KAAI S IL Y I L 


8711 


22612 


A 


8773 


1 


318 


FFFFVRWSFTLVAQARVQWRDLGSLQPL 
PPGFKQFSCLSLLSSWDFRHTPPCLANF 
VFLVETGFLHVGQAGLELPTSGDLPALA 
SRGAGITGVSHHPQPPLCFLFL 


8712 


22613 


A 


8774 


3 


70 


FMPIVLSLIPIjKIFEIGPVFTIi 


8713 


22614 


A 


8775 . 


369 


191 


TYKINKIGWAWWCAPLVPAAWKAEVEGL 
LEPGSLRPASVIQQDPHLKKGTYSRVLT 
THL 


8714 


22615 


A 


8776 


2 


124 


G F RHVGRAGLE FLT S GD P LAS A FQS AG I 
TGMNHHVQP I VE 


8715 


22616 


A 


8777 


1 


162 


LKYYTADENGKTSRLLPQRPSDECGAGV 
FMASHFDRHYCGKCCLT YCFI KPEBQ 


8716 


22617 


A 


8778 


346 


138 


YNSPPYKEKTIPLQARVNFGPPRDSLKR 
PPFFFFFFKRQSLTMLPRLVLNSWAKGI 
LLPWPPKVQVLQA 


8717 


22618 


A 


8779 


2 


117 


LNLDTTAVQVRN YPR I RES YKVS FLSAL 
EEYTKKLNTQ 


8718 


22619 


A 


8780 


2 


97 


WDLTMLPRLVSNSWAQVILPSWPPKVLG 
LLA 


8719 


22620 


A 


8781 


403 


276 


IISTVFQRGSCPIPSRKEVCSERPRRLQ 
KDSLTGFPHTVYKH 


8720 


22621 


A 


8782 


2 


119 


QAGLELLTSSDLPASATQSAGITGMSQR 
ARPH FQCKNVP 


8721 


22622 


A 


8783 


245 


101 


ELQFKMRSGWRRSQPISTOLTLVPRLIS 
NSWAQVILLTWPPJCVLGIQA 


8722 


22623 


A 


8784 


2 


124 


AKLEKKKKKTGRAKRRMQYNRRFVNVVP 
TFGKKKGPNANS 


8723 


22624 


A 


8785 


3 


355 


VGPSEPGAGYNLLVCRLLRPLEKHGIRV 
GVTGFSRCHLSWLPLARKGNS PTPCASW 
VRHELALLRLTVCGLHPLSCTHCLTSPS 
EMNPVPQLEMQKS PVFCVAHAGSCRLEL 
FLFGH 


8724 


22625 


A 


8786 


238 


9 


PVKLGAGPPQAPFWLRKWAPLIKPFFKG 
PIGSLQIPLFFFLRQSLTLPPRLECSAT 
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C=Cysteine, D=Aspartic Acid, 
E^GIutamic Acid, F=Phenylalantne, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
IS-Asparagine, P=Proline, 
Q=Glutamine, R^Arginine, S=Serine, 
T=Threonine, V=Valine, 
\V=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














WLTESSTSWVHAILLPQPPE 


8725 


22626 


A 


8787 


336 


10 


CLSLPSSWDKRRLPPHPASFCIS 


8726 


22627 


A 


8788 


1 


147 


I FYLLETGFH YVGQAGLELLTSGDS PAS 
ASQSAGITGVNHHALPEIHVF 


8727 


22628 


A 


8789 


3 


130 


DLEEG I QTLMGRLEDGSRRTGQ I LKLDH 
SSEFSKTRELYPVF 


8728 


22629 


A 


8790 


206 


3 


VPPSGPI KKGDKKKNIjYLI FL I KNVPNK 
KLRKGVS PLAI P FFFFFFVFLLETGFHH 
VGQAGLELLTS 


8729 


22630 


A 


8791 


124 


390 


RAGAQSNLMEPPPSGVKAIFFPNFPKKW 
GNGTPPPAPLI FGGLKKKGVSPCGPGGS 
EPPAPGGPSPLAPPKGGNNGRDRLPPPP 
DKFLN 


8730 


22631 


A 


8793 


2 


301 


HEERERERERERERERERERERERERER 
ERERERDALFAFFPPRGARPPPEIERGV 
FVGAGS VREI YPTLS IFFFSP PAHRGRE 
LRSRCVSHSLHLISRA 


8731 


22632 


A 


8794 


1 


285 


ARGERERERERERERERERERERERERE 
RERERERERERERERERERERERGAPPP 
PLFFLGRGGALRAHAISPPPFLGAPPYI 
YIFFHMRARPPHRYLCAQRETRPAL 


8732 


22633 


A 


8795 


1 


321 


ARGERERERERERERERERERERERVSR 
RIPRGDLRDQFPHPLARSRDSFCGSADH 
FGRGVFNKVSWTDPPTHRVTSSLGGGV 
ERDLLTLSGGGTYAPWKNMCATGEDQ 


8733 


22634 


A 


8796 


2 


140 


HEERERERERERERERERERERERERER 
ERERERERERERESLPPPAGCAKGAGWG 
E 


8734 


22635 


A 


8797 


2 


223 


SARERERERERERERERERERERERVSR 
RNLSRGGGGVPPPLQNVRIHSGGPARGT 
LHHIKKKTSLTDVGLAQ 


8735 


22636 


A 


8798 


1 


176 


ARGEREREREREREREREREREREKISF 
LGGGRGTQFAIERGVIQFCGRRYTTGRV 
RN 


8736 


22637 


A 


8799 


335 


27 


PPTKFFFFFFLVEMGFCCIAQGGLELLS 
SSSLPASAYQSAKI IGVSHHAWSVF 


8737 


22638 


A 


8800 


342 


191 


STSLSLPKCWDYRREPPCPANLSYFFKD 
PFSKCNPRLRYWGLGLQHRNFG 


8738 


22639 


A 


8801 


104 


332 


AVPLTMVKIQPLWKRVWRFHNKSKLELP 
CDPAI SLLSMYPKEMKSLCQRDVCTPRL 
STGPLTIAGMWNPPKRSSMD 


8739 


22640 


A 


8802 : 


563 


258 


FFLNLETRSWYVAQARVQWLFTGTVTVH 
YSLDLLASSHPPISASGVAGITDACCAQ 
LTLPFLKLGLLGLGGGGVGYPSYFGLEG 
FHNDCSRVRVLCMWSYQ 


8740 


22641 


A 


8803 


2 


199 


ARGLLVRRFLSPSEKRSIRVGVTRFSRC 
RPSPLSLTRKGNSLTPCASQVRQCLALL 
RLAHGACTH 


8741 


22642 


A 


SS04 


266 


108 


KLWSKCVALPPSFSSSCSSHIGCACFP 
FTFKYDCKFPQASQAMLPEQAVEP 


8742 


22643 


A 


8805 


1 


140 


KWW FKRPGVY YGQCS EI CGANHS FMP I 
VLELIPLKIFEIGPVFTL ' 


8743 


22644 


A 


8806 


2 


90 


SSAAAEENNDEKKEEAEKTEDDMEFGDF 
D 


8744 


22645 


A 


8807 


229 


34 


KKKI FFFFPGFFWPPQKFFFKAPPPFFF 
FFFFFFFFFFFFFF1>DLFIYLI ILSYTK 
LISIRAAAPI 


8745 


22646 


A 


8808 


43 


173 


NADSGKAQVJLTPGMPALWEAEAGGSQDQ 
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EIETILAGAGKPRLY 


8746 


22647 


A 


8809 


259 


134 


I YMI FKNKFFNRDRGLPMLHRLVLNWAQ 
VILLSWPPKVLGL 


8747 


22648 


A 


8810 


391 


193 


VLPKIPGIQFPPPCRFKKRPRPRFKKPP 
PKRKKISFSNPPKIWPPQGYFKRGPPRL 
FYFFFFWIL 


8748 


22649 


A 


8811 


304 


3 


TTPPTOIFFCFLFFIFIFLWYFIFIFFF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
QCFIGHDDQKMMVHTHTQTSSHLLQGGS 
NPQGKGQSPGWVLWCA 


8749 


22650 


A 


8812 


395 


268 


F LVE TG FHHVGQAGLELP AS SDL P ALT S 
QSAG I TG VS EHGWS 


8750 


22651 


A 


8813 


782 


1174 


LSRLFYFCVLFCLYMKTTQLPYFRGLVC 
LFVLRQGLTLSSRLECSGMIAACCSLNV 
PGSTDS PTSASRWRTTGVRHHTQL I F V 
CFVEMWFHYVAQAGLEPLGS SSPPALS S 
QGAG ITDVSHHTP LELCF 


8751 


22652 


A 


8814 


95 


236 


ATMPGLKNIYFLKNKDKGLTMLPSLVLK 
SWARVILLPWPPKVLGLQT 


8752 


22653 


A 


8815 


381 


58 


SCLLFPPFWGARGGGPPRAGGSNPPGPP 
GLTPFPPKKPKNYWARGGGPFI PPPREG 
WAGGFFLPRRGRVPLAPGSNNLELGPLP . 
SPPGYQKKTPFSKKKKKEKKKRKR 


8753 


22654 


A 


8817 


441 


158 


FF F FLRTDG YLTMLPRLVSNS W PQA I LL 
PQPPKMLGL 


8754 


22655 


A 


8818 


415 


342 


RLVLDSWAQEILLPWPPKVLGLQV 


8755 


22656 


A 


8819 


400 


171 


NFFFPPGVKFLGGGGPQFPPPQKRGFFQ 
KTPGGFFKPPPKKKKKFFPPPPKIGPPQ 
GIFKKAPPFFFFFFFFFFFF 


8756 


22657 


A 


8820 


2 


268 


INIILAFTISLLGILVYRSHLMSSLLCL 
EGIILSLFIIATLITLNTHSLLANIVPI 
AILVFAACEKKKKGGAVLKDPWGGQSLR 
VLAR 


8757 


22658 


A 


8821 


2 


258 


LCLPNQAWAMAGSPPPASLLPCSLISDC 
CASNQRDS VGVG P SE PG VG Y SL WRR FL 
SPSEKRSIRVGVTRFSSWVRWLRTVIPA 
T 


8758 


22659 


A 


8822 


250 


456 


GGGDKFGLIETFPPGLKPFFLLNLLSGW 
ELGPLAPPPFKFCFFKGRGFPFLPRFVF 
VANLLLTCCKKRD 


8759 


22660 


A 


8823 


492 


376 


QENCLNPGGRICSELRSCLCTPAWATER 
ACLKKQNKTNQ 


8760 


22661 


A 


8824 


1 


116 


LTPLPPSAPPSVDDNLKTPPEWVCSLPF 
HPQRMIISRN 


8761 


22662 


A 


8825 


2 


191 


FLVETGFHHFGRAGLKLLNSGGALTSAF 
QRAGI TG VI TGVSHCFEVNR VWTGTQYF 
LCYILKS 


8762 


22663 


A 


8826 


393 


75 


PRFFPPPPPKIFFFPPPPFFFLGGFPP I 
PPPPKNFFFPKPPPGFFFPPPLKKKIFF 
PPPPFFSPPPNFFFPPPPPFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFS I 


8763 


22664 


A 


8827 


81 


380 


KI ITKHNIARTNPYTFCIMNYLKKKKKK 
KKKKKKKKKKKKKKKKKKKKKGGGPFLK 
NPGGGQNNPGGKKKFFFFFGGGKKNPPG 
FFKKKPFFGGGKFGAPPPPKN 


8764 


22665 


A 


8828 


354 


13 


TKPKTPPLTKGPRRGFLAPRVPGGGGP I 
TPPPQNFLGEKGTLARKKVFPPGRPAPK 
R P PRKKNG FEMRP LKKNGPG PW KFLGF F 
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N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V= Valine, 
W^Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
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IANPPGFGPKKKKPENQTHKKKCQRRKP 
RF 


8765 


22666 


A 


8829 


1 


215 


KHSTTHSHCPRTIKLKKKKKKKKKKKKK 
KKKKKKKKKKKKKKGGP PFKKTLGGPP F 
SPGGKKKFFFFLGGL 


8766 


22667 


A 


8830 


1 


127 


ILFFWQRRGLALLHRLVSNSWAQVILPP 
RPPRVLGLQVIYFK 


8767 


22668 


A 


8831 


264 


55 


FFFXFFXFFFFFFFFFLFVFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFFWCSCS 
YVLIFIQPFIKSFH \ 


8768 


22669 


A 


8832 


399 


85 


P PQKN FFFPQAPKI SGGGGPQ I AP PQKK 
GSFQKTPGGFFIPPQKKKKNYFPPPGKM 
GPPQGFFKRPPPLFFFFFFFFFFFFFFH 
NRHLFKVKHLQNS FHLVQLRL 


8769 


22670 


A 


8833 


47 


288 


NCLYRTKKKKKKKKKKPPYKKKTLRGPN 
FPPAGPPAPLPLSGGEKKPSRGLLRRPP 
TLGGAAKGPPPPPKLTPLRKKKIF 


8770 


22671 


A 


8834 


1 


146 


INI ILAFTISLIjGILVYRSHIjISSLLCIj 
EG 1 1 LSLFI I ATLITPSTLC 


8771 


22672 


A 


8835 


2 


79 


EDPKTSPKPKI IQTRRPGLPPSVSN 


8772 


22673 


A 


8836 


1 


156 


PTRPTIITPILLTLFLITQIiKILNTNYH 
LPPSPKPIKKKKKKKKKKKKGGGP 


8773 


22674 


A 


8837 


152 


292 


AKTFNFYKVEFINVFFNGLCLLI I KNTL 
PKKKKKKKKKKKKKKKKKKK 


8774 


22675 


A 


8838 


1 


122 


VASNS IHPKAKENTAPKTHTHTHTHTHT 
HTHTHTfiLFMAIFK 


8775 


22676 


A 


8839 


2 


106 


HVGQAGLELLTSGDLPTSASQSVG I TGV 
SHRAWP 


8776 


22677 


A 


8840 


364 


224 


APLCGRQICDCDCTYPSPHTYIHTHTHT 
HTHTHTHTHS 


8777 


22678 


A 


8841 


266 


1 


NSLSVEFLILFPPYNYSPKQCICSHQNC 
LNFWVIKCVNTPHPNFLNLRMPLLAKRG 
AHACDPNTFGGGGRWI TRSVDRDHPGQH 
NETP 


8778 


22679 


A 


8842 


343 


52 


KTGFRHVGQPGEIiLASSDLPASASQSAG 
ITGVSHRVRPGLHI LDNSS FLDTSFAD I 
FSVLYLKAGIASLLHIYIHFLIiPLRDLL 
LVSSLTVSFPTAV 


8779 


22680 


A 


8843 


256 


35 


LFIFSLSLQYIFCVCGCVCVFMSLCFLV 
CAWGHVQLHVCSCVRMCVCENDVCVCGW 
VQMCFCI KNFQKEVYQI 


8780 


22681 


A 


8844 


333 


11 


PP I KKKAPPPKGRAFFFFFLKKKKGGP P 
PPQKKTGGGGPKKRGGVKKPP PGFFPGF 
FGPLFFWGPPFLPPFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFNFISTKNLF 


8781 


22682 


A 


8845 


15 


288 


RCGLTVLPMLVSDCWAQVILLAQPPRVL 
GLQAGASASTP I TS VNCSQGAI YPEVGT 
TGSTIIAAASSGGEFDSVLQNDICMCFL 
TQQFHF 


8782 


22683 


A 


8846 


1 


397 


ECAHHTQLIFLFLIESSLHKVGQAGLKL 
PISSDPPTSASQSAGTTAVSHHA 


8783 


22684 


A 


8847 


1 


179 


ARERERERERERERERERERERERERER 
EREREREREREREREREREREKKKLGGG 
GGGPQFPIGGGLTQGGGGG 


8784 


22685 


A 


8848 


2 


233 


HEERERERDRERQRERERERERDRERER 
ERETFFSSSGGGPKETTLFFETGAPKEG 
YQNMHARLIPSAVPFKKRGRG 


8785 


22686 


A 


8849 


2 


180 


HEERERERERERERERERERERSRERER 
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ERERERERERERERERDFVFYRWGEGHI 
DRQYCSFLCPGWCHCPM 


8786 


22687 


A 


8850 


2 


79 


PEERERERERERERERERERERERERER 
ERERE 


8787 


.22688 


A 


8851 


1 


178 


ARGERERERERERERERERERERERERE 
RERERERERERERERERERARLFARGEK 
KKHTGGGPPLFYYLYI 


8788 


22689 


A 


8852 


3 


76 


TRRERERERERERERERERERERERESP 
A 


8789 


22690 


A 


8853 


5 


227 


EERBRERERERERERERERERERERERE 
RERDFFFFFLEKKYLFSPIGELKGRGGG 
SLTPLLTGKWNPCPPPPH 


8790 


22693 


A 


8854 


1 


189 


ARGERERERERERERERERERERERERE 
RERERERERERERERERERERERERERE 
RHSRLFPRESVCVARAGGASSPPRG 


8791 


22692 


A 


8855 


2 


271 


HEERERERERERERERERERERERERER 
EREREEPPRGEKYCIRVPPQKILSPACD 
SLLYVTHNLCVSHSTPRGERYVALFNSR 
ETRCF 


8792 


22693 


A 


8856 


2 


309 


HEQQQQQQQLRNLRDFLLVYNRMTELCF 
QRCVPSLHHRALDAEEEACLHSCAGKLI 
HSNHRLMAAYVQLMPALVQRR I ANYEAA 
SAVPRVAAEQPGVSPSGS 


8793 H 


22694 


A 


8857 


44 


203 


G YSTCVGMHAHTH I HTDTHTHAHTY IHQ 
CLIQIGLCTFATCTSQVNKSSMAEL 


8794 


22695 


A 


8858 


329 


182 


I I FVCLVERGLH YVGQAGPE LKASDDLP 
ALASQSAGTSGVSHCVSSKMS 


8795 


22696 


A 


8859 


1 


175 


GLTLP PRLGCHAT I TAHYS LNLPGS S N P 
PTSASQVAGTTHSRP CPE ITQTALQRG P 
HD 


8796 


22697 


A 


8860 


3 


350 


HEAIGLRSNIKNGLDHFLPLGTPTPLI P 
IIAMIETILLLVQPIALAVRLTANITAG 
HLLMRLNGSATLAISTISLPSTLIIFTI 
LI LLT I LE I AGAL I QAYVFTLLVGLYLR 
DNA 


8797 


22698 


A 


8861 


139 


3 


TFKEAD I KGGGPPHLASFCI FGRDGVS P 
CWPGWSRTPDLKRSTRA 


8798 


22699 


A 


8862 


3 


134 


S PTPPPSS KPSS I PRKSS VDLNQVSMLS 
PAALS PAS S SQRHES 


8799 


22700 


A 


8863 


346 


219 


RLVP PCLDNFLNFCRYSCLTMLPGLVSN 
SWAQVIFLPQPPKD 


8800 


22701 


A 


8864 


1 


147 


GTRFFFERGIDHVGQAGIQLLTLGDPPT 
PASQRAE I TGVSHHTWPNFI F 


8801 


22702 


A 


8865 


267 


332 


KKKFW P FF F FFLRDGVS LLLRLVSNCWA 
QEILLPWPPKLLGLQV 


8802 


22703 


A 


8866 


343 


215 


LAQGLKTP I PKRVPPHTQKKKFCSERPT 
RIQTDRSCWVSPLS 


8803 


22704 


A 


8867 


1 


242 


GTSKKIDKLDYIELKCFCTAKETISRVK 
RPVEWEKI FANSSPEKGLISRINKEAKK 
IiNSS AI AAHNCNP STLGGQVRR I T 


8804 


22705 


A 


8868 


1 


219 


GTRRPTWATWQNPASTKKKKNS FFFETE 
SRTVPQAGGQGGNLGSLQSLPPGLKRFS 
CLSMPGAPKGQSGSLMG 


8805 


22706 


A 


8869 


1 


181 


ATGG I LLLLD WS LAYE S KHLLEGAKSE 
SAEELKKRAQELEGKLNFLTKIHEMLQP 
GQDQ 


8806 


22707 


A 


8870 


3 


255 


AAAPPNAPGGPPGPQPAPSAAAPPPPAH 
AT -GGMDAEI.T DEEALTS r .ET ET GLHRVR 
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C=Cysteine, D=Aspartic Acid, 
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ELPELFLGQSEFDCFSDLGSAPPAGSVS 
C 


8807 


22708 


A 


8873 


383 


233 


ATTFPNPGFLFVRLFVLVQWPDHEPPGF 
KQSSHFSLPSSWDYRHPPPPPG 


8808 


22709 


A 


8872 


211 


294 


AXKKKKKKKKKKKKKKKKKKKKNI I WGG 


8809 


22710 


A 


8873 


398 


100 


QKERPPGFPPFKPGPPQKRGGKGAPPKD 
GEHYFFWVLEKSTGFPPKTKRVYYPSPV 
YPHPPAPQKEGFPGGSPPPRPIASPFGG 
KKKEKKSYIEKENLF 


8810 


22711 


A 


8874 


3 


214 


DAVfVLGILVYRSHLISSLLCLEGIILSL 
F I IATL I TLNTHSLLEKKKKKKKKKKKK 
KGGAFKKNPGGAQ I 


8811 


22712 


A 


8875 


2 


254 


PRVRPEGRNRTPQSWFQANPMACMTFSK 
KKKKKKKKKKKKKKKKKKKRGGGFIKKM 
GGGKNFREKEKKNFLLIRGKI KKTGGDF 


8812 


22713 


A 


8876 


2 


135 


TINLPSTLIIFTIIjILLTILEIAVALIQ 
AYVFTLLVSLYLHDNT 


8813 


22714 


A 


8877 


2 


296 


IiARGAEVLG YG SH S RGR VPG ALVGQGAG 
RLFTEHPGSS PATLAI YPTPEGTSVAVS 
ISAPPKARSRPYPPSRSCKNLSLAGSSV 
LVPPGPVHRWVGRP 


8814 


22715 


A 


8878 


164 


29 


RKIGPARWLTPVI PAL WETE AGGS RGQE 
IEI ILANTVKPGACEVL 


8815 


22716 


A 


8879 


381 


219 


GEKMTEEEVHMLVAGHEDSNGCINYEGK 
R 


8816 


22717 


A 


8880 


2 


156 


ANGNSFATRLSNIFVIGNGNKPWISLPR 
GKGIRLTVAEERDKRLATKQSSG 


8817 


22718 


A 


8881 


3 


402 


SGFNIEYAAGPFALFFIAEYTNI 1 1 INT 
LTTT I FLGTTYDALS PELYTTYFKKKKK 
KGGRPFFFFFFLKKKKIPPPKKKNFGKK 
GETLKGRGGAP I FQPKKILS P PKKKKKK 
RGGAFKKS LGGAKFNGGGRN 


8818 


22719 


A 


8882 


1 


406 


RYSTPSEGEVGERYSTPPGETLERYSTP 
PGETLERYSTPPGETLERYSTPPGETLE 
RYSTPPGETLERYSTPPGEALERYSIPT 
GGPNPTGTFKTYPSKKKKGGGRFKEPLG 
G PRFTG VGRVKVFS LWGGVIiNAR 


8819 


22720 


A 


8883 


484 


307 


KKILNRRVRWLTPVI ptlweteagssrg 
QEMETILGNTVKPPASASQSAEIKGMRI 
IIF ! 


8820 


22721 


A 


8884 


3 


131 


GSLTHHINN I KPS FTRENTLMVIHLSP I 
VLLSLNPDI ITGFSS 


8821 


22722 


A 


8885 


406 


106 


KGGAGPKIAPPKKKAFSPIPPPFFWPPP 
VFLKGPPPPSPFNFFPPPVFFFRGPLFF 
FFFFF FFCETWLPFRPSNLVFTLPHVF 
AWLI P I DQYLF I SHS C 


8822 


22723 


A 


8886 


425 


325 


S S THTHTHTHTHTH TERSHLLAFE PS AG 
ELWMAM 


8823 


22724 


A 


8887 


394 


278 


VCVCVCMCVYICIYPCWLPCWHMCLELY 
KVFKGIMKGV 


8824 


22725 


A 


8888 


2 


86 


YSRTRAAAL I QAYVFTLLVSLYLHDSAY 


8825 


22726 


A 


8889 


400 


62 


FLYLKSFPPPPERGVFSPLPPQKFFFSP 
KALNFLGGGG PKFP P P KKRFFFKYP PR V 
FLYPPHKKKNYIFPPPDDFGPPRVFFKA 
PPPHFFFFFFFFFFFFFFFFFTRLEWVQ 


8826 


22727 


A 


8890 


3 


100 


SLEPGTSGLKGFSCLTLPSSEDYRHEPP 
CP AN 
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8827 


22728 


A 


8891 


32 


142 


WPTRPGQVASVLYFTTILILIPTISLI 
ENKILKWA 


8828 


22729 


A 


8892 


3 


382 


TPKNPLSRPSYEPKPASTPSRAPKKKKK 
KNPKKKKKKKKKKKKKKKKRGGALLKKT 
LGGPKLPGGKKKNFFFFRGGKKKPPGDF 
LKKTLFLGGGNLAPPPPKKKNPWGKKKI 
FWGEGGKKNFFFFG 


8829 


22730 


A 


8893 


368 


199 


KKKRISPPSRKRAPQGDFLRPTPPIKNL 
YYYIiSFFFLILYIFFFFFFFFFFFIFFF 


8830 


22731 


A 


8894 


3 


136 


TINLPSTLI I FTILILQTILE IAGALIQ 
AYVFTLLVSLYLHDNT 


8831 


22732 


A 


8895 


1 


144 


IATINLPSTLIIFTILILLTILEIAGAL 
I QAYGFTLLVS LYLHDYP YN 


8832 


22733 


A 


8896 


153 


375 


PQKKKKKKKKKKKKKKKKKKKKKWGGGF 
LKNSGGAPI FGGGGKFFFFFLGGGFKTP 
RGFFGKNPFFWGGKKKKN 


8833 


22734 


A 


8897 


332 


400 


WLTPVI PXLWDAEAGGSFEVRSS 


8834 


22735 


A 


8898 


215 


359 


TELVPNAVLFCPLPCSSGHHLLCAIHAK 
RVTIMPKDIQLARRIRGERA 


8835 


22736 


A 


8899 


414 


73 


PRLLCFHKKAWAMAGALPPALLPPYSLI 
S DC CASNQRDS VGVGPS EPGVG YNLVVR 
RFLSLSEKRSIRVGVTRFSRCCPSPLSL 
TRKGNSLTPCTSRLRQCLPLLWLAHGAR 
TH 


8836 


22737 


A 


8900 


374 


88 


VSPPSPLKI FFSPKAFNFV7GGVGPKFPP 
PQKRVLPQKPPGGFFSPPLKKKKFSFPP 
PVKLGPPKGSFKGPPPFFFFFFFFFFGL 
DFMYTGSSNVW 


8837 . 


22738 


A 


8901 


433 


56 


PEGPPPPGGFYPGKRGFFPTPLWFGGP 
KRDPPPRGKKNPSWRSGGKQTPRFPRDP 
LLKAFFPGGPPQRLGKAQGPFWFFFGPK 
KGKTQTGPPFFFKKLGDPWGPKKKPPPG 
GAPRLGPPKKKFL 


8838 


22739 


A 


8902 


3 


73 


I AVALi I QA YVFTLL VS L YLHDNT 


8839 


22740 


A 


8903 


3 


149 


PNLS YI IGKDTWVEHWPEEDECQDEENQ 
KQCQDLGAFTESMWFGCPN 


8840 


22741 


A 


8904 


351 


137 


FPPEFFAAPKKSGILFFSKTWMEVEVLF 
LSQLMLGQKTQYCMFSLINGRIiI IKAPG 
SLKGDHHPPLGLLEG 


8841 


22742 


A 


8905 


142 


5 


KI FLQKKKKKKGLGTVAHACN PSTLGGQ 
GGWIMRSRDQDHPGQQW 


8842 


22743 


A 


8906 


367 


244 


HVPQARAKFFC I FSRDGVS PYW PGWSRT 
PDLMI CLLFI VEL 


8843 


22744 


A 


8907 


146 


248 


SQI LGRLRQENCLNLGNGGCSEPRWCHF 
LAWAKE 


8844 


22745 


A 


8908 


2 


197 


AAASTHHARLFLCVCLFAYLLVEMEFHH 
VAQAGTELLDS SNPPPLASQNAGI IGMS 
HHTQPQYIC 


8845 


22746 


A 


8909 


1 


127 


KKTRLERAQWLTPVI PAFWNAEAGTLLE 
ARSLRRLPWPPNVL 


8846 


22747 


A 


8910 


368 


280 


LTMLLRLVLNS WAQAI LLTWPPKVLGLQ 
A 


8847 


22748 


A 


8911 


355 


211 


QQQNLHLFW PGT VAHTRNS S TLGGQGMR 
I TGAYE FETTLGNTARPHL Y 


8848 


22749 


A 


8912 


325 


190 


KKKKKKGGGGPQKKKKNPPPPPTSKKKK 
KKKKKKKKKKKKKKKKKKKKK 


8849 


22750 


A 


8913 


1 


411 


VNVPAPRGGAYRGRQASFSSGGLHPVPA 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
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sequence 


M 
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od 


SEQ ID 
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26 
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beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M-Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T-Threonine, V*=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possibie 
nucleotide insertion 














YRPLCLPNQTTNSAMAGAPPLASLPPCS 
LISDCCASNKRDSMGVGPSETGTGYNLL 
VCHPLSPLEKCS IRVGVTRFSRCRLSPL 
SLTKKGNSBTPCTSRVRRCLALLRL 


8850 


22751 


A 


8914 


258 


9 


GAIPCLHCETNPSHISSTQELQTPEPQL 
PGVPLELPPPGSCFKCRKSGHWAKECPQ 
PGIPP KPCP I CAGPHWKLDCPTGPRL 


8851 


22752 


A 


8915 


326 


42 


E KI HRKYF CYS FFVFL VEMGFY FVGQAS 
LKLLASSDLPASPSQSARVKAMSKCVRP 
VLVIFKNKILQNTENSTKIYVYPPPJJNQ 
ILTFCHFCFK 


8852 


22753 


A 


8917 


3 


106 


GLELPTSGDLPASASQNAGI TGI SHHAR 
PIMTYS 


8853 


22754 


A 


8918 


320 


171 


AASTTDGSYKCLCLPGYVPSDKPNYCTP 
LNTALNLEKCPFGLPHLSGSS 


8854 


22755 


A 


8919 


1 


201 


VKPSDRYHLMPI FTAAYPHQNSTYNVPV 
STRMVMAEEFKQGKRLSLCSDLYRKDLR 
TIVDPWSCAT 


8855 


22756 


A 


8920 


3 


182 


LLRPLEKRGIRVGVSRFSRYHLSRLPFA 
R KGNS PTPCAS R VRRC P S LLHG IjH PL S D 
KPQ 


8856 


22757 


A 


8921 


142 


1 


SCHPGWSAWRPQLLRRLRYENCLSLGG 
SGCSELGSCHCTPAWTTEG 


8857 


22758 


A 


8922 


1 


117 


LFFNYAWGWSTTLLSRLSLNSWVHVILT 
PWPPKELGLQA 


8858 


22759 


A 


8923 


357 


121 


LGAFSDGLAHLDNLKGTFATLS ELECDK 
LKVDLSNFRLLGNVLVCVLAHH FGKE FT 
PPVQAAYQKWAGVANALAHKYH 


8859 


22760 


A 


8924 


343 


133 


LFFNFNFLETRLRE WL I FVFS VETGFP I 
KRAGLKLQASSDPPARGSQSAEISGMSH 
YAWPCXSNRAFKCS 


8860 


22761 


A 


8925 


32! 


44 


PDS GG S PAS AS QVAGTTGACHDAWLM F I 
FNMDININIPFSAKRDKLSMDKINEIIF 
SLLINRISSFQIMYFLCSSRIESRKRWL 
KQTGRKKE 


886] 


22762 


A 


8926 


335 


106 


HFVFLIETGFHRVGHAGLELLTSTDPPA 
LAYKSSGITGVSYRSFTKLHLCACVCLC 
IYTYLGLVLSFTPGSWEKKT 


8862 


22763 


A 


8927 


2 


160 


LSRVPPRSANLS ISVFAEKRCHFTMLPR 
PISNSWAQGILLPSQAPKVLGLQA 


8863 


22764 


A 


8928 


3 


140 


RV I TEEE KN FKAFAS LRMARANARLFG I 
RAKRAKEAAEQDVEKKK 


8864 


22765 


A 


8929 


126 


59 


NSGGGGCNELRS CHCTPAWATE 


8865 


22766 


A 


8930 


3 


104 


KNMAC YCRI PAC I AGERR YGTC I YQGRJj 
WAFCC 


8866 


22767 


A 


8931 


311 


112 


MENYLSKMQQELEKNITRELKEAAAELE 
SGS IASPLGSTDESNLNQDLVWKASREY 
VQVLKKNYMI 


8867 


22768 


A 


8932 


345 


134 


GGGSSPWFPPFWRPRRADSLRVGVLNpp 
GPGGGAP FFLKNPL I GWGGGAR LWFQL I 
LLRVRPENSFDPGR 


8868 


22769 


A 


8933 


262 


39 


RPRRPHPGNFFFFFNSGSHYVAQAGLEL 
LDSSDPTLPSQSAGTTGMSYQAWPLDHT 
FENRDCVL.LHCSS PV3NT 


8869 


22770 


A 


8934 


84 


22 


TPAWVWWLTP V I PALWEAKAG 


8870 


22771 


A 


8935 


277 


111 


ERIINHAAGSHGVSGIFMKYDLS3LMVT 
VIEEHMPFWQFFVRLCGIVGGIFSSTG 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H-Histidine, l=lsoIeucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, $=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine> 
X=Unknown, *=Stop codon,/=possib)e 
nucleotide deletion, \=possible 
nucleotide insertion 


8871 


22772 


A 


8936 


2 


106 


DSAI PRRLECLKEDVQRQQEREKELQHR 
YAEFVF 


8872 


22773 


A 


8937 


1 


193 


LMWDFSPSGLDGAFQRGYY WYYNKY INV 
KKGS ISGFTMVLAGYMLFI YCLS YKELK 
HERLCKYH 


8873 


22774 


A 


8938 


240 


69 


DSGNQLKRSFSGRARWLTPVIPALWEAE 
VGESLEVRNIIYILQNMSNIGNIHLIIK 


8874 


22775 


A 


8939 


329 


177 


RWNLAMLPRLVLNS WAQAI CPSWPLKML 
GLEAEPSCPARTNFKVTSPRLH 


8875 


22776 


A 


8940 


358 


120 


HPSNFFVFLVEMGFHHVGQPGLELFPAR 
YVPTLAFRSGGITGI SHCAWPKEWALPR 
KASPGLSGRRGSPHCATCSSKAW 


8876 


22777 


A 


8941 


333 


104 


PHVYSNGHICLSILTEDWSPALSVQSVC 
LS 1 1 SMLSSCKEKRRPPDNS F YVRTCNK 
NPKKTKWWYHGKQFEFVITF 


8877 


22778 


A 


8942 


1 


241 


APLVAFTQVNLEDKGGLSKLVEAIRINF 
NDR YDE I CHHWGGNVLG P KSGTR I AKLK 
KAKARELATNWI KCTLLS FLYIKR 


8878 


22779 


A 


8943 


1 


240 


APLVAFTQVNLEDKGGLS KLVEAIR INY 
NDRYDEICHHWGGNVLGPKSVTRIAKLK 
KAKAKELATNW I KCTLLS FLY I KR 


8879 


22780 


A 


8944 


306 


150 


AASTLCPRliECSGAIMAHCSLNLLiGSGD 
PPASASQVGGARGSLEPGRWRLQ 


8880 


22781 


A 


8945. 


284 


2 


GKSRRSPLDLCTSGFTGSTHFTLIICWV 
PRYVATCPPIGLNFVFIKTWWSHYIVQA 
DLKLMVSSNP P VSASQSAGNTGMSHKTW 
LENRLLTTKK 


8881 


227S2 


A 


8946 


1 


226 


WSFALFAQAGLEFLTSGDPPASASLSAG 
ITGMSHHAQPAVSLNS I FSASTFCATS S 
YCAKMELKELKFMLIVDAA 


8882 


22783 


A 


8947 


363 


96 


NTAPGYMPLLS I PFLVLSKAVFLFLLGM 
FTAALLI I AQRWEQPKCASTDEW INKMW 
HMHTMEHY STCCGVHAAVN I S I Q I SEWN 
CWEW 


8883 


22784 


A 


8948 


1 


208 


GRVGQPGLRLLTSGDLPASASQSAGVAG 
VSHRAQPTPALNCVLTCWVLQHYIKHNH 
GKRSEYFTI FNDM 


8884 


22785 


A 


8949 


69 


298 


SRGHQR VADAALLS FPDVI CLGLMLWGS 
SSIVCILHRHKRRVQHIHRTSVSPTSSP 
ESGATKTI LLEQERTRSPRS 


8885 


22786 


A 


8951 


2 


201 


LYDLLNMRSISLPLPSLVCIHSHTHEHT 
YTH I HTHAHTYTHTHSHWVLS LKQKKMV 
KHRPRLSVWI 


8886 


22787 


A 


8952 


1 


285 


GASPPASLPPCSLISDCCASNQRDSVGV 
GPSEPGAGYNLWRRVLSPSEKRSIRVG 
VTRFSRCRPSPLSLTRKGNSLTPCASRV 
RQCLTLLWLVH 


8887 


22788 


A 


8953 


341 


256 


LGLGGGG CS E PRWHHRTPAWVTE RDF I S 
L 


8888 


22789 


A 


8954 


227 


43 


I LFF IGKHTHTHTHTHTHTHTRAKCLFP 
PARFPTSPNSTTNCTFPTEPAANPRGLV 
RAGLG 


8889 


22790 


A 


8955 


310 


79 


RPRRPGLMERVNVFIFSILRESFYKIRI 
TNQAQCCMP I VPVTQETEAGRSLE PKSL 
RLAWAIQQNSISKKNNNNNFF 


8890 


22791 


A 


8956 


374 


198 


FFFFLPNRHGLAMLPRMVLHWVQAVLPQ 
PPKVLGLQG 


8891 


22792 


A 


8957 


138 


10 


RPRRPRLCQAQWLMPI I PALWES VAGGL 
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SEQ ID 

NO: of 

nucleotide 

sequence 


SEQID 
NO; of 
peptide 
sequence 


M 
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od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A-Alnnine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R-Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 














LEARS LR PAW F F FNT 


8892 


22793 


A 


8958 


354 


216 


AGPRVFPGGEGGFRPPPPGGS PPQGPPK 
GGESGGKPPPPAHWVFF 


8893 


22794 


A 


8959 


364 


223 


LQAARSKKKKKKKKKKKKKKKKKRGGGK 


8894 


22795 


A 


8960 


259 


11 


FEALKDPSNNLQSLFSFIVTVLTVAHVY 
EKLSTLTSLAARRGLAJyiLPRLASNSRTQ 
TILLPWPSKVLRLQARATEPGLGLLL 


8895 


22796 


A 


8961 


334 


213 


AASTKNFERMEWGLTTLPMLVLNSWPQV 
ILLPWPPESMRL 


8896 


22797 


A 


8962 


260 


3 


KGGFFPIPFIGIKPGFFFSPALIFPKVF 
I FSQNFFVFLKKI FFFFFVETVS SHWK 
AGLELLASSNPPAFTSQSAGMTGMSHHA 
W 


8897 


22798 


A 


8963 


425 


248 


ADEW IHKMRHIHMI E YYLAI KRNEMLI H 
ATAGMHLENILSEKDHMYDS J YMKCLAW 
ANL 


8898 


22799 


A 


8964 


1 


118 


GKLMELHGEGS SS GKATGDETGAKVERA 
DGYEPPVQESV 


8899 


22800 


A 


8965 


2 


130 


GR VGPRMALNS WAQVI HLP WP P K VLGLQ 
VAMKRKIMILSTSY 


8900 


22801 


A 


8966 


119 


364 


LGWYLIEKIQICCSPCL3ADPFQQMISP 
LRDGSLT VLPKLVLNS WPQVFLP PQTRK 
VLGLQAEVTAPGH I I FMI IMINTFK 


8901 


22802 


A 


8967 


238 


333 


TRCLRLLPRLVLNSWTQALLLPS PPKVL 
GLQV 


8902 


22803 


A 


8968 


297 


169 


NPPPPHIYKFFFIFNIFFFFFFFFFFFF 
FFFFFFFFMSKILIF 


8903 


22804 


A 


8969 


2 


212 


RVRPRDRPLVRVRVGRNKDGAETTPSFG 
LLPAHLTFPLDYHLNQPFIFGLRDTDTG 
ALLF IGKNMDPRGP 


8904 


22805 


A 


8970 


242 


357 


TLDI IKSFKKKKKKKKKKKKKKKKKKKK 
KKKKKKKKKKKKKKKKKKKKKKI I 


8905 


22806 


A 


8971 


191 


265 


I LMPXKKKKKKKKKKKKKKKKKKKR 


8906 


22807 


A 


8972 


398 


97 


MFLYNRILWFFSPHPPLRFFFSPRATIF 
LGGGAPHFPPPKKGFFS KKP PGVFLS P P 
QKEKIFSFPPPLFLAPPKDFFKSPPPFF 
FFFFFFFFVSTGSSVS 


8907 


22S08 


A 


8973 


370 


265 


YNSSLPSSWDYRHVPPCPDNFCIFSRGW 
SQTPDLR 


8908 


22809 


A 


8974 


3 


119 


GLKLLTSSDPPASASKTAG ITGVSHHTQ 
TIFVFSSHKS 


8909 


22810 


A 


8975 


1 


110 


VAQAGFKLVSSANIiPTIiASQSARVTGVS 
HRARSMKQ 


8910 


22811 


A 


8976 


3 


152 


WNQPKYPSTGEWMEKIWtfYIYTKLVQPMA 
HGPHVAQDGFEGSPTQIHKLS 


8911 


22812 


A 


8977 


375 


193 


FQDLPLGSTSNISFFKRWSPAMVPRLIS 
NS WPQE I LLPQTPKVLG I TGMSHCTWPG 
GHISI 


8912 


22813 


A 


8978 


2 


395 


RAALPTQASAMAGAPPPASLPPCSLSSD 
CCASNERESVGVGPSEPGAGYNLLVCYL 
LRLLEKRS IRVGVTRFSRCHLSPLSLTR 
KGNSLTPCASRVRQCIiALLWLTLSALYP 
LSCTHCQTLPSEMNLVPQL 


8913 


22814 


A 


8979 


120 


19 


ARVQWHDHGSTATLNSWPQVILLPWPPK 
VLGLQA 


8914 


22815 


A 


8980 


204 


48 


LNRDKGLTMLARFVRNS W PQAI PLPW PP 
KVLGLQALATAPCQMSFLRAPLS 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=Gluta mine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Unkno>vn, *=Stop codon,/=possible 
nucleotide deletion, \=possib!e . 
nucleotide insertion 


8915 


22816 


A 


8981 


335 


221 


FFFLFLFFCM I TSHCS VNVLGASDS PTS 
VSHIAGTIGT 


8916 


22817 


A 


8982 


360 


150 


LSHMPLLFVSLYPDHRAFMQVI FFFFET 
ESPSVTRLECSGAILAHCNVCLPGSSDS 
PASASYAGYSFYL 


8917 


22818 


A 


8983 


346 


116 


GQGRGVETESCSVTQAGVQWCDHHSLLS 
LEhLGSSDPSTS AS P VTGTTGGHYHTS S 
HFLFSVSSSYFPTPVCLAYR 


8918 


22819 


A 


8984 


350 


263 


LPMLPRLVSNAWPQAIFLLWPPKVLELQ 
A 


8919 


22820 


A 


8985 


3 


159 


EEYNLLVCRLLSPLEKCS IGVRVTQFSR 
CHVSPLSLTRKGNSLTPCASWVR 


8920 


22821 


A 


8986 


2 


253 


KHWGELKNDLS I CGTLHS VDRYLNI KL 
TNISVIDSEKYPHMLSVKNCFIQGSWR 
YMQLPADEVHKQLLQDAARKE PLQQKQ 


8921 


22822 


A 


8987 


2 


115 


GAR I VGHLTHALKQGE YGLAS I CNGGGG 
ASAMLIQKL 


8922 


22823 


A 


8988 


3 


159 


GFYH I GQAGLKLLM SGDLPAS ASQGAG I 
TGLSHCAQAGI FFFLP YDI FT VF 


8923 


22824 


A 


8989 


1 


123 


SHYVAQAGPKbLGSSNPPASASRSAGI T 
GVSHHAWPPSLFL 


8924 


22825 


A 


8990 


462 


303 


LARTPSRPTRPPTRPPTRPPTRSPTRPP 
FMLSVDCLLLY 


8925 


22826 


A 


8991 


382 


178 


PPFFFYFFFPSFFFFFFFFFQKGFKGFF 
PQKFFFFFFGGKPVKNFQKFFLRAKGKG 
GEKKPPQNKKVF 


8926 


22827 


A 


8992 


532 


432 


RDAGLTMLPRLVSNSWAQVLLLPGPPKV 
LTLQV 


8927 


22828 


A 


8993 


418 


172 


KKGKKKKKNIFLI FFNYFFFL I FFFIFF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
F FFF F FGGAWS L FRGGDSGE V KGTDS MS 
RQCFQGF 


8928 


22829 


A 


8994 


2 


722 


AVRLNI S YPPQNLTMTVFQGDGTASTTIj 
RNGSALSVLEGQSLHLVCAVDSNPPARL 
SWTWGSLTLSPSQSSNLGVLELPRVHVK 
DEGEFTCRAQNPLGSQHISLSLSLQNEY 
TGKMRP I SGVMLGAFGGAGATALVFLS F 
CI I FVWRS CRKKSAR P AVGVGDTGMED 
ANAVRGSASQGPLIESPADDSPPHHAPP 
ALATPSPEEGEIQYASLSFHKARPQYPQ 
EQEAIGYEYSEINIPK 


8929 


22830 


A 


8995 


498 


323 


EPKAELNEGDIAVVHIKREEGWFKGTLQ 
RNAKTGLLPGSFVENI 


8930 


22831 


A 


8996 


21 


511 


LI IDGVEAYALNASGWNI IVFDPKGWA 
LFRSFKAVKEKLDTRRGSNSELETAVKD 
LGKAVS YKGMYGDVAI WYSGQYVENGV 
KKNFLPDNTMVLGNTQARGLRTYGCIQD 
ADAQREGINASARYPKNWVTTGDPAREF 
TMIQSAPLMLLADPDEPVSVQLA 


8931 


22832 


A 


8997 


178 


306 


GAINAINWPPFNNVGMGKTLGFPNLMAL 
KQ I PFS P VGEKTKRR 


8932 


22833 


A 


8998 


351 


211 


SIQSLRMQPVPCYISGRHHHYHSHHHIH 
HHHYSSYFLKYPQEYLRPL 


8933 


22834 


A 


8999 


1 


381 


FRLFKFALMELCS I EG ISDQPLFFVFGF 
FFFGKEAS FG PQGGREGGKFKLMEPLAK 
GVKGISCPKPPHPCLNPPKPKRGGESSP 
PEGPKKTGAPGPAKKGLFFLTKGKNPWP 
GPGKTRKKAVFPKNF 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=Iso!eucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V= Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unkno>vn, *=Stop codon,/=possible 
nucleotide deletion, Y=possible 
nucleotide insertion 


8934 


22835 


A 


9000 


173 


399 


I KS E WAAKKK K KK K KK KKKK KKKKKKKK 
KKI FFFGGGRFFFFWGGDYFWGGGGVKK 
KGGGKKNPGPQKXNGGGKK 


8935 


22836 


A 


9001 


2 


81 


EFTPAVHASLDKFLASVSTVLTSKYR 


8936 


22837 


A 


9002 


3 


835 


DAVRANDDLKENIAIVERRNNLLQAELE 
ELRAWEQTERSRKLAEQELIETSBRVQ 
LLHSQNTSL INQKKKMDADLSQLQTEVE 
EAVQECRNAEEKAKKAITDAAMMAEELK 
KEQDTSAHLERMKKNMEQTI KDLQHRLD 
EAEQ I AIiKGGKKQLQKLEAR VRELENEL 
EAEQKRNAKS VKGMRKSERRI KELTYQT 
EEDRKNLLRLQDLVDKLQLKVKAYKRQA 
EEAEEQANTNLS KFRKVQHELDEAEERA 
D I AE SQVN KLRAKS RD I GT KGLNE E 


8937 


22838 


A 


9003 


429 


280 


RGFRGHKFLEPPGFPGFPPIMGPPSPPF 
GSPPQKNFPLQPKPPLGFKPPP 


8938 


22839 


A 


9004 


2 


108 


LS AYQGTPLPAN I LD WQALNYE I RG YVI 
IKPLVWV 


8939 


22840 


A 


9005 


3 


158 


SSTQGHKHCGRPQGPLPRKTRDLCSPVY 
LLTFSPLLSYDPVTSPSPRNTQE 


8940 


22841 


A 


9006 


3 


370 


LLLTGLNRLTTD L Y S L Y I FTTTQWGS LT 
HHINNIRPSFTRENTLMFIHLCPILLLS 
LSPDIITGIFSYTRHYVNSSTYIiETNER 
GGSYKNRLGGPESKGLRIKDDSLCWGPL 
HDTSAYYENK 


8941 


22842 


A 


9007 


363 


76 


KLFNVGGGGLDKPPPPKKFFFFKKNGLF 
FFYPLKKKKNFFFPRELLWGPPRFFIKT 
PPPYFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFVMIS 


8942 


22843 


A 


9008 


1 


87 


KMLKKPKFELGKLMELHGEGSSSGKATG 
D 


8943 


22844 


A 


9009 


420 


3 


RFYFPKPRKRGGGGVFFFFPPKKKFFFL 
NPPPFFFFPPPKKKKNIFFPPRKGGRGS 
KVQTMGPPLFFFFFFFFFFFFFLFFFFF 
FFFFFFFFFFFFFFFFFFFLRETSSNNV 
LAYQIENSLLKLDLQRQFCLPLPGRV 


8944 


22845 


A 


9010 


2 


74 


LHRLGGPEAGLGEYLFERLTLKHD 


8945 


22846 


A 


9011 


33 


132 


IDPRCRNSARAPLSLAGPHPGMGDASNH 
MGQM 


8946 


22847 


A 


9012 


83 


607 


RRSGTPCGYNVTSQNGTIYSPGFPDEYP 
ILKDCIWLITVPPGHGVYINFTLLQTEA 
VNDYI AVWDGPDQNS PQLGVFSGNTALE 
TAYS STNQVLLKFHSDFSNGGFFVLNFH 
AFQLKKCQPPPAVPQAEMLTEDDDFE IA 
ITGEGKPTRMPQSRKKSCHI ISVLISAK 
LNRAFL 


8947 


22848 


A 


9013 


483 


364 


PPPPPPFLNPAPGIFFPPPLGGIGARPP 
PPSFFFFFFFFF 


8948 


22849 


A 


9014 


74 


180 ^ 


LLSTYVGRLSARPKLKAFLASPEYVNLP 
INGNGKQ 


8949 


22850 


A 


9015 


488 


317 


HVGQAGLVLLTSGDPPTSASQSTGITGV . 
SHRAQPLRSFSFVIiSLLQKRIVLC 


8950 


22851 


A 


9016 


149 


13 


DERGGLQAVAHTCNPSTLGGRGGWITRL 
GDGDHPS CR ILAGDASQ 


8951 


22852 


A 


9017 


20 


107 


FDSTADSDQVNPIQGLASKWDYEKNEWK 
K 


8952 


22853 


A 


9018 


448 


239 


VDHQQVIWNRERISNSQNGIVKEIKGAD 
TFI FGHTPAVKPLKFANQMYIDTGAVFC 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid. 
E^GIutamic Acid, F=Phenylalanine, 
G=Giycine, H=Histidine, I=Iso leucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Prol ine, 
Q=Glutamine, R=Arginine, S-Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possibIe 
nucleotide insertion 














GNLTLIQVQGEGA 


8953 


22854 


A 


9019 


3 


209 


CPPLSPGCTNSARAEPSVHALPDALNNL 
RTHEGSGDGPSSSVDWNRPEDVYPQGIY 
VISAPSIYAREA 


8954 


22855 


A 


9020 


504 


20 


NYAQMSYSTARASANESWAYFMGRRKFV 
ASRQASQMFLCWLEEAIVRRVVTLPSKA 
RFSPQEARSAWGNCDWIGSGRMAIDGLK 
EVQEAVMLIEAGLSTYEKECAKRGDDYQ 
EI FAQQVRETMBRRAAGLKPPAWAAAAF 
ESGLRQSTEEEKSDSRPAAGS 


8955 


22856 


A 


9021 


3 


168 


LPRLEAAARIRHEERERERERERSRERE 
RERERERERERERERERAPPPTLCET 


8956 


22857 


A 


9022 


2 


89 


LCSLVYLLTFPPLLSCDPAKSPSARNTQ 
E 


8957 


22858 


A 


9023 


508 


371 


FFCRDKVLATLPRLLANAWPQVI FLPQP 
PKVLRLQPCTTPSLKDL 


8958 


22859 


A 


9024 


2 


170 


RTHSAGRPYQCNQCEKAFRHSSSLTVHK 
RTHVGRETIRNGSLPLSMSHPYCGPLAN 


8959 


22860 


A 


9025 


3 


185 


PPAATPQGPSPINSPPTKKKAKNKNRTK 
KTKKKKKPPPSKKKGPQKTPKKTMGGPL 
KTTT 


8960 


22861 


A 


9026 


104 


319 


LVFLSKYTPPLLVSLWIHGLSLLSFLPS 
LPSFLPSLLTYLLPSFLPSFLTSVINPL 
PSLFFLVFKLSQFWT 


8961 


22862 


A 


9027 


476 


364 


SLKRLTSGDLPASASQSAVITGVSHRAR 
PIIYFKYVQ 


8962 


22863 


A 


9028 


36 


154 


FLVGQAGXjKLLTSGDLPASGSQSARITA 
VSHRTWRTHYF 


8963 


22864 


A 


9029 


2043 


1237 


WLRTRVAPALPERLGACTQLGPVLPCSQ 
PYWCRQCPEYRRQAAQPPHCPAPEGEP 
GAPQALGDAPSTS VS LTTAVQD YVCPLQ 
GSHALCTCCFQPMPDRRAEREQDPRVAP 
QQCAVCLQPFCKLYWGCTRTGCYGCLAP 
FCELNLGDKCLDGVLNNNSYESDI LKNY 
IiATRGLTWKNMLTESLMALQRGVFliLSD 
YRVTGDTVLCY CCGLRS FRE LT YQ YRQN 
I PASELPVAVTSRPDCYWGRNCRTQVKA 
HHAMKFNHICEQTRFKN 


8964 


22865 


A 


9030 


15 


389 


IVDHYPCGGIPTSGSVTQAGAQWHNLGP 
LQPRPLGLEGSSCLSLICSWGHRHAPPH 
LARGHFSSEVEQLRSRLQHGAVFVKQRK 
GTPEDPRQGLRTLISSCLLKLVAIiALWF 
SLPFHGPRIHMQ 


8965 


22866 


A 


9031 


3 


243 


RHSLAFNRFSCLSLPSNWDCRHPPSCPA 
KFCTFVEMSFHHVGQAGLELLTSGDLPT 
LiASQSAGITGVSHHAWTRCCCCF 


8966 


22867 


A 


9032 


1 


245 


PTRPAGI AICICMCMKNHRATRVG ILRT 
TH I NTVS S Y PGP P P YGHDHEME YCADLP 
PPYSPTPQGPAQRSPPPPYPGNARK 


8967 


22868 


A 


9033 


46 


124 


QELMTHGAXSPDGTRVHNAFLFVTTV 


8968 


22869 


A 


9034 


235 


435 


KKKKKKKKKKKKKKKRRP FKLKG P FKPP 
LKPQGGLFEAPPFGGPPPPGFFFAGGGA 
PPRGFFKKKGK 


8969 


22870 


A 


9035 


333 


70 


CPIYQIPILKDQVNLSSISSPFSHLIKD 
LWKTLFISYITELIFVCFVEMMFHHVAQ 
AGLELLG S SNPP TS TFQSAR I TG I SHHA 
CLNI 


8970 


22871 


A 


9036 


297 


411 


NKTMS FAATWMQLEVI LS KLMOEQKTKY 
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ding to 
last amino 
acid 
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Amino acid sequence (A-Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, lr=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaJine, 
W=Tryptophan, Y^Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














CMSSLISGS 


8971 


22872 


A 


9037 


461 


167 


NKNYPLLGGPPFGS PS PENLNGG FPLTP 
GGGGPINPNSPPPPPPGGRRPPFLKKKK 
KKKKEKETNTYILQDTCDRCSAVLPIMQ 
VKIVMICLIMHRLF 


8972 


22873 


A 


9038 


466 


292 


NGCLLEPASWNSLANI FIFCREEGLTML 
PRPQVILPPLPPKVLRLLPPPLSS 


8973 


22874 


A 


9039 


34 


124 


GIRNIKFSSLSTFIAHKKYYCSSHAAEH 
VK 


8974 


22875 


A 


9040 


54 


239 


RRRRKLRRWGLAMLPRLVSNSWHQAIIiW 
PPKVLRLQACATTPGLTRRWLHGQIWSP 
GAGSV 


8975 


22876 


A 


9041 


469 


223 


P WAR VCVC VCVS VCVCACVR VCARAS E S 
QTCAAVPSLGPAGAGRPALGCCAVCEDE 
VGSCPHARPLPRAAVDKAAGRRGLCF 


8976 


22877 


A 


9042 


491 


380 


SPGWAHSYNPSTLDGRGSHITRVRKFM 
TCLATEGAV 


8977 


22878 


A 


9043 


144 


360 


IHSHWGYWLDVSLIAAHLAINPDCFMYV 
YIFSQCTDKLTEKEFIEGTLANKEILRL 
IQFEPQKVKEKMKNA 


8978 


22879 


A ' 


9044 


23 


263 


RWNSDGKGSLRPAWATWPDPVS IKNTQN 
THTTHPTPHSPHTPHGQQPFLSPTPFFW 
PVSSPSPFPPFSHTCLSCLLPPGA 


8979 


22880 


A 


9045 


461 


287 


PPSLANIYLFIYLFIGRDGVSLFPRLVS 
NCWAQAILLPWLPKVLGLQGEPPNLALK 
SL 


8980 


22881 


A 


9046 


485 


6 


NN KNS WAWWRVP VLP ATREAGAGE S LE S 
GRRRLR 


8981 


22882 


A 


9047 


475 


307 


LFLRDGSLTVLPRLVSNSWPQAILLPWL 
LKGLGLQARATLHMACSSCINSTQNILQ 


8982 


22883 


A 


9048 


483 


338 


LREGERQREQERERERERERERDRERAH 
TTAFTQLLFQYVAI ILLPCY 


8983 


22884 


A 


9049 


680 


303 


NDLLFGSLYVKQFAASGTGRVTLPEPSE 
EQWCLRCFQDLFGDDHNKNGFKMFDSS 
NYHGQDLLFKDATVRAVPVGEKTTYRGW 
LGLDYVAALEGMSSQQCSGAGRTGPPCR 
LPWCYSDPFTPEP 


8984 


22885 


A 


9050 


26 


165 


AFGYHRVGQAGLELLTSGDPFASASQSA 
G I TGVSHHARSQYFFVYV 


8985 


22886 


A 


9051 


461 


83 


HKINLPSHKKKKKKKKKKKKKKKKKKKK 
K 


8986 


22887 


A 


9052 


475 


209 


MAAPPKKKKKKKKKKKKKKKKKKKKI 


8987 


22888 


A 


9053 


531 


404 


SVTQAAMQWQKHNSLQGSTLEFKQSSLL 
SLPTIWDYKHADLS 


8988 


22889 


A 


9054 


66 


416 


KKKTNKI KMGGGPLPFKISPPFLKKTLS 
WKXGAPKTPPLEMAPFFFPQGGKKNS PF 
KGGKIFFFFFPKKTFIKGPPGGEHTPPL 
GGGPQI FSLPPPKGVPQXNPPPLLRGGL 
FFTI 


8989 


22890 


A 


9055 


1 


104 


ILAVNGETLEGVTHEQAVAILKHQRGTV 
TLTVLS 


8990 


22891 


A 


9056 


3 


106 


LKRWGIAMLTGLVLNSWAQVILPSWPPE . 
VLGLQV 


8991 


22892 


A 


9057 


1 


299 


PTRPINIILAFTISLLGILVYRSHLISS 
LLCLEG 1 1 LS LF 1 1 ATR I Y P I AP PL P PL 
EPEKKKKKKKKKKKKKKKKKKKKKNTKK 
RKI KAKR KNLrGGGGL 
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nucleotide 
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ing to first 
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sequence 


Predict- 
ed end 
nucle- 
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ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E^Glutamic Acid, ^Phenylalanine, 
G-GIycine, H=Histidine, Wsoleucine, 
K=Lysine, Lr=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possib!e 
nucleotide deletion, \=possible 
. nucleotide insertion 


8992 


22893 


A 


9058 


482 


146 


LTSSKKKKKKKKKKKKKKKKKKKKKK 


8993 


22894 


A 


9059 


2 


290 


YFLFAYTILRSVPNKLGGVLALLLS ILK 
KKKRGGPFKRTLGGPKFNRGGQGNIFFL 
MGGGLKPNLGILGRNLSLGGGKNWPNLP 
PKIKGFRENKNF 


8994 


22895 


A 


9060 


413 


48 


PQKKGFFQKNPKGFLKPPPKKKKKKFPP 
PGKIWPPQKIFKKPPPPFFFFFFFFFFF 
PKFFFFFFFFFFFFFFS I RNP PSS YTKW 
LRSLDVCHLLP P I TPQVPGPTP VLLFST 
LPDCNHSLL 


8995 


22896 


A 


9061 


2 


274 


FTISLLGILVYRSHLISSLLCLEGIILS 
LFI IATLITLNTHSLLANIVPIAILVFA 
ACEAAVGIALLVKKKKKKGRPFKRILGG 
PQINPG 


8996 


22897 


A 


9062 


42 


255 


QEFRTRRERERERERERERERESSGWR 
AKFRSNFSAGAIAHTIRVMLPPSGIYSN 
EKS INKCGFALLFF 


8997 


22898 


A 


9063 


20 


413 


YALRNSARAGDPHGQLGRRDGAVGGRGW 
LMPREALHPWWANVQAWAGS PGLFLGAP 
WPQGGVWLPWLCVSMSASSSS INGLPPQ 
KKKKKKKKKKKKKKKKFWGGGPGPQSPF 
WGGLKKFGGRFLKRGGGEN 


8998 


22899 


A 


9064 


6 


306 


ECEHSTTYSHFAGGGGPVMVGGDADARS 
KALLG VCVGSG TEAYVLVLD PH YWGT P K 
SPSBLQAAGWVGWQEVSAAFDPNSFYNL 
CLTSLSSQQQQRTLD 


8999 


22900 


A 


9065 


463 


126 


AHYQLVYTCKVCGTRSSKRISKIiAYHQG 
WIVTCPGCQNHHI IADNLGWFSDLNGK 
RNIEE ILTARGEQVHRVAGEGALELVIiE 
AAGAPTSTAAPEAGEDEGPPSPGKTEPS 


9000 


22901 


A 


9066 


23 


244 


SCHAGIRHEERERERERERERERERERE 
REREREIYFLALFSQKGRVHGFFKTPRG 
GQKRELLSNRVNTLCTE 


9001 


22902 


A 


9068 


396 


143 


KDIQNFGQAWWLTPVI PALWEAEAGGSR 
GQE I ET I LVNMVKPHLS VLLAKNSSSHC 
PMCWLAELQWYDLCS FQKI FFG 1 1 LI PG 


9002 


22903 


A 


9069 


828 


99 


VDREPVDREPWCHPDLEERLQAWPAEL 
PDEF F ELT VDDVRRRLAQLKS ER KRLE E 
APLVTKAFREAQI KEKLERYPKVALRVL 
FPDRYVLQGFFRPSETVGDLRDFVRSHL 
GNPELSFYLFITPPKTVLDDHTQTLFQG 
EPLPGRSGCNLGAEEPAGVYLEPGLLEH 
AISPSAADVLVARYMSRAAGSPSPLPAP 
DPAFKSEPAAEEGALVPPEPI PGTAQPV 
KRSLGKVPKWLKLPASKR 


9003 


22904 


A 


9070 


20 


427 


DGTRRERERERERERERERERLYLTEDT 
LHSLS LEGARSGAS VCVQSVYPTGASAP 
HS I INGHKRVLVLCVETEHTFPLAEKDR 
AFLGDTSVFVGAAPVGGANTGASAGGAP 
PSRGE PKEESGESHEDMGFGLFV 


9004 


22905 


A 


9071 


297 


126 


CS I YDKRI PTFGLGEGVSERQRLTLLL P 
CLPPALTETPTS SDWQSTDATPTLTNS S 


9005 


22906 


A 


9072 


117 


286 


WLRGSEKMFQVQE INEQYLQCI L I KG I 
FLVKKKKKKKKKKKKKKKKKKPRPEYI K 


9006 


22907 


A 


9073 


3 


302 


PPMETLRQWTTIVHSQEEEEEHFEVIKD 
EVKWARKHGQPGTPVAIATQLPPRTSA 
AFPAQQQPLQPLAPTTVLSVSADWNALL 
QRPNKYI WGLQYLEM 
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nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 
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Amino acid sequence (A^AIanine 
C=Cysteine, D=Aspartie Acid, 
E=Gtutamic Acid, ^Phenylalanine, 
G=Glycine, H^Histidine, I=Iso leucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparogine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 


9007 


22908 


A 


9074 


437 


79 


NQKREEEEERRKKRRRRRRKRRRRKKKK 
KILRQKKKKRRKAGSREKKKI LRHLPEK 
NKRSKKHYELINALSRLQDTRLIYQKMH 
FYILAKNNWKIKLKIQYHLQALENKVPR 
HKWBTYA 


9008 


22909 


A 


9075 


51 


185 


CRADFMFFHDKI IVYLCSLVYLI/TFPPL 
LSYDPAKSPSARNTQE 


9009 


22910 


A 


9076 


452 


310 


QFSRCHLSSLSLDRKGNSLTPCTSWVRR 
CLALLRLTLGALHLLSDNPQ 


9010 


22911 


A 


9077 


438 


326 


LLHCWWDCKLVQPLWKSVWRFLRDLELE 
IPFDPTLHF 


9011 


22912 


A 


9078 


37 


169 


GIPGFRHAGQRRLQLLITGDLPTSASHS 
AGITGTSDRARPPSTF 


9012 


22913 


A 


9079 


1 


408 


RKGRLSKEEIERMVQEAEKYKAEDEVQR 
ERVSAKNALES YAFNMKSAVEDEGLKG K 
ISEADKKKVLDKCQEVISWLDANTLAEK 
DEFEHKRKELEQVCN PI I SGL YQGAGGP 
GPGGFGAQGPKGGSGSGPTIEEVD 


9013 


22914 


A 


9080 


31 


168 


VRRN PRLE HTVT I TAY CSLELLG S SNL P 
ASASQVARTTGLQLLNLQ 


9014 


22915 


A 


9081 


460 


319 


YFFVEMGSHYIAWAVLELLGSSNPPASA 
SQSAE FTDAHKPTDCRPLR 


9015 


22916 


A 


9082 


465 


326 


PCVAEFCTHTHTHTHTHTHTFKCVGSHL 
HTNND 


9016 


22917 


A 


9083 


33 


269 


PGNSRPISSVRGIHPPCLANFCFLNVFV 
EMGSHYWLAGLILGSSDLSALAVQSGR 
VTGMSHCAWPE I F FLLENW 1 1 L 


9017 


22918 


A 


9084 


47 


300 


DLITSLFLLKQSSLATMNIQTPFKNIYF 
DLIIKTLFLRFFIFCFIYFYFLFFSYGG 
FFSFFFFFLKGSSFLPPRLNGGAKIWVN 


9018 


22919 


A 


9085 


437 


220 


NIAIKSLSLSLSLSLCVCMCVCVSACVC 
MCLSLSLFSPSSLSHSLFPFSTPNMDIS 
HLPILSTQENMLLLLQ 


9019 


22920 


A 


9086 


227 


23 


DCPKANILGLTPRLVSNSWPQVILTPWP 
PKGQSLLCRQGEGDRIWTWHPWWLLHPP 
PQARFPPRTHFH 


9020 


22921 


A 


9087 


206 


326 


PHWAQW PT P V I PALW EAE AGKS QGQE FE 
TSLANTVKPRLY 


9021 


22922 


A 


9088 


364 


123 


SRWLPLKAPGEDTSLPLPASGGSRCSL 
AWGS I PP I SAS I PTWPS VLCP CE I FPLL 
FLLYGDCNLGPTLI PDDLIMRSLP 


9022 


22923 


A 


9089 


373 


205 


MPPRIxANFCYFSRDWVSPYWSGWSQTPC 
WDYRREPPFPPCVRTSYSVQYCACYAGL 


9023 


22924 


A 


9090 


283 


2 


REFS FS AWLAFKI F WKLFS P P F WRVR P 
EDWFIPGGCGPNEPNFPPCPPGVGKKGE 
PRFKKKKKKNFI SSQAQWLMP VI PAFWE 
AKAGGSLEP 


9024 


22925 


A 


9091 


16 


261 


CPEDLLVGQNVKVEDSDMGDPDRRPMCM 
FACFYLLVYRRCLAMLPRLYLNSWAQVI 
LLPQPPASLELLQAHATTHPALCLSF 


9025 


22926 


A 


9092 


17 


109 


RQGLAMLPRLISNSWTQANLLPWPPKEL 
GL 


9026 


22927 


A 


9093 


282 


114 


KKGGKGALLVFLKTGRKP FF FFLRQGLA 
MLLRLVLNSWAQAILLPWPPKVLGLQA 


9027 


22928 


A 


9094 


205 


337 


NANLHGRMQRLTPCLPE I WEAETGGSQG 
QE I ET I LANMVKP CL Y 


9028 


22929 


A 


9095 


62 


287 


EHSCAKLFCFAFESGPYSVAQAGVQWHD 
HS3XQSQTPGLKQSSCLSFPASAGFYKE 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G^GIycine, H=Histidine, l=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 














SPS FYKDKSCHHLGKHSGA 


9029 


22930 


A 


9096 


170 


9 


AASTGYCIGRFCTKLKIQSGGQVRWLSP 
WP KREAE VGGLLEARSSRP I WATW 


9030 


22931 


A 


9097 


2 


104 


RRPGGRAEPRARVRGDRDPPVLLRHYPH 
IFEECG 


9031 


22932 


A 


9098 


350 


142 


LCLSCPKGGGYRAMAPWRGNFCSFGKNG 
GFPMLGRVVLKWPQMGPLPQPPKRVGFQ 
GLSPPPGPPPFLV 


9032 


22933 


A 


9099 


494 


342 


YFYFYFVEMVSHHVTQAGLELLASSNPP 
ALISQHAVITSGSHHTQPKLYF 


9033 


22934 


A 


9100 


30 


232 


GGIMLCHGRCACFPFAFCHDCKFPKVFQ 
PCFLYSLQNCSFLYKIPSLRSCRLQLAE 
LLSIIYLENPIi • 


9034 


22935 


A 


9101 


26 


171 


RNSRRLRHKNRLSLGGGGCSEPRSCQCT 
PAWQTARLSLKNKKQQQKKP 


9035 


22936 


A 


9102 


486 


239 


HHTCPHQVCVCVCVCVYRVCVCVCVCVY 
IYMYMCIYIYIYIYICIYICRERERERE 
REMWFHHVAQAGLELLGSRDLPAPAS 


9036 


22937 


A 


9103 


487 


344 


TGFHHVGQAGIELLTSSDLPASASQSAR 
ITG VSHHAQPLFMI FCAKC 


9037 


22938 


A 


9104 


470 


221 


TPGLKRYTCLSLPKCWDYRCEPHARQGA 
FLCAFSPLSSQQPWGKRPHFHFSYETNK 
VERGKRIiPLRSPSTEQGFDFRCVLFQT 


9038 


22939 


A 


9105 


468 


259 


YVFF CRDGGLT I SSRLVLNS CPQQS PCL 
SLPKCTDYRHEPPCPSCCSNLLLMFHFT 
NVFLIDSECSHFTP 


9039 


22940 


A 


9106 


466 


329 


RVAGTTGTHHKAWL INI PS VETGFHKVG 
PQVIQPPRPPKALGLQA 


9040 


22941 


A 


9107 


409 


306 


AGFELLAS SDLPAS ASQSAE I TGVSHHA 
RPSLIF 


9041 


22942 


A 


9108 


417 


269 


MWAQADMESSKEPGLFNWI INDSLVQ 
AYPBLKEALYEEIKKAQRRGA 


9042 


22943 


A 


9109 


533 


399 


RGSLRHETRLNAEARRCSELNSRQCTPA 
WATERDAVSKKTKQKA 


9043 


22944 


A 


9110 


16 


239 


LVHFEVRWNSHEVSLTADSRKSHLIFPQ 
PNYADTLINQES YEKS EPLL I TQDLLET 
KGEPRQLQVSFFPPKREE 


9044 


22945 


A 


9111 


68 


219 


EPPHLANFLQRLVFTMLLTLILNSWPGV 
ILPPGPPTVLGLQDKLPVLLPS 


9045 


22946 


A 


9112 


192 


57 


DRVPLCPPDWSAVARSWLTAASVSQVKG 
ILSPQPTSCAPPGNST 


9046 


22947 


A 


9113 


21 


201 


TIVSWEFMNLGGGTCGEPRWCHCTPDW 
GTERDSVSGEKKEDKDVARRHWVGSFYN 
GPQ 


9047 


22948 


A 


9114 


2 


144 


TFTIDRCAKDLFVAKQVGTKLSVNEPLS 
FS VES ILXRPSSAI TRVSQ 


9048 


22949 


A 


9115 


156 


316 


MILN KKKKKKKKKKKKKKKKKKKKKKKK 
KGGGPFKKTLGGAKITPGWKNYFFFFMG 
GV 


9049 


22950 


A 


9116 


1 


84 


TKPTPFIiPTLI TLTTLLLP ISPFILIIL 


9050 


22951 


A 


9117 


1 


176 


HLNLGGGVCSEPRSCHCTPWATSILSQ 
KKKKKNGFKFWEPRPQKEGGGGRKKPPG 
VP 


9051 


22952 


A 


9118 


1 


184 


AAAGYPGQQQNFHSVREMFESQRIGLNN 
SPVNGNSSCQMAFPSSQSLYRTSGAFVY 
DCSKF 


9052 


22953 


A 


9119 


69 


374 


I I FS KKKKKKKKKKKKKKKKGGPP KKKP 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
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Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A= Alanine 
C=Cy stein e, D=Aspartic Acid, 
E^GIutamic Acid, F~PhenylaIanine, 
G=Glycine, H=fiistidine, ]=IsoIeucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 














WGGPKI FVLRRVLS FFIRGEKKNTFGDF 
LKKTFFWGGEKLGKPPPKKKTPWGKKKI 
FKGERGKKPPFLGGKKK 


9053 


22954 


A 


9120 


405 


93 


EFSPLLLTTNQKRGFFPPSPPKI FFFPK 
RIYFLGGGGPKFPPPQKRGFPQKNQKGF 
FFSPPKEKKYNLPPPGKFCPPQGFFFPP 
PPLFFFS FFPFFFFRCQIK 


9054 


22955 


A 


9121 


2 


380 


WDYRCPPPHPTNFCI FSGDGVSPHWPGW 
SWPQVICLPQPPKVLLTPWVRDRTRDLE 
GGDL WYGACLGACLHLGPQAASFGHG I L 
G PASLLCC I PKHVHMDTLS PGVKLQS CD 
TGVALRVGEKREGS 


9055 


22956 


A 


9122 


2 


534 


DMIREQKIYHKYLAQRREEEKAQEKEFD 
RILEEDKAKKLAEKDKELRLEKEARRQL 
VDEVMCTRKLQVQEKLQREAKEQEERAM 
EQKHINESLKELNCEEKENFARRQRXiAQ 
EYRKQLQMQIAYQQQSQEAEKEEKRREF 
EAGVAANKMCLDKVQEVLS THQVLAQN I 
HPMPQGMPQ 


9056 


22957 


A 


9123 


389 


46 


GPNFLPPPIFGFFSPFSPLKFFFSPKTL 
IFLGGFFPIFPPPKKSFFPKNSPGVFFC 
PPFKKKNFLFPSPLNFAPPRVFFKPPPP 
FFFFFFFFFFFFNFYRLYFIWVNGLAKV 
VW 


9057 


22958 


A 


9124 


422 


59 


AQQKKKKKKKKKKKKKKKKKKKKKKKKK 

K 


9058 


22959 


A 


9125 


333 


168 


KGP P C I WCM VE FHKIiEAGSHS VAQAGA 
QWCDGNLLQTLTLGLKRSPHFSLPSR 


9059 


22960 


A 


9126 


175 


89 


KPPPHFFFFFFFFFFFFYFFFFFFFIFC 


9060 


22961 


A 


9127 . 


368 


250 


CFNYLSRDRGLPLLPGLVLKSWPQAILL 
PQPPKMVGVQG 


9061 


22962 


A 


912S 


392 


157 


FFSPKRLNFGGGGGPKMPPPKKKVSPQK 
PHPGFFS PPQKKKKKFFPP P VKMGPPKS 
FLKSPPPFFFFFLISSIRGLFL 


9062 


22963 


A 


9129 


396 


13 


FPPPPPPIF FFS PRRFFFRGGGS VALS P 
PPKVFFLLNPPPVFFFSPPFKEKTSFFP 
PPFFCPPPQILFLAPPPYFFFFFFFFLF 
VFFFLFFFFLLPLLFLGKTSFLLLLFLF 
I YLETGS CS VAWARVQ 


9063 


22964 


A 


9130 


55 


180 


LLYNKLI FTLKKKKXKKKKKKKKKKKKK 
KKKKKEKYEKTKAY 


9064 


22965 


A 


9131 


2 


95 


SFVHMEHEKAVLLLKSFQNTVDLGIQRE 
LTV 


9065 


22966 


A 


9132 


172 


404 


KKKKKKNKKKKKKKKKKKKKKKNWGGGF 
FKNSWGGPKKRGGGKIFFFSLLGGKKKT 
LGVFLEKPLFLGGRKMAPPPP 


9066 


22967 


A 


9133 


1 


125 


QPVSYPFTIIGQVAYELYFTTIIilLIPT 
MSLIENKILKWAY 


9067 


22968 


A 


9134 


1 


120 


QPVSYPFTIIGQVASVLYFTTIIilLIPT 
ISLIENKILKWA 


9068 


22969 


A 


9135 ; 


7 


269 


IATGQPRLQKLTLDPTLLLKRFLLTKKK 
KKKKKKKKKKKKKKKKKKKKKWGGGGHK 
KKPPGGPKT YPVKKKI FFFFKGKI XKPP 
WGI 


9069 


22970 


A 


9136 


1 


64 


LVSI SNTYGLDYVHNLNLLQC 


9070 


22971 


A 


9137 


400 


259 


SQPIiRRLRHKNILNPGGGHYSELGSCHC 
TPAWDTEQDSIHSWLLSSV 


^9071 


22972 


A 


9J38 


3 


310 


E C FPK S PNMNGH CNLLVQS TS P S ISTMP 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
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od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted* 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E-Glutamic Acid, F=PhenyIalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, JR=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /-possible 
nucleotide deletion, \=possible 
nucleotide insertion 














CPKYVALEKVFLLNVLWPFYLKYRDNRP 
GSVAHTCNPSTLGGRGGWITGAQEFETS 
U3NIARPSLIKRKKYKLF 


9072 


22973 


A 


9139 


34 


142 


NCKKLNTHTHTLTHSHTHMLTHSDTFTH 
THTHTKYK 


9073 


22974 


A 


9140 


146 


2 


PPPPPPEGVSLCGPGWSAVARFRLTASS 
ASRVHAILPPQPPQQLPTRP 


9074 


22975 


A 


9141 


3 


102 


PPAS ASQSAGVTGMSHHAQP I EPLRCSS 
HRTT 


9075 


22976 


A 


9142 


245 


51 


NREGDLLCCPRWSQTFGLKQSCLGFPKC 
WEYRRGPPHLAYIMSMKKNILIQMIFYY 
NNPCLLYS 


9076 


22977 


A 


9143 


2 


108 


GAKCVSDRIKHAFLIKEQKI IVKVLKAQ 
AGSQKAK 


9077 


22978 


A 


9144 


3 


200 


MGFLHVGQAGLKLPTSGDLPASASQSAG 
ITGVSHRAWPVSGFFCSEAKECFWEEQL 
INLDKNSQK 


9078 


22979 


A 


9145 


405 


227 


WFFLGFPLKGKLGPPKMFFRAPRVYSPF 
FPPFFFWPPPPRIIFPPGFFKFFVFLAG 
PIW 


9079 


22980 


A 


9146 


387 


220 


PLFFVGCFFFFFFFFYYFFFFFFFFFFF 
FFFFFFFFFFFLGGCERNFI IRKNINN 


9080 


2298! 


A 


9147 


539 


220 


VINFKHILIKTVIQSLFFFERGSCLPPR 
LECNSMIMAHCSLNLLGSSHPPASASQV 
AGTTGAYQHTQL I FI FPAETGSCYVAQA 
GLQLLGSSNPPPWPPEVLGLQV 


9081 


22982 


A 


9148 


3 


141 


GGGGCCE PRWSHYTPAWATSAKLCLKKK 
KKKRFLEKSCLTPLLKA 


9082 


22983 


A 


9149 


413 


240 


FIFCRDGGLTMLPGLTSNTWSQVILPPQ 
PPKVLGLQAYHADELPHLATWGVLYGI/F 
V 


9083 


22984 


A 


9150 


395 


193 


LLFGPCLLNLITQFVSCHLLDI KLQMTL 
S EG Y F PLNI Q E S P F YRAPLDD PS VRHAR 
GEIbPISLGPG 


9084 


22985 


A 


9151 


1 


77 


LVAAI VLGGTGWGFKN IRRGRS GGA 


9085 


22986 


A 


9152 


365 


181 


I VDRPDP CRDADAQREVFTLS AR YP KI W 
VPPGDPAREFPFIQSAPVMLLAAPDELV 
SAQLA 


9086 


22987 


A 


9153 


373 


251 


FKKIFFCRDRALNMSPRLVSNSRTQAIL 
PPQPLKVLRLQA 


9087 


22988 


A 


9154 


204 


95 


DYHWSQPGMVAHACNPSTLRGRGRNITR 
SGDRDHPG 


9088 


22989 


A 


9155 


44 


280 


AELFGTLMREDGLS LGGGGCKE PKS CHC 
NPAWATEPDCLRKKRKESPYLLWTPPD 
SSYSPFFFLLLPAPDNNDYFLS 


9089 


22990 


A 


9156 


2 


117 


ERLRHENCLNLKGRGCSEPGWCHCTPAW 
ETKQDSISKK 


9090 


22991 


A 


9157 


3 


207 


HASAHAS ADAWD YRR VPP CPANFCI L YV 
SHVGKAGLNLLTSSDLPTSASQSAGITG 
VSHHARTPTYV 


9091 


22992 

i 


A 


9158 


117 


267 


NRVLLCCPDWPQTPGLKGSSLSLLSSWD 
YRHIPPRLAARSSYLGPGFYH 


9092 


22993 


A 


9159 


415 


167 


LFFFPIKNXFFFSPPLFFFFFFFPPPPX 
XXFFFFFFPFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFLELKQNF 


9093 


22994 


A 


9160 


237 


41 


SSQIHRDRNRMVRQGAVAHACNPSTFGG 
KGEQ I TRAQEFKTS LANMVKPRLYYKPC 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A-Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=Phenylalanine, 
G=GIycine, H=Histidine, I=lso)eucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine ? 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














LY 


9094 


22995 


A 


9161 


341 


164 


FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFFFYVCF 
VSVELKKWQNYSFDVRIILRVEP 


9095 


22996 


A 


9162 


190 


402 


IHTGKHP KCYRN S YS VLFLAS KKKKKKK 
KKKKKKKGGGFKKKXNLNRGGGEI IFFF 
WGPKKIFGGGGLKKG 


9096 


22997 


A 


9163 


108 


209 


LS VKRCAKNPKKGKVAPG I KE I RTRSHQ 
DSEDI 


9097 


22998 


A 


9164 


2 


145 


KHCGRPRPQGPLPRKTRDLCSLVYLLTF 
PPLLS YDPAKS PS ARNTQE 


9098 


22999 


A 


9165 


170 


402 


ISRCHFPSLILSYVKITKTIKAPKKSKK 
KKKKKKKKKKKKKKKKKWGGGPFKKKNF 
LGGGGGE I FFFLGAPKKKRGG 


9099 


23000 


A 


9166 


399 


113 


WRLRWEDCLSLGGRGCSEPGSCYCTPMW 
VTEQDQSSGYEGFQRPSSLSKARIPSVA 
AERSMCQPACVCWLSVEHLSLYSCLtLAQ 
QGKALYKLLNK 


9100 


23001 


A 


9167 


3 


138 


SWITGVCHYAWLI FAFLVETGFHRVGQA 
DLELVTSGDATTLRSH 


9101 


23002 


A 


9168 


134 


301 


SKLTQSHSVTQAGVQRLNVKSLRPPPPR 
FKRFSCLSLPSSWDYRNLETVLQEETH 


9102 


23003 


A 


9169 


201 


1 


GATILRFKTKKKNVFFKKKKKKKKRKKK 
GKGWSGTESRWNLSGEGSSGLRRRKLQA 
PAGPQEFEKLA 


9103 


23004 


A 


9170 


2 


191 


FVPQPLPLGFKRFSCLSLLSGWDYRCTP 
PHLANFCG FFFLKF FF FFFKKKNFFS PR 
GRGGALI 


9104 


23005 


A 


9171 


2 


191 


GGTES CFVAQAAVQGCDLGSLQS P PTGF 
SETASQTVDPDKVKDPGKLRTMFVAELP 
KTIVTDS 


9105 


23006 


A 


9172 


3 


143 


SWFLYVGQAGLELLTSGDPPALASHSAG 
ITGVSHRARLQTG I FRAV 


9106 


23007 


A 


9173 


361 


174 


VLGIILSMSETEI FNTKCLDQSGTVAHT 
YNPSTLGGQGGRV I RAQEFETSLGNTGR 
PHLFKI 


9107 


23008 


A 


9174 


493 


351 


LVETGFNHVGQAGLE LLTSSDP PASASQ 
SAGITGVSHLALVLISRKSC 


9108 


23009 


A 


9175 


I 


115 


FRKIWYIYTMEYCIAVXKKEMLSFTITW 
MNLEDFMLVR 


9109 


23010 


A 


9176 


286 


96 


PLFFFNRWGLAICLGWSAVAIHRRDHSA 
LQSQTPGLKRSSCLSLLSSWDYRRVPPC 
LANE SHY 


9110 


23011 


A 


9177 


399 


237 


RNTKWLFFLRRSLTLSTGLECSGAISAH 
CNLRLPGS SDSLANG I FKDLALLDM 


9111 


23012 


A 


9178 


427 


101 


grtgrqgrgaph: SHDGRPGRDAPRFPD 
VMAAGKRRSSLPRWDGGRAETLLTFQTG 
QPGRGAPHIPDDGRPGGDAPHFPDGWA 
GQRLQSQHFGRPRQAAGRWRL 


9112 


23013 


A 


9179 


193 


41 


IVFLFIYLFCFFVGTGPCYVAQAGLKLL 
VS SGPLASAPQHVG ITGMSHHT 


9113 


23014 


A 


9180 


292 


119 


K I TKINKQ I LNLS YTLDQMDLTD I CR I F 
HPTATEYRFFSRAHGTFFRSGTKWSAWH 

T 


9114 


23015 


A 


9181 


3 


279 


ALSPPYSSECCKLGSFNRELSSPSPGGQ ' 
KSEIKVSAGPRPLPRVQGDPSPPLPASG 
GPRCSWACGQITAVSASGVTHPSPPLLH 
VCCVCLI 
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SEQ 10 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, . 
E=GIutamic Acid, F=Pheny!alanine, 
G=GIycine, H=Histidine, I=Isoleucine, 
K=L.ysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \-possible 
nucleotide insertion 


9115 


23016 


A 


9182 


402 


238 


QMQWHNLWPLQSLPPEVTPLFCLNLPNN 
WGYRWPPPPPTNFFFLKGVSIiGRPGSV 


9116 


23017 


A 


9183 


3 


207 


E FHHVGQDGLELLTSGDP P AS ASESAG I 
TGVSHRAQQACRFLHFFEGTCFLSQELS 
YLKLGRDTIIC 


9117 


23018 


A 


9184 


3 


106 


YPHYVKSIVASTFI ISLFPTTIFMCLDQ 
EVIISN 


9118 


23019 


A 


9185 


no 


248 


GVLVGSPLRTRFHSQLQAEGAEEIGWT 
WMGDGVLPVCWLEVDV 


9119 


23020 


A 


9186 


925 


1132 


KCPSAWAAHPTSEIRISQGGSQAAWLV 
LFFLLKNPGDVIPALWEAEAGGSQGQQF 
ETSLANMAKPRLY 


9120 


23021 


A 


9187 


399 


292 


AS Q VAGT TGAHHHARL I FF VFL VETG FH 
HVSQDENF 


9121 


23022 


A 


9188 


7 


82 


PCDPPASVSQSAGITGVSHHAWQIF 


9122 


23023 


A 


9189 




117 


LPISLPSFLPSFLPSLLFSFLPSSFLPF 
FFPSSVTPYLK 


9123 


23024 


A 


9190 


238 


46 


NKAFRIKAFVSPSNKKFSSFFFFLNEMK 
SHYVDQAGLELLSSGNPS I LASQNAKI T 
GVSKHVRP 


9124 


23025 


A 


9191 


2 


282 


LI FYFVDNMLVRAGLELLSSSDLPALAP 
KVLRFTGVSHHTRPELYFQRLLAPGTLL 
INSQAAGTNFPIGLSRIKPTQSGHGMLQ 
EVWGTSGLR 


9125 


23026 


A 


9192 


44 


136 


CLCVCVCVCVCVCVSCKQTLMCCFLENF 
DV 


9126 


23027 


A 


9193 


469 


347 


PCPANFVFLVETGFLHVGHAGLKLLISG 
DLPNSDQKFYVN 


9127 


23028 


A 


9194 


427 


238 


PPARPTAFPYIEFFVEMVFRYFGKAGLK 
VLISGDLCASASPGVGITGVSHHAQFK 


9128 


23029 


A 


9195 


34 


178 


SLVHWEFSRGRAEWLTPVIPALWEAEA 
GGSRGQEMSTILANTDFWTG 


9129 


23030 


A 


9196 


22 


338 


CGPYSMWWNSIKRERKEGKKRKKKRRRK 
EQKERERKERKERREGGREGKERGRREG 
KKRKKNKRKKVGLIGSPCIVFTWQAAYC 
EESGGVWPGGETEHTQEGLVQ 


9130 


23031 


A 


9197 


3 


91 


QGLSLGGGGCSEPGSCHCTPAWATERDS 
I 


9131 


23032 


A 


.9198 


1 


203 


RTRGKKS I DDTSNFDDFPESDI LQPVPN 
TTEPDYKSKDWVFLNYTYKRFEGLTQRG 
S I PTYMKAG KL 


9132 


23033 


A 


9199 


2 


144 


PTRTMPCCCCCCCCCCCCCRCLFMTGVL 
FTCCKNQFENHSGCLPRQI 


9133 


23034 


A 


9200 


1 


99 


YAAKELVHDRYRPTIEMERRRGLWWLVP 
RLSLE 


9134 


23035 


A 


9201 


3 


149 


LNAVQRMEWQLKVQELDPAGHKSLCVNE 
VPSFYVPMVDVNDDFVLLPA 


9135 


23036 


A 


9202 


284 


21 


KGPPLFFFFFFLRQGLVLWHRLEYSGMT 
TAHCNLELLVSSDPPTLSSQNAETTGVS 
HCAQPASLI LSKPYIGFLHQIQKSCLPV 
TFL 


9136 


23037 


A 


9203 - 


1 


77 


LRLVLNS WRQAILLLWPPKVLGLQA 


9137 


23038 


A 


9204 


529 


340 


GVLLWRQAEAGELSLGGGGCSELKVRHC 
TP AWATRAET PS QK I NE LI KS MAVE LTD 
DFIT 


9138 


23039 


A 


9205 


413 


241 


HPPPPELKLFSCISLISRWCKRHRPPCP 
YVFVLLAETGFHVGQAGHKHTYCCVSLP 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyiaIanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine,M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonme, V^Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop cod on, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














T 


9139 


23040 


A 


9206 


33 


352 


QLQGLESWGMERCALPGYPWFTNLCKY 
RSWIEETMRDK 


9140 


23041 


A 


9207 


70 


402 


HVDWEALGNDPRLVNLGICKLKASGIHE 
WVHTRTCAHTHMHTHTHTCTHCRS PRW 
VLNLG CTFELPGALKNARAYPHPNATKP 
MTSESVGIRPTRQVFKAPRAEQHCSTA 


9141 


23042 


A 


9208 


46 


262 


GGIPPAAVRGGRSWLRDVLSRLAAAGRR 
RRGTRRGRGRAGGAGGGRDRVGSRREPA 
AARTAAEGERAVRGKC 


9142 


23043 


A 


9209 


15 


206 


PIEPLSERTNSAGAPPPASLPPCSLISD 
CCASNQRDSVGVGPSEPGDSDMQTSLWA 
KVPVSKD 


9143 


23044 


A 


9210 


475 


110 


NLLAPHLLVSLEKHS IRMGVAQFSRCHL 
SQLPLARKENSLTPCASGVRRCLALLRL 
MLGGLHPLSCTHCPTGPSEMNLVSQLEM 
QKSPVFCVAHAGSCRLELFLFGHLPL 


9144 


23045 


A 


9211 


420 


132 


PPPPNYFGGAGFFPPPPFFLRPPPPFFF 
L.PPKKKKIFPPRGPKKIFFLKAPPPSFF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
LVRSHFILTIM 


9145 


23046 


A 


9212 


353 


87 


FFFXKQXFFFPSLFFFSPPXXFSFYPF* 
YLKYPLLPPSFFLFFFTPPPPXFFFFFF 
FFFLFPPPFFFFFFFFFFFFFFFFFFFF 
FFFFD 


9146 


23047 


A 


9213 


2 


276 


FIQLLKRRLCELGYEVLPHPSYSHDLLP 
TNYHFFKHLDNFLQRKFFHNQQDAESAF 
QEFAE SQSMEF YATG INTL I S PWQKCVD 
CSVSYFD 


9147 


23048 


A 


9214 


402 


276 


LSCLSLPSSWDYRCPPPKPGYFFYFFLF 
I>VKTGFPHLAQAG 


9148 


23049 


A 


9215 


386 


37 


GFKKRVCSPFFFFFFFFFFFFFFFGREG 
FFLQKKKyTCNNPSSNFCLLTLGLQGKKE 
IQPALMKKNSISQS 


9149 


23050 


A 


9216 


1 


142 


KDFRVQVWYNHSSLQPQPPGFKQSSRLS 
LPKYWDYRHEYLHCIQLAP 


9150 


23051 


A 


9217 


402 


181 


FFFRXFFFFXXFFXXXXFFFFFXXXXXX 
XXXXXXXXXXFFFFXFFFFFFFFFFFXM 
FQQEKLLFIFYVNIWTF 


9151 


23052 


A 


9218 


2 


87 


SGSLI FGLI YICMCVCVCVCVCVCVYPC 


9152 


23053 


A 


9219 


420 


271 


YYCI FRETGFHKVGQAGLELLTSGSPPA 
SASQSAGITATAPSPFLTFLLV 


9153 


23054 


A 


9220 


139 


413 


SGYHSKTLFQKKKKKKKKKKKKKKKKKK 
KKKKKKKKKKKKKKKKKKKRGGGAFKKK 
I FFRGGGGNFFFFLGPQKNI GGGGLKKR 
GGGKKPPPPQIFFFVG 


9154 


23055 


A 


9221 


134 


236 


LHRWGPAMLPRIILNTWPQVILLPRPPK 
VLELQA 


9155 


23056 


A 


9222 


364 


280 


STHLSLPKYWDYRHEPPCLAKKDAQSFS 


9156 


23057 


A 


9223 


1 


147 


TLLPMLESSGTIMAHCSLNYLGSSDPST 
SASQSARITSVSHCTQPPLPF 


9157 


23058 


A 


9224 


1 


208 


AGAPPPASLPPCSLISDCCASNQRDSVG 
VGPSEPGAGYHLWRRLSQSEKRS IRVG 
VTRFSSLLLSSHA 


9158 


23059 


A 


9225 


1 


145 


FQHVGQAGPKLLTSGDLPASASQSAGIT 
VSNRVRP KTKNF YTGNTRPN 


9159 


23060 


A 


9226 


396 


210 


KCWD YRHEPPY PALTNLLF I EMGS YG VA 
RAGLKT..T.^SSNPPI.SASQSAGVPGLSHH 
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SEQ ID 
NO: of 
nucleotide 
sequence 
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NO: of . 
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beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AJanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Iso leu cine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














AQPIF 


9160 


23061 


A 


9227 


2 


154 


FHHFGQAGLELLTS SDLPTS ASQSAE I T 
GMSHHTQPRMVFFSNFSSTAQC 


9161 


23062 


A 


9228 


1 


238 


FFLLALG I F I LHAI YLS I Y LS I HP P S MH 
PSIiSIYLSIYLSIYLSIYLSISLSLSLS 
IYLSIYLSIYLSIYLSDFLSSAR 


9162 


23063 


A 


9229 


3 


65 


RGD LITE PFLPKLLTKVKG S 


9163 


23064 


A 


9230 


25 


188 


YFRTAWNQAWW YVP I VSATWEAEAGGLL 
EPTSSRLRCAVIAPVNSYCTSTWAVS 


9164 


23065 


A 


9231 


44 


232 


YDIiK FNNHWFY FL FETE FHS LSKVECN 
GAISAHHNLCLCDGARLHHRRGGKKTKK 
QKTLPS 


9165 


23066 


A 


9232 


3 


195 


FFVGLGSPYWQTGLELLCSGNPPASAS 
LNARI TGMSHQVQPNTGRLS FYLGSS I Y 
GSIHAMI 


9166 


23067 


A 


9233 


380 


244 


SVPLAEMGFHHVGQASPELLPPVIHSSQ 
PPKAIGLQARATAPSLH 


9167 


23068 


A 


9234 


11 


101 


LLL INTLMF IHLS P I LLLS LNPDI I TG F 
EG 


9168 


23069 


A 


9235 


411 


226 


FFFFFATNSQGGPTSFWGPRNNAQVAGP 
LRNKI FTP VGS PKKI FPS WETP VF 


9169 


23070 


A 


9236 


404 


262 


QGLAMLPRLVSNSWTQAILLPWPPNQSA 
GGSGVSHHTQPIFISENIH 


9170 


23071 


A 


9237 


369 


152 


HAFFVS IWRSFHVSNSYMTVTVQMYHNL 
SIPLPDPHFFFRGGGLAMLPRLVSNPWP 
QAILPPQPPKVLGLQA 


9171 


23072 


A 


9238 


3S9 


190 


RPKKKKKFFPPPGGKGVFFKGAPPPYFL 
NFI FFFFFFFFFFFFFFFLIYTHNFLIL 
FLNCDKLYIT 


9172 


23073 


A 


9239 


330 


3 


GGMEPPFFPPTGAQWGYFGSLQNSPPEF 
KQFLCLRLPNVLEYKSIPPCPTNFCSFY 
KNGIFPCWPGWPFKFQKKKKISWVWWCA 
LAEAGGSLEPRWRWRLQSAKMAPL 


9173 


23074 


A 


9240 


122 


248 


KKKKKRGGLTPVI PPLWEAKGGGSRGQK 
I QT I LAGGGKAPL Y 


9174 


23075 


A 


9241 


2 


286 


FFVQTEFHHVAQAALQLVSSRDPPTSAS 
QS VD I TGVSHCAQP V I VSCDDQVTLVQR 
RQVTVTS L IAFCQVAYPKGAKS QWPQS F 
FFSSPFLFSL 


9175 


23076 


A 


9242 


2 


95 


HMKKCSSSLAIREMLIKTTMRFFFFKLI 
TCY 


9176 


23077 


A 


9243 


377 


293 


PLSPRIiVLSSWAQMILLPLPPKVLGSQA 


9177 


23078 


A 


9244 


422 


147 


KKGGTQGFSPPRVFKRGPRNFFFGPPKK 
KKNPPPPRGKIFFFLKGPPSFFFFFFFF 
YFFFFFFFFFFTHK YFLLGDLLL I YHLS 
KKNYSFF 


9178 


23079 


A 


9245 


402 


208 


TIGRCHHTQLNLKFFVEMGSHYWQAGL 
KLIiDSTNPPSSAFQSAGISGVNPYAWTE 
IMFLYSIPI 


9179 


23080 


A 


9246 


314 


168 


EPYHLHFLPPHHYYYCRWGLIVLPMLVS 
NSWAQVILLPWPPKVQGLQV 


9180 


23081 


A 


9247 


3 


320 


WQAGLKLLSSCDQPASASQSAVITGVN 
HHARPLI FLYEMDYGSRTVLI I RKWDDN 
EQSNRTI FLKKDELTGNRSACLKYTPKD 
INVNSGLGWRYRFGMHHQCRW 


9181 


23082 


A 


9248 


22 


303 


KGSYFSFTYEAQTESCSIARLECRGAIS 
AHCNLRLLDSSNS PASASRVAETTDHKT 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E^GIutamic Acid, F=PhenylaIanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, MNMethionine, 
N=Asparagine, P=Proline, 
Q=Glutamine,R=Arginine, S=Serine> 
T=Thrconine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














LVPERSCFTPKERNVIQKGQEESEEMSN 
HLFTWLPILH 


9182 


23083 


A 


9249 


13 


174 


LGLKVRHRLWLI FVLSVETEFHHLGQAG 
LELLTSGNPSAGITGVSHCARPTSHS 


9183 


23084 


A 


9250 


3 


141 


HVGQTVLRLLTSGDPPTSASESAGITGV 
SHRPRQIFISLFTICIR 


9184 


23085 


A 


9251 


56 


262 


LRI PLTFGGWKGEEEDRAVMHLRLCGD 
DSPMSESLHFIKLPRTVLVDEDARPEKK 
KKKKKKKKKKKKK 


9185 


23086 


A 


9252 


401 


275 


PADLFIFIWEDGGLTMSPRLVMNSWSQA 
TLQPWTPKVLGLQV 


9186 


23087 


A 


9253 


411 


258 


PKVFKYRCDPPRLAWFLFFVQSLLPRLG 
LNSWAQAFLPPWPPRVLGLQA 


9187 


23083 


A 


9255 


391 


167 


GLKILTSGDLPASASQSAGNTGMSHCAR 
PLMSFKIRYFLFTIKMKLKLLWPVWSA 
GVHPPHFSCP I VI TNKPRI 


9188 


23089 


A 


9256 


9 


255 


FFCRYGGLAVLPRLVSNSWTQAILSPWP 
LKVLGLQACTHHFFLMVNSNAAR IKT I E 
MLLSEDVNFFVALLCPSFTISHFHL 


9189 


23090 


A 


9257 


487 


42 


RYF I FLF YFI FKTETGS YYVAQARLRLL 
GSSNPPALAFQSARITAVSHRAQLTCIK 
YTFLFTHIP 


9190 


23091 


A 


9258 


1 


243 


ILSSGGSFISLTAVILIIFMIGEAFASK 
RKVLIVSEPS INLEWLYGCPPPYHTFEE 
PQP KKS E I YS KCFFFS VFVHFGG WG 


9191 


23092 


A 


9259 


411 


2 


FFFFSPPVSLSIiCVLESSPPPFFFFSPP 
PKKGFFSPPFFFFSPRVFPPPVFFFPPP 
PFFFFPPLKKNFSPPPPGKI FFFFFRPP 
PFFFFFFFLFFFFFRPFFFFSFFFLRWE 
EVLYFIMCHHISDAYPPNVDMNKE 


9192 


23093 


A 


9260 


417 


219 


SVEMGFLHVGQAS I KLPTSGDPSASGSQ 
SAGITGLSHCARPHSQLSIVCCYSVPVQ 
CLHSLAARY 


9193 


23094 


A 


9261 


408 


172 


CPNHFRKWGHRGCPPPRVFFFVGRDGVY 
YAQAGLELLDLSNS PASASQSNGTTGMS 
HLGQPLLFILNDVIAVIPLQGSF 


9194 


23095 


A 


9262 


401 


132 


HQDNKPAPPGFFFFSPPPKKGFFSKPFY 
FVSPRFFPFPPFLNPAPLFYFWGPKKKI 
FFFPPPAIKFFFFKRAPPLFFFFFFFFF 


9195 


23096 


A 


9263 


393 


220 


LFFLNKIPFFPPGGSKGGDFGPHTNPPP 
GVKPIFRPHPPEEGGPKGPPPRPREFFL 
GF 


9196 


23097 


A 


9264 


206 


438 


FP I LKTHFRLGTVAHTYNPTTLGGPGGR 
NTCSKFKTSLGTWGDPFFHKSENCLGWG 
VPNCGPCFWGAQQKSAWSEGL 


9197 


23098 


A 


9265 


333 


48 


ALLAGPTPFFFLGKKKTRFGNRALFFFF 
FFFLNEVLLCNPGWIMWAIHRCDHGTL 
QPPTPGLKRSSCLSLSSSWNYKLKPLHP 
AGVRFLDFVS 


9198 


23099 


A 


9266 


3 


275 


TGFHH IGQAGLKLIjTLGDP PLSASHSAG 
I TGVSHHAWPMHLES YWRVQYPFMPQVI/ 
YFWAPSGAQHCAQI IENAIPTLLSRPQE 
EPASEA 


9199 


23100 


A 


9267 


49 


226 


HGPE PCMDADAQREG INVSARYPKNWES 
TVDPARE FTM I YSAPLMLLADPDE FAS V 
QLA 


9200 


23101 


A 


9268 


159 


236 


PRLVLISWPQAILPPWPIKWGLQA 


9201 


23102 


A 


9269 


1 


208 


I FNYCVFS SGVAIiAYGI YKQDT..P ALKE K 
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USSN 
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Predicted 

beginning 

nucleotide 
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correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A-Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
VV=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














PRGWFVDMGHSAYQVSACAFNRGQLJCV 
S I YFFSRRLLWW 


9202 


23103 


A 


9270 


1 


209 


IFNFCVFSSVVALAYGIYKQDLGALEEK 
PRNWFVDMGHSAYQVSVCAFNRGKLKV 
SI YFFSRRLLWW 


9203 


23104 


A 


9271 


379 


263 


ESCSVGRLECSGAVPAHCNLCLCLPDSS 
GSPASSSRLC 


9204 


23105 


A 


9272 


312 


3 


FRREGLAALQPWWPNKVSS KKKRKKERE 
RKKERKKERKKERKKEDGATADGISEAE 
VIRVHPYPVGFGVLSEGDIWTEDMRAGR 
RPHAGESGDTVMLPQAQE 


9205 


23106 


A 


9273 


405 


246 


PTKKNKYLHAPPLFHFFFIIPNPNFFFF 
FFFFFFNFFFFFKTKGCEFLYFSY 


9206 


23107 


A 


9274 


2 


227 


LVETGFRHIAQAGLQLLSSSYPPTSASQ 
SVGITGVSHQPGPSFHFERKSLGLKTLL 
LCVKVGVRACFALFFQGSS 


9207 


23108 


A 


9275 


70 


304 


NLFLKASTYPSEHLYHLPPTPIVYIRKR 
LDAMAHACNPGTLGDRGGW I ARTPDLKC 
S AYRGL S KCWD YRHE PPRPVLV 


9208 


23109 


A 


9276 


2 


260 


TVF YHVGQAALE LLTSGGP PALAS ES AG 
ITGVSHHAQPFLLILVISLFPCNQSFHI 
SPYRFSFTQLKHGFNTVLYPLVI NISEI 
FF 


9209 


23110 


A 


9277 


406 


330 


PRLVLNSWTQTILLPWPPKVLGLDV 


9210 


23111 


A 


9278 


2 


160 


ELLTSSDPPASASQSARITGMSHHTRLK 
YI VYRVSRKSKNI PLKHIG IQS FP 


9211 


23112 


A 


9279 


1 


91 


GLTVFPRLVWNSWAQMILFPWPPKMLGL 
QV 


9212 


23113 


A 


9280 


3 


107 


LGETGFHHVGRAGLELLNKVMHPPWAPR 
VLGLQA 


9213 


23114 


A 


9281 


336 


234 


AHYKLLLKIKLPGQAQWLTPVIPTLWEA 
SAGRS 


9214 


23115 


A 


9282 


3 


241 


DRPE PCWDADAQREG I NASARYPKNWVT 
TGDPAREFTMIHTAPLMLLADPDEIESV 
QLAQSWPFGAIASLGRSPCRKRN 


9215 


23116 


A 


9283 


404 


277 


IFFVETRSHHIAQAGLELLGSSDPPYFF 
QVFTEMSLSQRALF 


9216 


23117 


A 


9284 


399 


231 


HYVAQASLKLLYSGDPPASAFQSAEIAG 
VSHCTWPTGSLNKIQNKIHSRISSM 


9217 


23118 


A 


9286 


394 


231 


AFFFFFFFGERGFHHVARTVLKLLDSSN 
? P AWSQS AG I TD VS HDFLI SL YR I Y 


9218 


23119 


A 


9287 


420 


311 


LNFCRDGIiAMLFRLVLNS WAQAVLS PQP 
PKVMGLHA 


9219 


23120 


A 


9288 


398 


173 


LLSFFYLFFYFFFFILFFLFFFFFFFFF 
LMFKALLFSCHYFPAMVTFKFSLS ISGV 
VRTSGYIMLFSYLCLLKTM 


9220 


23121 


A 


9289 


270 


23 


TPEQCRG I KLFI YSLILYVRDGVLLCCS 
GWPWAIHKCDHSTLQHQIPSLKQASCL 
SLLSSWEYRYAPLHLASANFLKVTW 


9221 


23122 


A 


9290 


273 


46 


HLLNMPLFTEIWTERLFLFYILCFVFLV 
ETGFFYITQAGLKLLTSSDPPAYASQSA 
E I IGTSHCTRPICYVNVTNI 


9222 


23123 


A 


9291 


424 


230 


CLPFTHSLSLSLSLSLSHTHTHTHTHTH 
TIYMNK 


9223 


23124 


A 


9292 


2 


146 


WLFLVETRSHHVARAS FKLLSSSWPPTS 
ASQSSEITGASHRTWPEVFF 


9224 | 23125 


A 


9293 


393 


202 


GRERERKREREREKERKXERKKERKPER 



1104 



WO 01/64835 



PCT/US01/04927 



SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQJD 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 
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Predict- 
ed end 
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ding to 
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residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 














PSGSLTYCVREGITAANLGKRRHKASKT 
LISHKNKN 


9225 


23126 


A 


9294 


252 


193 


GFCCCCCCCYCCCSDKRHPG 


9226 


23127 


A 


9295 


409 


269 


SQYFGRLRRVDCLSSGVQDQPGQQGETP 
SLLKYK I Y I HI S V7AG WHV 


9227 


23128 


A 


9296 


2 


213 


FCCGLLIHLIYCKTEYSGQSTVAIHRCN 
HSVLQPQTLGFKQFSHLSLLSSWDYRYS 
PLCPAYIFNLSIGY 


9228 


23129 


A* 


9297 


169 


408 


RFARHMNSPHEERLPPPAGIRVRPPPQP 
A^SLGPPRMAPSISRRFQMTPYYKHNDF 
AGQAQWLMPWPALWEAEAGGSSE 


9229 


23130 


A 


9298 


199 


51 


LHPP PPSLSLSLSHTHTHTHSHTHTHTH 
TLWLGRPGPAQDWLVWGNLT 


9230 


23131 


A 


9299 


1 


176 


MGSHSLAQARMQWCNQSSLQPQPPGLKQ 
PSPFSLPSSWDYRHVPPCLAS IFFLKLN 
YH 


9231 


23132 


A 


9300 


412 


209 


MGFHHVGQSGLELLiTSGDPPTSASQSAG 
ITGVSHRNWPHCGFNLVFLMPDDVKHLF 
MCILAICVSSL 


9232 


23133 


A 


9301 


195 


347 


KIKGFRPGATAKAGNPSALGHQGGRTAI 
AWEFT I S MSNMGRP CLYRKY KN 


9233 


23134 


A 


9302 


386 


190 


GFLFFFPPPKKEVFPHPFILGFPGFFPP 
PPFFKRAPEFFFWGPKKKKNSPPPRALN 
FFFFKGAPP 


9234 


23135 


A 


9303 


27 


173 


S FQLWGHLKVPR I RP KRADKLRSGVQDQ 
PDQHGKTLSLLKKYKISRAW 


9235 


23136 


A 


9304 


374 


121 


QWCDHGSLQPQTPGLKQSSCFSLPKRWD 
YRREPPHPTSLSPYIVAWFCTLCPPPLS 
ALWKHLIHQKCDGVMVLQVPGRTHNVTL 
T 


9236 


23137 


A 


9305 


70 


196 


PQVIKPGQAGLELLTSGDLPASASKSAG 
ITGVSHHAQPISHF 


9237 


23138 


A 


9306 


378 


256 


QKWVS I CCPGWSQTFGLQES S CLGIi PKS 
WDYRRQPQPCLTL 


9238 


23139 


A 


9307 


206 


3 


PCHLGWSAWQSWLTAASNSWPHAILPP 
QPPKVLGLQASLKFLKAGITQVTFSDYN 
KILNKEIKRGR 


9239 


23140 


A 


9308 


338 


16! 


HFKKKIYIYIYVYIHTYIYIYTHIHIYT 
HTYTYIYRYIHIYIHTYIYMYVYIYIPS 
LPF 


9240 


23141 


A 


9309 


503 


288 


FFLNLDKSQEWWLMPVI PTLREAEAGGS 
LEVRSSRPASSDPPTSASQSAGIIGMRH 
HARPWLRENGRIQN 


9241 


23142 


A 


9310 


418 


309 


FFCKNGISICCLPKLVSNSWPQAILQPQ 
PPKVQGTL 


9242 


23143 


A 


9311 


442 


275 


VSPPPLGGGFLSPPFKPPPFFFNLKGGP 
PFPTIPFLIIKGKKFCPPPPFFFPGLF 


9243 


23144 


A 


9312 


2 


95 


HENRLSLGGGGCSELRLCHCTPAWATER 
DFI 


9244 


23145 


A 


9313 


318 


154 


FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFI 
FILFIYSLLYWCVGVLGS 


9245 


23146 


A 


9314 


388 


199 


EQPKCVHR I KQIWH I CALE Y YAAFKKKK 
VLPYAVIQIKLEDTMLSEINQSWGKYCM 
IPLIGGI 


9246 


23147 


A 


9315 


2 


97 


STPGLKRSSHISLPKYWDYRRGPPGPAS 
ILF 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, HHHistidine, l=Iso!eucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V= Valine, 
W=Tryptophan, V=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possib)e 
nucleotide insertion 


9247 


23148 


A 


9316 


3 


235 


TMWWAEGI P I RTTLDNS TTVQ YACLLH 
HLTMKAKSTVRDIDPQNDLTPLRIRSKK 
HEIMVAPDKEYLLIVIQNPCE 


9248 


23149 


A 


9317 


253 


71 


LFFFFLFFVDTKFHHLAQDGFQLLSSSN 
SPTSASQSGGITGRSHQAWPYVHLLTLQ 
KLNY 


9249 


23150 


A 


9318 


425 


95 


CLLFCPKKKKKKKKKKKKKKKKKKKKKK 
KKKGG P I KKKKKQ P 


9250 


23151 


A 


9319 


25 


214 


VRIDDRPEPCMDADAQREGINSSARYPK 
NWVTTGDPARECGMIHSAPLMLIADPDE 
FVSVQLA 


9251 


23152 


A 


9320 


50 . 


231 


GHRPEPGRDADAQREGI IASARYPKNWV 
TTGD P AR2 CTR I QS APLM LLAD PDE FVS 
VQLA 


9252 


23153 


A 


9321 


401 


226 


QWLFTNMIIGHCSLELLGPSNPPISASR 
IAATTGTRHHAQPSKFRNLKGWLCAKRA 
RT 


9253 


23154 


A 


9322 


410 


159 


LGSLQLPPPGSKRFSRLSLPSSWDYRGC 
SELRLRHCTPAWVTEQDSVPPKKKKQI F 
RPFPFSYGESLGVGLRHLCFSKPSRRF 


9254 


23155 


A 


9323 


310 


25 


KWGGLFFFSPPKKRGFFKQVFFCVPPVF 
FFPPFFKTGPGDIFLGPLKKKIIFPPPG 
GKKFFFKRGPPPFFFFFFFFFFFFFFFF 
FSEVECVCLCR 


9255 


23156 


A 


9324 


406 


289 


PGFYHAGQAGLKLLTSGNPSASASQSAR 
ITSISQRTRPH 


9256 ' 


23157 


A 


9325 


-j 


116 


LHSVLLPFFYFCVCVCVCVCVCVCVKHK 
YVRAGGAPGR 


9257 


23158 


A 


9326 


2 


158 


F I FLFT VGGUTG I VLANSS LD I VLHDT Y 
YWAHFHYVIiS IGAVFAI IGGFIH 


9258 


23159 


A 


9327 


3 


107 


RGCS E PRLRHCTPAWVTEGDCVS KKKKK 
VPEAWG 


9259 


23160 


A 


9328 


257 


2 


GFFSNISASVIIPPPISRKNLFFPFRVY 
FGPPPKTFKGPPPFFFFFFLS PRLECSG 
VIMAQCSLTLPGSSDLPLSASCVAMTVS 
I 


9260 


23161 


A 


9329 


66 


384 


GNLCGWMKSMAWDNPWRKADWGQIRES 
LECQPNTGMLKKKKKKKKKKKKKKKKKK 
KRGGPFLKISWGGPILGGRGKIFFFFFG 
GGILTPRGVFWKSPFFW3GKKK 


9261 


23162 


A 


9330 . 


1 


98 


IIGQVASVLYFTTILILIPTISLIENKI 
LKWA 


9262 


23163 


A 


9331 


33 


412 


NLFYVMAEILHTHIQLCCQVKLHLSVPR 
PSLPLLCCWNFSLVLHEHSQPASEKKKK 
KKKKNKKKKKKKKKKRG3VFKKKPGGAP 
FCGGGENFFFFFFGGGLKKPRGVFGKNT 
FFWGGEKRPPPPPY 


9263 


23164 


A 


9332 


401 


311 


PTRPVDLLVDDSVKESLIGYGPTEDSKE 
FS 


9264 


23165 


A 


9333 


5730 


452 


TRSAPRAASWLEDPREVRSVCLSATFFT 
LSLQKPRLliLFSPSWHLGVPLSVGVQL 
QDVPRGQWKGSVFLRNPSRKNVPCSPK 
VDFTLSSERDFALLSLQVPLKDAKSCGL 
HQLLRGPEVQLVAHSPWLKDSLSRTTNI 
QGINLLFSSRRGHLFLQTDQPIYNPGQR 
VRYRVFALDQKMR PS TDTITVMVENSHG 
LR VRKKEVYMPSS I FQDDFVI PD I SEPG 
TWKISARFSDGLESNSSTQFEVKKYVLP 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A-Alanine 
C=Cysteine, D=Aspartic Acid, 
E-Glutamic Acid, F=PhenylaIanine, 
G=Glycine, H=Histidine, I=lso leucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glu famine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unkno\vn, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














NFEVKITPGKPYILTVPGHLDEMQLDIQ 
ARYIYGKPVQGVAYVRFGLLDEDGKKTF 
FRGLESQTKLVNGQSHI SLS KAEFQDAL 
EKLNMGITDLQGLRLYVAAAIIEYPGGE 
MEEAELTSWYFVSSPFSLDLSKTKRHLV 
PGAPFLLQALVREMSGSPASGIPVKVSA 
TVSSPGSVPEVQDIQQNTDGSGQVSIPI 
IlPQTISELQIiSVSAGSPHPAIARLTVA 
APPSGGPGFLSIERPDSRPPRVGDTLNL 
NLRAVGSGATFSHYYYMILSRGQIVFMN 
REPKRTLTSVSVFVDHHLAPSFYFVAFY 
YHGDHPVANSLRVDVQAGACEGKLELSV 
DGAKQYRNGESVKLHLETDSLALVALGA 
LDTALYAAGSKSHKPLNMGKVFEAMNSY 
DLGCGPGGGDSALQVFQAAGLAFSDGDQ 
WTLSRKRLSCPKEKTTRKKRNVNFQKAI 
NEKLGQYASPTAKRCCQDGVTRhPKMRS 
CEQRAARVQQLDCREPFLSCCQFAESLR 
KKSRDKGQAGLQRALE I LQEEDL I DEDD 
IPVRSFFPEKWLWRVETVDRFQILTLWIi 
PDSLTTWEIHGLSLSKTKGLCVATPVQL 
RVFREFHLHLRLPMSVRRFEQLELRPVL 
YNYLDKNLTVS VHVS P VEGLCLAGGGGL 
AQQ VL VPAGS ARP VAFSWPTAAAAVS L 
KWARGS FEFPVGDAVSKVLQ I E KEGAI 
HREELVYELNPLDHRGRTLE I PGNSDPN 
MIPDGDFNSYVRVTASDPLDTLGSEGAL 
S PGGVAS LIjRLPRGCGEQTM I YLAPTLA 

asryldkteqwstlppetkdhavdliqk 
gymriqqfrkadgsyaawlsrdsstwlt 
afvlkvlslaqeqvggspeklqetsnwl 
lsqqqadgsfqdpcpvldrsmqgglvgn 
detvaltafvtialhhglavfqdegaep 
lkqrveas i s kans flgekasagllgah 
aaaitayaltltkapvdllgvahnnlma 
maqetgdnlywgsvtgsqsnavsptpap 
rnpsdpmpqapalwiettayallhlllh 
egkaemadqaaawltrqgsfqggfrstq 

DTVIALDALSAYW I ASHTTEERGLNVTL 
SSTGRNGFKSHALQLNNRQI RGLEEELQ 
FSLGSKINVKVGGNSKGTLKVLRTYNVL 
DMKNTTCQDLQ I EVTVKGHYE YTMEANE 
DYEDYEYDELPAKDDPDAPLQPVTPLQL 
FEGRRNRRRREAPKWEEQESRVHYTVC 
IWRNGKVGLSGMAIADVTLLSGFHALRA 
DLEKLTSLSDRYVSHFETEGPHVLLYFD 
SVPTSRECVGFEAVQEVPVGLVQPASAT 
LYDYYNPERRCSVFYGAPSKSRLLATLC 
SAEVCQCAEGKCPRQRRALERGLQDEDG 
YRMKFACYYPRVEYGFQVKVLREDSRAA 
FRLFETKITQVLHFTKDVKAAANQMRNF 
LVRASCRLRLEPGKEYLIMGMGLDGATY 
DLEGHPQYLLDSNSWIEEMPSERLCRST 
RQRAACAQLNDFLQEYGTQGCQV 


9265 


23166 


A 


9334 


46 


324 


SKYLPPVITFQHEIWVGTNIQTISDWVL 
ICCPGWSAVTQSRJaAAASTSWAQAMSSS 
SCLGLSSS CYYRHTPPCIoANFVAFVETK 
LYTMLPGLS 
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SEQ 10 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoieucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X^Un known, *-Stop codon, /=possib!e 
nucleotide deletion, Y=possible 
nucleotide insertion 


9266 


23167 


A 


9335 


338 


178 


EVDIAVSHRRKQAFNLCLLKCKPHNNCL 
NLEGGGS S ELKLCYC IPAWATDQSL 


9267 


23168 


A 


9336 


396 


254 


KLSTDCTSKEI INRHPTEWEKIFANYAP 
NKGPTSTIHSSCLMFPYLF 


9268 


23169 


A 


9337 


454 


359 


SSWDYRHAPPCPANFVFLVETI FIQGEI 
GLL 


9269 


23170 


A 


9338 


540 


328 


IKKRGAPRRGVLFFFTRKKKFFSYYKKS 
TKERVS FLFVAPGCI QGGKKKKPPTKKK 
KKKKKKKKKKKKKKK 


9270 


23171 


A 


9339 


21 


209 


AEE FPDFKLYYKANQNTYRGQWNRTETS 
EITPDIYLGIQLVSNSWPQVTLIiPWPPK 
VLGLQA 


9271 


23172 


A 


9340 


345 


2 


KIFFFLIYILFFFNFFFFFFFFFFFFFF 
FFFFFLENRWFLMENFIKGRGHVKGPLG 
QARDHDI SGRRWWRREQCRASQLPGVRR 
S LLQVQAGGPVH I R I SGLQDARQGPRVA 
GA 


9272 


23173 


A 


9341 


35 


252 


LERVIRQSAALTDSFFNFILLILFIYLF 
FETGS PCVSQAGLKLLGSRDPPVSASQS 
AR 1 1 GVSHRAQ P ALTDFP S I S F 


9273 


23174 


A 


9342 


484 


276 


RHAPPCQANFFVQTRFCHVAQSGLKLIR 
SRYLFASAFQSVRITCMSHCARPLYFLM 
FKRLQYLKSPTKS 


9274 


23175 


A 


9343 


464 


326 


LVETG PHHVGQAG PELLAP CD P PAPAS Q 
SAETTGMSHRARPPANFV 


9275 


23176 


A 


9344 


486 


280 


SFSHSHSCSLPSSWDYRCTSSHLANFFF 
KRWGFTILPRLILNSWPQVILLPQSHRH 
EPLCLETLKTLNV 


9276 


23177 


A 


9345 


276 


179 


DRDFTMLPRLVLNSWPKTILLPL.PPKIL 
RLQS 


9277 


23178 


A 


9346 


45 


143 


RWNSRQENCLNLGGGDCSEPSLCHCPPA 
WATE 


9278 


23179 


A 


9347 


229 


73 


PPPHMIFFFFFFLFFFYFFFFFFFFFFF 
FFFFFFFFSRLQVSVYCSSVLNKNVD 


9279 


23180 


A 


9348 


391 


139 


I PSLKRSSHLSLAKCWDYRHKPPCQVTF 
S FLNVR I YATKFFLS TPLAAWS KYWYLV 
FLLS FSS KYYLTPI S FLFLTHNVSSSI F 


9280 


23181 


A 


9349 


53 


191 


IYVTYIWYSEVSHAYNSSTLGGQGGRIT 
GAQEFKTSLVNIAKPLLY 


9281 


23182 


A 


9350 


373 


235 


EEGELLEPGVQGCSELSWCHCTPAWGTE 
QDHCVSYNNSNTHNNVNY 


9282 


23183 


A 


9351 


476 


82 


YLATYLPTYLSIYLPTYLPIYLPIYLYL 
GSHY IAQAGLKFLGWVGHNGSCL 


9283 


23)84 


A 


9352 


286 


142 


ELQFKMRFGWRHSQT I S WGLSLLPRLI S 
NSWAQVILLTOFPKVLGIQA 


9284 


23185 


A 


9353 


3 


102 


FFETESHSVARLEYSGAISAHCTPVIKL 
YLYT 


9285 


23186 


A 


9354 


3 


270 


TAGAPPRPANFCI FSRDGVSPCWPGWSR 
TPGLRCDPPASASQLLGRLRHENCLNLG 
GEGCTGAIIiAHCTSTLRAQAILQPQPSC 
PAPQ 


9286 


23187 


A 


9355 


1 


112 


GEGSAHCNLCLPGSSNSPASTSQVAGIK 
GVSHRARPN 


9287 


23188 


A 


9356 


1 


63 


VALIQAYVFTLLVS LYLHDNP 


9288 


23189 


A 


9357 


244 


140 


QRRHTHTHSHT YTHTHTHI PNKK I PILT 
ItiKCIA 


9289 


23190 


A 


9358 


464 


181 


LGAGGMTG I h FSLG AS M VLGGVAQMLAP 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 

NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E~GIutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V= Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, ^possible 
nucleotide insertion 














KARTRRIQTTDNGKQNTYFSSLDNMVAQ 
GNVLPVLYGEMRVGSRWSQEISTADEG 
DGGQWVIGR 


9290 


23191 


A 


9359 


11 


469 


IDLRCRFRHEERERERERERERERERER 
EREREREREREREREGRAP PP PLFFGGG 
GPPARAHALSPPHLGRPPLYTYFFSYVR 
AP PTERSARTQREARRALKSVCWRERE 
THSLPRARCVCSHPTGCGGAPGARVSLS 
QSAHCFLCVCGAG 


9291 


23192 


A 


9360 


22 


172 


GLSHEERERERERERERERERERERDTV 
CPLFCPTGGFILREARVKSGCLCF 


9292 


23193 


A 


9361 


15 


345 


PPNRPPAEFRTRHLISSLLCLEGIILSL 
FI I ATL I TLNTHS LLANI VP IAI LVFAA 
CEAAVGIiALLVS I SNTYGLDYVHNQKKK 
KKTWGGGRAPNS PYGGRNNNPRPAV 


9293 


23194 


A 


9362 


61 


298 


SCDSPLQECCIHTHTHTHTHTHTHTHTH 
LFSDIKTHTWREWALSLFLLTEEEGGGT 
LFQGTLI FGHTLFWDKPLWQKEF 


9294 


23195 


A 


9363 


430 


2 


GGAGPPPPCWTQGPGVPPPTLSPRVNGP 
SLCSDFPSPKFYKEKRKKKGNKTPKRKG 
KTLLPTET I VLHVS PG CES KPGAGSADR 
RAPGVRDGGS DAS S S GTRAS E LGDG PVR 
VNFVLGDPGEAGRGFSMLGHPPGTSGPR 
TS 


9295 


231% 


A 


9364 


432 


291 


GITGLRHHSQLIFLFLVETGFHHVGQAG 
LELLIHQHRPSKVLRLQV 


9296 


23197 


A 


9365 


343 


507 • 


GQGLTIiVPQVGVQWGNLSSLQLLPPGLK 
QSTCLSLPKCWDYKCGLPPPASFCFVL 


9297 


23198 


A 


9366 


417 


128 


MGWWPGRGAP PFLDEMAAGQRRS SLSRV 
GSQAEGLLTSHTMGGPAETLLTSQTGWR 
LGRGCNLGSLEGQGRRLGGGGCNEPSSR 
HCTPAWAPLSTE 


9298 


23199 


A 


9367 


500 


285 


SLPRSWDYRRAPPRPANFLFLVEMGFHH 
VGKAVLELLTSGVELLTSGDLPASASQS 
AGITGVSHRAWQIRE 


9299 


23200 


A 


9368 


1 


229 


I VDLTT WG I PTHT I HKYTHH I HKY I HTH 
AYTHHTHRHKHI YT Y I HTDMVWLS PHPN 
LILNFSSHNPHVSWEGPNGR 


9300 


23201 


A 


9369 


451 


88 


KQTNI YTC I YM YVY I YTCI YTYVY I YTR 
TYTCIYVCVYTCIYVCVYTCIYVCVYTC 
IYVCVYTCIYVCVYIRVYTCVCIYVYIR 
VCVYTCIYVYVYIRVYTYICIYTCISTY 
I QCI YTHI YKVTKCSG 


9301 


23202 


A 


9370 


1 


208 


ADETIEQRGGIPKFVHDNYVIRRTAVDR 
G I PLLTNFQETKLFAEAVQKSRKVDSKS 
LFHYRQYSAGKAA 


9302 


23203 


A 


9371 


354 


202 


PAAQREG I TAS AR YS KN WVTPGDPAREF 
TM I QS APLMLLADPDE FVS VQLA 


9303 


23204 


A 


9372 


1 


336 


KSLIKYKGYQVAPAELESILLQHPNIFD 
AGVAGLPDDDAGELPAAVVVLEHGKTMT 
EKE I VDYVASQVTTAKKLRGGWFVDEV 
PKGLTGKLDARKIRE I L I KAKKGGKS KL 


9304 


23205 


A 


9373 


2 


187 


NILVTALYSLYIFTTTQWGSLTHHINNI 
KPSFTRENTLMFIHLSPILLLSLNPDII 
TGFSS 


9305 


23206 


A 


9374 


96 


240 


ACWLMPVI PALSEAEAGRLVQPRRLRAA 
WATWQDLI STKKKKKGGPF 


9306 


23207 


A 


9375 


3 


293 | STI SAFTTS TTS ATTTS TTSATTTST I S 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Meth ion ine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V^Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














APTTSTTLS PTTSTTSTTITSTTSAPI S 
SNKLHKKKKKKKKQKKKKKRKNDDTKQD 
TITKlHSGGGSh 


9307 


23208 


A 


9376 


69 


160 


EHTHTHTHTHSHTHTHSHLPERFFCVLN 
SL 


9308 


23209 


A 


9377 


336 


199 


EGITASARYSKNWVTTGDPAREFPMIQS 
APLMLLADPDEFVSVQLA 


9309 


23210 


A 


9378 


321 


128 


EPGFTMWAS LAPTS S LQVIKRLGLPKCW 
DHRREPPHPAYHSLYESGPSVQNGEEKG 
INPIIKMG 


9310 


23211 


A 


9379 


440 


286 


SSDPSASASKSAGFTGMSHHTQLLTSYS 
EGECLSYLVFLVLV 


9311 


23212 


A 


9380 


392 


154 


KGSSPLSPPISLAGGKPPPPRKFFLNFF 
FFFNRDRVSPCLPRLVSNSWPQAILPPQ 
PHKALGLQAYATTAS FI FLI VTV 


9312 


23213 


A 


9381 


403 


178 


GRRHEPCRDADAHLEGI IFSAFYPKNWE 
TTGDGAREWTMSNSAPLMLLADPDEFVS 
VQLA 


9313 


23214 


A 


9382 


303 


206 


DPVSTKICSLQPPPPGFKRFSCLSLPSS 
WDHR 


9314 


23215 


A 


9383 


215 


116 


SNNFWCCCCCCCCCCLRWSLTLSPRLH 
SCGTM 


9315 


23216 


A 


9384 


2 


285 


ARHLLMQLSGGATLAI S TIDLPSTIil I F 
TLLI LLT I IjE IAVAL IQAYD ITLLGSL Y 
LGDNTYKKKKRGAAFEETKFTSAGMQS K 
NFFLGTLNWR 


9316 


23217 


A 


9385 


12 


123 


G YEN FLLPTLHTQTHAHTHTHTHTHTH I 
HSKDPMEY 


9317 


23218 


A 


9386 


3 


230 


KCWGYRCEPLRLAPLRCFFTPGGISLTS 
GPALLSNASALAEVRGQQLCPPTPFLPT 
LIALTTLLLPISPFILIIL 


9318 


23219 


A 


9387 


590 


396 


ARVQWPDliGSLQPPPPGFKRFSCLSLLN 
S WDYKCAP PCRADFC I FSRDGVLS CWPG 
WSQTPYLK 


9319 


23220 


A 


9388 


224 


79 


GLG PVGHACP TLWEGEAGGSRGQE I EAL 
PANTVKPRFLKRKKKGKKKG 


9320 


23221 


A 


9389 


249 


72 


LVYLI TVKKI LFPVH I RTFNKI DHMIGH 
KTCIiNKLQMTEILQSVFSDYNRI KLEAS 
NN 


9321 


23222 


A 


9390 


400 


254 


LRQPGSTNS PVSASRVAGITGTHHHTQI 
I FVFLVEMWHACNPS YSGG 


9322 


23223 


A 


9391 


3 


64 


PPQKSDPPYKLVPVHLDSIL 


9323 


23224 


A 


9392 


266 


106 


LTANFLFLVETGFPHVTRVGLELLASSD 
LPTLASQRAGITGVNHLQNFWPGNF 


9324 


23225 


A 


9393 


430 


348 


FGKPQPPPKGKSPPPRGNFPPQKAPKG 


9325 


23226 


A 


9394 


4S1 


311 


LSLSEKRSIRVGVTRFSRCCPSPLSLTG 
KGNSLTPGVSQEIHSPPWFGIAYSVCTH 


9326 


23227 


A 


9395 


3 


136 


VFPRLVLNSWAQMILLCWPPKVLGLQAC 
TTRPSHFLPLLNSKYK 


9327 


23228 


A 


9396 


143 


347 


NFVFLVBTRFHHVGQAGLTLLTSSDPAS 
QSAGITRVRPEILTLKRYHFPSGILGTE 
GMVIEMAYLLKV 


9328 


23229 


A 


9397 


2 


231 


LELLTSSDLPASASQSAGITRGSHHARP 
SFTFRCVIHLEFNFMRGAREERLHFYCR 
VJV I RRLTTFGRGYPS PAAS D 


9329 


23230 


A 


9398 


383 


200 


QH YLNRRLS LAQWWM PWPATQEAEAGG 
LLE PR S S S LQCAMMLP VN SHCTLAWATH 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspa rtic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=GIycine, H=Histidine, I=Isoleucine, 
K=Lysine, LHLeucine, M-Methionine, 
N=Asparagine, P^Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaJine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \-possibJe 
nucleotide insertion 














REPHL 


9330 


23231 


A 


9399 


3 


416 


GFLHVGQAGLKLLTSGDLPTSASQSAGI 
IGVSHRAGCLKS I YLTFPTRKFFVQMS I 
CDFKVEIEGKIGFISVPISLQSILHIVN 
NFFFFLETGSFPPGWRAWEKFGLIAPLA 
SGVKGIFLPLPRENLGWRPSPIPPA 


933J 


23232 


A 


9400 


400 


151 


FGYPQVFPLPPFLKPGFGIFFLGPLKXN 
FPFPPPGLKIFSFKRAPPFFKFVRKGLI 
MLLRLVSNSWAQVI IPPPPPEVLGLQA 


9332 


23233 


A 


9401 


324 


154 


FFFFFFKXXXXXLLFFFFFFFLFYIFFF 
FFFXFXFXFFFFFFFFFFFFFFFFFFFF 
F 


9333 


23234 


A 


9402 


413 


312 


PKLVLNSWLQALLLPRPPKVLGLQARAT 
ITQPK 


9334 


23235 


A 


9403 


363 


3 


FKKKNFCLGGAGWFFFF YDPPPKKNI F 
YFPSKKIYFLSPASSSFFFFLCFPFFIF 
FFFFFFFFFFLENRWFLMENFIKGRGHV 
KGPLGQARDHDISGRRWWRREQCRASQL 
PGVRPRV 


9335 


23236 


A 


9404 


401 


82 


selphdp inpttg yi ypkkmks vcftd i 
ctpmfttalfs iakiwnqpkclskriti 
fmflvtysqlfcthypilpypffpapls 
sshnyhytlylyenilldstye: 


9336 


23237 


A 


9405 


382 


219 


WARWHGFLVPTSPGGEAGGSLEPGKRRV 
QLTQISPLPSSLGGKVGLPLKKKKNN 


9337 


23238 


A 


9406 


3 


195 


YLYSLVETGFHHVGQAGLELLI SRDPSA 
LASESAEITGMSHSIQPKSAFYIQIPET 
FWQPAFK 


9338 


23239 


A 


9407 


371 


1 


FRRLRLKNRLNLGNGGCSE PRLCHCI PA 
WETRAKLGRKKKEERKREKGTKMYNKMY 
KMYLELNIjTCPEPASPKOMTRQIiEVLSS 
NPSPGTFQFICKKQFKFSCCFALRVSIY 
LLMSVFEYCSC 


9339 


23240 


A 


9408 


326 


229 


RQGLTMLPRLVSNSWAQVILPPWPPKVL 
GLQA 


9340 


23241 


A 


9409 


81 


231 


LESASFPPGKVAHTCNPNTLRGCAGWIA 
WAQKFKTSLGNMAKPCLYKIK 


9341 


23242 


A 


9410 


367 


256 


HPGKEWWLVSVI PTLREAKVGRSLEPRS 
LRLAWDTWG 


9342 


23243 


A 


9411 


2 


192 


VFLVEMGFLHVGQAGLELPISGDPPASQ 
SAG I TGVSHHAQALTFLNI PINPSHPPK 
ACCSKQL 


9343 


23244 


A 


9412 


3 


83 


KNCLNLGGGGCSELRLCHCTLTWATE 


9344 


23245 


A 


9413 


371 


301 


EDHPRMV I LRLLRY I VRLVWRMH 


9345 


23246 


A 


9414 


486 


337 


AGLELLGSSNLPALTSHIAEITGMCHHT 
WLAYI FVCAFGWVLLLDLCLAF 


9346 


23247 


A 


9415 


378 


254 


IVAKCSLDLLGSSDPLTSASRVAGPTGM 
CHHVWLIWFFVC 


9347 


23248 


A 


9416 


160 


309 


WRYSSNEFLLFLLFYLFKSNFTVFCCC 
YCCCCCCWRQSLTLPPKLECGV 


9348 


23249 


A 


9417 


162 


339 


RPSSFCEFLHDSVPELALPMKARFEKSR 
RSQVSWLMPVIPTFWEAKVGRYQARSLI 
PA 


9349 


23250 


A 


9418 


1 


147 


GGGGCSEPRSCHCTPAWATRVKLGLFKK 
KKKINFLF VPPF I FLKS KTGV 


9350 


23251 


A 


9419 


283 


126 


KMMYKGQAQWL I P I I PAFWEAKERGLLE 
VRSLRPAWATEQDSTATKNKNKNF 



1111 



WO 01/64835 



PCT/US01/04927 



SEQ ID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E^GIutamic Acid, F=PhenyIalanine, 
G=Glycine, H=Histidine, I=lsoleucine, 
K=Lysine, L=Leucine, MNMethionine, 
N=As para gin e, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine> V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, V=possible 
nucleotide insertion 


9351 


23252 


A 


9420 


133 


3 


ELMFS WAWWIMPVI PALWEAEAGGSRGQ 
EFKTSLPNMVKPRLN • 


9352 


23253 


A 


9421 


3 


133 


LRGLYSGAGI IRFEAKKWNALALWAWDI 
AGDNCAICRNHIMDL 


9353 


23254 


A 


9422 


409 


291 


ILILGGGSCSELRSCHCTPAWATRAKHR 
LKNKSPINVY 


9354 


23255 


A 


9423 


1 


210 


RFFHLGLSKHWDYRCESPHAGTRNEVLI 
YATIWMNSENIVPNEI IQTQRTNI VS FY 
LYQVGRIGKFIDRR 


9355 


23256 


A 


9424 


479 


306 


GVPTNQPWSNRVKTPSPTKNI KI SWPWW 
CMPCNPSCSEGQGRRIALNPGRWRLQ 


9356 


23257 


A 


9425 


1 


209 


GFFLFQGGGEHAGQAGLELLTSGDLPAS 
ASQS AG I TGAGPNQG I FVALES PMESAA 
LLHLHWLSSPCS 


9357 


23258 


A 


9426 


628 


433 


RQGVAHLPGMECSGTITTHCSLELLGSS 
DPPASASQSSRLAWSHCAQPSLLFSVL 
CLKPIFCCA 


9358 


23259 


A 


9427 


417 


125 


FLRQSLTSSPRLEYNGAIPAHCNLCLPG 
SSDSSILASQSIRITGVNECTWPDAAVL 
RQNFFFSEKPQFYSEGFQQTHI I KSNLL 
YLKSTKCTCYPYL 


9359 


23260 


A 


9428 


366 


256 


SFFYRDRSLATLPRLGSNAWSQAILPPG 
PPKVLRLLA 


9360 


23261 


A 


9429 


3 


140 


GFCHVGQAALKLL I S S DLPTS PSQSAG I 
TGVSHCARLTRLFS SHE 


9361 


23262 


A 


9430 


368 


3 


VRASQLLCLPTQASAMAGTPPPASLPPC 
SSISDCCASSEQGSMGVGPSEPGAGYNL 
WCSSLRTLEKCS IRVGVPQFSRCHLSQ 
LPLARKGNSPTPCASWVRRCLTLLQLML 
RGLYPLSCT 


9362 


23263 


A 


9431 


3 


13J 


LTMLPRLVSNSWAQGSSHLSPPKCWDYR 
CEPLHSGLKDIFLE 


9363 


23264 


A 


9432 


384 


274 


LSFAMLPRLVLNS WAQESAHLGLP KCWD 
YRHEPLGLA 


9364 


23265 


A 


9433 


431 


273 


PPPLGYFFIVKMEFPPVVQDGFKLLNSS 
DCSTSAS PSGRI SGVNHSTRPR I F 


9365 


23266 


A 


9434 


211 


473 


CVCTFFSFLSMEI FFLFLVGEAESHCVT 
EAGVQWHDLGSWQPPPPRLKRFSCLSLP 
SSWDNRRVPPYPANFCIFSRTRVWSCWP 
GWS 


9366 


23267 


A 


9435 


391 


188 


LPGSSDSPASASRVAGTTGGCRHTVLLG 
FEFKDNIRWIKLVEKAHTQYGNIYICVI 
KYMPLFYPYHH 


9367 


23268 


A ; 


9436 


331 


115 


SRDRGFAMLPRLVLNS WTQAI CSLLDSQ 
SAHLS LPKCWNYRRE PLCPASGE I LS F I 
KPSSSKSFHNSTKKRQ 


9363 


23269 


A 


9437 


3 


109 


IYGSYSLPACFFPSFLPSFLPSFLPSFP 
PFFLSFF 


9369 


23270 


A 


9438 


2 


167 


E CNGAI SGHCNLCLS GS S DS PAPAWATR 
VKLRLIKKKKKKKNQILFFYPPRKTLF 


9370 


23271 


A 


9439 


648 


310 


VCPCILRHPAFLFAKSGNPLESSLLDHS 
SPrFKLTDRENLRHVKCTSHSQSLPTSG 
FQPGLPDSTLCAW YPTRS PKLS PAGWVG 
G YRAPVGRRGPGS RNS WRLHWT P I TS VG 
D 


9371 


23272 


A 


9440 


1 


170 


YEIWFLDVGQAGLELLTSGDLSASGSQS 
AGTTGMSHHARPLKLFTISVSSNKKVKC 


9372 


23273 


A 


9441 


133 


244 


L FG YQTRGLAMLLKLDSN S W PQAVL P PQ 
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SEQ 10 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E- Glutamic Acid, F==Phenylalanine, 
G=Glycine, H=Histidine, l=lsoleucine, 
K=Lysine, L-Leucine, M=Methionine, 
N=Asparagine, PHProline, 
0=f»luf amine R=Ar< J ininp S=Sprinp 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \— possible 
nucleotide insertion 














PPKVLGLLA 


9373 


23274 


A 


9442 


365 


176 


RGSMDMVPSESCTGYNLLMCPLLRPLEK 
RSIRVGVTRFSRCHLSQLPLARKGNSLT 
PCASQVR 


9374 


23275 


A 


9443 


368 


231 


VEAGFCHIAQAGLTFLSSTDPSALASRS 
ARITGVSHHARPI FLKE 


9375 


23276 


A 


9444 


88 


194 


KMLEAKKKIEQQGGFTFENKGVLSAFNF 
GTVPSNN 


9376 


23277 


A 


9445 


3 


349 


I RS G S YAC YFC Y PRE CYAL FH C YSM I VS 
YGPYYFMNIDQRHESRLSTIYIFKFEIG 
SKKVAQAGLELLCSSSPPAFTSQSAGIT 
SVSHHTRPQKIFNVIYSCDLHFHPGIEG 
PQR 


9377 


23278 


A 


9446 


3 


141 


HAECSSMMIAKCELELLGSTDPPASASQ 
SAETTGISYHAQPYSTF 


9378 


23279 


A 


9447 






ICPTRSRFPPRVLKNGRKKDRLHCSGAT 
SANCSLCLPGSSDSPASASRVAGSTVAE 
EDS TRRAVRHQGS S RSAPSGRGS WAGKQ 
ERADRAQQS 


9379 


23280 


A 


9448 


307 


430 


GLAWWLTPGIPILWEAEVGGSLEVEFET 
SLASMEKPSLYYK 


9380 


23281 


A 


9449 


2 


253 


IQCVCVCVCVCVCVCLSVCLHFIAILGC 
SPWGSRLPCSVSSPSGRGPGGEDLSPLP 
PTMSVSPEADFPPLAAAAPGNIMTAIT 


9381 


23282 


A 


9450 


] 


166 


NTGGGGEE I R VFS S CFAE DR LO KGKD R P 
GAVAHACNRSTLGGRGGWITRSQIFFC 


9382 


23283 


A 


9451 




123 


NTRGLVMLPRLVSNSWAQVILQPLPPKV 
LGLQAS VTVH INN 


9383 


23284 


A 


9452 


396 


291 


SLCI KRVCACACVC VCVCVSGAKLRKLV 
LDFMRS 


9384 


23285 


A 


9453 


1 


147 


FAIRARFSNTYTKIGTIHTYIYIYTYIY 
TYIYTHTHIYILSQVLMEYEF 


9385 


23286 


A 


9454 


394 


235 


TRSCYIAQSWQWLFTGTLI IHYNLQLL 
SSRDTPVSASQYLGLQAHTTTPDSC 


9386 


23287 


A 


9455 


|_1 


85 


TNPVATLLEWQDALCFI PVGSAAPQGS P 


9387 


23288 


A 


9456 


339 


165 


KTHWWGKKSFLKNPFFFFFFFETKSRSV 
TRLECSGVILAPKQKTKKQNDKTTINKT 
I 


9388 


23289 


A 


9457 


j 


209 


LL I LFS I KMRNR P DS P ALLS TQb P P I F I 
YIHTEIYLYIYTHIYIFIYIYLYIYLFA 
YIHIYVSLFPLK 


9389 


23290 


A 


9458 


374 


156 


QAGLELLTSGDPP I SASQSARITGLSQR 
AQLSELTLTVFTTRHCTCLCAHFLYYTV 
FLEGRNDDEFISIPST 


9390 


23291 


A 


9459 


287 


3 


EAGLLWRKATHK YAHTHTHTHTHTHTH I 
RGLQNM INCKYI TYQE I MTLAQAKNKGV 
IPDSFPFTSSPPLTYPPALSFHFQKHIP 
NKTTTYHGHC 


9391 


23292 


A 


9460 


3 


72 


KTSLKMKFTNILESS FLMNNKS 


9392 


23293 


A 


9461 


327 


92 


ISLFLWGALFFPKKI I YPPKKKSPLFF 
PPPPPQENFNPPSFLAPPVFPPQENPKF 
AGGRKIFCPPPPVFFFYFFFFF 


9393 


23294 


A 


9462 


3 


173 


ACRGDS GG PLVTR DL KDTW YL IG I VS WG 
DNCGQKDKPGVYTQVT Y YRNW IAS KTG I 


9394 


23295 


A 


9463 


258 


39 


LKVILPSDPSSIFVCLFLENESRSVTQA 
GVQWHDLGFKRFSCLSLLSSWDYRRVPP 
PPAKFCIFRKHGVSPC 
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SEQ IB 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/51 5,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 


9395 


23296 


A 


9464 


104 


336 


NHVSQCEE I FLFDEYYI FS RAVASMFWV 
DAELGSDIYLDGKLKNSFLLSFLGFFFF 
FFLGKKVFFFSQNGGLGRPGQ 


9396 


23297 


A 


9465 


410 


0 


LNLGGGGCSEPRLCPCTPAWATE 


9397 


23298 


A 


9466 


254 


61 


KKKKDAFFTI PLYPSSQNLFAFTQMDPD 
TNISQQLTWTVLPQGFRHSPPTFLDKPF 
KQTSINFA 


9398 


23299 


A 


9467 


403 


255 


PAPQLGGPPMMPMMAPPSSWMMPMGCAP 
AMRLPIGGDI IWLCVPTQISP 


9399 


23300 


A 


9468 


1 


1428 


PARFKDLNLAGTAEVGLAG Y FMDHTVAF 
RDLPVRMVCSSTCYRABTNTGQEPRGLY 
RVHHFTKVEMFGVTGPGLEQSSQLLEEF 
LSLQMEILTELGLHFRVLDMPTQELGLP 
AYRKFDIEAWMPGRGRFGEVTSASNCTD 
FQSRRLHIMFQTEAGELQFAHTVNATAC 
AVPRLLIALLESNQQKDGSVLVPPALQS 
YLGTDRITAPTHVPLQYIGPNQPRKPGL 
PGQPAGPRVGYKALQPWPAQRLGPVGRP 
PCRPEGQPLSCCCCGCSCWGRLTRCLAE 
WTEVLTVGQVQRGPCTALLHKELCGTE 
PHGCASTEEKGLLLGDFKKQEAGKMRSS 
QEVRDEEEEEVAERTHKSEVQEQAIRMQ 
GHRQLHQEEDEEESKEERKRGPMETFED 
LWQRHLENGGDLQKRVAEKASDKETAQF 
QAEE KGVRVLGGDRS LWQGAERGGGER R 
EDLPHHHHHKHQPEAEPRQEKEEASERE 


9400 


23301 


A 


9469 


265 


170 


GTESHSVTQAGGQWLNLSSLQPLPPGFK 
QFS 


9401 


23302 


A 


9470 


289 


110 


RDLELASFLSFFPSFLPSFFPSFLPSFL 
PSFLSFSFLISFFPLPSFFSFISFSLSL 
FFE 


9402 


23303 


A 


9471 


67 


210 


TVTTGQALTHAYNPSTLGGRGGQITRGQ 
DFETTLVPGDLNFSRVLASS 


9403 


23304 


A 


9472 


2 


186 


TSLWRQAKKNVEKKTDKYTEVLKTHGLL 
VCTQKSCSFLKNKNKAWLGKQANAPARK 
VEIRC 


9404 


23305 


A 


9473 


417 


265 


FLSS FLFFFFFFFFFFFFFFFFFTV 


9405 


23306 


A 


9474 


430 


120 


ASQKKKKKNKKKKKKKKKKKKKGGAL 


9406 


23307 


A 


9475 


1 


150 


PTRPWQAGFKVLSSGNPPALVSQCFRI 
TGVSHRARPHLLNLRAPNLPVL 


9407 


23308 


A 


9476 


404 


109 


P P PLF FNFG P F FLVEARLHHVGQAGLKL 
PTTSDQPDSASQSAGFTGVSTRAQPTLI 
TLY 


9408 


23309 


A 


9477 


391 


235 


PPFFFFCRDRGLTMLPRLISNSWAQVIL 
LPWSPKMLGLQARATVSGLNKRYS 


9409 


23310 


A 


9478 


438 


277 


KRAPFFFFCRDRGLTMLPRL I SNSWAQV 
I LLP WS PKMLGLQARATVSGLNNRYS 


9410 


23311 


A 


9479 


399 


279 


PPPPVFKNPPPKKNKGAPKKKNISPPPG 
RKKYFFFKGPP 


9411 


23312 


A 


9480 


247 


26 


LKAGDDSRTVLGKVHQSPCFSLLLFFIQ 
VS CRDEGLTVLPRL VLNSRPQAI LLP QL 
PKLLDHRCEPPHLANHG 


9412 


23313 


A 


9481 


2 


176 


HAPPHPANFLFFEEKESCYVTQVSIKLL 
DSSNPPALTSQSAGITGENQHGHPMNCL 
SG 


9413 


23314 


A 


9482 


2 


178 


HAPPHPANFLFFVEKESCYVTQVSIKLL 
DS SNPPALTSQSAG I TGVSHHGQPMNCL 
SG 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A-Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, {^Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleuctne, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W^Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 


9414 


23315 


A 


9483 


2 


151 


GRVGEGPELVSLLVLDQHPVGTSDHRDA 
TQLGFQPIjMSDLGSTGSTEGA 


9415 


23316 


A 


9484 


420 


325 


FFVFLLETGFHHVSQDGLDLVICPPRPP 
NTFV 


9416 


23317 


A 


9485 


400 


104 


LFFFLFFFFFFFFFFFFWNPPKTFFKKV 
FGLNFPTLLKLKFNPPPPGPPKXKKKEL 


9417 


23318 


A 


9486 


117 


280 


SSTNQANNCFFFFKKRGLAILPRGALNP 
WAKG I LLP WPPKGLKLRGGAI GPGQK 


9418 


23319 


A 


9487 


76 


353 


VLWNTLEFCLI YLDLLFLLFFNKE ALI R 
KKKKKKKKKKKRGGNKKEKKKKTPWQEK 
AGLFLGDT 1 1 YSRGAVLKNSGGETPRHT 
PSGGLWPR 


9419 


23320 


A 


9488 


3 


122 


HASEDNVFLHCSHSDLGYVLLAGTASLQ 
DFSFFQKSPPP 


9420 


23321 


A 


9489 


1 


155 


PTRPTVCSGAMLAHCNLCLPGSSESLAS 
ASQVAGTTACTSTPDPFFFFFSL 


9421 


23322 


A 


9490 


3 


212 


ETGFHHVGLSGFKLLTSGDLPTSASQSA 
GITGMCHCVWPQFAYFCLLLVYLDHLKF 
TIIMLVLCLLFYD 


9422 


23323 


A 


9491 


2 


163 


ETLFRHVGQVSFEPLTSGDLPASASQSA 
GITGVSAMPSWMIKFFIAFYSCINC 


9423 


23324 


A 


9492 


474 


296 


SQPAGTADCLLPLPPGSCFPTI PCPAAL 
VPLPSFPSARLWPPQSLHQLLPLPGTCS 

S.CT 


9424 


23325 


A 


9493 


7 


151 


LCAVEFVYMKPPAHEEAKAPSRGFWRY 
APWTASSSEKGRDDGMGPRA 


9425 


23326 


A 


9494 


52 


363 


TTAPSLSVFLFTFTGELFVYWWVGVTLE 
TSFISFFLFFFSFLEKDFSFSPAGRPWP 
NFGL I QPLASGVGG I S RPNPPS RWELRA 
LPPPPGGLGFFFWKKGGPPG 


9426 


23327 


A 


9495 


383 


118 


WKPRPWVPPGAGVQTPPGPQGETPLFLK 
KKKKRGRGGPLFGPPSFGGGGGEIPLTP 
APKGQEIYFPPPPPPPGGKKKSLFQKKK 
KRIR 


9427 


23328 


A 


9496 


554 


419 


FFFETKPCSVSQAGVLWRDFGSLQPPPS 
QVQRDS PCPQASQVAWD 


9428 


23329 


A 


9497 


428 


281 


GSYVQLTFKYFS I YI YVHTHLHVYLYTH 
IYVYVHICIYKEGYITGSVLA 


9429 


23330 


A 


9498 


144 


16 


KVKKKVSQCPGIVAHNCNPSTLGGRGGR 
IAWGQEFETLSLQKN 


9430 


23331 


A 


9499 


2 


155 


NLALLGSSDPPVSASPVSGIRGVHHHAQ 
LI FFFFFGKRAWFNPPGGRKGEE 


9431 


23332 


A 


9500 


2 


184 


FIYGRDSLTMLPRLVLNSWPQAILPPRP 
PKVLGFRCEPPHLASLILF YRDI LFTLG 
LKSV 


9432 


23333 


A 


9501 


465 


153 


PHKVFLNKGPFETPQGELSRGEWKTRPP 
PPRERSRPFHRKGGFGSPPKLRQGPPFV 
FFSWPPWCPPPKKGARPQQKGFFYFPPP 
FFNPPFFFFFCLFFI FEIV 


9433 


23334 


A 


9502 


429 


11 


CRTLLGADCVCVCVCVCVCVCVKAFPPI 
SYSCHLVPSDLPLLGVQFKCWVHFGLSF 
LTCPFPSPPWFGGAQPGSCYWGLGCLCL 
FACKLGARLTEQTLGI PASGPGTWATPL 
GSCASVLRRLPGGSERDTDLKVSVLSG 


9434 


23335 


A 


9503 


436 


202 


LPGGYIFQGEGVEKTKFSPLLRENNFFS 
NFFPPKKNNGGGPRIPPPPPPPLFKKGV 
FFI PKFPAPPRPKKKFPKKKKK 


9435 


23336 


A 


9504 


3 


112 


S FDLLS S S DP P ALAS QS AG I TG VS I LSKG 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E-Glulamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Argtnine, S=Serine, 
T=Threon ine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














QKVYICNM 


9436 


23337 


A 


9505 


426 


208 


VGHNLLVPRF L S PS EKRS I QVGVTRS S R 
CRLS PLSLTRKGNSLTP FASRVRQCLTL 
LRLARAPTDLRPLSGTP 


9437 


23338 


A 


9506 


441 


189 


CPANFFVFLREMGFHHVGQAGLLTSSDP 
PASASQSAGITGLSHCARPTVGSDVLSL 
PSGQCTSALTGFMLGSTTVHSNAAARA 


9438 


23339 


A 


9507 


117 


278 


NKALLCH PGWNH I KKVKGRLGMETHTCN 
PTTLGGQGQEFENSLGKIGRPHLYK 


9439 


23340 


A 


9508 


3 


126 


■ FFVEMGFRCVGRAGLRLLTSGGLPVLAS 
QGAG I TGVSHC I 


9440 


23341 


A 


9509 


421 


134 


FLVERRSHHAAQAS LE I LS S S DS PTS AS 
QCAGI TGMSHCARPfCYTSFFRLHRHMPM 
ATPTKIiPPTLASAS PQVKVCLYPLIiPST 
SKGEIFTLQLY 


9441 


23342 


A 


9510 


2 


200 


PQPPTFYPSFLPSTPEPPAFYPRLPPST 
PASRLLPQPPAFYPSACHLLPQPPTFYP 
SLGNRARLHL 


9442 


23343 


A 


9511 


414 


204 


ERDRKKSRKKEGRKERERERKKEKKRKK 
KKERRKEGRKEGRKEASKGQRVWIMVD 
T 


9443 


23344 


A 


9512 




211 


FHHVG WAGLGLLTSGDLPASASQSAR I T 
GMSHHARHREG FDNLSLCALAPS PTPLW 
RRHRRRGHFTVPEK 


9444 


23345 


A 


9513 


390 


265 


VFLVEMGFHHVGQAGLELIiTSGDLPPLE 
SQSAGITGVSQQFS 


9445 


23346 


A 


9514 


438 


180 


IGFRHVGHAGLDFLPLGDLPSLASQNAR 
I TGVSHRAWPAMAFVNCHGAGGS VAMRT 
TRGHS CRHLG I GGFWPASLLQP VLSAGS 
L 


9446 


23347 


A 


9515 


126 


11 


RNLDFFLSIiGLAMLSRLVNSWPQVIFQA 
WPPKALRFQV 


9447 


23348 


A 


9516 


2 


249 


FLHVGQAG FVL PTS DDFR PTHT P QS SG I 
TGVSHRTRPLRCYFLRLNNI PLYVYAYY 
ILFIHSVISGYVGGCTAHYCHPRKW 


9448 


23349 


A 


9517 


403 


57 


PEGALVPQKKKKKKKKKKKKKKKKKKKK 
NIKI 


9449 


23350 


A 


9518 


406 


140 


PHYNHTFVYNGVRLEDLQHMCLELTVWD 
REPLASNDFLVGVRLGVGTGEI PSPILT 
CLKASCLFYSYYHSRPPTQSSSVMTWF 
FPFS 


9450 


23351 


A 


9519 


3 


75 


G I E WMVN CWRNVHR PPRQRDI T 


9451 


23352 


A 


9520 


183 


6 


LLTHLILTSALIGEIMVAPILQKNSGSE 
RECRLPRLVLNSWPQGILLPWPPKVLGS 
QV 


9452 ! 


23353 


A 


9521 


89 


339 


LQCLQKSGKLEPGMTYTKLIDADVNVGN 
ITSVQFIWKKHLFEDSQNKLGAEMVINT 
SGKYGYKSTFCSQDIMGPNILQNLKPC 


9453 


23354 


A 


9522 


3 


329 


PSLMFPWLXLSFPGRPRGRQTYSRYQTL 
ELKKEFLFNPYWTRKRRMEVFHALGLTE 
RQGKIWFQNRRMKGKKENTRDNLPEAQN 
EEKVBEKGIEEKEKEEEEKEENRD 


9454 


23355 


A 


9523 


1 


156 


AKTSMNVNE I FMAI AKKLPKNE PQNPGA 
NSARGRGVDLTEPTQPTRNQCCSN 


9455 


23356 


A 


9524 


1 


272 


AKNSARALLGILGYRSHLISSLLCLEGI 
ILSLVI IATLITLNTHSLLANIGPIAIL 
VFAACEAAVGLALLVS I SNT YGLDYVHN 
LNLLQC 
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Predict- 
ed end 
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correspon 
ding to 
Inst amino 
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residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=Phenylalanine, 
G-Glycine, H=Histidine, I-Iso leucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


9456 


23357 


A 


9525 


1 


,62 


GNDAWVCTNDMFRLALKNPG 


9457 


23358 


A 


9526 


1 


111 


GDI I HNENMRQGRD K Y KTLRQ I RQGNTK 
QRIDEFEAL 


9458 


23359 


A 


9527 


427 


340 


IPIGNIMSCDSHSHHELVMSAQLNAIFH 
D 


9459 


23360 


A 


9528 


439 


27 


PRRTDKWLTVEMIQDGHQVSLLSGELTV 
DQRAS I IQRFRDGKEKVLI PTNVCARG I 
D VKQ VT I WT FDLP VKQGE E PD YE T YLR 
RIGRTGRFGKKGLAFNMIEVDELPSPMK 
IQALFDGRNKQLNAEVPCIQAAMED 


9460 


23361 


A 


9529 


1 


236 


REVL I TTNVCARG I DVKQVT I WNFDLP 
VKQGEEPDYETYLHSIGRTGRFGKKGLA 
FNMIEVDELPSLMKIQDHFRKS 


9461 


23362' 


A 


9530 


437 


240 


QQVSSARPEPEAGEVSPPVGAGVNSNSW 
AFKYGPGNPKQSGPEPKKQTQVSFLLRR 
KGEASQPRQ 


9462 


23363 


A 


9531 


413 


260 


LFDQWKVASPSGLRTSTVRYGDPEKLK 
LEESLRQQAEKEGSALSVRISNV 


9463 


23364 


A 


9532 


431 


260 


ASSGQMFSVKQQAPPVRDLPPFGISLFG 
GQIYDNISPRPPQGIVAPPGGRGNIPNL 
G 


9464 


23365 


A 


9533 


2 


134 


IHGVCVHMCVCACVCVYACVCVGGYDGR 
IFLALSSTVFSMVNIM 


9465 


23366 


A 


9534 


419 


53 


LLLLTNGDRQTQREKI EACACQS YFDAV 
WGGEQREEKPAPSIFYYCCNLLGVQPG 
DCVMVGDTLETD I QGGLNAGLKATVW IN 
KNGIVPLKSSPVPHYMVSSVLELPALIiQ 
SIDCKVRMST 


9466 


23367 


A 


9535 


439 


109 


NCI VI IDSTPYRQWYESKYALPLGRKKG 
AKVTPEEEE I LYKNRSKKI QKKYDERKK 
NAKISSLVEEQFQQGKVLACIASRPGQC 
APAD AYV I EG KE LE FSLRK I KARKGK 


9467 


23368 


A 


9536 


1 


185 


NTTSSHHFTI PCIiVLYSCTSTHTHIHTH 
THTHTHTHVRVFS Y I PSQDVKEGCIPAP 
SVFSW 


9468 


23369 


A 


9537 


168 


280 


NPNS I RLTSCCCCCCCCCWKS P YNVYNR 
LLRTGFTFP 


9469 


23370 


A 


9538 


142 


324 


KIVKMGFPHAGQAGLELLTSGDTPGSTS 
QSAGITGVSHRSRARSIFLYPYLPSVYL 
PWRRV 


9470 


23371 


A 


9539 


3 


139 


YIGGMGQAWWLMPGKPRQNDYLS PGVQD 
QPGQHSETPSLQKGFKN 


9471 


23372 


A 


9540 


383 


318 


MSTQNWIQS YP IGEDEESESD 


9472 


23373 


A 


9541 


372 


207 


YGLNMCRQCFRQYAKDIGFI KVGVCRRT 
PCCLGREGCGGYFFRERQTDRSSLEW 


9473 


23374 


A 


9542 


2 


360 


ARGDDDHRSSDRKDGVLEEQ I ERLQEKV 
ESAHSEQKNLFLVI FQRVIMILTEHLVR 
CETDGTSVLTPWYKNCIERLQQIFLQHH 
QI IQQYMVTLENLLFTAELDPHI LAVFQ 
QFCALQA 


9474 


23375 


A 


9543 


379 


75 


TNLQQFAMSHGSAQVKGHGKKVADALTN 
AVAHVDDMPNTL S ALS DLKAHKLRVD PV 
NFKL I SHCLLVTLAAHLPAE FTPAVHAS 
LDEFLASVSTVLTSKYR 


9475 


23376 


A 


9544 


380 


56 


RDLSMVRMKSMFAIGFCFTALMGMFNSI 
FDGRWAKLPFTPLSYIQGLSHRNLLGD 
DTTDCS F I FLYI LCTMS IRQNI QKILGL 
AP SRAAT KOAG G FLG P PPPSGKFS 
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sequence 
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NO: in 
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Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino arid 

residue of 
peptide 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

peptide 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine > D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H-Histidine, I=lsoleucine, 
K=Lysine, k=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
0=G 111 tannine Tfcs=ArQininp S=Sprinp 

\^ VIUiaiHIIItj ftv /TL 1 gill k7 Uvi IIIC, 

T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 

X=tJn known *=Str»n rndnn /=nAccih!p 

/» VIIIVIIl/fT II) kJll/p VUUvllj / pUOJ ft Lf 1 W 

nucleotide deletion, ^possible 
nucleotide insertion 


9476 


23377 


A 


9545 


304 


92 


S KS RVTNNTLR S TR KDTM I TLNTHS LLA 
NIVP I AI I VFTACEAAVGLALLVS I SNP 
YGLDYVHNLNLLQC 


9477 


23378 


A 


9546 


394 


108 


KEIDFSLVDVTADDAGNYSCMYYQTKSP 
FWASE PSDQLE I LVTVF PGTTS SNYS LG 
NFVRLGLAAVI WIMG* FLVEAWYSRNV 
SPGESEAFKPE 


9478 


23379 


A 




I 


jjj 


GNAVGLNLHR INFDKYHPG YFEKDGMKH 
YHLHRNQSFCPTVNLDKLRTLFSEQTRL 
NAAKNKTGTAP 1 1 DAARS VY YKVLGKGK 
LPKQPVIVKAKFFGSLIFCFRIWILID 


9479 


23380 


A 


9548 


326 


.188 


NFLRMGTVAHASNPSILGGLGGRIAGAQ 
EFETSLGNITRPCLYFKK 


9480 


23381 


A 


9549 


381 


152 


VTLISLSLLLQWIGVLLWIARLNLNE 
VEKQWRLNQLNNAATILVFFTWINVFI 
TAFGAHKTGFLAARASRNPL 


9481 


23382 


A 


9550 


170 


65 


IHRSKNKWKFHLKDGIMNLNGRDYIFSK 
AIGDAEW 


Q489 










JO 1 


RRLROENRLNPEGGGCSKPRSCHCTPAW 
ATEQDSSSKEKKERKNLKGNTVKSVPLtP 
LTSHA 


9483 


23384 


A 


9552 


369 


181 


GVLGATSGDMQIHTFGSMI IGS ITGMVS 
ELGYKFLTVSTADVMIQTLNLHLFSKMS 
HQQKKSY 


9484 


23385 


A 


9553 


1 


516 


S LVHWE FGQA I ARRLKP FG VQRFL Y TG 

WACS LT PATEG L CNKDF FQKMXETAV F 
INISRGDWNQDDLYQALASGKIAAAGL 
DVTSPEPLPTNHPLLTLKNCVILPHIGS 
ATHRTRNTMS LLAANNLIAGLRGEPMP S 
ELKL 


9485 


23386 


A 


9554 


386 


56 


LFPACVLRWQFAS DGGDIG FGVFLKTKM 
GEQQ S AREMTE VLP S QR YNAHMVPEDG S 
LTCLQAGVYVLRFDNTYSRMHAKKLSYT 
VEVLLPDKASEETLQSLKAMRPSPTQ 


9486 


23387 


A 


9555 


375 


150 


IWGKTSFHVSIPLSLFFPGDVLYDLLQ 
HILKQSKPRILFSPFFHPGNSIHTQPEV 
ILHQTHEEGTGRGFS FLA 


9487 


23388 


A 


9556 


3 


354 


CLPTQASAMVDTPPPALLPPCSLISDCC 
ASNEQGSVGIGPSEPGAGYNFLVCRLLR 
PLEKHSIRVGVTQFSRCCLSPPSLTRKG 
NSLTPCTSRVRRCLALLQLTLSELPHCP 
VPSV 


Q4R8 




j± 


yjj / 






PTKVEDPEPVKKPPKEORSTKEMPFTTr 
DEFNGVPSYMKSRLTYNQINDVIKEINK 
AVISKYKILHQPKKSMNSVTRNLYHRFI 
DEETKDTKGRY F I VEADI KE FTTLKADK 
KFHVLLNILRHCRRLSEVRGGGLTRYVI 
T 


9489 


23390 


A 


9558 


338 


104 


NFKKKKKRRTKKKE IRTEEKGRGANKKT 
FRGPKKKETRGSNRSPE3KKNHGAKKEG 
ERRPRKKKKKKKKLARYDFTCL 


9490 


23391 


A 


9559 


1 


152 


NTCVC VCVYTGFCHVAQVAHELSS SNLP 
SSASQSAGITGISHHAWPGFGF 


9491 


23392 


A 


9560 


517 


238 


SGPGESVPHPWSAVRTRPAKKPGSQCLL 
QSAFPLWCSSRRTTWAEERKLNTETFGV 
SGRFLRGRSSRGGFRGGRGNGTPRRNPT 
SHRAGTGRV 
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Amino acid sequence (A=Alanine 
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eth 
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ed end 
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E=GJutamic Acid, F=PhenylaIanine, 
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ing to first 
amino acid 
residue of 
peptide 
sequence 


otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


G=Glycine, H=Histidine, I=Isoleucme, 
K=Lysine, b=Leucine, M=Methionine, 
N=Asparagine, PHProline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=<ryrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


949? 


j ~j y j 


A 


9561 


709 


280 


NTS PALTAPCGR PRGI RFMQC WARVPAP 
SSACPAWQLPSLCLRSTVPAPHPSPAAH 
QPCELSQHHRGGLEPALFGTRQRLCSRT 
WHRCVRFLPGAQEAGELGPAACATGRIiP 
VSHSTDMEQAPGLTSLHQEPHSTARGGT 
ATH 


9493 


23394 


A 


9562 


3 


130 


YRGFCHVGQAGLELLTSGDR I TS ASQS A 
EITGVSHRAQPQMS 


9494 


23395 


A 


9563 


409 


160 


NPGVKRSPCLSLSSSGGYWSMPPCLAYY 
KKKKRRFLKDGGLTMLPRLASNSCAQQF 
SHFGLPQSWDYRHEPLHIAQFFLFSLL 


949S 


23396 


A 


9564 


] 


241 


GICLGSNIGSGWMGTSSLCNPYSNTLNQ 
NLLS TTKP FP VPS VGTNCG I EPWDSGVT 
SGLGKMLFLYYYYLLLFSQMTPQS 


9496 


23397 


A 


9565 


2 


215 


IHWMNGWMDGWMDGWMGSCVRAVTGPGT 
SSPHTSTCGHAVCVCVCVCVYTCAHVFL 
ALFGVEKNNYFLWKA 


9497 


23398 


A 


9566 


244 


117 


DHQHLNWS VF I S FS FKVI KARKNKKG I L 
NPDSSMETSPDFFF 


9498 


23399 


A 


9567 


414 


77 


FGGVPPGTLVPPTWLPPVRVPPPPAFCL 
GELS P P P FFWRKGLAPPLLNFFFSPGQR 
GFPSPGGKGPRPLPRPAGFPGFVAKPPF 
RPPGFSSTPSDCGGLPFFFFFFFKKKVF 


9499 


23400 


A 


9568 


418 


198 


P P VS LTE V I VS KSHKAELVF WNM PGTLE 
HPEGDNNYMEF I EVLTEALERVLLVRGG 
GSEVITIYSYPTLNARA 


9500 


23401 


A 


9569 


549 


415 


RQENRLNPGGRG CSEPRSCCC I AAWATE 
GDCLKKINNNKPKNEQK 


9501 


23402 


A 


9570 


546 


1 


KLiE CRGA I TAHCS LNLPG P GD P P TS ATQ 
AAGTTGTCH 


9502 


23403 


A 


9571 


415 


215 


SRLRKKI Y I Y I YMCVCVYVY I CVY I C I Y 
VCVY I CI YVCI CVYI CI YVY I CVYMYI Y 
MHEFITFFSI 


9503 


23404 


A 


9572 


374 


138 


LAASAAIQELFKRISEQFTPMFRRKAFL 
HWYTGEGMDEMEFAEAESNMNGLVSEYQ 
QYQDATAEEEGE FEE EAEEEVA 


9504 


23405 


A 


9573 


370 


233 


GSPPVRSVHQSGVSLSGTQVDEGVRSGS 
KRMVAPPGGRYNITSLS 


9505 


23406 


A 


9574 


3 


399 


YSSPGPIALALRDPCAHGGPSPNSSLSP 
AQGHPARAQQGPQPQQGPPSHLGSDLRP 
HVAHTRGALHGPAPAPASPAPGGSGSGV 
AASMCSAPRPGRAGGTLGSVLTAHHPAS 
AGCAHRDAYVCKAPGRAVF 


9506 


23407 


A 


9575 


314 


48 


SILPHSDPERCFSIQPEEGTIHTAAPLD 
RE ARAWHNKT VLATELQS SAQAS RVQVA 
IQTLDENDNAPQLAEPYDTFVCDSAAPG 
QVSN 


9507 


23408 


A 


9576 


505 


264 


I F WEME FHHFG QAGLRLLTSGDM P ASA 
SQSAGVTDMSHCTRPDTPFNFKEIiPGAV 
VTFLLCGMQVGRDELEDDWMGAWS 


9508 


23409 


A 


9577 


508 


314 


PATWKGCLSNFSSYLTKPADFIEHQVLS 
WEQVPDGFFIFNERFKSFTWVLNNVAE 
FVCKYKLL 


9509 


23410 


A 


9578 


512 


1 


RMREAI FS PGRL P RGAAPLRG P AGTLVP 
WMPTPGECWLSLSVSAAGEKPYKCSVCE 
SAFNRKDKLKRHMLIHEPFKKYKCPFST 
HTGCSKEFNRPDKLKAHILSKSGMKLHK 
CALCS KSFSRRAHIiAEHQRAHTGNYKFR 
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peptide 
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Predict- 
ed end 
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otide 
location 
correspon 
ding to 
last amino 
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residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G-Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Melh ion ine, 
N=Asparagine, P=ProIine, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptopban, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, V=possibIe 
nucleotide insertion 














CAGCAKGFSRHKYLKDHRCRLGPQKDKD 
PV 


9510 


23411 


A 


9579 


3 


487 


WDLIYVGRKRMQVEHPEKAVPRVRNLVE 
ADYS YWTLAYVI S LQGARKLLAAEPLS K 
MLPVDEFLPVMFDKHPVSEYKAHFSLRW 
LHAFS YE PLLI YPTH YTGDDG YVS DTET 
SWWNWEHVKTDWDRAKSQKMREQQALS 
REAKNSDVLQS PLDSAARDEL 


9511 


23412 


A 


9580 


1 


840 


S KYL YDEGE EE EV FNDEWGAAGG VAVPT 
LNALERG FLS AMDWHL YTD PRE I FEVLS 
WLESCVAEQQGRWRGWYTYTDLCVLLEQ 
PTWQIiALGSLCQRLVKLSCLLAVAYVSS 
VALAVASVAVIHQSLGLSCTPTPGPPDL 
GLTSRCLLEPCIPSVPQCLPSLANVSSC 
LEGSMGLRSLWGSLLASLTPPPLPPPDP 
PAPPTLLHNCHLCQKIiQRDS PTCHACLH 
PNRTVPTALSSPWYHTYGLAPPWPWSPV 
PLSLPQPQQCSLFSVMELARLKSFVFPG 


9512 


23413 


A 


9581 


2 


158 


IHAGFISVSVLDNIIYICMCVCVCVCVC 
AHMCACI SEHITHAPRLPEVFYTP 


9513 


23414 


A 


9582 


3 


144 


YTAGVISVSVLDNIIYICMCVCVCVCVC 
AHMSACFSEHITHAPMLP 


9514 


23415 


A 


9583 


503 


3 


GKCVCLSAVWGSLLSAFRGSLSVCCLGQ 
FVCLSAVRGSFSVCCPGQRVCLSAVRGS 
VSVCLSAVQGNVSVCLSAVRGSLFVCLL 
S GAVCLS FCLLSGAVCHLCRAP TP CLS R 
SLAHRRCSVNVCEMTQLISVSYFFCWGW 
WRHRNN I HLQKCRLR I LMAYTGQGPP 


9515 


23416 


A 


9584 


2 


1358 


AGLDS LHKFQ VKI EKEKVYVRAS KQALQ 
LQRRTKVMAKCISPSAGYSSSTNVLIVG 
AETLRQEGFSDRIVLCTLDRHLSYDRPK 
LSKSLDTQPEQLALRPKEFFRAYGIEVL 
TSAQWTVDVRTKKWFKDGFKLEYSKL 
LLAPGSS P KTLSCKGKEVENVFT I RTPE 
DANRWRLARGRNWWGAGFLGM2VAA 
YliTEKAHSVSWELEETPFRRFLGERVG 
RALMKMFENNRVKFYMQTEVSELRGQEG 
KTiKEWLKSSKWRADVCWGIGAVPAT 
GFLRQSGIGLDSRGFIPVNKMMQTNVPG 
VFAAGDAVTFPLAWRNNRKVNI PHWQMA 
HAQGRVAAQNMLAQE AE M S TVP YLW TAM 
FGKSLRYAGYGEGFDDVI IQGDLEELKF 
VAFYTKGDE VI AVASMNYDPI VS KVAEV 
LASGRAIRKREVELFVLHSKTGDMSWLT 
GKGS 


9516 


23417 


A 


9585 


360 


239 


Q CR KGLGKLVS FGGLG I WS L I D VLL I G V 
GYVGPVHGSSYV 


9517 


23418 


A 


9586 


160 


1156 


MPRLTFAPKGWPHPPTSLHPGQVTDQTT 
WWLFQELPTPSENSMPPGLSTPTASQEG 
AGPVPDPSQPTRRQIRLSSPERQRLSSL 
NLTPDPEMEPPPKPPRSCSALARHALES 
S FVGWGL P VQS PQALVAMS KE E KE S P FS 
SEEEEEDVPLDSDVEQALQTFAKTSGTM 
NNYPTWRRTLLRRAKEEEMKRFCKAQTI 
QRRLNE I EAALRELEAEGVKLELALRRQ 
SSSPEQQKKLWVGQLLQLVDKKNSLVAE 
EAELMITVQELNLEEKQWQLDQELRGYM 
NREENLKTAADRQAEDQVLRKLVDLVNQ 
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C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=PhenyIalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T^Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 














RDAL I RFQE ERRLS ELALGTGAQG 


9518 


23419 


A 


9587 


419 


130 


RRDPHLPRIMGRTRLNRKNAETRINAQL 
PLTDKGRMARHVLDNSGEWSVTKRQVIL 
LHTELERSLEYLPLRCGVLTGLAAIASL 
LYMLTHYLLPYA 


9519 


23420 


A 


9588 


2 


399 


FSCEFNMEANQCPLVVEPSYPDLVINVG 
EVTLGEENRKKLQKIQRDQEKERVMRAA 
CALLNSGGGVIRT4AKKVEHPVEMGLDLE 
QSLRELIQSSDLQAFFETKQQGRCFYIF 
VKSWSSGPFPEDRSVKPRHV 


9520 


23421 


A 


9589 


19 


205 


LYCLV E WHDLCS LQ PLS PGFRQ FSCLNL 
LSSWDYRHMPPRPTIKLSSYSSIKLIYY 
LFMLVL 


9521 


23422 


A 


9590 


500 


407 


KLTNEVRSVKRSQLFPESFMCKEKLGNT 
Nil 


9522 


23423 


A 


9591 


727 


236 


RRSRGLEGREALALCPGDGSHLLCRRTD 
SSFSFMAFFFTFMAQLVIS I I QAVGI PG 
WGVCGWIATISFFGTOIGSAVVMLIPTV 
MFTVMAVFS F I ALS MVKKFYRGSGGS FS 
KAQEEWTTGAWKNPHVQQAAQNAAMGAA 
QGAMNQPQTQYSATPNYTYSNEM 


9523 


23424 


A 


9592 


381 


278 


HPSVYKVASGLKEGLSLFGILNRCHCKW 
GEKLLR 


9524 


23425 


A 


9593 


412 


156 


SQRCLSLGCKEHLANAYAI I I CDNGFSA 
LLGP PLPGWI YD I AQKYDFS FY I CGLLY 
MIGILFLLIQPCIRIIEQSRRKYMDGAH 
V 


9525 


23426 


A 


9594 


263 


50 


KHAAPPASLSLSLIjLHHGQKRACFPFAF 
CRDCQLLEGSPAMLPVQPAKLLVLLEQV 
CLLCALLIPPSGSRI 


9526 


23427 


A 


9595 


432 


309 


GTFSERGPPLEPRSQTVTVDFCQEMTDK 
CTTDEQPRKDYT 


9527 


23428 


A 


9596 


457 


29 


. TLLPGWITAQASEGEIGASLPSSLPPLL 
YLPSWRCSPSGALTHPAACPTGQRSSSA 
LLPSLPQAPPGFCGMTGLRPLPSPSVGC 
CSAPSPQLLCPTQLPPAPESSEGGCSES 
RCVANVKYTRDLGDFLEKCTNGEASALE 
YP 


9528 


23429 


A 


9597 


864 


407 


PSRRLSPTGRTTHGSSAACAPPPPRLER 
RSRTSRPPWGPPHSASARTQRACTTWYS 
RSWACSCSWITSRPSWMTTKRTCMIFST 
I PTT P R T ALWRG L S L WKD A W LLTRL K LA 
PSSPTSMPPTTTCTACSSTWMTCGSPAR 
WASTPMPRSSTT 


9529 


23430 


A 


9598 


421 


145 


LTQHNGDAAASLTVAEQYVSAFSKLAKD 
PNTI LLPSNPGDVTSMVAQAMGVYGALT 
KAPVPGTPDSLSSGSSRDVQGTDASLDE 
ELDRVKMS 


9530 


23431 


A 


9599 


1013 


604 


PGRPTRPDICLLLERMEEQVKNVMKTFR 
EELYNIEKAFEVERQELLASNKKKWEQA 
LQAKNAKELEYLNNRMKKVEDYEKQLNR 
QRIWDCEE YNM I KI KLEQDVQ I LEQQLQ 
QRKAIYQLNQEKLEYNLAGAEEER 


9531 


23432 


A 


9600 


409 


191 


HHVGQAGLELMTSS DLPSLAS QGAGI TG 
VSHCARTVYGLSIVFFSSHRGFASVRTP 
PSNENFPDRPISIAHP 


9532 


23433 


A 


9601 


367 


1 


PPRYSPPLGGVRGGVPPGPGVLAPPFPK 
GKPPFFFKNPNLPRPGGGAPYSPFFGGV 
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C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L= Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Ty rosin e, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














GRG I S FNPGGQGS PGQKKPPG P PRRGTK 
GFFSKKKKKESTKLPVTSHTTHCWFLPE 
ALDLELDPLY 


9533 


23434 


A 


9602 


1722 


394 


GWNGS WNDNL VDT S PL KRDPLQD I CRR Y 
MEDLKKICFYRELNSKTTLKFVHTSFHG 
VGHD YVQLAFKVFGFKPP I PVPEQKDPD 
PDFSTVKCPNPEEGESVLELSLRLAEKE 
NARWLATDPDADRLAAAELQENGCWKV 
FTGNELAALFGWWMFDCWKKNKSRNADV 
KNVYMLATTVSSKILKAIALKEGFHFEE 
TLPGFKWIGSRI IDLLENGKEVLFAFEE 
S IG FLCGTS VL D KDGVS AAVWAEMAS Y 
LETMNITLKQQLVKVYEKYGYHISKTSY 
FLCYEPPTIKSIFERLRNFDSPKEYPKF 
CGTFAILHVRDVTTGYDSSQPNKKSVLP 
VSKNSQMITFTFQNGCVATLRTSGTEPK 
IKYYAEMCAS PDQSDTALLEEELKKLID 
ALI ENFLQPS KNGTGSGRSCLGVPPNTV 
MTLCGAYGNRATRRNCHTLEPCG 


9534 


23435 


A 


9603 


14 


356 


DFVERTQYTHTQTHTHTHSHAVRLSPRP 
VQSRPEASQTGPTRTKPALGPAHPSPRG 
AP C P DD PGSLRMLG YGSRASHNSRRGRW 
GHGRCKDQRAMGPHSHMWEAALPPGALS 
SP 


9535 


23436 


A 


9604 


477 


264 


LVETGFLCVGQAGLELLTSGDPPTSASQ 
SAGITGVSHRTWAALSTFTVLCNYHYLF 
LEVFHHPKLKLSPH 


9536 


23437 


A 


9605 


1 


378 


EGINFSDNLRRCWMVGMPFPNIRSAEL 
QEKMAYLDQTLPRAPGQAPPGKALVENL 
CMKAVNQS IGRAI RHQKDFAS I VLLDQR 
YAR P P VLAKLP AW I RARVE VKATFG PA I 
AAVQKFHREKSASS 


9537 


23438 


A 


9606 


417 


254 


MVSLTQELCPVAMRVAEGHNKMLSNVAE 
RVTVPRNFIRGALLEQAGQDIQNKLE 


9538 


23439 


A 


9607 


404 


272 


PPKFLLESGPIRWDWWSLGALMYDMLT 
GAVGAQLKAAC I IGLC 


9539 


23440 


A 


9608 


409 


205 


HMNGSLGSGDIDGLPKNSLNNI SGISNP 
PGTPRDDGELGGNFLHS FQNDNYS PS S L 
PCSWSLFYSLPF 


9540 


23441 


A 


9609 


449 


263 


VWICPCDPPASASQSAGITGLSHHLPKC 
WDYRLE P PRPAHC FCF YS FTMRNRNLLS 
LVKYSR 


9541 


23442 


A 


9610 


366 


75 


TPGLKRSSCLSLMSRWGYRIEKKRTIIR 
S WEAI KEQDGRE VDWEYFYGLLFTSEN 
LNLVHIVCHKKTTHKLTCDSSSIYYPQT 
RLKRKQPVRKRQ 


9542 


23443 


A 


9611 


1 


801 


PGYCGSWVFTCGALRQLSGGRDLSRGAR 
MGNSALRAHVETAQKTGVFQLKDRGLTE 
FPADLQKLTSNLRTIDLSNNKIESLPPL 
LIGKFTLLKSLSLNNNKLTVLPDEICNL 
KKLETLSLNNNHLRELPSTFGQLSALKT 
LSLSGNQLGALPPQLCSLRHLDVMDLSK 
NQIRS IPDSVGELQVIELNIiNQNQISQI 
S VKI S CCPRLKI LRLEENCLELSMLPQS 
ILSDSQI CLLAVEGNLFEIKKLRELEGY 
DKYMERFTATKKKFA 


9543 


23444 


A 


9612 


179 


329 


KGLAFCPPPGRGGGPPPPLWPPPPSYKK 
IFGLKTPKSGALSPPPPPPGNF 
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nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
\V=Try pto ph a n, Y=Tyrosine, 
X=Un known, *=Stop codon, /= possible 
nucleotide deletion, \= possible 
nucleotide insertion 


9544 


23445 


A 


9613 


410 


298 


PRFTPFSCLSLRSSWDYRCPPPRPANFL 
SLSSVLHGT 


9545 


23446 


A 


9614 


1 


130 


NPGSHCVAQAGLQLLSSGNPSASASQSA 
RITGVSHCAWPNTTI 


9546 


23447 


A 


9615 


1 


146 


NTWRLHMVPRLVSNFWAPG I Lft S WPPKI 
LGLQKRATTMPGPE I YTS IN 


9547 


23448 


A 


9616 


552 


26 


RLSIRNLLTHAFFAEDTGLRVELAEEDD 
CSNS SLAI RLWVEDP KKLKGKRKDNEAI 
EFSFNLETDTPEEVAYEMVKSGFFHESD 
CKAVAKS IRDRVTPI KKTREKKPAGCLK 
ERRDSQCKSMGNVFPQPQNTALPLAPAQ 
QTGAE CEATEVDQHVRQQLI QRE PQQHC 
SSVTGDL 


9548 


23449 


A 


9617 


420 


48 


FSQILRASYEFDSPFWDDISESAKDFIR 
HLLERDPQKRFTCQQALRHLW I SGDTAF 
DRDILGSVSEQIRKNFARTHWKRAFNAT 
SFLRHIRKLGQIPEGEGASEQGMARHSH 
SGLRAGQPPKW 


9549 


23450 


A 


9618 


190 


285 


RQGLTMLPWLVSNSWPQWLLPWPLKVL 
GLQV 


9550 


23451 


A 


9619 


392 


308 


SPIKPKIPLSAPRKNTNSVKYRLKFRFG 


9551 


23452 


A 


9620 


166 


23 


FCNPIPSSLPSSLPSFISPFLPCFLTPF 
LPCFLPPFLPFLS FMTLT F 


9552 


23453 


A 


9621 


49 


374 


DRRG I R I MAAAL FVP LGFSLLGTHG PS G 
AAGTVFTTA * YLG SKI LLTC S LNDS ATQ 
VTGHRWLKGRWLKEDS\LPGRKPEFNV 
GPHPPNGDKSSFVFPPPSPCPRLTF 


9553 


23454 


A 


9622 


85 


472 


SHVFPPLRI/TLTLHSMAVPETRPNHTIY 
INNLNDKI KKDELKKSLYAI FSQFGHNL 
D I LVS R I LNMMG QAF V I FKE VS S ATNS L 
RSMQGFPFYDKPMRIQYAKTYS YI IAKN 
XGTFLSC/DRKLIK*TPQ 


9554 


23455 


A 


9623 


170 


2 


I FFFFCFERGSCSVA* AGVQWRS YSLL* 
S *TPG\SSNTASATSVAGTTGL\HHHAE 


9555 


23456 


A 


9624 


2 


469 


RRLCS DRGLQRS LGSMSAAVTAGKLARA 
PADPGKAGVPGVAAPGAPAAAPPAKEI P 
EVLEDPRSRRRYVRGRFLGKGGFAKCFE 
ILDADTKEEFAGKIVPKSLLLKPYHR* K 
MFMEI FIHRSLGHQHVL*FHGYF*EHDF 
MSEELEL\CRPRSFLF 


9556 


23457 


A 


9625 


661 


1393 


ASPPPSQTTPAAPGCSAGEVSLA\GLCL 
QERRLVDVAGFSIFIPSRYLDHPQPSKA 
EQDAS I PPGTHEALLQTALS PPPPPTRP 
VS PPQKAKEAPNTQAQPI SDDEAS / RWG 
RNPG*CS*SGSPKQWPLPHLP*GRIi*A* 
PS PCCPG * VG/ PVHS PQARRRP I TTS I S 
QRSWKFWPKPTVFRETSGGPWAMPRPSM 
PSRAS I SLS PRTRSRKQPRPLTLGCCVW 
HWHTDGERATCGDVRRAHHSPR 


9557 


23458 


A 


9626 


1 


286 


PTtMAKLIALTLMGMELALFWNHQASYQT 
RLNALREVQPA* LPNCNLVKGF * TGSED 
F\EILPNGLTFISSGLENPWI*RLDP\N 
SPGKMLLMDLNE 


9558 


23459 


A 


9627 


173 


491 


EGPLPLESSSNWGQADLDKKSRELLWKT 
VYLL*LNIQLPYNPWPILGINPRETHT 
C\HQEMCTRIFIATLFTKAKA*K* prcp 
*AREW/ IK* / I WNIHI I3HY 


9559 | 23460 


A 


9628 


351 


1 


VLGDA I QSRRGSSRKAGS YLLSRS P / CS 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q^GIutamine, R—Arginine, S=Serine, 
T=Threonine, V=Va!ine, 
W=Tryptophan, Y=Tyrosine, 
X^Un known, *=Stop cod on, /= possible 
nucleotide deletion, \=possible 
nucleotide insertion 














EGGRGGCWAWNLGTMSSVASGHGAWVPR 
ELTS*AWPEPGPAGQR2GRPPPTPCPPT 
CAHGP PAQP * PRGKAAYPG/ CLGLQWPD 
*SAFNP 


9560 


23461 


A 


9629 


599 


n 


KM WR * * H * AKSQ I RNA I PFT I AHK/ RDL 
KS LKI QLTXEAKD I YNE>TYKI PL /NEIR 
DVTNKWKNIP\CS+ IGYLYSTYKYKCIC 
LVNIYL 


9561 


23462 


A 


9630 


226 


1 


KCDNLGYSTFFFFFFPFFFFEIGSYFVA 
Q VG V * WRN Y S N S / DAS A FQVAGTTGVCH 
HAQLI FNLFVEVGSHYVAQA 


9562 


23463 


A 


9631 


241 


1 


VSGCPLPRG*RPPPQPQ3PPPSTPGTSD 
HPCTPCR/ PALRR * SP * * LPCLDLPPAS 
G?PSTSGNSTGTMQPT*PKLNSSSS 


9563 


23464 


A 


9632 


201 


1027 


MTLTPAPGQGQQSSQCLWRRPL/PLPPG 
PDSEEPGSGAAG/PGSSL*SPDLPPAAP 
PKP*TSS*EMNTTVGSTQEGSGLAQRTTQ 
HCPVPPST/RVDILGTCLQPFPALCLTS 
APWLVRMPPSQRSSREIP*RLEAF/SLR 
TPIPAHSAOPSHOEKEP / PS PR * DHGD * 
P P P * P PKPTHQTIiAGQAPRSGQGLARPT 
LPYPPSPLRVTPRICRVHV*GHLRRPRG 
PPPSSARRPTWDWRVGLPWEDLEWLCR 
GPAWLLARTSAFFSSLPTHACTGKKTTM 
PF 


9564 


23465 


A 


9633 


103 


379 


YHCITIIVLKCIFFFFFLKPCLILSPRL 
EARG 1 1 LGH * NL CLT \ SQLLRRL I RKS P 
LTLEGGVCSDPRSPHCSPAWPTKGNFFS 
KGKKKILD 


9565 


23466 


A 


9634 


155 


405 


VPAFRSVTNISISVSFFFFETEPGFVTQ 
AEMQGLDIGNGQPPCPKLSSFA*ASQVM 
GTTGPRH/HC*ALIFLVKTGFPHVAKTW 
F 


J J oo 


7^467 




7UJJ 


A 


Hi j 


GAMRGDRCRGRGGRFGSRGGPRGRFRPF 
VPHI PLDF YLCEMAFPRVKPAPDETSFS 
EALLNRNQDLDPNSGEQASI LSLVTKMT 
YE I DNLTVAPGTLD VL I E ERQQGG P Y I K 
GTMTT*RH/ SVADLTVILQILTTWESVA 
S 


9567 


23468 


A 


9637 


2 


492 


GSRAIGFGLAHELERHLIKIWLASIVR* 
DLA* LMNWS G\SKKDLAGKYRP I LEKAI 
QLSGSEQLQAFKAFEESMVNENVSLVIS 
RQMLTDFCTHLPNLPDSTVKEIYHFTLE 
KIQPIVISFEEQVASIRQHFASIYEKEE 
DWRNAAQVLVG I PL * TG QKQYNVD YKLE 
TYLKIARL YLEDDD PVPAY 


9568 


23469 


A 


9638 


1 


417 


WQGLGLVMGSPSRRLQTKPVI TCFKS V 
LLIYTFIFWITGVILLADGIWGKVSLEN 
YFSLLKDKATI VPFI \LI ASGSAI ILFG 
I LLCI FS V * HYPV * LLLVTFL I * LLVFL 
NKWWISPHFLILTFNYLLFDI I IDNSS 


9569 


23470 


A 


9639 


125 


451 


I YRKPLKS IRRVGGLHLNKIFFFFLKQI 
WGWARGLM PC* HFGRLKXGDCLTLG I GD 
/SRG/RYDHDPALQPRQQSKTLSLKINK 
IG WAQWLMPG I PAP WEG * AGRS FGAG I 


9570 


23471 


A 


9640 


670 


927 


GHVLLPRRAGCAQPGTG PARAAPAAVQR 
LPGGGVQPSGPGGRAGQGQGLRGT*AGG 
NPGTGF/ GGPHASALLPPSEAPGSLLPS 
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nucleotide deletion, \=possible 
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QPR ■ 


9571 


23472 


A 


9641 


115 


460 


ISWIFLI ICVGFFGGGGGARVPLCKLSP 
RLECSSAIRARCGFDFPGSSDP*ASAS* 
VAGTT/G/SCHHVQLIF 


9572 


23473 


A 


9642 


1 


431 


AMRAEVLVYCCRGALASGCVQAFRARY\ 
WVL I I VDVPDNEQANAS I IVKLTDS FTE 
QADQLTAEVGKLLGE * KVDAI LCVAGGW 
AWGNAKFKSLFKNCDLMRKQS I WTWT I F 
SHLATMHLKEGGLLTLAVAMAVLDGTPG 
MIGD 


9573 


23474 


A 


9643 


154 


408 


IIMNAQSVEEDSILIIPTPDEEEKILRV 
KL*EDSDGKYGSKIPWNHLPEPENFRLL 
FKHVGYLD*SGLY\EILTKF* ILCVLCY 
Y 


9574 


23475 


A 


9644 


3 


403 


YSQFFGGPKLEDPIRPGVFSQPGEHRGH 
PFQI F* KVGLGNGARPGFPVLGKVKCKD 
NLKLGGQGCSRVEMHPRTSAWAIERDPV 
LKKKKRQR\ ehqcsneqkhtn 


9575 


23476 


A 


9646 


3 


399 


ARAM VL S PAD KTN G KAA WG KVG AHAG E Y 
GAEALERMFLCFLTTKTYFPHFDLSHGS 
AQVKGHG KKGADS LTNAVAHVDDM P S AL 
VALSDLHAHKLEVGPVNCKLLRH* LLVT 
LGG\HFPAWLTPGGIAFLEKF 


9576 


23477 


A 


9647 


22 


407 


APSAWGMCHFTEEDKATITSLWGKV\NV 
EDAGGETLGRLLWYPWTQRFSDSFGNL 
SCASAIMGIPKVKAHGKKALTSLGDAI * 
HLDDL*GTFAQLSELHCDKLHVDPENFK 
LIX3NVLVTVLAIHFGKE 


9577 


23478 


A 


9648 


243 


2 


QIATTTLGPGSEGVGKRDLGSSSRHHSQ 
• VES VKH\ PRDLAQDA*GKGSAPRPLCP P 
PAQSWHCLRGPCLAPHHPTLCEPSR 


9578 


23479 


A 


9649 


283 


196 


SYFI * I ITTTHSLI \ ISI I PLLFFNQIN 
NNLFSCSPTFSSDPIiTTPLLILTT*LLP 
LTIMASQRHLSSEPLSRKKLYLSILISL 
QISLIITFTATELIIFYIFFETTLIPTL 
AIITR*GKPTPLIQ 


9579 


23480 


A 


9650 


10 


354 


QLRVI TRLTWNN* TSA* SRGVS YP*AH 
\SSDALHL*RELTEQRFIVSVNCAIAHC 
RHQSASEDHVKLRNEVTEFAQTWDADES 
AENCDKSLHTLFGDKICTVATLRETYGE 
MAD 


9580 


2348! 


A 


9651 . 


2 


404 


AFGTTKWVT Y I S VLFLCS SAYSRGVLRL 
DAHKSEAAHRFKDLGEENFKALALIAFA 
QYLQQCPF\EDHVDLVNEVTEFAKTCVA 
DESAENWDKI \LHTLFGY* LCAVACFRE 
T\YGETAAC*AKHEPVEDE\CFLR 


9581 


23482 


A 


9652 


83 


396 


NIPGVGGGGVPIPLVWKYNISHTQR*RE 
PYIQSLRSGNWSERMSHMVAR * RQRHHP 
YYl\LLQAQliSMA\SDPANVMMDSVITA 
G VLVA/ S RRARR I S KLGHR I 


9582 


23483 


A 


9653 


67 


954 


REGNHNTERNCKRPPQDTGPTQ/RPGPP 
PPEVPWQDASSAPTPAAAPVGKLLPS*G 
P PGSAS CQGR * TLGSVHWL *GS PLPSPS 
AGGRALPYGPAQGQSQGHWENTGRGTPL 
PGSRET* GHELTTGVEHQRHKAPREPQA 
GQPRLCPWVQSLPPT\ARSPSWSQGERP 


9583 


23484 


A 


9654 


1 


252 


RPPQLLASDSLPPSRPPSFPPSLLLSLR 
PPhSRShChSmhCPPhShlQQhTSFPD 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
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beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














TFSPPKSTTSGPPQPEDSPSS*PPTPSP 
P\PPSLLSSLPPSLSSAPPF/YVHSVSP 
TISAPLYP * YNS * PHFPI PFPPRKVQHL 
ARPS PKTARP PWTIRR I 


9584 


23485 


A 


9655 


1 


394 


SLRIYCFVIEVICFLYLL/FIMYSIS*G 
I*LLMNFTILCIGY*LLQYFFFWVKSP 
YFNF IRIPFI LFLYL I CLLVF * L I F FDL 
S YF/MCLRLI I FNLFY* LFLLYS I CLWN 
MVLLLISKLF* F* FCGRFALIQ 


9585 


23486 


A 


9656 


122 


371 


S P YL LLLQGGVS W I V WLLN V E STNS S FI» 
FPETGSHCAVQGGVQWCGRGSLQS * TPG 
/ SQS * VAGITGMCHHAS +LEKFFFVEMG 


9586 


23487 


A 


9657 


425 


3 


KTSMMNS I VPH I S I PTLNVNGLNVPLKR 
YRIAE* IKIHQPSICCLQETHPTKKDSH 
KLKVKGWE* IFHANG\QGVAILISDK/T 
DFKATTVK/RDKEGHYI 1 1 KGLVQQENV 
TVLWLCAPNTGALKFIKXQLLLDLRNEI 
NANT 


9587 


23488 


A 


9658 


179 


373 


VHSWLLWGSQSFVLFRPLTD*MRLTH/ 1 
MKGNLLESKSTSLSVNLIQKHPHRNIQN 
V/AQNIWVSWNS 


9588 


23489 


A 


9659 


103 


373 


LKHVLRLCLQSERKLHYLFNIDQWGKFY 
TWDDL YEC I S S KFTTH FFFSFL/ FFS F * 
SQGLALSPRLDCSDAIIAHCILEFLGSS 
HAPTSAS 


9589 


23490 


A 


9660 


216 


424 


TDINVTTKTIKPLDENIGITLYDLGLGS 
GFLDMT*T/AQVANEKIG*WDFIRNLKIi 
TCIGHYHEVKKTNPW 


9590 


23491 


A 


9661 


276 


2 


PRIGKLPTNMIEPLIIKEMLINLAMRFH 
LSPVRMAI / 1 KI KKSNRCW*GCREKGI \ 
IYHYWWECKLVQPI»/WKSSLK/FIK*LP 
IDLSFVPAIPLL 


9591 


23492 


A 


9662 


239 


356 


ICIKRRK/WSGTVAHTCNPSSLGGQGGR 
IA*VQEFQPGQQ 


9592 


23493 


A 


9663 


1 


340 


RHELMTPHAFGAMKRVTFI SLLFLFNLA 
YSRGVISRDAHRSEDAHPIKDLV*ETFK 
ALVMIAFAQHLQQRPFEDHV* LLNE\VT 
EFAKTCVAEESAENCDKSLHTLFGDK* C 
TV 


9593 


23494 


A 


9664 


759 


1039 


KRI PFGRPRRVDHLRSGVRDQPGLHDET 
PS\LKIQKLAGQGGICL*SQLLRRLR*E 
NHLNPGSRGCSEPRSCHCTPAWATE*DS 
ISKKKNTKIS 


9594 


23495 


A 


9665 


1 


355 


ITDLYSMFHFG+VDLATITSLWGKGNEQ 
KAGR* TLERLLDVYPWTHRFFDSFGNLT 
SASAIMGNPIVKAHGKKVLTSLGDAIMH 
LDD/LKGTFAQLSELHCDTLHVDPENFH 
ENPCDGS 


9595 


23496 


A 


9666 


364 ! 


3 


APNIPPSVTRIPALLAFRSQCLCGSCPL 
PLATE PVGLLLSMS FQAAPCFTWWVTEA 
WLTG ISWPPDFLI SLLCI / HINYNAVI F 
RKYRPRVGAVAH I CNPSTLGG *GGWI A* 
AQEFETSLGN 


9596 


23497 


A 


9667 


1 


357 


PISNAMRHtXSQEETATITSLWGKVNrVED 
AG*ETLGRLLVI YPWTHRLFDSFGNLS S 
ASAIMGWP*V\KAHVKKALTSLGDAIKH 
LDDLKGTFAQL I ELHCDKLFVDPENFKL 
LGNVLAT 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 

I 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Pheny!aIanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K= Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possibie 
nucleotide deletion, \=possible 
nucleotide insertion 


9597 


23498 


A 


9668 


24 


369 


APRPDAMGHFTEEDKATI TSLWGKGNVE 
DAGGETLARLLGVY PWTQRLFDS FGNLS 
SASAIMGNPKVKAHGKKVLTSLGDAI *H 
LDDLKGTFAQ\LSELH*DKLHVDPENFK 
LLGK 


9598 


23499 


A 


9669 


12 


365 


LLLMDGERK* FLETDSAPCEDAMNTVEM 
TTKNLE*SINLVDKAVA/ SGFERIGFNF 
EKSSTLGKMPSNS I ACCRE I FHERKS/S 
MWQTSFLSYFKKLRQA/PPAATTLNDHQ 
PSALRQDPSP 


9599 


23500 


A 


9670 


346 


2 


YPYDPWSLINNMFFLFKRFQNFFLKEGT 
LTFPFKGFRGDLSLFFWPPYKLFNLKSF 
L/CKFLE I * R YF FG * 1 1 FFFPFFFFFFL 
RQSLTLSPRLECNGTISAHCNLCIiPD*S 
NSP 


9600 


23501 


A 


9671 


121 


364 


HKKKTAGRVQW P LLV I PAL W E AKE AD CL> 
SSGVQGQPGQHSKTPSLP* PSKSWDYRC 
TP PHLAS F CNF FS F F FGRDG VLL CCPG W 
P * TPELKQSA/ CLGLPKCWDY /RARATA 
PGLLFF\ FMLKYVLVKPAVN I ANAN* KE 
LSEPDVKNEICFV 


9601 


23502 


A 


9672 


437 


223 


EME / FSLLiLPGLECNGTCLAH *NLRLLG 
S SD FPASASPVAG I TGVRLHAQLELYFL 
NLLGFFFI ISLWYLS 


9602 


23503 


A 


9673 


100 


338 


KSRPSAVANLTPVI PALWEAEAGGSPEL 
KQFSLSLLSSRDHRFATPC/RG*FFIFF 
VEMEFHCVAQCGFELLDSS S PPTSA 


9603 


23504 


A 


9674 


161 


1 


S FLWKLCLRGAP S CMRCQLA\ LL * DVSQ 
LGY/SGVRDPLEEAVCPFSDLKLRAGR 


9604 


23505 


A 


9675 


304 


1 


VP PLAS QLGDKR R I F FQ KK KK KKKNKDK 
IHIIISIXLKKFDKI*YSLIIK\TL*KL 
GME*TYLNIIKVIYDRPTASIILSGEKL 
KSFPLKSGR*QECPLL» 


9605 


23506 


A 


9676 


111 


346 


SKEGSKEGSLEIWGFLiVLAFPFPFPFPF 
SFFFF\ *RQAGRPRLECRSMITAHCNLE 
LLGSSDP PAS AS \QVAGTTGPTS P 


9606 


23507 


A 


9677 


3 


349 


ARAGRI IKDLWLGFFFFFFFFYPKKTPL 
FTRDKGGGKLQTAKKNNPHPVKREKKKP 
PPGGTRGGKNPPGNPFWGFGPQRFFFGG 
KTFGGGAHPPDPTLG/PSPQS*KRPFGK 
RGPPP 


9607 


23508 


A 


9678 


38 


368 


QASLLKESESERKSPTSLTLSQKLGMIR 
LCEEGMSKAKAD*RLGLLH/ qvsqlvnt 
KEKLFKEIKSAISVNTIiMIRK/RNSLTA 
DMDKVWAWIEDLSSQNNPLSQSLIHSK 
A 


9608 


23509 


A 


9679 


187 


345 


PFILFFETESYSVA*AEMQWCDLGSLQT 
PPPG\SSNPPVSALWEARAGGSRSQ 


9609 


23510 


A 


9680 


192 


468 


ERTSLFTIHLLKLLTITNSAAMSNHV*V 
' FV* T* I FLSLGQMPKS VTAGSNGQSLFR 
FFK\NCQTLFHGGCAMLH/SHTQLNAIP 
LFRNLWQHLS 


9610 


23511 


A 


9681 


141 


367 


GS WAAS PCS CTMDDFAKE S FT WD YVLL 
ENCPNMGD YWAPQ FMTDNYVRVTQLN W 
DGVGTQ* KDYI /SSERKL 


9611 


23512 


A 


9682 j 


193 


3 


IFGERR* I LKI YKLTGHVGLRL * S Q I LR 
RI /RIiNLGGGGCSEPRSCHCTPWJATER 
DSISEKKEK 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S^Serine, 
T=Tbreonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 


9612 


23513 


A 


9683 


1900 


1988 


NLVHFEEHFPSLFTPSPSLCHIF/CSRD 
RVSPY*PGWSRSPDL/RYLP/CLDLPKC * 
WD YRRE S P CPAS S R FLMNFLY F * NTTLW 
Q* KT I PLLF* SGDC / S* FF* YNFLAHTS 
STMVNSGI SRNSRPVFKLADKAPSHS PL 
SIIAFGVEFF/NLGSLQPQPLKLK*ASH 
LNCLSSWDNRHAPPCPDNFFFNIL*RWG 
FPILSRMVLNSW/PLCSLSTSASQSSG/ 
ITGVSHCAGPILVF 


9613 


23514 


A 


9684 


69 


358 


ILIVKVFFPHTQKPNFVLRLSLQSIQQM 
LLPERKWTHAQKTC VS SENTERCHEKQV 
ITSHWLGMVAHSCNPSTLGGGGGWIT*G 
QVFET/SLANTVKP 


9614 


23515 


A 


9685 


66 


344 


IGE KLLLTDKKRNW VLETE * APGKDS VN 
I DEMT / TR / DLE Y Y I N \TDKAVTG FERT 
DFNFKSS I RGQSL* YSI I / CAKGSQQKQ 
QISLVPYFKKLPQ 


9615 


23516 


A 


9686 


269 


1 


PKIQNGHCKTLPKNPSQSDTKTFPLFTS 
GSSPQPHKLSFCFSAGPTLPSKTQLKIH 
L\WQGTVPHAYNPSTLGSRGGR±T*GQE 
FKTIQ 


9616 


23517 


A 


9687 


229 


361 


DRYYYKSHFKPGAEAHAC*LVIiASFFMC 
YRAIMINK*HLYHRKL/VYQVIRH*KYR 
D / 1 * THVTLTTTL * DRYYYKS H FKPGAE 
AHACNPSNLGG*GG* I I *GQEFKTSLAN 
MA 


9617 


23518 


A 


9688 


318 


46 


KLYALNDMASNTCKEYDNLIQYYRYKIY 
KRLIQHD*VEFIPEME/NWFNI*/RKSI 
NVIKY IHSLWEEKNYMI ISLDVADVSDK 
IKLTFLIF 


9618 


23519 


A 


9689 


168 


3 


GNSNTMIFFTILYLFFLYFF/LFFIFDM 
KSHSVARLECSGVILAHCNV*LPGSSNS 
P 


9619 


23520 


A 


9690 


192 


3 


WHKASLSNPQPAGCMQPRMALHEAQHTF 
VNFLKTLWAG\ P VAHAYNPNTLRGRGGR 
IT*GQEFK 


9620 


23521 


A 


9691 


388 


1 


CRSAGVCWRSTPDPVCLGITSSGCRTAE 
I VACS FLWKIiHSRGAPARCQPELSCMRC 
\R*TLL*GVSQSGGTGVRDPLKEAVCPL 
AELENGARRSTALFRASRQDSLSLLKLR 
PQLPLLPGAIiSQ I EG I 


9621 


23522 


A 


9692 


226 


3 


WFLLPQVKLAIMTPRLYFPNLPFFFFF* 
DRV/ CTVAQAGVQWRDLS SLLLS LLGSS 
SS PTSASRLTGTI I LQHHAE 


9622 


23523 


A 


9693 


305 


1 


NHAITTVN* FGLIRHLVTKAAFNSGEVD 
I V/S I KDPLI ALNYTVYMFHHDSTHGKF 
RGTVK 


9623 


23524 


A 


9694 


320 


1 


PRDP PASASQSAG ITGI SHQAWPDMILY 
IENPKDSSKNPLGLINKYSKVAGYKINT 
QKSAAFL * TNN YLKN* P/ MRT J PFT I AT 
KKKYLETYLTMEVKDLYTENYKM 


9624 


23525 


A 


9695 


263 


I 


VRILLFSESLALSPRLECSGVISAHCTL 
CLiPGSRGAIjSLjLSRVDY/Y*VFLVKAGF 
RHV/GQAGIiEFLSSGDPTVPSLPDFWDY 
RCKPR 


9625 


23526 


A 


9696 


262 


359 


IRKIHKIWSGWAHTCNPSTLGGR*TKA 
HEGSLYCLRVGKI VSNKVGTRS * FFRTQ 
K*V/KLFFS*VYHINGSH*SRTSL* irk 
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SEQ ID 
NO: of 
nucleotide 
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NO: of 
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sequence 
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beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AJanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














IHKIWSGWAKTCNPSTLGGRVGWIT*G 
Q 


9626 


23527 


A 


9697 


166 


473 


TQDTSQPWKNIWISFGSLKVFENVDIHK 
L. YLE LDNRHRS WMTD I LLF FY F * DGS FT 
VT \ RLE CNGA I LAHCNLHLPGS SNSPAS 
AS * VAE VGGTLE PRRLRLQ 


9627 


2352S 


A 


9698 


456 


59 


LFLWKLHLRGAPGYVRCRLA\LLGGASQ 
LGYSGVRDPLEEAVCPFSDLKLHAGRTT 
TLFKAVRYGHLCLQKFLLPFVRLCPAPT 
GGVFTVRQASMSCDGLLIV* \VSERH*L 
PKPQQFWELVPKFVRPGLKSL 


9628 


23529 


A 


9699 


4 


375 


VFCS S VTI VSE I AFLMWLSAQLLLVYRN 
ASDFCMSSLYPETSLTKVTQGQAL/H*N 
G * LT * * EQ YWVKTDQKAVI / CQQSTGS S 
GQSS QARE I KGPGAVAHTCNPS TLGGRG 
RWIT*GQEFETSLA 


9629 


23530 


A 


9700 


239 


361 


RNDTFFFFETESNSVA\RLQCTGVTLAH 
*NLCLPGSSNSPA 


9630 

7UJU 




A 


9701 


169 


1 


I FYWKAI FFETESHS VAYAGVQ * SNLGS 
LQLP S S \GSSDS P I S ASQEAGTTDAHH 


9631 


23532 


A 


9702 


37 


34S 


HAKNFDVSLLPYTKTHEHFVKEDIWIAN 
KHVKRCSATLVTKEI PPQS /TLKWLKH* 
PPC*QGNGRTGSPI P /C/WWECKIVQLW 
KIV*PFPKKLSIYL/PY/DPPILLLGI 


9632 


23533 


A 


9703 


16 


339 


ARLKTFAMAAKC*SERKS/RN/SLTLNH 
KLQM I KL I EEGMLKAVNGQKLRLLYQ / T 

sqfanakdeujMD r i satpatv* mig * q 

NSLIADLEKVRW*LDDQ/TSHNIHLSH 


9633 


23534 


A 


9704 


276 


2 


GISRGGLSKPPFFP*DFPLDYLGQSPFC 
KVKRNS /RFRTTKS PLDFP IPPLFFFFF 
EKVSQAI S AHCKLHLPG* CHS PASASRV 
AGTTGTRH 


9634 


23535 


A 


9705 


352 


2 


CNSR S DFFQLMMNCFLRWAVS PRVECS / 
GVITAHCNLNLPDSVDPPTSTS * VAGTT 
GHTP\HTWLI 1 1 1 1 I VAGIFLFLFFLFS 
FFGRDGVSPCCPSWSQTSEFRQSACLSL 
PKCWDY 


9635 


23536 


A 


9706 


6 


351 


IWNSRPRRPLGGQGGRIQDWPKLHGKTR 
S L/LKKIiQICLARRGRAS LWSQVLKRLRL 
TWVDCLSLGG*GCSQP*SH/IHCSPAW 


9636 


23537 


A 


9707 


76 


359 


CFLFVCLDFVIFIiAYFCN*YLFPFLISH 
TF FF FLKKGLI FAPGVNLRDQNKFYFTL 
FDVFFNI Y YC* I FFI F/CLVLLFF I FFF 
FF YCFVMRF I VL 


9637 


23538 


A 


9708 


342 


3 


GEELLLMDEQSKWFLE \TPGENATNI VD 
LTTKDLE YS I S WDKAA^GFER I GSN* N 
STLELKTVN/SVPHAR* IFCKRLQSVWQ 
TS LLP I LRN \LP * S PHPLVS AML 1 1 QQS 
STLK 




23539 


A 


9709 




350 


RQDIALLPRLE*SGTITTH/CDPQLSGS 
SEP PTSASLVAGTTGACHYV 


9639 


23540 


A 


9710 


253 


3 


VKCFSREHMLIRRSAIGGKKHIEKGTLI 
MSFWWYVSQSEH/NLCNHYAY\SHIKK 
WLGTVADVYNPSTLGGQGGRIT*GQEF 
E 


9640 


23541 


A 


9711 


80 


324 


LITLTMASMCSNERKSHTSLTSNQKLEM 
SKPTEEGMLKAKTS * KLGLFHQT/SQLV 
KTKKKFLEE I KSTT PLMAPLMRK* NS L 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 
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SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A~A!anine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H-Hist id ine, I=.Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 


9641 


23542 


A 


9712 


246 


3 


QLAXHGLLFFLFFEMGSH* LRLECSL\ T 
ALLSKISAHCNLNLiPGP\SDPPTSASQV 
AGTTGACLLAQLI FKNF WETGSHYVA 


9642 


23543 


A 


9713 


2 


359 


AAASTTTKEQDMLKFYFLFYFFETESHP 
VTQVGVQWCNLCSLQPPPP\GPSDSPAW 
VTEQDSVSKE* KRKKLIKINVYLQVTTH 
P*KSPQHIVRVLSVLN*LNEKINRRVTK 
SSSKEKDR 


9643 


23544 


A 


9714 


278 


53 


KLFRGLNNHRKFQTRCGGLFFFLFIYF* 
DGA/YALSLLECSGTIMAHHSLDILGSS 
HPP I SAS LAA\G WRTCS PS C 


9644 


23545 


A 


9715 


166 


402 


TLSLKKNENISRAWWHVPVVSGVWKAEA 
R*SLVPRSLRL\HCTPAWA 


9645 


23546 


A 


9716 


16$ 


1 


VDCI IMRRSIAKSPRRKCRGTISARCKL 
RLSG\SGHSPASAS *EAGVTGVGHDPR 


9646 


23547 


A 


9717 


] 


553 


TS I RLFFLLS I FFFSRARNR I LFKI YCN 
GVRAKGRGGGWGTCRGHRNTMTWPGPQL 
LLSWGRGMKRQDQGWSWGGRGEGDTGCI 
PPPPGSTSRPWTPRSPWPLRLHLFSASG 
PPGSSSCSP*LFPLTDSQVGRCSGPDPN 
PPYHLMKVQPLPALLLPLLKPHRFPSLG 
Q\PDPPPSSPPSP 


9647 


23548 


A 


9718 


13 


364 


PPNR*RNPLIELTNHSLIYLPTPS\GIS 
A**NFGSLLGACLILQITTGLFIAMHYS 
P I RLQLAFSS I AHI TRDVNYG * IIRYLH 
ANG AS I FFI CL FLH I GRGLY YGS FLYSQ 
T*NIGI ILLLAT 


9648 


23549 


A 


9719 


82 


351 


GEALEPELPMPTUjSH/HAATLIiAPLPTT 
PFVGTKVTI VGQAQ W /HQAL * SQHFER P 
RRVDYLHLFSRNGQNPISTKNTRISWAW 
WRVPVIPA 


9649 


23550 


A 


9720 


179 


365 


FSLFLASLKAQIIDSKPFFSNTIiENI*K 
YEVINFFLYI \ I KLHFMNLTSLCFYCHS 
VCTILL 


9650 


23551 


A 


9721 


2 


320 


PIPPGNESSPPTAQEDMQAANKH/IRRY 
STSLAIRET/QL/KTTTREPYAPITMAT 
TETSANTTCWREYGESGSYCWVGCKTLR 
PLCKTVWQFLKKPSM*LAYDPAITLF 


9651 


23552 


A 


9722 


1 


344 


PLPTQNWPQGQSFLHYPVLLPAVSVLWS 
SFFFFFWXQILLLP?RLEG/NGQNSG*W 
KFPLPGPSLFSCPSFQTSGNYGPPQQAR 
AI FWKFKI KTGFHGVTREGLNFLTSGSA 
?LGS 


9652 


23553 


A 


9723 


156 


3 


VTEAYFI LFYFETESHSV* WHDLGSLKS 
P PPG \ S SNS PAS AS Q VAGTTGALH 


9653 


23554 


A 


9724 


76 


337 


YLSIYLSIYLAIYVSIIYLPIYHPSISI 
YVSSIYS INHLPLYYFSTHLSSTHSSIY 
LLS ISILS INYLLLS I SLS I YLS I * LS I 
YDSS IHLS IHLSHLYI YLI * YLS I YLS I 
YLAI YVS I \TI YLS I IHLSLSMYHLF IL 
S I IYLCI I FLPI Y/LSTHSS I YLLSIS I 
LSINY/HTSIYLSIYLFVY/HLYIYHPT 


9654 


23555 


A 


9725 


249 


436 


ELAG* IQDHP/GQHHKTLSLQQIKNLAR 
RGGRHL* SELFRLRQEDHLTPGVRGCSE 
L*SHHSTP 


9655 


23556 


A 


9726 


177 


3 


HLN**FSNLIFFETESPSVT\RGIISAH 
RNPRLPGSSDSPTSASRVAGTTDT 


9656 


23557 


A 


9727 


272 


233 


RKNQRI YQ IAR KRLNEMAR I S PLRS Mil 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=Pbenylalanine, 
G=Glycine, H=Histidine, I=lsoleucine, 
K=Lysine, L=Leucine, M=Methionine 5 
N=Asparagine, P^Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \-possible 
nucleotide insertion 














LNVS GLNF P LKR CRLAE WT E KKK \ D P 1 1 
CCL/QKTHFTG/KDIYRLKIKGWKKIFH 
ANGSQ*Q 


9657 


23558 


A 


9728 


368 


380 


RKGQRISEIAIKRLNKMARISPLISIII 
LNVSGLNFPLKRYRRAEWTEKKK\DPI I 
CCL/QKTHFTG/KDIYRLKIKGWKKIFH 
TNGSQ*RR 


9658 


23559 


A 


9729 


294 


2 


KGNLSPKKKKNLKNFKVTPQNFGLKDQK 
TQGKPTLKVSQNQKKPKFK*NLKGKRPL 
KLFPKKKKLNPSRSWFYBKKKK\INKLD 
GTLVRQINKKRKNQ 


9659 


23560 


A 


9730 


7 


284 


SQGLGRLRRENYL* FETESHSVARTGVQ 
WRNLS S LQS PPPG\ SSNS PASASQVPV I 
T 


9660 


23561 


A 


9731 


1 


322 


RSLKKKKRREKKGKNQPGQHGKTLFLPK 
TPKI*KIKKIYPG/RGGTHLNSQLLRNL 
RKENHLNSGGKGCNDP KLGHC I PAQMTK 
TP FKKKGGAKQNGKKLLQL I YG I K 


9661 


23562 


A 


9732 


168 


2 


GFYRKSLSCLSVWICFKDFFFFFEMVSR 
SVA*AGVQWHDLGSMRPPPPG\SGDCPA 


9662 


23563 


A 


9733 


43 


317 


LKLQPRDTHCIWLSTTSPFIYFFFETES 
CSVAQA*MQWHDLGSLQPPSPG\SGDSP 
ASASCTGMHHHTWLMKCINSFDVPDMPL 
GLGLNSHI 


9663 


23564 


A 


9734 


8 


296 


NWSACSLNGLVKGRLSCLTLNQIL*MI 
TLTTERFLKAERGQKLGLLHQTA/QWN 
/AKE I FLNEVS SATPVYEG 1 1 RKLNSL I 
ATMEKAC WW I EDQT 


9664 


23565 


A 


9735 


3 


300 


TRPSNSPAALSKCSSRRKT/R/TCLTLN 
QKLEM I KLSKEG I LKAERGQKLGLLHQT 
A/QWN/AKEKFLKEVRSATPVYEGI IR 
KQNSLIANMEKA+ WW IEDQ 


9665 


23566 


A 


9736 


352 


1 


PFFYLLNFPTFFKGPPLIHFFLESFLEN 
FGFQGAFFKLQNFFAFLFSLLNGPFLEN 
PFYFKFPINC\KFLGQNTFYCPYLGFF* 
RSFFFFFFFFETESRPVA\RLECSGAI S 
AHCKLG 


9666 


23567 


A 


9737 


130 


3 


KIFFFFFFFFLRQSLSVAQAGVQWC/N* 
LGSLQPPPAGLKRFLV 


9667 


23568 


A 


9738 


167 


3 


ISRSVFCFFFETESCSVTQV*EQWRNIG 
SRRPQPP\GSSNYCASACRVARFTGLV 


9668 


23569 


A 


9739 


140 


419 


INVKYLSRHFFKGNIQSPNKHRKAIREI 
QI S \ * TDNNKCW * G CGEVQYF I HCCMCE 
MVHSLWKIVW*LLTGL/HIELSDDPAIL 
LGG I PPEEMKT 


9669 


23570 


A 


9740 


93 


371 


KLCNGYYQESKREPTERN* IFANHKTKG 
CI S KT Y KE LQLNNKNS / NPTQKWAKBLD 
I S PQTMKQTAKNQ VKRCSTS LVI REMQ I 
YTKSIPLHIH 


9670 


23571 


A 


9741 


1 


288 


GTRDHLRSGVRDQ \ PGQHGETRSLLKI Q 
N*GCGHGGSCLSSQ\LTWRLRQVNCLS 


9671 


23572 


A 


9742 


296 ! 


563 


FI KHLNS YMLGQNLATKLSE I CKLPFY I 
EK* IQAGYGGTCL* SQLLRRLRHDNCLN 
LGGGGCSEPKI\HCCTPAWATDGGSVSK 
KKKSRR 


9672 


23573 


A 


9743 


32 


400 


DALVPHWSSYSPICLDLNVLGIYLFSDR 
VSLLVPRLECSDAI r4AQHS LDLPRLR * S 
SHLSLPRNW LM FV I LVETG FCKVA * DG R 
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Amino acid sequence (A-Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L^Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Giutamine, R=Arginine, S^Serine, 
T=Threonine, V=Valine, 
VV=TryptophaD, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














KLLGSSNPPALA/ FLKC/ WRjNMRKSQL* 
PLLLGLLS ADHS D 


9673 


23574 


A 


9744 


60 


356 


HSEVDSI I IPILQGRKQISEVRPQRMCE 
REGGLFISETLLFLCPAPSCS/HFLTPA 
Q* S Y F * KNVS WLGAVGDTCNPSTLGGRG 
GWI\RGQEIKTILANMV 


9674 


23575 


A 


9745 


2 


446 


PMVNKHVKRCSASLVIRAEQIKTIMRSH 
CTPNS *M* KTDNTKSW/*GCGATGTPI H 
C/WMKYKMVQPLWKKAQQFL\KHIHLPY 
DPINLLLGIS*EK+NHLC/YKKTYVRI F 
IAALF 


9675 


23576 


A 


9746 


257 


1 


VWLRLSFPSFLLPILSPSSQGDLVCCFL 
MVF*RDGVSLPIPAGVQWLDRSSL*P/P 
SPGS S S PPASAS * VAGTTGMHLHAEP YA 

sc 


9676 


23577 


A 


9747 


148 


3 


I KLCEQ FHK* TI HVLNHK / L / W PGMVAH 
AYNP S TLGGRGE W IT* GQE FDS C 


9677 


23578 


A 


9748 


769 


I 


YPQLFKIAKKKKKSNVPIKS IMVSQYGH 
* ENKKWVLNHKGATLQ/EGKGNFGDTFK 
VTLKDKIAVAVKTRQERLPQELKLKFLL 
EAKI LKQYNHPSNVKSIGVHTQRRPI YV 
I KELVPGGDFLS FQRKKNELKL \VKFSL 
DADSGWCI SSKKCIHRDLAVRNCL/VGE 
NNVLKISDFGMSRQEDGGVYSSSDLKQI 
P I KWTAPE ALNYGR YHSES DARS FG I LL 
WETFGLG\ VCP * PGMTNQQPQEQVERRY 
WMSVSWQC 


9678 


23579 


A 


9749 


607 


986 


SFSSSSSPEFISYGCDLLPMRDPHVFIT 
HPAFNLPLFLALGMFFSLFSPSPLFIYF 
LPPPPRPPC*HSPP\PPPPLTNSLFVFL 
PPPPGVPSSPPNTSPPHLPVPRPPPSPP 
TSTHLSSPPPPSPTH 


9679 


23580 


A 


9750 


313 


339 


AR*GLSCSG* S * TPGFMQSS CLNLPKC W 
DYRC/RATVPSLCFVF 


9680 


23581 


A 


9751 


213 


453 


NLAVNVDE INLFFFPLiFL I VTLAS KFCS 
/HLLVGR* FIQSCHYFLFFIFDTESRSV 
T\RAEMSATVLAHYWLCLPGSPDSSA 


9681 


23582 


A 


9752 


368 


3 


PLSSPRLPFPLFDPSYIFLISISSIRFL 
SPPCYFLCVFLSISWGFSLVSQDGSDLL 
SS*SAFLGLPKCWDYRL+PPHPAWRELS 
FFFS / S FFYLR * GS LS PSLECSGMI IGH 
CSLDILVSSARA 


9682 


23583 


A 


9753 


2 


335 


ARAGFSGGDWNVGMQSTASGLSLFHCFI 
S*GELFIILYILLKCYITL* IN*LMFGS 
FYFRDLQHFRLP / TS I V3KKRPGTVAHA 
YNPNTLRGRGGWITQGQEFETTLANMVK 


9683 


23584 


A 


9754 


467 


120 


SQLFGRPRQTDHLKSGV*DQPG \QHCET 
PSPLKIQKLAGRGWV/ CAPIGPRLLGRP 
R* ENGLNSGGEVCSEPRSRHCTPAWVRE 
RDCUCNONKTKS G VLFHSPHQ CFLI SNT 
SRNLL 


9684 


23585 


A 


9755 


137 


368 


DGVYLWTHRPYCGLGSIiNFGSVI IVLP * 
VKAYGWMVLTSLGDAIQPLADPECSF\G 
QLRELRCDMLHVDPEDFRLLGK 


9685 


23586 


A 


9756 


172 


267 


SKSGNKPNDYHLMNGSVRWCIRTV*QLL 
KRLTTELPYGPAVAFLGVYAREMKAYVK 
MKTYSLQKMCT* I FIAALFG I VKKWK+ A 
K* LS PDEW I S K/ M / WCIRTVE Y YLT I RE 
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NO: of 
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NO: in 
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beginning 

nucleotide 

location 

correspond 

ing to first 

amino arid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A-Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=Iso!eucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, I*=Proline, 
0=GIutamine Tfc=Arcnninp S=Serine 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


9686 


23587 


A 


9757 


56 


318 


QFWGFSFFFFFFCFIDTETCSVSQAGVQ 
WHNQSWLTA\ P * SDPPAS \AS*VAET IG 
MCHHDQLIFFFFKQALCPRGWSAMAQSQ 
LTSSS 


9687 


23588 


A 


9758 


179 


1 


KDTDRVSRGNTQDTLKPPPPG\SSDSPT 
SAS * VARI IGVHHNTWLI FCILVEMRFH 
PRA 


9688 


23589 


A 


9759 


219 


208 


NFCNFQMPLYQKILFWVRVSLLSPRIiEC 
SGTIITYCSL/RTPKLK*SS*IjSLGVQG 
CSKL 


9689 


23590 


A 


9760 


133 


371 


AMAYQLYRNTTLGNSLQESLDELIQ\LR 
NRVNFRGSLNTYR FCDNVRT FVLNDVE F 
REVTELI * VDKVKI VSL\DGKQTGF 


9690 


23591 


A 


9761 


813 


3 


CGEG WAAGQDQGSRMGS CL P VPRAGRQL 
GSLFPAFQVYCSSCGTQRSSQSQWHKTE 
RSLPGNNPAPQDS / ASAPKYQTPGPVLG 
VQSPLNQQSCHTALSPEASSSAFFHTTP 
ATLLHQARTLS LTQGSRPTTW PVLLKMP 
A*RPRESRH\GR* EWTKRSQARSQEAIN 
ARPRKALGLSDPAPSLEPGSLEDSPKFS 
LGAPKAPPLPASSPTAQETQQSHISQQP 
TA TG FLGARLC P S POH I CORGG FPG VTP 
LLLLGEWGVGEHSFPDVLV 


9691 


23592 


A 


9762 


96 


377 


RYKTNMAAQI PESDQ I KQTGFHHVGQAG 
LELLTS \ FKE FLGT YNKLTET * FLDCAK 
DFTTRERKSEETTLSEHCLPKYSTMT\H 
RISLT\FTDNHI 


7V7i 




A 


9763 


164 


399 


TDEELLLRDEQ / RKWFLKI ESTAGADAV 
NIVEMTTGDLE* CINLVDKVAARLERTE 
TNFE/RCSTVRQ/MLSKSIACCTE 


9693 


23594 


A 


9764 


225 


25 


KCKITMNST* IE / TVFKDY YEHLYVCKL 
KNLEKTDKVLTIHNFLRLNQ + EPEVLTR 
AKMYKIYNDIE 


9694 


23595 


A 


9765 


128 


218 


FFKDLPDCFPKWSHCFTLLLAVYEGFTF 
ST*LVIREMQIKTP\I*YHFSPNRMVIM 
KKSNINKCWQGRGESKALIHC* *ECKTV 
* PLWKTVWQ I LKKLI IG 


9695 


23596 


A 


9766 


194 


551 


KNFFFLEMEF/SVIjLPRLECNGVISAHR 
RLRLPLSS YS PASSSQVAGDYRACTTTA 
G*ILYF* *RQGFHHVGQAGLELPTSGDP 
PAS ASQSAG I TGVSHCPQLKKS I LHETP 
KGLTGVTS 


9696 


23597 


A 


9767 


238 


3 


SFLWKLCPRGAPACLRCLSA\LLGGVS * 
SGYMGVRDPLEEAVCPFSELEHHAERTT 
ALFRAVPQGCLS LQKLSAAFCSC 


9697 


23598 


A 


9768 


343 


1 


RGAHHSRGDCVRFTGFCAPI PVLNHPKP 
LFPFFV*KSPIERSSLLGPPLKRSKNRF 
PFLR / VPRFFKNHPEFFFKKI FLFPP FF 
FFFEMESHSLA\RLESHGAILAHCNLSL 
PGSC 


9698 


23599 


A 


9769 


191 


2 


FRINH*TSH/LRGQKKKKKTNSKAIGRK 
QITKIRA*LNETQPPESIQRINET*S*F 
SGKIKLI 


9699 


23600 


A 


9770 


165 


2 


GKPPNCFYILNFKKKKGQDVFNRT/WLG 
WAKACNASTLGSQGGWIT*AQECKTS 


9700 


23601 


A 


9771 


97 


912 


VILSTGCSSGPLPGWQSVLHHSLTRCPF 
S FFLS S PTPEME / FSLLLTRLECNGE I S 
A HCNLLLTjGS SNS PASASGVAGMTG I / C 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=GIycine, H=Histidine, l=Isoleucine, 
K=Lysine, L=Leucine, MNMethionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine t V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possib!e 
nucleotide deletion, \=possible 
nucleotide insertion 














GLHAC*AS*FLYFHSVETGFHHVGSGLG 
LELLTSDD\RPASASQGAGITGVTPRV* 
PLNSFSRHGLSCFFKETRVLSCCPSWT* 
TPGLK* SSCLNS * E / C *GYRHTPAHPAQ 
DMVFI FI TY / CPFFVFPLL / VYIPRFLF 
LRWS FTLI VQAGVQWRNLGSLQPPPG\H 
KQSETPSQI * INKY*LLN*LITFFFS IK 


9701 


23602 


A 


9772 


3 


363 


HELRSSANEHVFAS FIGPTI LGLPGVLL 
IILYPPLLIRTS IYLISN\RL\VTTQQ* 
LIKLTSKLMITIHNSIGRS*SLILGSLI 
rilATTNLLGLLPYSFTPTTQLSINIiAM 
AIPL*AGAV 


9702 


23603 . 


A 


9773 


38 


385 


YLI I LDS PEKGL I CGCLCNCMHSSLLDH 
LNLT*SS*MCLYANIHIYVAISCIYFI* ' 
F*SI/YIFII*YFHYSIAYVYIY/lYIY 
M YTHAH I HMC I 


9703 


23604 


A 


9774 


13 


1350 


DRVSLLLLRLECNGS ISAHRNLGLLGSS 
DYPASASQVAGTIGVHHHTRPTFALFLV 
ETGFHHVGQAGLELPTSGDPPPL\ASQS 
AGITGVSHCTWPHLSTTGKILSSGTICP 
GIWGQRWWAFLIL\SLPWL*LWHPCLSS 
IMCLSLFFFLCQ/RCKPLISD 


9704 


23605 


A 


9775 


376 


1 


KYFELMYIPVIAICLFNSEVFLNK*QFF 
NLTSSSG IQNLNRLI ISNEVES 1 1 KSLP 
TVKNLGPDGIAATF\TYKEQLTLILKLF 
QKHEEARILSNLTSETSITLILRKQQPK 
KVPNEHTVKILV 


9705 


23606 


A 


9776 


192 


2 


NVYQPPPVFLYPPGVAGGV*PF*SPPVF 
RVK/LGF*GSLFFFFFFFETKSRSVTRL 
ECSGVILA 


9706 


23607 


A 


9777 


77 


350 


WLRKFGSRNSGKLTLYLNVYYLQTSFFF 
FFFKIGV*LSPTLECRGPIWVN*NLCFP 
G*RDFPTLTS*VL\GPRVILEFGFLEKT 
GFSHVPQL 


9707 ^ 


23608 


A 


9778 


133 


325 


PRWVRFPYLGDCTPRVSEMLGLQGWPS 
SSPFFVCFET*SCSVAQAGVQ/WWCDLG 
SLQPLPLG 


9708 


23609 


A 


9779 


145 


362 


RLAFGS CS TCF P LT * TVLY YLI* FNI F F 
VFEM ES LS VS \RLE CS GAI S AHCNLCT P 
AWTTERVDAAANSRSMN 


9709 


23610 


A 


9780 


260 


1 


QKQNQEKKRKKI FRNIKHQRDYNEQ* YG 
/NKLDNPEDMNKF* ETRYT* /PNLHQEE 
IGMLNRPVTNKVL*LVIKIPLTKKSPEP 
DGFV 


9710 


23611 


A 


9781 


230 


1 


SSDSKTGS S WLVACRFFFFLETGSGSV 
TQAEVQWHDHSSLQPPPPQPPK* / PGTT 
G A* S TLAS + NAE I TG VS HHAR 


9711 


23612 


A 


9782 


20 


322 


SQHFGNPKGNPKIFRNLEINFFRLRNAD 
HMI ISTDAQKVFDKI QHLFI I * TS * QT * 
TERI FLNL I KAS * KKPTANI I LNG * RLN 
I\FPKTGETKHIC 


9712 


23613 


A 


9783 


158 


379 


LVKIICNSFQYISFFFFFFLETKFCFFP 
QVBVQGGNFG*LNLPLPGLNRFS\ASAS 
RE PE I PGPPHHPG* ILVF 


9713 


23614 


A 


9785 


54 


312 


KVDKSTKIGRNQSKMAKNSKNQNASSPP 
KDHNS S PAR KQNW TENELDELTE VG FRR 
WVITNSSKLKEHV* PN/GKETKNLEKRL 
DQ 
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peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D^Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine T 
K=Lysine, L=Leucine,M=Methionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


9714 


23615 


A 


9786 


13 


613 


PGAGWARQHHGV* DQ PGQHG\KRPSLLK 
I EKLAGHGGRCLLS PAIGGGLKQGESLE 
TGGRDCSEPRLCHC\MPAWGNRNKTLSQ 
/ QNKTR 


9715 

i 


23616 


A 


9787 


794 


164 


FSFLFCFFFLRWSLTLSPSLECSCVTSP 
HCNLQLPGLSHS PATVS* VAGTIGTCHY 
TKLS* FF\FDDSL*APFLGVVDTETYYP 
DPPSSKGLLPHSGQHGQQHPPAISRAWL 
SC*VLPHLRS/STSI/3QP/RIW*IiSKVE 
V*RGFTFSPTQNTPTNNNCS IAPGRVD+ 
SLLID*LQWCDHSSLQPQTPVLK\HPPT 
LASQS AG I TGI SHHTRPD 


9716 


23617 


A 


9788 


256 


380 


YI FRQAKTQGLP PCALFREAVGENTP * V 
FAC/PKNVLFYVHP*LFG* I * IYQLKIT 
F * N YEGMVT / FVFRFLFETR S HLS PRLE 
CSGTI TTHCSLELVGSTNS S I SAS * VDG 
NTGVRHNASC 


9717 


23618 


A 


9789 


22 


226 


TKINSKGNKELNVRAKTIKLLEENIDRN 
LCDLGLGNCFLD*YCIL*TKINSKGNKE 
LNVRAKTI KLLEENIDRNLCDLGLGNCF 
LDMTPNAQTTKGR\ I DKLDF I KI *NFWL 
DT 


9718 


23619 


A 


9790 


288 


2 


QVWQE * AETESAIHCWWERKAI QLL* KM 
RWQFLKMTNV\ELPYGSAI PLVGIHGRE 
LER*ST*NIMFTAPLFVIGK/T*NLFKW 
PSPYEQITKMHSC 


9719 


23620 


A 


9791 


142 


361 


PPGEGKRVPFTTEPSPRIPSKAKSVSQG 
DTCTPMFTAALFTIARMWNRSKCPSFEE 
* I KKMR C I TT/ME C YSA 


9720 


23621 


A 


9792 


305 


346 


TYEYTDYGGLIL* *IiITFYDHAVLIIFL 
ICFLFL/YALFLTLTTELTKTTGGDAQE 
IETV* TILPGI I LGLIALPSLRRLYITD 
DAPDASLTI KS IGHQWY*TYEYTDYGGL 
ILNS 


9721 


23622 


A 


9793 


2 


317 


SRDRPRVRDR*LFSTNHKDIGTRYLLFF 
A*AGVLGTALSLLIRAEIjGQPGNIiLGND 
H I Y Y VI VTALAFVI I FFIDLP I L I \SS F 
G\SVYVLLILGVPCTAVFLHSS 


9722 


23623 


A 


9794 


384 


1182 


RIGKIKG/LCFLFV*WFLKFFFKMEFLL 
PRLECNGKI \HCNLLLMGSSNSPTSASQ 
VAGI TGMC * F\ VFL I EMRF FHVGQAGLK 
LLTLGD/PPRPPKVLGI 


9723 


23624 


A 


9795 


205 


1 


GLQIKCTMRYHFAPTKMAITYFFLKKGK 
G\NNKCW*GS/GTEIGILIHCVJWECGMV 
QLLWKTADAWADAW 


9724 


23625 


A 


9796 


307 


348 


QSARL* EAVCPFSDLQLRTGRTTALFKA 
VRQGHLS LQRLLLS F \ VCL YPAPRGGAY 
RGRQASLS CGGLHPVRASRLLCLPKQAW 
AMVGAPTPASLPPCSS ISHCCASNQRDS 
VGYDPSSP 


9725 


23626 


A 


9797 


193 


381 


I LLI * IHI LLSM I S S PFFFETGSHS S / V 
SRLECSGTASAHCSLDIiPGSGGSPTSAF 
*VAGTTGA 


9726 


23627 


A 


9799 


153 


359 


FGTYMCFADSYQLSYLFWGTFSWLEEE 
GV*WCICGSLQPQPPRVK*/FLPSLSLL 
SSWDYRCAPSC 


9727 


23628 


A 


9800 


2 


396 


ARAARAARELEELIKIFFFFFFFWGQIL 
AIiMPKGGGQGGILTYPNPPLPG*NNFPG 
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NO: of 
nucleotide 
sequence 
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ing to first 

amino acid 

residue of 

peptide 
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Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
. last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenylaIanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S == Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /"possible 
nucleotide deletion, \=possible 
nucleotide insertion 














LTPPRTGINGLGPPGRVNFGIFKKKGGF 
PPGARGV*NPGPRGTPPPGFPXGR\DKA 
PHFGPL* TFEKVPRPPNFPQ 


9728 


23629 


A 


9801 


256 


347 


L Y I LLLNS IiK * I/VEKLR * NWLASLI I R 
YKVTTKRF* * FSKPAQSEELQLLTSAPG 
LNF*RYI ISFFFWIQSFALVA* VGVQWC 
DQGSPQPLPPGFKRFS/CRSWDYRHEPP 
R 


Q779 


23630 


A 




109 


401 


HLRRPCPTPPFALRTSGDQPLSPEARPD 
SGLLHSPLTSPRGLNGQCPPRGSTLNFN 
P*ASIPASPFPSGQPQASQEPPPVASEH 
PPDT*GDPACPPP\RPQDFP7GPATVPRS 
QAGQWP S PLPSDFS KGAQWP VPPTR/ PP 
PSTSTQSRWT 


9730 


23631 


A 


9803 


371 


2 


PGFPLFSFPEGNGPSKRQTGD* IRCLF* 
DGKWJECSPPKKKKKTKKRKKAVIF/CV 
PVQTKCI WEGGEETLVGDV* V* P* \GS 
FKHWAMFPE K/ DCLCTLYEAS FKTKES 
RRVDGFVCVRVGT 


9731 


23632 


A 


9804 


188 


] 


FbFFFFSET*SQSVTHAGVQGLEQSLPP 
G\SGNSRASASQVAGITSRRKHSWIIFV 
FFVETG 


9732 , 


23633 


A 


9S05 


112 


383 


VFINIRVFRLFTFSFFFKRGFNLSPRVK 
CNG* TKGHCI PDLPGVR* S FPFSLLTNW 
DYRCAPPRPNKFFF /SFETGSHSVTQA 


9733 


23634 


A 


9806 


599 


237 


FRDRSLTLLPRLHCHCVITAHCSLKLLG 
SSELPQASPLSSWDYRHSPPCLANFLFF 
VETRSHY/ ASRNSLGSSNPT* A/ fpkcw 
DWQV * ATAPSLAYWFSEAEKL I LRLLAL 
KVSDENI CTNF 


9734 


23635 


A 


9807 


168 


2 


PQQDVFFFFLETGSYSVTKAEVQWCDHG 
SVQL* PPG\STDPPTSASQEATGTHHRA 


9735 


23636 


A 


9808 


225 


3 


GPLGYLPPYFPQPLISRG*NPSDFLKIS 
RVFYGT/HYKFEFLFLFFFETESRSVA\ 
RLECSGVI SAHCKLCLPGS 


9736 


23637 


A 


9809 


541 


3 


RQLTGINSGRRQFQPQMSNYVIPYMMDM 
I ILKTKKSNNKYWQGCEKTELLIHCLWE 
CKMVQLVWKTV*QFLNRSDIEFPHDPGI 
PPL/GYKRKKMKTCLPKKLCVLVEP/RM 
FTAALFE VAK * * KQ/ PQTPITR* INTMW 
RIYTMEYYSAIPRKTPLIHPIA* LDTES 
II*TEKIQSQETTYY 


9737 


23638 


A 


9810 


213 


1 


FLCVDFVPCNFAELKLALVGLFVCLLFG 
SVTQAGVQWCNL\GSL*PPPPGLSDLPT 
SAF* VAGTTGVSHH 


9738 


23639 


A 


9811 


54 


388 


PARPL PRQ / WDERPNQQPTKKKKKRGRN 
LPTKKKTQNYSHQWPVAQRNPGGKTNQK 
KPKTNPKKKKGGAL*KKTQRGPKQTGGR 
K*KISPNKGGK*NPCGKLLEKNPFSGGE 

K 


9739 


23640 


A 


9812 ; 


429 


24 


RSKGAEFPICPCRSSKFSVLQQRQVRIV 
FQHPELGESVAR/ CRYSQLLGRLRQ*SH 
LNWGGRGSSEPRS CHCI PAWATRASS I F . 
CNFQASSVEVRRSARKKLFSDILKRHNT 
INWRVSGLLLVDSYFGRLATPVRTQ 


9740 


23641 


A 


9813 


149 


350 


CH FP C FHTTL / C E * E PG ED VTG P I GTKF 
FPAPGRTDGHTHTHTHTHTHTHTHSDPH 
HLCVWNTLIHFV 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
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Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, V=possible 
nucleotide insertion 


9741 


23642 


A 


9814 


66 


401 


PENKKTVYRKPIATIIFNCERLNALPLK 
LETRP * YLL Y * LFNIMLEVLAS WRQEK 
E INLS I FT*DI V I K/MKI PKNLENKLLK 
LLS P F S KI TG Y K I S I Q KLT SSFLYYGFT 
V 


9742 


23643 


A 


9815 


82 


1 


AFPPARFFFFFFF*EKFSSFPPGEVQRG 
VFCPCNT/ CFP* G* KI FFL\KPLiKKVGF 
WPPPPGPKNFFYF*KKGGFPFWARGFFI 
PTPFVPPLHPFKKWGVKRG*APPPARFF 
FFFFFL*GRVSLCHPGWNA 


9743 


23644 


A 


9816 


256 


2 


QHFP*LPIPHIPIQT*QHFPQTRSPFQG 
PPLCQKPFSSPWGSKWFILIiSFFFFF* 
DRVS LCHPGWNAVM* SQLTAASNTW \ VK 
*SSHLSLLSS*DYRCVLS 


9744 


23645 


A 


9817 


53 


411 


TIYCSNYVYNFVSQHFRSLYS *KNCGLC 
L* FHLF VLQS * NFMQFHM * FCLKI * VDP 
IITFEV* FTYI * FF*TYSSVSFTDAYNF 
VKPHR / VYRGPGAVAHI CNPSTLGGQGG 
WIT*GQEFK 


9745 


23646 


A 


9818 


100 


417 


DEHSEKQDEPLQRPRSKLLPFPGTPPSS 
LLLLNHIGQESGLTHVRLASCLRNRGFC 
FDMKERLCLLPMIjECRGMISDHCSLI/R 
PG* DKP PASAYQ VGETTGTYHRA 


9746 


23647 


A 


9819 


136 


2 


IYMGMQPGAVAHACNPSTLGGQGG/WWI 
T*GQEFKTRLANMVKPC 


9747 


23648 


A 


9820 


257 


426 


IVPGLFLGTGSCSFTHAGVQ* S.NCSS VQ 
S *TPGP \ SD P PASASRVAGTTGAHHKAW 
L 


9748 


23649 


A 


9821 


151 


416 


LHKLCLMVMESGTAKTPS LS P I FNSVGL 
ILLLLE YS IDSLC* SVLCCYKEI PEAG* 
F I KKRGL / WLG WAHACHLS TLGGRG GW 
ITRSGN 


9749 
y /H7 




A 




Jv 


513 


RGDPRVRPRVRI IKLSEEGMWKAEIGRK 
LDLLHHTISQWNSKEKFLKEMKSATSV 
NTRM I RE * NSL I ADMEKVL W * DQTSHN 
I FLVQNI IKS KG IGLFKLMRAKRDEES / 
AEGKLDASKGRFVRLK/EKRSHLHNMKV 
QDEASVDI ETTAS YLKV 


9750 


23651 


A 


9823 


48 


406 


LI I WYTLTTRW IGHSAVS YPLLRPP YS 
LRHN I E I RPINNPTMAS KCS S ERKS LTS 
LTFHLKIKMIKLSEEGVSKAKTG*KLGL 
L/R/QTVSQVVNAMKKYLKEIKSATPVN 
T*MIRKQNNL 


9751 


23652 


A 


9824 


143 


444 


WNKKNNKNELADMILWGCGDKTIFMRKL 
TRDETKYGIPQLALRGHSHFVRDGVl\N 
LDGQVDLS \G * WDGTIiHLWDLTSGSTTR 
RFVNK/TKDVLSADFS 


9752 


23653 


A 


9825 


297 


2 


HRSPTKETKIIASI KKKTMITKDLNRQF 
SKQDKHLTSEYMKRCPTS /LREM* AKTI 
MRYHLTPIRRAPI KNKNK/NKSKCW * GC 
GETEILVHCW* ECKMVP 


97i^ 
y / j j 




A 


9826 


175 


400 


GGKGGKNFSLKGGEKKTKLG I FGKKP I F 
GGGTNGANPPPKIKGSKEKKNF*VFFPP 
FPFKNFFFP*NL* FLGGGWPHLSPPQK* 
V/CFPKIPKLVFFSPPLREKFFPPLPP* 
NLGPPGF/ SFWGPPLFFFFFFFF*RDKF 
S FYHPGWKAVI AAHFSLELLGS SNP PTS 


9754 


23655 


A 


9827 


64 


622 


MFSFFLFDLILNHLFFCFV*MFSFF3jFB 
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correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutnmic Acid, F-Phenyla!anine, 
G=Glycine, H=Histidine, I=Iso!eucine, 
K=Lysine, L=Leucine, M=Methionine, 
N^Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














LILNHLFFCFVIRQQFSFLFF/ ISFFFS 
FFHCNVNTPELCDFSVCIHSDNSFRFLH 
LF/NF*HMTLCISELR*TLKQSCYSYTK 
IC**FYSVQLSFLTILY**CKS*QALSI 
ALQNE FLS V\Q YHKNMNY I TLT I EFSRN 
KFLS VTFLLFS I FSHNFGI S AS ISIFIL 
HVYIKNNPIGFFK 


9755 


23656 


A 


9828 


298 


1 


MRN C LTDDERKW F P EMAST PG EDAVNT V 
EMTAKDL5/Y* * Y INLVDKAASDFEM I D 
SNFERCSTV/NKMLSNSTACYRENFKER 
* GQS VQQTFVRVGERV 


9756 


23657 


A 


9829 


361 


I 


FLTGTQWGK/DSPFNTWC*DH/WNIHRN 
LDLYL I P YI KINLKQLTGPNLRAKT I KL 
PEQNIGENLCDL*LSRERYSTKSITRIR 
KL/DTLGFIKIKNKCISKDTIRKR*ATD 
WEKIFANHVLKG 


9757 


23658 


A 


9830 


402 


2 


RFHHR FS F VL FGP FAKVAF P I G AKNF I S 
NHYWAGVFPFWEKKNKIDSFLPPFSKNP 
ILGELKA* F*NFLF*GFQIFF*SLFFCP 
FGGK* I PGVFSFFFFFLVFFE IGSHSVP 
\RLKCSGTITAHCSLDIiLGSSN 


9758 


23659 


A 


9831 


2749 


3215 


FCQ* IKMYVMCICI YNFRRVYLCI YTYL 
HTHTFTHTHKKTPTQIPEKDS/QCSLSD 
LKGHSL 


9759 


23660 


A 


9832 


3 


386 


KLRLGQLNTSAMAAKC*SERKS / R/ THL 
TLNQKLEMIKLSEEGMLKAVIGQKLGLL 
YQ/ TSQ WNAKEELLKDIKSATPVTA * M 
IGKQNSLIVDLEKV*W*IEDQ/TSHNI 
SLSQSLIQN 


9760 


23661 


A 


9833 


1 


370 


RRCRWPPDPSRTVGRQIGKLVTHRPTVF 
QERGCP FPLTRQAGSHHGGGAFAQVI S P 
TKSISPCGRGGSRL*SQHFGRPMQVDKL 
R/LGVQDQ/RWPICGQYGETPA\LLKLP 
KISWAWWLAPVIPA 


9761 


23662 


A 


9834 


125 


409 


GQENRETWKIVHLV*VLYT*HIKRLNCF 
SHFI * CYQPTASQAHVHHDSNDSSTHV\ 
N*NSRWPGTVAHACNPSILEG*GGWIT* 
GQEFETSLANM 


9762 


23663 


A 


9835 


223 


1 


PKPQKMGTF*PPMEGS*MGPPHWEKMGP 
FLKKKEGHPFFFFFFEMESRSVT\RLEC 
SGA/TISAHCNLCLPGSSNS 


9763 


23664 


A 


9836 


377 


515 


FILFLRQVLTLLPRLEYSGAIVAHCSL\ 
AF*G*SDPPASASRV 


9764 


23665 


A 


9837 


274 


1 


AGEWKDQICVFKI PLMKNGLHGARVGQE 
SWKERNQQV*DRCQKQNLSDS*IQT*R/ 
M/WPGTVAHAYK/PSTLGGQGEWIT*GQ 
EFETSLANMVK 


9765 


23666 


A 


9838 


553 


114 


GAGVKTHPGHKGKTLFFFKFF FQKLPGV 
GGKTLPS PLSRGLGRENSFHPGGKGSNK 
QSSPPSFPG/WGKKGGGLPFQKKKKKEK 
RKKGKKNGVAPS E PPLKYSN/ WQATWG * 
RKLLNDTMRVQPTKSRMQKILQDKGPVN 
GIFTKTGRL 


9766 


23667 


A 


9839 


36 


434 


LP FPQCVTE FI I VL I S WCYI REDACKNL 
KHVAITI INVCIYICVCv/CPSTLYVYMY 
MLP/HLSDTLLTLN*VTLVEMLSSFQCL 
MFWYKGH I * K* HELDWAHACNP STMGG 
QGGWIT* CQEYKTSLA>?MVKPR 
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peptide 
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Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=G)ycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N-Asparagine, P=Proline, 
Q^GIutamine, R=Arginine, S=Serine, 
T=Threonirie, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


9767 


23668 


A 


9840 


268 


1 


RFPPFFCFLTILFWAPGWFSNFLGVRA 
PQPP* IFFPI*KKPSPLIFFFFFFEKES 
RS VS \ RLE YSGT I SAHCKLCVSGSQHS P 
AS AS 


9768 


23669 


A 


9841 


463 


86 


LYMHEFISGLFLLFCWSI/WSVLCQYKN 
C FDDKVYGLD YGNTS QVFT * NS KL I KL Y 
TLTMYS FLC I NQTT I KPSEM I QATKAN I 
DK*NYMKLENFCTAKETVNKMKRQLKKW 
EKVFSIHIPQEVNI 


9769 


23670 


A 


9842 


37 


400 

» 


VHSPPFKFWSLYVSVLKFPFCSLYFKFL 
ALYS I FLQMCQHLLV* AFFF* KRGLNS V 
T \ RLE CRGAI S PNCNL Ch/SGSS DNWA.S 
LPRKAEAGNSFT/ CR * SQVAGNP I PS I S 
TRLNQRKTPFES 


9770 


23671 


A 


9843 


42 


407 


NCLSFLITFFFFFLFENKVSFCPQG*G/ 
RGGP F W PHGTLP PRG * GNNP P * LPGRGE 
*RGAPPPPGYFWLFGKKRGFPL\GQGGS 
KPPALKEPPPLGPPKGG\NYKRNPPPPP 
GNFF * LP P YQ W 


9771 


23672 


A 


9844 


113 


379 


LGPWATSLRGRFLGYQLQQSKKH*NYP 
TVR*GCGEMRTSVHCW*ECKTA* ?L*KT 
VWQFLKKLNTGLPFDAETPL/ LAETPKE 
LKA 


9772 


23673 


A 


9845 


284 


528 


PLPKTMEIMLDKKQIQTIFLFKFKMGHK 
IAETTRNI KNAFGPGTANE \KCTAQWWF 
KKFCKA* E \ SLEDKEA\ * GHP SEVDTN 


9773 


23674 


A 


9846 


888 


1295 


LEQGCNLFLHKHS*LGEVFFFVCFRRSF 
TLVAQAGVKWRDLGSL/HKLPSSLSLPS 
SWDYRPPLPRLA/NFFVFLVEMGF\TVL 
ARMVS IS * PRDPPTSASQSAGI I GVSHR 
AG P VAG I LMFCRWNI SNKPKGAVFKKK 


9774 


23675 


A 


9847 


88 


387 


AYRMKI IDRISLSLYTAALWVTLFLFLF 
LFFFFKTEFCFAPQAGGQ/WGQFKLMDP 
NPPPPG*K\DFLVSHPRDLGIKGAPPQC 
GQNFVFYKEKGGWLLTAT 


977S 




A 


9848 


298 


438 


KI PRGAPNFPGGGKI FS PPLGGP IKTPR 
GALEKKTLF*VRGPWPGFVKKESPRPKK 
LG*QIFV7PPGKKPNPPMGG/SLDPPPPL 
ILSRPDPP * KKKKKKKKKKKGGGPLKNP 
PGGPKFPRGGKNFFPPFRGAYKNTPGGS 
*EKNPFLGGGKKRKQP 


9776 


23677 


A 


9849 


251 


498 


AI IKKMESKKFW* RHGEIGTLIHCYWEF 
KMV*PFWKTV*QFLK/D/LNMGLPFDSA 
FQLPD I CLGELKTYVHTENCTQMFMAAL 
F 


9777 


23678 


A 


9850 


478 


37 


ESRNKHSHLWSINFQKGLR*LNRGKQPL 
NK* CRHNS I S TCKRMKLDP YLT P Y I KI \ 
KSKWTKDLNVRAKTIKSLEENIEVNLHD 
LG * GND FLDMTPKAQTTK* N ID * LD 1 1 K 
I *NFCGSKDTINKVRRQSTE*EKI FANH 
VADKLK 


9778 


23679 


A 


9851 


2 


378 


RLEGLFLCALFCS I AI CMFFFFFFFF* K 
KRGPFG FQGGDPGGNFRQLD PLP PG I KQ 
FWGP\PPRKAGKTGGGPPPGANLGFWGK 
KRVPHGGQKGFKP/RNPRGSPRPGPPKG 
GVAFVFRPKPPGLEQ 


9779 


23680 


A 


9852 


229 


3 


FGP YKI FFKKKGAPCPLENPPVF \HQKL 
GLGFFFFSFFFF*DGVSLFLLRL-KCSAA 
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ILAHYKFLLPG*SDSAASAS 


9780 


23681 


A 


9853 


108 


420* 


GPRVCVRLSLRDDSDLLATEDSLAHTEG 
DPVTAS SHS QAPSLSVKPGAALEGVCWG 
TGAIAGTPEPQKP PQLTPGGRPA\ P I AP 
GLTWEPPSHPPPQ*RSPPPPP 


978 i 


23682 


A 


9854 


2 


256 


FFIFPLRYYIYCARFQFLSPILYLGK*K 
MD**RRVFQEKWEQAYFFVEV\SPMCLI 
CNQTLS VSKE YN* R CHYETNHGENFD * F 
TEKMHDENLLHLTTRQP 


9782 


23683 


A 


9855 


323 


3 


I IQMANKHRKRH\QSQ* * VRVMQKKI IM 
IHRYIPQWLKTSKSDMIKCCSGYGTTEI 
FTHYLWECKLLHLLWRMVWL\ ILLKFKT 
DT PHDPAI PLLGT YPAE CMHMYPRA 


9783 


23684 


A 


9856 


28 


385 


DGVTQAGTQWG SKFTAA* TA* YQGS I PN 
AWSFFSPPPVLKTPPPPQKK/YPPPKKK 
KIPPPKKK 


9784 


23685 


A 


9857 


436 


518 


AANRLENIYRHLIYDKD/EHYRIVGKDD 
LFNKWCWVNWI/ASNKEKNLDSYLIPHT 
K/LNSR* I IQLDVKD* I I KLLEDNLYLH 
DLGDRQKFLGRI/PVFTIKKKIGKFDPM 
LKLSTFVHQKTPC*RLNSA 


9785 


23686 


A 


9858 


153 


492 


RLAGSDPGVADVSVQLQRQEKSPCPSWK 
AVSQAESSSS SAGVSLVLFRLLADWMRP 
IPIKEENKL\SQSTDLNFNFIQKHCHRN 
TQNSV* PGGVGPPGTCGPFRRQSSP 


9786 


23687 


A 


9859 


2 


419 


TTGKLQVSHKKSTYSHFSKKPTHTTNKD 
MKRYSPSL/AIREMQSKTTG\MRYHFTP 
KYG\NNKCWLGCGETETLI YGW* ECKMV 
QPL WKAVWHFLK* LNI ES/PIY** NYS * 
YISYSWRKTCITIYDPVILLLGIYPREV 
KTH 


9787 


23688 


A 


9860 


88 


419 


TFFFFFGFWFLKTKPLFVPQLFEGPGPI 
LG* WNPP PRD * KNFS /G PNP PGGGE * RA 
QPPPPGYFLFFKKK\GVSPWGGGGSKPP 
TPGNPPPGPPKIRVIRGGPPPPGGRKHF 


9788 


23689 


A 


9861 


301 


401 

> 


KRA*GGQEPIKVCPFFYNRKVGNGSPPK 
KFKKGRRAAFFGLRANKRGLFVKKGKKI 
WVGKVGEPIT/DPFKEFGGR\LPEKDGP 
LV . 


9789 


23690 


A 


9862 


220 


415 


KIMGGAQIFRGGGGFFFFLEGWSKNNFG 
VSFRRKFFFGGGVFLPPPP*HKKNYFSS 
QRQYI SLGGGGRKTPPPKKNFLLKDTPK 
LFFSHPSKKKKNPPPPRKIWAPPMIF* I 
PPPPI FFFFFFFFFFFFFFFFFKN* P / S 
FFFNFKKPIFKTFLSPPFKVFPPPKKKK 
KKNPPISYRRWPLAI 


9790 


23691 


A 


9863 


116 


366 


GQEFETSPAMFCFETESCSIA*AWQWH 
DLSSLQPLPPG\SGDSPASASRAAGELL 
EPRIRRLQSVEITPLHSSLGNRVRLHL 


9791 


23692 


A 


9864 


619 


295 


FFFEMEFSLLLPRLECNGAISAHRNLRL 
PGS S DS PAS AGGLL * SQVAGI TRLRHHD 
+ LILY / FLVEMRFHRVG*AGLELLTSGD 
PPSSASQGAGITGMSHSAGHYGKI F 


9792 


23693 


A 


9865 


12 


432 


IADRRLFFTNHKDIGTLYLLFGA*AGVL 
STALSLLIRAELGQPGNLLGNDHI YNG I 
GTAHAFGI ILVI VI P 1 1 IGGFGN* LVTL 
1 1 GAP DMAFPR INNI S FGLLP TS LLLLL 
TSAILEARAGTC* TVYPCLA\GGYSHLG 
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A 


9793 


23694 


A 


9866 


30 


380 


LFSTNRRGIGPLYLLFGA* SGVLGPALT 
LLI * AELGQPGYLLGDDHTYNDIDTAHA 
FVIILFIVIPIIIGGFGD*LVPLIIGAP 
DMAFPRIN/NISF*LLPASLLLLLASGI 
REGRSK 


9794 


23695 


A 


9867 


256 


395 


EVLR KS EKS FETVC I INNKC/ WPGMVAH 
ACNPMTLGGRGGRIT*GQEF 


9795 


23696 


A 


9868 


458 


2 


PKKRFFSQKPPRGFYSAPLKGKNYI FPP 
PVNFGPPKDFFKGPPLFFFFFFFFFFFP 
RFFFFFLEKG*KF*NFFFT*NMVFFYIS 
/CSKKFPFQFVLTPIPF+KVPKPVWFFW 
DFPIFFKKIFFFFFFCDRVSLCHPGWSA 
VARSRLTASPRV 


9796 


23697 


A 


9869 


144 


425 


IPLCSRIYSFGPC*FSLISSSSCLSPFL 
SIPLCVLS**T**YLFFYISMREIYKTR 
TKGIY?GAQRNSLYTH\FSIEVQSTIKA 
ERFWPGAVARACNPSTLGGRGGQITWGQ 
EFE 


9797 


23698 


A 


9870 


138 


401 


DLRLKLNLSKNI IRLQGTVESSVLTIKE 
MQ I KAKVR YNL I PVKI KR/ SGNDRW W * G 
CGERGTLIRC* *ECKLVQPLWKTVWSFL 
KKKKK 


9798 


23699 


A 


9871 


179 


441 


PSGKGGIKGPPPPPKKIFGNFFFFLKKK 
GVPPGPPGGPKPRPLETPPPNPPKGGEY 
GGGP\WPPPPIP*FFKGPFKKKGPPWTN 
RGNP 


9799 


23700 


A 


9872 


57 


435 


FTQMRKNLKNNSGNMKK*GSITPLKDHS 
NF P AVD PNQNEM FE I PGKE FKK/ S D Y * V 
TQGDTRERCKPT * VLKT I LDMDEKCS KE 
MDILMKNQSELLEKKDTFRELQNAMESF 
MNRLDQ VEER I TELE 


9800 


23701 


A 


9873 


228 


443 


FSS KF I LI SVFLFLFLETGS CSVAQAGV 
QL*DYSSLQPQSPG\SGNPPISASQIAT 
TAGACLHALLI FVFFV 


9801 


23702 


A 


9874 


349 


2 


KKPKKKNFLPPKNFGFFSPFSP*KFFFF 
LKG FN F FRG FF PNF PP P KKKFFS KNS QL 
VFFLPPLKKKIF/ CFPTPVKFGP/SQRF 
FLKG PPLFFFFFFF FLDRVWLC YPG W S A 
VARSRIS 


9802 


23703 


A 


9875 


405 


2 


KKASRMENVRKKKKLEMIKLSEEGMLKA 
NIGQKLSLLAKQQ\A/NAKENLLKEIRSA 
IPLNTQMIRMQNSLIADMENIL/WWI/ 
EDPTNYNIPLSQSLIQSNALT\FNSMKT 
ERGEEAA*EKFDAISG*FMTFKERS 


9803 


23704 


A 


9876 


355 


394 


THP YYSHQEYQS P* P/LTGALSALLKTS 
GLAM*GHFHSITLLILGLLTNTLTIYQ* 
WRDVTR * S AYQGHHTPPVQKGLR YG 1 1 L 
FITSEVFFFAGLF*AFYHSSLAPTPQLG 
GHWP PTG I TPLN PLEDPLLNTS VLLAS G 
VSIT 


9804 


23705 


A 


9877 


47 


85 


TIYIHICVRIYACECVCVCAYIHICGVP 
AHLTEQNL\KPARTQSPPAPQKNPELPV 
DLVS I S LDREAQDVELNH YRLGNTCRTD 
VLPKVNTLCLRL/NFFPCLP I LEELQEL 
P*LENIPLFPQPIDYIYTYMRAYICV 


9805 


23706 


A 


9878 


2 


389 


GRGQFPPEFFYFFWGGSLVPSTPNKKKP 
1^RVT.,GGGWSNRAPKGWGGGFPPPILAF 
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KPKGPFGPKFVLKFSNVPPLLFFPKMRG 
PLPKGGGFLPFFFFFETKSRSVTQPGVQ 
WCNFGSL*PPP\TGSSNS 


9806 


23707 


A 


9879 


351 


1 


FLGPSKNFJSFPFPPQKWPFAFFPKSHRAR 
VNE I FS / RLSVINF* GLTKPQRLFFSLL 
NFSS\PIPPFQFPSRLIKEIRFSTFLTP 
ENSKI KKPLFFF * FFFF*DRVTLCCPGW 
SAWQSR 


9807 


23708 


A 


9880 


123 


387 


SFLWKLRPSGAPTCMRCLLA\LLGGVSQ 
LGYTGVRDPLEEAVCPFSELQRHAGKTT 
ALFRAVRQGCFFFAYF* IELFPCY/CSG 
RVGG 


9808 


23709 


A 


9881 . 


341 


659 


S FLGRVQ WLHACKS QHFGRLRQVDHLRS 
GV*DLPGQHDETLSLLKMPK/QKLARHG 
GMCL* SQLLRRLRQENHLNLGGRGCNEL 
RS QH C T PAWATE QD S I S KTKTKK 




23710 


A 


9882 


216 


] 


PKFFWVLRDSQIFKPGFGPQGFF/GGFG 
NPQGKKKPGFWGLG* KKKGFFFFFFFFF 
LRDG I SFCCPGWS I QW 


9810 


23711 


A 


9883 


2 


244 


GRVGAVGRREGENFIKCDELPPPPPEGK 
QQNLCKFLTGFSLPCQDKKP\CFFV*KK 
KKKKKKKKKKKK KKKKKKE KGGGGA 


9811 


23712 


A 


9884 


310 


2 


IFLQTLFPNTFENSLSEYCIIFSEYSFI 
**KQ\TNFILSSILLYYI*PSGSYISNL 
*Y*ELFSQNLSYQHFLWI+KKWGLGAVA 
HTYNPSTLGGQGRWIAWSPG 


9812 


23713 


A 


9885 


242 


358 


KTVWLYMNI *MANNHME*CSLPLVIREM 
* IKTPVTVH\TVGMAAI*NKK*NITR/C 
W * ACR ETAVLVY CWG E F YTQPL * KT VWL 
YMN I E / 1 P CDPA I PLLGMYLKE * R PGTV 
AHTCNPR 


9813 


23714 


A 


9886 


360 


2 


LLKLGFPFWAKKGKKKRLWVEGPLGKPP 
GGFCQKLFFSPGGGPKGFSPS I IILPKG 
* P FVWG P FREVF * KKRFKLKKKFRPTFS 
/Y/LFFFFETESHSVA\RMECSGAILAH 
CNLCRLGSSD 


9814 


23715 


A 


9887 


236 


347 


IKGFFFF*TESHSVT\RLECTST/SAHC 
NLCLPGSSNSP 


9815 


23716 


A 


9888 


310 


3 


NFFFFFETGSCSVTQARVQWSDQ/GSLQ 
PRS PGL \ SDPPTS AS * VAGNLKFLSGLC 
SSL*SLEIS*DTKKSQVQITDDLCYTW 
LCLFFEMESHSWQAGVQWH 


9816 


23717 


A 


9889 


119 


391 


APAYWNPHARDS PFFFF* TNFP FFPQVG 
GQWPNSGSPQAPPPG/SPPP/YSASASK 
ISGAPGAPPPPGPNFFFSFFSSPTFFSR 
DGVSPYEP 


9817 


23718 


A 


9890 


195 


3 


DGVSLLLPRLECNGTILTHCCLSPV/SW 
DYRRLP*RL/V* FFLFLVEMGFHHVAQA 
GLELLTSGDHP 


9818 


23719 


A 


9891 


3 


378 


RDGERNMLALGTASAKALKYSLALSLPR 
LECSGTIISHSSLNPP/GLKWSSCIiSST 
S S WDFRPAP PHLAKKTKQQ/HATFF * EG 
ESPYVAQAGFKLLASSSPPAFSLPKCWD 
YKL\DRHAW 


9819 


23720 


A 


9892 


231 


1 


PKPFFPKFFSP*N*KPPFQISPAKIQKI 
RVYQS \HKKFFS FFFFFHGVSLLLPRLE 
CNGTI SAQCNLCLSGSSDS P 


9820 


23721 


A 


9893 


2 


347 


APAKQE^A^C\^/CVCMC^/CI\^/LKUIIC 
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K=Lysine, L=Leucine, M=Methionine, 
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E VLC FY KMLGKKRS I YAY* LVY* Y I Y I H 
MCVYIYI/YIYV/YMYIYTYVCIYTH 


9821 


23722 


A 


9894 


28 


382 


TAHLE AGATGYRTFLLPAI GL LCL P WVL 
DDGAGQTAQ* SMLLDHAMLLAHRAHELF 
I /DTYYEFEETYIPKDQKYSFLHDSQTS 
FCFSDSIATPSNMEETQQKSNLELLRIS 
LLLIESW 


9822 


23723 


A 


9895 


93 


386 


FWKIISHYYFKYFSCSILISPSDLPITC 
ICMFHLLKLCQSSEFLHLLLFVWLVFET 
QSHS I KQPECS * * LS AA/CHL PGS SDP P 


9823 


23724 


A 


9896 


174 


1 


TLDWGPRKKNPFFFFFETRSRSVTEAGV 
Q*HNKGPLQPQPAG\SSDSPTSASQADA 
W 


9824 


23725 


A 


9897 


251 


386 


LQRLECSGTI/SAHCNLCLLGSSNPLAS 
AS* IAGTTGTLTGDVDST 


9825 


23726 


A 


9S98 


127 


393 


GHMGTLTGS PATLS I LALFFWGGR I STQ 
LDGINKS \ FSLFLYFI»RHCSLSLR1jE*S 
GAI ITYCRLELLGSGDTPASASQETGTA 
GTCHT 


9826 


23727 


A 


9899 


3 


334 


KI1RLGQLNTSAMAAKC * S ERKS / R / THL 
TLNQKLEMIKLiREEGMLKAVIGQKLGLL 
YQ/TSQFVNAKEELLKDIKSATPVTA*M 
IGKQNSLNCDLEKSCWLIQDQTG/HNI 
FLS 


9827 


23728 


A 


9900 


356 


1 


GRKPFSHLPKATLLPMGPVLGGRRALMG 
PDSRPGPVPSCSLVLLTPLAPLPLTARE 
SLCPCPPS * TPQPS VNP/H/GKLARRS P 
CWSGRQSLP*AEIVPbHAPPALGDRDE 
TPSQKKKK 


9828 


23729 


A 


9901 


118 


343 


IPYAKEKKKQENVLRFIHVNLCISNWFW 
FETGSHFVTQG*VHWCNLSSLQP/?NLQ 
GSGDP PTSAS RAAGATGVQH 


9829 


23730 


A 


9902 


321 


50 


SLRTRV*RPPSAPG*NPFCLKPQKI FQG 
GG\ESPLSQILKRVKQSNSYNLGGKGFK 
* PKLPPCPLTWATKKTSPPKKKKKKNYR 
TWETSVS 


9830 


23731 


A 


9903 


317 


8 


NCYDPNSGIVIPLLGILPKSTK\T+VRT 
KTCT * ML I VALC 1 1 TKKWK* S KCLS TDK 
QVNKI*YIHIMEYYSPIKGRKY*HTLQH 
/WMNLENIS * KRLD I KKPHI I 


9831 


23732 


A 


9904 


287 


3 


RGALNGRGG PRGVKS PGFAPFKLQLKPP 
GTFLNPK\QFPFFPNPP?*KKGAFFHFF 
LPLF*S FRWTEFYS VT\ KLECGGAI SAH 
CNLCLSCSSDSRA 


9832 


23733 


A 


9905 


2 


406 


PRVRTSSRSRATALFFFFFFFVFFWGKG 
EIWAPPLKNC/ IPPEKFCYFGGGAPGAG 
LPPPPGVIFFSLCRWKKKKKKGAPPGG 
VLGKKKKTPFTPKGGTFLTLGGF* KKSL 
FGKKTLLWVGGPLLLKNFF* EKFR 


9833 


23734 


A 


9906 


1 


296 


I WVGATECI FKNNPTI WGHQETHFTCKD 
T YRLKV/ KG WKK I FHT FHTNGNQK * AG I 
AI VISDKTDFKSKTI KR* KGHY/ IMTNG 
S I QQGD I LDW YKSNCS F 


9834 


23735 


A 


9907 


474 


41 


FMEYLTLSFIHVLLQ* FIHCFVHLFIHS 
FFYALMKSLIQ*FIHLYSCF/ICIDLLI 
SSCIYSFIYALVKSLANSLTHSFIHTFL 
Q*FVHLFTPLLICSFTHSSIH*PSHSLT 
FIQLHLFFHALVSGFJWSFIYLFCKINV 
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FLIDK 


9835 


23736 


A 


9908 


286 


84 


LKPPPPFFFFFQANFLCFIRD/GFHHVG 
QDGI>PLLTS*SASHGLPKCWDSRCEALR 
PASPLISGPHQTI 


9836 


23737 


A 


9909 


316 


63 


DQLLVRSIGFEALMSPIFFFQMEF/SLS 
L PKL E CNG AI SAHRN FCLLGS SD P FAS A 
AV* /PNANLLKLCASVSYCIELVLNELH 
WIK 


9837 


23738 


A 


9910 


370 


461 


F* F* FLFSETESHSVA\QLECSAMISGR 
CNLRLLGSGDSPATREAGAQESLEPRRQ 
SEPGLCHCTAAW 


9838 


23739 


A 


9911 


294 


28 


PGKNSTVQKEPGKWFCEMKFTPGENAVW 
IV* PTTKDLEY/W INI /DKTVAGCERTD 
SNFERS STMGKML \ SNS I AV7YTE I FRGR 
KSQLMGI 


9839 


23740 


A 


9912 


307 


463 


CKL* TLSNNDVIVGSLIVT/TCTTLVGV 
LIMGD/RLCMCGEKEYLGNLGTFLS IL 


9840 


23741 


A 


9913 


377 


28 


REMLTVKPLAPSFFRWSRAYRAIFFITP 
TWVSKSGKVTGPLFFLHIFFPGP/CPGL 
PI F FFI KTKGTRFFFGWFFFFERVSLCC 
PG*MECNGAISAHHKLRLPGSRYSPASA 
SEFHR 


9841 


23742 


A 


9914 


384 


725 


HLTAI RMTV I KDTENSKCERGC \ KTLMH 
SCWEYERVQPLWKTRW*F/LQQVE/LPS 
PCDPSIP/LLRMYPK*LNTD/CKRMC/C 
IPTS I AALFPVTK/ SWKQ 


9842 


23743 


A 


9915 


187 


3 


QPHKHLrGLDNIYIFFEIGFCSVAQTGVQ 
*RDKGSLQPRPP\GSRDPPTSGSRVAGR 
PRQENG 


9843 


23744 


A 


9916 


15 


167 


DGVSLSLPRLECNGAILAECNLR/LPGL 
K*FSCLRLPSSWDYRLPPPHPAHV 


9844 


23745 


A 


9917 


345 


477 


YCSRTSLPL/TLKKGSA* PGAVAHAYNP 
S ILGGRGRWI T * GQE FK 


9845 


23746 


A 


9918 


460 


42 


HQR PKVDKTTKMGEKRSRKTGNS KNQSA 
S/APPKERSSSPAMEQSWTENDFDELRQ 
EGFR* SNYSELKEEVRTHGKEVKNLEKK 
LDEWLTRI /SNAEKSLRDLMELTPKA*Q 
LPPERTNLS S * FNQLEERVS VMEDE INK 
MK 


9846 


23747 


A 


9919 


282 


21 


AAGYLQPSEFWLHSLGSRRGLPISVFPV 
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X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 














RGRQAWLSCDGYHPVRPSRLLCLPTQA 


9851 


23752 


A 


9924 


254 


3 


PGGFYFPGFLPPPLGFPKGFL*RLWPPC 
PATFFFLYQISKFPFFFFGKYQKNFFFF 
FFFFPESKSRSVT\RLECNGAISAHYNL 


9852 


23753 


A 


9925 


309 


1 


YTPTIMLLGI YSNELKTNVHMKT* I *MF 
TATLFLIAETWKQPRCSSLSEWI/QQLW 
YIQTI /EYYSWRRNI K/LMKTW/MKIiK 
YILQSEKTQSEKTTYI*FQLYDI 


9853 


23754 


A 


9926 


237 


2 


RWNFNFTQL I LKC I FLRTKF FS * VS K* N 
FKYFLGFWFFFFLRQGLTPVT\RLECSG 
TISAQCNLRLPGSTDFSASASQE 


9854 


23755 


A 


9927 


244 


1 


KPQCLWRK*D*WHNTQKLDVEKGKPLIH 
YWWERKLV* PLWKTVWRLMR /NLKI ELP 
YDPVIPLWGNYSKEMK*LCQREDSRA 


9855 


23756 


A 


9928 


2 


644 


IVQVDQKIRLNIHCLQBTHFKCKYTYRL 
K\7NR*R*VYHVWINQEKALVAIDGAYFR 
ARKITRDKERHY / I MIMGSMLQKD 


9856 


23757 


A 


9929 


1 


364 


GTSGTRPINNTTVFSKCSSERKT/R/TH 
LTLNQKLEMIKLSGGGISKTDMG*NLGL 
LHQTVSQWNTKEMFLKE I KSATPLNT * 
MIRHQNSLIAAIGKVLWV/IDQ/TQHNT 
LLSQRL I WNKALT 


9857 


23758 


A 


9930 


88 


374 


ILKACGPCPGNPHPWGPL*GQGGLGLNP 
SKS PPHPHPRGDPTWGGTP IGD YGPKPP 
KLPHGAGLPKKPCSSGGQNN/ PKI SGP 


9858 


23759 


A 


9931 


384 


2 


LCSPRFFYRKIVJGGGQGSRLLPPPFWKP 
GGKDCFSPGVLDPPGEQRGAPCFP*KKI 
RWG*GAAPWPPPQEG* /VGRAPLTPKI 
PAPVS FGCSPALPGEGQSETPFFLKKKK 
KEKSEAMIPPPPGPRA 


9859 


23760 


A 


9932 


32 


359 


IFFFLFFF/C*KGGPPPVPQAGGQGPKT 
PPWPP I WAKKTKPLS KKKKKKKKGAGP P 
PPPKGGPPPLKKGEGPLLKRKFLKKNGG 
ENFKKKKIWAGGGGNPREPPPFGGAR 


9860 


23761 


A 


9933 


138 


1 


TGFFCCC* SFALVAKGGVQWCDLGSLQP 
PPPG\SSDSPTYASCSC 


9861 


23762 


A 


9934 


103 


355 


LAGQACCS * STSANRCVNHA\ PGYANGR • 
CDTITCILVFIRNSWLITPIALITOFCY 
HVG*VRPQEQAPGNRVSLSDLLL\PPVT 
C 


9862 


23763 


A 


9935 


29 


363 


AWSHNAFLLFFFKRELAFVPQRGERENN 
FVSLKPPPPG*SPFSAPNPPKGGNKGPP 
PPPQLIFFYF* /GEKGFSPYNPEGPKPP 
TLGKPGLF*NTKKICPQPRGLKKGGQHI 
W 


9863 


23764 


A 


9936 


173 


363 


PKKRPIISLKRAFNSNPGD*QKFLKRGL 
PPMGSQFKRGNYPYDKNLEKLCPGGGNK 
GSG/YHYGL 


9864 


23765 


A 


9937 


360 


3 


NQKMRKNQSSKTGICKNQSACPPPKERS 
SSLAMEQSWMENVFDELREEGSR* *NYS 
ELKEEVRTHGKEVKNLEK/NLDEWLSRI 
S /N* EKSLKDLMELKPKAQELHGECPSL 
SSQFNQLARA 


9865 


23766 


A 


9938 


194 


370 


VSNFFFFFETEFCSFAQVGAQGQNFR*L 
KAPPP\NCCLFPT*ASRWGTTGTCHHS 
WLI 


9866 


23767 


A 


9939 


426 


1 


SQEFKVAVSYDLTMHSNLSNSETLSQKE 
KTKQTKTKQKKTY* EGRKH I KRCPTFL I 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenylaIanine, 
G=Glycine, H~Histidine, I=IsoIeucine, 
K=Lysine, L=Leucine, IVNMethionine, 
N=Asparagine, P=Proline, 

v£ — vriuiaiiiiiie, i\ — /vrgiiiitic, o ocimc, 

T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucieoiiae aeietion, v—possioie 
nucleotide insertion 














IREIQIQTTVRYHFTHIRI/VLIiPQKQN 
KCWCRVEKLKPLVRC * WECETVOPLW I T 
VWW / FLKI LNI KLP FDLATPWCI YQME 
FSC 


9867 


23768 


A 


9940 


255 


16 


RVI YNI Y IMYILC I YTPSRLLVEIGFHH 
VGQDGLDLLTS * S I RLS LP K\ CWD YR YE 
PPCVNIYIYLHIYLIYMSDGVKY 


9868 


23769 


A 


9941 


206 


355 


FLQGHTAI KNGRG/ W PGAVAHTCNPSTL 


9869 


23770 


A 


9942 


3 


402 


HEELNPRS ATS S PALVLHVT I AHVTI AF 
AEGCFQHDFCFSQPENRHI PLVFTTAS K 
VRNT FYIiLCRGRGT LRFFIjVCIjKXDPAT 
PLMGIYPK\KSLYHKDTYIHMFIAAEFT 
VAKIRYQSKCPSTDD*IKKICV 


9870 


23771 


A 


9943 


79 


328 


cikillkflclfcilppclnfflrqscs 
varag vqw \ slq p * pprlkhtpts as * s 
aetrgthhhahlifsynfl*r*vshva 


9871 


23772 


A 


9944 


321 


414 


VPSWAG\PVAHACNPSTIiGGRGGRIT*G 

OTJ" H'h' 
VjiTiI* Xj 


9872 


23773 


A 


9945 


142 


480 


QE AKS FDFS DSTTME VMLDKKQI RVT FL 
FEFK\MGRKAAETTCNIENAFAPGAAND 
*OMPWWT .PTfEPKnPPQT.FDFRr'fiGPPRR 
VDDDQLRAI IEADPLTTIQEVAKELS ID 
H 


9873 


23774 


A 


9946 


191 


3 


F FLPRGGGDS VFKKP PKKKNGFPKI FP F ' 
CPGFPLAT/ INFPPQIFFFFFFF*DGVS 


9874 


23775 


A 


9947 


451 


461 


K* INK* KVKGW / RKEHHAN INQKKVDVT 
I FISDNMNFRKKNKITGDREGRYI IIKQ 
P I HORNT A T T.NVYVPNNR VAKYVK * KL I 
ELKKEIHKFTILVGDFNTILSTIDRTTR 
*KVS 


9875 


23776 


A 


9948 


186 


405 


NHTLAI SLF * PYKP * AI LKLGI YRKERK 

LVC *RDI CALMFI TELVTAAKL WNQPKC 

PCITTTK /WICMWYTPTMEYYc; 
it o a u s\/ *' xvi i iniiv„i 1 1 j_> x x o 


9876 


23777 


A 


9949 


18 


412 


PEFRITTILPEKKNLL*LTN*VSKFSGY 
NTMT*TWAFF\Y^NNKTSETGIKIVPF 
TILSl^MKYLGM/TLTKNMKDLHTENYK 
MLRKENEEYQMGNMGSWILKINIVKISI 
LHKVI YRVKFLSNFNAI LLKK 


9877 


23778 


A 


9950 


1 


396 


LAFDGRAERLKTGFHRVTQDGLELLTS* 
S VRLGtitiKCWDYWREP / PVPGLWD 


9878 


23779 


A 


9951 


307 


405 


G I RVRR* PQGLRPPNPDALVFKKMVWPP 
PTN\PRGWGGKGWEPPQGPQDPPPQAAP 
GFLPETWRGPWNSLPTRGAWPPNSNLPP 
WPPAPSRHPAQRAPGPSTAAPPPGGGGE 

V? xs tr sr VJIv V 


9879 


23780 


A 


9952 


2 


402 


YGRPKEGGSLRFWLTTPGPKGETPFFL 
KTQKITGGGGGGP * FPLPRRVRQENS FY 
SRGGGFN* PKRAPCALTWGKKQKFFLJCK 
V TTWTPnWPK'KT.Pnr'P TOY* THV / Q Q QT.P 

MGGTPGKRQ 


yoov 


/J /ol 




yyjj 


Any 


1070 


SSFISRFYFICIYLFETRSHSVTKARVO 
CCDYN*LQPQPPGAQEILPLQPP*SAG\ 
I TGVAPNTPQLI FYFFVE / SGGSTHVAP 
RLALiNSWAQNIAL/RLPKVLYFTF 


9881 


23782 


A 


9954 


75 


396 


GFKGRKRGLPLPLNQKLKTLNLSEKGLS 
KAJJTGQICLGLL/RPVSQWI^PInGKFLRE 
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Amino acid sequence <A=Alanine 
C=Cysteine, D=Aspartic Acid, 
J&=Glutamic Acid, F=PhenylaIanine, 
G=GIycine, H-Histidine, I=Iso)eucine, 
K-Lysine, LHLeucine, M=Methionine, 
N=Asparagine, P=Proline, 

VI U 1<j 111 IIIC, Jl\ — r\ I gilllllC, lj 'OCI tllCf 

T=Threonine, V=Valine, 
W=TrVDtoDhin Y=Tvrosine 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














IKSAF / PIEK* VI * KQNSLI PDMEEVLG 
V/WKTKKYSHNI PLGLSLTQKRVLTLF 


9882 


23783 


A 


9955 


181 


458 


RLWNDGLYNI FDLKLQ/S IHV* SLLLLF 
ILEPESHSVS\RLECSGAVTTCCSLNIP 
GWSDS PPQAFQVAGTTGVCPNAWVPSAF 
jj 


9883 


23784 


A 


9956 


67 


430 


LAP RL I Y S P IKFQ WS FMS VRQ I Q VQS LA 
IGNFHNPRLPILI CHCYAFHR I LLFQMV 
RNFILYSISISANKQESKCILKYILLYI 
NLRNW LG TVAHAYN / PSTLGV* GGRI S * 
AHEFETSOGN 


9884 


23785 


A 


9957 


77 


422 


LPLALWASCRPQLCLDTDFIHGPPTSGA 
ASQFLCFQTMQPPTCSSSRLYNL*PALV 
LDHTTSNWCFRSP/ SFFFLLPLPLPPA 
PLP\ FFFFFFFDRVLLCHPGWSAVARFQ 
LTAAS 




15 /oo 




QQCQ 
77 JO 


n 
5 




GGPSLGGFFFGPPGFPKIVKKKRGALIG 
FPFFCPKKKPKKPLFFPKPLGKPPRKKG 
F*IGEPWKT/EGPGFPKKLNFVFFSPWG 
GNFLFPPQKPGGKVGKILLQPKKKPPGK 

Q 


9886 


23787 


A 


9959 


370 


2 


PFSS WVANFREGDGEKCSVLQG IRFCMF 
CRYINVTPLVHFPFPISKGTSFFPLYFF 
SPEVM/HFP*LPY*FGWALLSRF*/MSL 
T t * T iFP<5 If F P FFP ^ F F FF * R O FHS <3 PR T .K" 
CNGAI SAHCNLCLP 


9887 


23788 


A 


9960 


256 


2 


AWEEETAPLHSHCLGNQARPWNHRHAP\ 
P * L I V FVFS VETG FHHVAQAVL ELLAS S 
DPPTRFNLLSSWDYRGLSPQPMSRKSAQ 
EG 


9888 


23789 


A 


9961 


269 


3 


HRQACGQVRSPIQCWWACKIAQPSLQIV 
RSSLKG* I *SPYDPVI PLLVIYPRELKT 
\SLHTKTCT*RFTAALFIIARRWKQLKC 
PSIDE 


9889 


23790 


A 


9962 


268 


324 


SGTLSH/ W I FS* VG IEITMWLKVPDQTK 
KISLRSAEAAIKYFLTQATASI I IRRAI 
LFNNRLSEQSSITNTTNQYSSLIIIMAI 
AI KVGMAPFHF* VPEVTQGS PVTSGLLV 
L 


9890 


23791 


A 


9963 


341 


2 


ITFLPTKI IRQSNFKTGKRKTDSJJDLNR 
HFKREDTQMANKYM* FI*KMQI \KTTMR 
YY FT KRTT I SR KTSN I OC * DTWS PGT F 
INCLLECKMVRLLV/KMTWQFLTEVHIHL 
VY 


9891 


23792 


A 


9964 


250 


2 


YHPVNLYFKYISPIDV*FGCFIYIFDYP 
LCSCFSLHPLVYLQ*L/LFFFFEAESHC 
VARLE CSG WS AKCNLRLPG S S D S PAS 


9892 


23793 


A 


9965 


3 


334 


RTLRHH I WL I DPS FE * HLS CFHLLAI VN 
SAAINI PC/HSYV* THF* YS *MYRGRTA 
GS YAT /SMLNLIiRNCSTLLHRNCTF*PF 
YH/WE*VPISPQSCQKLDFFIFLIIAIS 
EG 


9893 


23794 


A 


9966 


179 


352 


NQE* TE ILNRPMSNQ I ESVIKYLPTRKS 
HRPIIRFTAELYQTY\ I LLKLFQ KVEGEG 
LL 


9894 


23795 


A 


9967 


2 


350 


THPSINSFIHVFIHASMNSLIHLPTHSF 
S I YLFTKS F IHLF IKFFS FLFFLRPS FT 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E^GIutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=G!utamine, R=Arginine, S=Serine, 
T-Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X= Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














PVAIYSFIHPFTYSSIH*FIYPCVYSCI 

T"^T7 'CT'LJ O DTT T 7\ T? T TT T7TTT4.TT / TDTVOT7 

ISFLFF 


9895 


23796 


A 


9968 


228 


558 


CKQLRQELAI FTS FVI LQLFSGHLDVYM 

s~~\ 71 r.T s\ t\ tr voir r\nw J. riTnT Trvf ~r r\ utpt 

QAWAQRPDKYEYDNK* FI E I KKI IQFTL> 
ISKRMK/ YVG INLTR* VKDLHNENYKTL 
MKEIESDTSEWKDISCSVINGRINSVKT 


9896 


23797 


A 


9969 


196 


2 


BRKV FFRPLFI I FGLKKKI FPLFWFFPS 
/ IRGRG*IFFFFFFFFETESPSPRLECS 
ETISCHCTP 


9897 


23798 


A 


9970 


245 


1 


TQCDMCVF I YTHI Y I SH I CYMVY I Y I HT 
RTHTGILFTPKREKNSNTCTCATKDNP / ' 
CMTLC* /HKRRQTHTHTHTHTHTHTHT 


9898 


23799 


A 


9971 


163 


380 


NHRRQKKNGR/HNGNKYKTVTKMINSNP 
P I TRNNLNVNGLKTP I KRQKLSDS I KKQ 
YPSIYCL*KT/YLKYKD 


9899 


23800 


A 


9972 


328 


1 


AKCGFLKKLN IELFNS I PGHS CKT * TQM 
FAAAb Y 1 1 S KKYKQ PS C PSNDE * MNE I W 
HILTIGYSAVKKISWG\RWMNLENIARL 
SGRQRGHI LQDS VDMKS PE * AI PQ 


9900 


23801 


A 


9973 


200 


3 


MASKCSSERKSHR/ SITHNQKLEMIKLS 
EEGMSKAKIG*KLGFLCQ\VIHVVTAKG 
KFLKEIKSATP 


9901 


23802 


A 


9974 


111 


517 


NLEQPLIKRTIANKPIKKINTTILI ITI 
NVNGLNTLTK/ RQRWSD * I FKNQDPTM I 
/ C*KKN/HCKYKDTNTLKIKG*KNIYDA 
NMLTQIKKKAGVAILI/DKIDFRA/TDI 
TRAKEGHF I M I KGSVHQEYVTI LNVSAH 
AS 


9902 


23803 


A 


9975 


342 


3 


FLATKK I FF FFP VFPPQGPP PVLVAP F F 
W AFGGG P PKWAP KKKF FG PRG P P FENF F 
F*GFFFFKTGGFFFLRFSKKVFFFFFFF 
SETESCS VA\RLECSGT I SAYCNLCLPG 
SR 


9903 


23804 


A 


9976 


13 


326 


ILDHSNRPSSSTLKTPNAGKDVEQKQFL 
/LCCW*ECKIWYLGR*L/WQFLTKLNL 
Li * P YD P E I LL LG I YQKE LKT C I HTVACT 
* MF I EALFV I AKT * KQPRYS SVG 


9904 


23805 


A 


9977 


74 


396 


LAPRTQPLPCS PAPLRQCHTQSLLPPFL 
LRLIFYFFCFLKTRAHSVAQAGAQ*RDH 
GSLQT* TPTIiACTPASA/ VARTTGTHHH 
AWL I VFFL/ 1 EKVSHSLVCDKT 


9905 . 


23806 


A 


9978 


287 


2 


QPHNSSYTQIFTPKIIIKTPPPNLKQNN 
TTPIKKKTC FP/PSTInP/SYPSTHPPT 
HLFIYPLIHISPPPSPTHPRIHPPTYLS 
Tun OTuno Tunnel) 


9906 


23807 


A 


9979 


97 


332 


GNNDLFLYFFFFLETGSHSVAQARVQWC 
DHSSL*PQPPGSSGPPISTC/RLGLQAC 
MCHHAWIiIFNFFCKGRLSLFGLLR 


9907 


23808 


A 


9980 


168 


2 


STWLR WDYWKTFFKTSKS / WPGTVALT Y 
NPYTLGGQGRHIT*GQEFKTSLANTVKS 


9908 


23809 


A 


9981 


267 


1 


LKRERR*RRAN/SKTSRRKEIIKIRAEI 

DNPLERLRKKERD* ITKVRNGREDIL*N 
NDLIKVKKD 


9909 


23810 


A 


9982 


230 


412 


CHIYLC/D*PIDGIV*OTIYIYFGHKHT 
HTYTHTHTHTNTHPTC*VSK3LLKKKRR 
TCSVCA 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenylaIanine, 
G=Glycine, H=Histidine, Wso leu cine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 

— (~7 1 11 tci m inp T? — \ ret initio C— Cnrina 
\^~VJ»I U lalll lllc, JT\.~>4I glllHIK, O — OK* IUc y 

T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
/v — \j iimiutvii, — oifjp cuuun, / — pu^aiuic 
nucleotide deletion, \=possible 
nucleotide insertion 


9910 


23811 


A 


9983 


229 


415 


SYVFCHIRFCPLDSLNKEIVA/SQVQWC 
VS WSTT* VAEAGGSLEPPST.P PDWASO 
RDPISKN 


9911 


23812 


A 


9984 


2 


288 


RGYDSTSSAYTVGHFT*EDKATITRIjSG 
KENVKHAC5fiETI iRRTjTjDVY P * TOR TTDS 
VGNLNSA\SAMM\GPPRLPKVLGIiQAWT 
TMPGP*KYIHIFF 


9912 


23813 


A 


9985 


246 


3 


FLLD YQ YCALCCH CFLFN FANCGHFN I L 
LGGVKHTKFPKYS * KFSVL / WL>DAVAHS 
CNS S TLS GLGW I TRGOE FETS LANMVTC 


9913 


23814 


A 


9986 


342 


1 


FLNFGLTGFFPKGPPKPLGLRDGALPLD 
PPQFLGVFKRGAFFLGTQ* FHF\LEGFK 
FWFWEPKAFFFFF/ CCETISAHCNLCLP 
G^ 9 D c; A A A ^ W T AG T TGA WHY AP T P T » 
RQQN 


9914 


23815 


A 


9987 


330 


1 


TOPIxlWLVMASKCSSERKGYTSFVLSRK 
/ urfjci x xsj-io h.e» 0 vilm x v jl lo" r^ijVjxjxj^v x 
/ VQVNTKEKFLME I KS ATPGNKGTVTKS 
NTlxTADREKV* VI wtknktnhkt plso 

i^ijji/iyivwiw v x.t» x.x\x^xxxx«xxx\.x. t xjci\^ 


9915 


23816 


A 


9988 


209 


2 


I KG SHFPVL I * YRKI III FGGVKGKGGF 
YSY\LFFFFEKGYHSVTQLKCSGMILAP 
CNLCLPGS*DSPTS 


yylv 


zJol / 


rx 






Sj/ 


HI-ID T/JMT\7 r T 1 T?MT . WVT Z" 1 * T T V^TXTP /UTUTUT 
yrlKWrxxM I CiPlXJlH I l-i J. iivxtr / ill xx Ixx 1 

HTHTHTHTHTHTHT 


9917 


23818 


A 


9990 


263 


1 


IQIRVFENSHLWPSYVHFPHGHSYLFCS 
L* * AFFRH\G * FVLSVHFLKI FLNLKKI 

FFFTflV QWQVAfi APQfJMT Q ZVHT'MT PT.T CI 

SSDR 


yy lo 


93ft 1Q 


A 


GQQ1 


j 


J Do 


,nPP R ^ R T iHT ■ A K" T VPPR <3 <I T .RHP AP PC* 
ojji^rivrvoi^J-ixixxrvi\.x. v rrrvooxjoijivriurv, 

LIxKKLELFFETGFCCVAQPGVRWCSHGP 

LQPRIPG\SSNPLTSAT*VAG\ IAGMCY 

HTQLQLAF 


0010 






0009 


90d 




SGTVAHACNSSTLGGRGGWIT*GO /FSO 
TSLTNM 


9920 


23821 


A 


9993 


317 


3 


TDKELLL RDEQRKC F VEME FS PGKHAM S 
NVDMTKDLEYS INLVDKAAVELERADSN 
FERSSTLjGKMLSNSISCYKEIFLERKSQ 
LM * Q I PLLLM/ FKKLPQPP * PSA 


0091 
yyzi 


975279 


j± 


000 4 


t O A 

J 7^ 




LNLMNQ*DHSLSQSLIQSKSATLFNSVT . 
AERGDESAEEKLEASRSWFMRFKERSCIi 

hnikvqgeaartdr/eaaasypenlr/v 
kdeggytk*rifwdeiafcwktmpskt 

FT AREKS I PGFKVSKDRMTARrGR SCTi 


9922 


23823 


A 


9995 


82 


420 


sflwklcprgaaayvrcvsa\lledvsq 
lgytgvrdpieeavspfselkrcrertt 

AVFTAVRQGHLSLQK\ FLjLPFVQLCPAH 
RGGV+RQ*iVLLCCGGLSPVQAPLPLCLP 
1 w 


9923 


23824 


A 


9996 


376 


1 


CGRTDAPASTSPYAMTVRFLRPPQPCVL 
DGLKNCPARAPQEQNSLGEVDKRGPREQ 
TR\ PATAAPPRPLGS PS WICPGDW * * CR 
NSLOG FSIxAAS VAR GRDWALAGLPATVS 

*' L,iJ x Vi w-j*itkj vruwivij nrvjjrivijr x v 

ARFQEQQKMNTV 


9924 


23825 


A 


9997 


495 


982 


VQKFLRPNLAPKIOlQRKIxAFNSLQGRxjR 
S LPS P TVW CTMAP PTGVLS SLLLLVTI A 
VCLWRMHS WQKKHW * AS FKKTHD * TGLA 
EP\ S CARKQCSEGRT YSN AV I S PNLETT 
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SEQ 10 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

tl 111 111 IT CtvJU 

residue of 

Jf CUUU v 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 

f\ tno tfk 

last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
0=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 

v<» vjriu vain iiie, iv — /\rgiiinic, o — ocr me, 

T=Threonine, V=Valine, 

WnrTT** vtl f fk nhon V=TvrncmA 

tt i rypiupnan, i — jLyrusine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














R I MRVSHT FP WDC TAACCD LS S C DLAW 
/ WFFGR f*YT .V PPU Tf T?MPF P K"KW2 


9925 


23826 


A 


9998 


401 


341 


IASKCSTERV/SRSHTLKQKLERFKL* E 
KGM/SKNEIGQK*AIiLYQ/VSQWNAKE 
KFLKE I KVD \ LHAQM I RK * N S LTAD VKK 
VLWKIDQTL/PLGHSVIQSKALTLFSS 
MKAO+fiFFAA-frRX^ 


9926 


23827 


A 


9999 


30 


393 


DTTILKLRQLI IGAVATKHWSERKSLTS 
LNLNQKLGTI KLTEEGMLKAKKGQKRGIi 
SCQTVGQVMN/SKE KFLKE I KHAI PGNT 
* MISK*NSL/ 1 1 ADMEKVLVG/ WS KVI F 
*DQTSHNIP*SQN 


9927 


23828 


A 


10000 


133 


1420 


EQRRLPVTPGVASLQTLCCFLPRRGCSH 
G * ES AATH PPGPELVLLLLQGH * AGYLG 
/DFG/PRTATGSPG\PAGVQPGPCLVPQ 
LSRAAAAG * PDPT* *GAIFSYHCKALRA 
TAARHCPPGC*WPG\PAGSPPGTSRPLS 
HPSSPSPPWTGVSSFLLNLVLCP*ASDA 
QGHACYPGPVPALAGQGHDSAGGRLGDP 
GHHWQQAALHG/DQGPPGHQAHAGTQPS . 
AQGLGC * ATHPGEGG * VG YQGLPAG* GR 
S HPR F P I VTRPMGR S C P YAT VRPG PGSQ 

QEVTGRWWW*GKKPLGHYPPCGAVRCPS 
S * PHPDTMFKGTGREGCLPPTSPVGVTG 
QMS\EGSRPCEWTLTMSPWGE*SPRASC 
AMLHFCPGSRVGGYHGCPPLHHMGPQST 
AG QAGQPH TLD I KAS 


9928 


23829 


A 


10001 


1818 


6682 


IKFPEAEEEEIQEVLVLLFRVAEEKENP 
SRVGNSQREYENQWLYSICNQLRYRNN 
L/RHVKKDERGYYEELLNYSRDHLMLYP 
YRLLDI MVKGFS ITPFS YYTGIMENIMN 
S GHNFTAADCLRLLG I GRNQ Y I DLM I QC 
RS SKKFFRRKTAGDLL? I KP VE I A I EAW 
WW QAG Y I TEDD I K I CTWPEKCATDKTV 
DSGPQLSGSLDYNWHSLYKKGFIYLDV 
PMSDDSCIAVAPLEGFVMNRVL/NFDYF 
ETLLYKI FVS VDEHTNVAELANVLE I DL 
SLVKNAVSMYCRLGFAHKKGQVINLDQL 
HS SWKNVPS INRLKSTLDP* KMLLSWGG 
GESRRPVQEASSATDTDTNSQEDPADTA 
SVRSLSLSAGHTKHIAFLFDSTLTAFLM 
MGNLSPVQSTGEGEAQRYFDHALTLRNT 
I LFLRKNKDLVAQTAQPDQPNYGFPLDL 
LRCESLLGLDPATCSRVIiNKNYTLLVSM 
APLTNEIRPVSSCTPQHIGPAIPEVSSV 
WFKLYIYHVTGQGPPSLIjLSKGTRLRKL 
PDIFQSY\DRLLITSSG\HDPGWPTSH 
VLTMLNDALTHS AVLIQGHGLHG IGETV 
HVPFPFDETELQEDSC\NMGVHKALQIL 
RNRVDLQHLCGYVTMLNASSQLANRKLS 

IEEATIDSATKQTSGATTEADWVPLELC 
FGI PLFSSELNRKVYRKIATHGL\ * EKR 
AFKNLLHSSRKLSLQVLNFVHSFQEGAS 
TLDIHTEPSFSSLLSQSSFADMGVPLPA 
KKFN/SLKIVSY 


9929 


23830 


A 


10002 


439 


1135 


MAVDFYNFVTKLWTTGYLRISFLAYKF 
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SEQ n> 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


IVI 

etb 

od 


SEQH) 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

.1 III 1 11 0 <lCIO 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
uing id 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Aianine 
C=Cysteine, I>=Aspnrtic Acid, 
E=Glutamic Acid, F=PhenylaIanine, 
G=Glycine, H=Histidine, I=IsoIeucine, 
K=Lysine, L=Leucine,M=Methionine, 
N=Asparagine, P=Proline, 

n^r^liilomina D — A rninino O— Carina 

v^ - viiuiHininc, rv — /\rgininc ; o— serine, 
T=Threonine, V=Valine, 
W=Tryptophan ; Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \ = possible 
nucleotide insertion 














FSFPFLDSLS/LLSPRLECSGVIPASLQ 
PLLPGFKRFL/ CLSLPSS *GYRRAPPPC 
PALLYF * VETGFHHVGQAGLELLTS GKL 
TP. T .C3 T . P KT* WD VP 


9930 


23831 


A 


10003 


216 


572 


LTLVS I FTAKQS VTLP I I CMLEMRKWLL 
NDTSAF/PI I * KVLFPKI FFFFFF* KGV 
LFCPPGGREGAQFGLTKPSPSGLKEI FL 
PNPPGGWKQRRVPP FAG YFCFFIKRRVS 
T.T.WPftT.PW 


9931 


23832 


A 


10004 


3 


354 


YSNN P KDRKGE TG INEKTNNKMVDLSS Y 
TSVITLNF*ML*/D/KIN*KFVKSP*KK 
PTLCCLIiETYFKYGIGGLKVKRWKKI *H 

nisi 1 ±\D±\iS.l Li V/iJ. Jji. J. I\w-1 or»JJO IrlilUj 

EHYILIKE 


9932 


23833 


A 


10005 


187 


455 


ASIHYSLGVRICIYIPGCIHPGEKSQKC 
NTCGKNSTQKPELVN/HR * LKGHRCMK* 
IKVyoL/ /ivJnlvKV X JL.e>KJ\.o ± is. 

C 


9933 


23834 


A 


10006 


134 


456 


IWNLPLGKPCAWKQVTSDLISPDAIFIN 

rtrtlnoJbor X? 17 Jc JC V Em x r^v i-ir v ry VVjVj.fVv.tl 

NLN*LKLLPPKL\SDFLVA*VSKKVGIT 
GPPHHPGLVIWGFKKKGGSPMLPR 


9934 


23835 


A 


10007 


210 


451 


CYYTIQTTEQPANDSMLGIKPHVSILTL 

XTT 7"\TM'T HTDD7DT r*UA r~U»l T 1/" /"P/^lTM!) A TTf" 1 ^ 

LQETS LTCNRTHR FKVKG * /WKKIYH 


9935 


23836 


A 


10008 


169 


462 


YPCWVDMAKNTSYNKCWQGCGEIRILPH 

LPIPLLDIDFKEWKTGVQTKTCR/RMFI 
AALFPTSRS 


9936 


23837 


A . 


10009 


490 


724 


SMPP * FMEKNFLAPGG\ VTPPGVNTTRV 

M PK"T Zi FT>Pin? Q <? P hT ,P VCnifPflT .T FfZT . 
l^J 1 1\ i. tW U if ivrv O 0 «j .tr r i\.o\J IS. r^vrlJ-L HuJj 

THLKTLILNPLKKDESGEIPVLF 


9937 


23838 


A 


10010 


306 


588 


KQF I RAFI EMS WWLTP VI PVLWDTKADG 
SGIRDQPSQHGETPSLLKIEKLAGHGGA 
GL* /SQLLERLRQENHLNPGGGGCSEPR 
SCYCIAAWVTE 


9938 


23839 




IUUl 1 


4/J 


55 


T CT ^'l 0 inC r Pr>TC*PT'NTT'DT3M7i'PVT CCIfOVC 
loruvl/o J. UXo J. J.1M Xr'Kl v LH.r i\JjO o JVK l\o 

HMSFSLNQK/L/EMIRLS/EEAC*KPRS 
QKLGLLSQS* PSCESKFLKETKSATPVW 
TQM I * K*NSLITNK\LKEVLMVWIKDQI 

ESAKC 


9939 


23840 


A 


10012 


184 


455 


FWLSLIKKPLSSVTQKKIKRET\ ILLTT 

ATLYS VFCVDSS KS ERGSVDPTQVFVET 
*DKCFDY 








1 Aft 17 ! 






TfPPT.VT^PPPTf CCTCIT J.T.MO^OV + T TTfT* 
IVDCtJUI lOKi!iKi\00 lOllbiiLNvoyiS. JUXIVU 

SEQAMPNANIGDRLKARPLSPNSHLIKT 
KQKFLKEV/KCATP 


9941 


23842 


A 


10014 


237 


483 


TLIRYI ILQGLFSYI PPFFFETESRSVA 
QAGVQW* DLGSLVPGSRHS PS SASQVAG 
TTGFKY/HAWLI FCLFCATGS 




Z3o4J 


TV 


IUUl j 


1 

1 J / 


Of J 


G S MAG I TT I EAVKR K I Q VLQHQ ADDAE E 
* AEHLQ* EAEGKRWAWEQAEAEVAS VNG 
RIQLVEEELDCAQECLATALQKLEEAGK 
AADESERDTKVIEIWALKD/E/EKMELQ 
E I QLKEAICJ 1 1 ACEADG KYEE VAC KLV 1 1 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=G!ycine, H=Histidine, I=lsoleucirfe, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R— Arginine, S^Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unkno\vn, *=Stop codon,/=possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 














EGDMGCTEERAELAESRC*EMDEQIRLM 
DQNLKCLSAAEEKYSQKEA* CEKERKIL 
TDNLNKSPMR 


9943 


23844 


A 


10016 


579 


30 


CGISTKKNKWMIKQNSSELSTCVHNQIT 
LHKI AMATQWG\KNSFFNK* C/W /EN* V 
ST/CKKIKLSPSVTCTI'KQYLKWIKYL 
NIKI +QTS*IS\NGGKYDIGI»GTNFLLQ 
KTKNLMSFS*L*NQMHKEQIIEKWDYIK 
LQNFCTSKEKFSGVKMSPKKW/DEIFEN 
HI FDRS * FS QY I NNV*NSTKS E 


9944 


23845 


A 


I00I7 


44 


228 


EFVCRVRGLWW \C WLASQ1.L WEAEAGG 
SLEPKSLRLQ*AMIM\DCTPPW 


994 5 


71R46 

iJOHU 


A 


10018 


471 


40 


PGEGRFQQTRF P P FLP P WGP KGGP FFKK 
PKKKKRKEM I KLS * KPRQANQ * LGLLCQ 
/ TSQ WNAKE KFL KE I KNAT P VNTQRVR 
KLNSLIADIETVLWGTEDQTSHNIPFG 
QSPIQINILAVFHSMKAEQSKEATEEKF 
EARS 


9946 


23847 


A 


10019 


217 


486 


KKIS* P VYLSLYLS I YLSVCLS VYYLS I 
YLSIYLSIYLCIYLSTIIYYYLSIYLPV 
YLS INLLS I /CLS P I YLLTYHLSSYLLS 
YLPTYLP 


9947 
y y^ i 


23848 


A 


1OO20 


327 


1245 


TENQQGVLALAALSKIRMMPTRLKWFLN 
IDYTKFCEAFHVFKKRKKKKINKNIKQL 
KMPLTKRIYLAFTEYSTQ*QQNTH\FFL 
SADETHGKIE/HVLGPKTRHNKFKR 


9948 


23849 


A 


10021 


329 


490 


GE S P TDNHCRS P KS Q VEKV I YT FFE TE S 
RSVTPAGV* WCNLGS LQPP P P \ GTSD 


9949 


23850 


A 


10022 


3 


394 


YRVLE VLGYMLNI RF WLGCSS CGQAAQ 
FREGSRCTTACRHPAHTLHAAKMSRRKV 

ssakcr*retpkkrsarwsakpap\qse 
tkpkkaagkdksstss\fssssslkaek 
ggkekqaevadqetkdlsae 






A 


100?^ 


448 


477 


SQVFETSLAS * MSKNIL/WAGTGAHTCN 
PSTLRGQGGQ I T * S Q VFETSLAS 


9951 


23852 


A 


10024 


198 


455 ■ 


S LKTNRREI PEDFHGQPCGKLVHSEKPR 
VRMGQTESKYASYLSFIKVILRRGGVRA 
S/TRNLIMLFQTIEQFCIiWFPE*GTLDL 
KDW 


9952 


23853 


A 


10025 


469 


26 


PPRAVQPPFLPSSSLHTCVPLCRTVFSA 
TSRPDSLLPALQVSAKMSPVL*SSLLLP 
SLGQGFPMASIAPRL/PHRSLRSCWAA 
AHSRAELDRHEAFLQCSLGQWPRNDFLF 
SFFRDRVLLCHPGWSTAA* S * LTAASNS 
GAQVMEFQ 


9953 


23854 


A 


10026 


222 


489 


KKKKKKKKFCF I ARVKSNGAI SAPCNLC 
LSGSNYFPASTS LVSGNTGACPLARVNF 
FY*NL\FLVKMGFHHIGQESI 


9954 


23855 


A 


10027 


313 


2 


QDWE I S KL YS YCHS * VRRL/ 1 FLKI RLS 
LYYILQHSLNCNIIG*LQFESCYIKLAG 
DKIAFRFLTLKTNHQLGLVAHAYNSSTL 
GGRDG WI T * GQE FKPS LANM 


9955 


23856 


A 


10028 


49 


388 


TPKIRGVFPFKGWFLGVFGAFPFPQKDI 
FYFPRTPMGTRLVSPGKKKKKEV*RVLY 
GLKKAKKIFVLKVKFSHKRGFFGEIQFK 
KLFKW/HLKRNPPPKKKKR 


9956 


23857 


A 


10029 


394 


231 


GRQIASAQEFKTSWGNMAKLCLFI*RDR 
VSPCCPSWS*IPELKQSACLSLPKY*DY 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

etb 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, Wsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
0=01 uta mine, R = Arginine, S = Serine, 
T=Threonine, V=Valine, 
w— j rypiopnan, *— .tyrosine, 
X^Unknown, *=Stopcodon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














DP /PNTTPriT>*T-TTiT.PRnT.T.T TT.PcrvT.PP 

XvLM 1 X XT V3JJ IT XJ XI 17 Ix^-XjXj J. J. XJ.tr O Ilir f 

CKSFKLLYMDI 


9957 


23858 


A 


10030 


265 


2 


HDNMRGS D YFP ASNGLAVP FKRQVFT I A 
SNFLEKSPPRLAFT*LQVD*ERKWAG\P 
VAETCNPSTIiGGQGGWITRGQEFKTSLA 
NMPKP 


9958 


23859 


A 


10031 


235 


413 


KAGKLFFFTEITFKKGKKAGGGGSFLKP 

. rujrOKF^oijJMrlKVKlij^lvKij^ xJSJYlljKvv 

LKSLPCDSPPLASQKGGVSRKSPPPRPF 
CPF*RLFP*KKKAFRLFIPKKV* *ALV/ 
CST.PPTnPf2TfTT.PPPP* HP aT.T.PP Prtwwa 

VAPSGLKLSSCLSLLRS*DYKHVPPCLA 
MF*KFF/CRN 


9959 


ZJOOU 


Pi. 


IUU5Z 


lot 


A^A 


HPPPTiT PMfZItJMViV'PDTJQPPPPPPQPTMC 

LSVTQAGVQW*DITALSASWTPVISDSP 
VSASRVAG \ IGTTGVRHHTQL I FL 


9960 


23861 


A 


10033 


252 


3 


VEGCLSSGVQDQPEQHSETPSLQKKIFF 
•\ *t arjwrjQMrr.* cjot.t C2pt p wtttiWQ Q prz 

\ xjHoxIvjo l v Il~XJ ay JjiAiKJuK W & JJflo i3 irV7 

G*SCSEP*SCHCSPAWATE*DPISKKKK 


9961 


23862 


A 


10034 


175 


492 


CPTCPF VWI PLCSEQPVQLYMWHTHTH 

TTJTTJTPTT-rTT-TTT-rZi /WT T PQT QPQPT PHfl 

LSLSPTLWRSGMI / CGSTQSQSPRLKQF 
CSPRYSRG*GWRMA* /DPGGRGCSE 


9962 


23863 


A 


10035 


200 


448 


T VT UT T7r^\T OT3T HT7DTDT T~C C TM TC T CT 

CLSFS * FP* VCFFFLVSLGLF*GRVLLC 
RPGWSAWQSQLTAD/SDFPVLKHSSCli 


9963 


Zibo4 




1UU36 


Zly 


455 


MTT PTCWTQTrCT.ATTT.aMWDDTT MT PTP 

LDTGSCS VAQADVQWHNQGSL* / SELLG 
SSDPPTSAS 


9964 


23865 


A 


10037 


164 


422 


GLSSQLTDGHLLALLRVGGSEVGSREES 

T?p T7T7 7i fiP TTP TTfSPP P T> Q TfdfZ * P P P P P PP Q 

GAREEREKKEKRER/ ERDREEK 


9965 


23866 


A 


10038 


1 


490 


PNQ I QN A V L I M I S F S I CFVS V I F F KS E I 
vMr'ADVPtf cpppp*"npQpQ\/2\nan\/n 7w 

CG/LQSQQPRPPG\SSNP\PTSASGVAG 
ST 


9966 


23867 


A 


10039 


255 


446 


FFLRQSCYVAQARVQ/YAIHRCSHSTLQ 
t . p zxt . a c: cw p c a t .t , a PP\/anT anMnT-THisin 
L*YFFRIHS 


9967 


23868 


A 


10040 


157 


390 


DHTCPPPSTADYRLS * FLYFLLLLFRRS 
LALPPMLECSGAISAHCTLCVQETRETD 

YQDGH/DGS WWCDLRTHPPRS 


9968 


23869 


A 


10041 


290 


30 


TLKKRKEFSSVLCPRGYIVLLLLVHQFL 
FFQTGSCSVTQAGMQ/W/CDQSSLQP*T 
PG\SNNPPVSASQVAGPTGMHHGFLHTE 


9969 


23870 


A 


10042 


190 


929 


NHRY I S YLTRLVKI KNFANKS WLESLS P 
TxxKHS*/W*CK*V*SLWKTVWOTLVm\ 
KDDLTFDPAI P FLM I YL IEMCAQ VHPE I 
vaPMPTairr ppTPTTHn\xrMPNWT.MVTnT 

RESYKTIKIKE 


Q07H 

77 /V 


zJo/ 1 


rt. 




tfj / 


774 

/ err 


H FGR PRO VD P.T.S S G VODOPGOHGE VOSV 
LKLi\KKlxAGHGGAHL*SQLLWRLRHENH 
LNLGGRGCSEPRWCHFTPAWTTEQDSV* 
NNIVTISINW 


9971 


23872 


A 


10044 


111 


429 


KPDEDTTGRENYRP VFFTNTDANLS KI K 
CM YP C I S VH/HVNMYGKl PEMMFT * Y * * 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

9fninn sipIH 

residue of 

nentide 

sequence 


Predict- 
ed end 
nucle- " 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 
peptide 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Iso!eucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 

0=02 In famine R=Aroininp Jv=Sprinp 

T=Threonine, V=Valine, 
VV^TrvotODhan Y=Tvrosine 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














*LFLSFFFYEMESGSVPRLECSGVISAH 
CNLSLPASSNPPPGFPR 


9972 


23873 


A 


10045 


30 


220 


LGVLYKKNS VFNKCYWDN* I S IWKRM \G 
LDPYLTPHTKINFKWIKDLNI ITKTLI C 
CSWMKTKA 


9973 


23874 


A 


10046 


203 


361 


TYKLLVI F I VI DHF/ LFSFLSFFFRDRV 
SFLLPRLECSGAISTHRNL*LPGSSD 


Q974 


23875 


A 


10047 




358 


IQRKCFTFFFFFEV* SCFVAQASVQYCG 
SPG\SSNLPFSASEVSKTIGWILPS 


9975 


23876 


A 


10048 


270 


272 


RKNQRFFKIARKRLNKMTRISPIiISIII 
LNVSGLNFPLKRYRLAEWTKKKK\DPI I 
' CCL/QKTHFAG/RDIYRLKIKGWKKIFH 
TNGSQ*QRSEFF 


9976 


23877 


A 


10050 


256 


1 


GVTTNLYFWKNFLNPPFLVKKCENCPIN 
JCVFP * KPPLLKKTLFFSKKKFCGV /HPT 
EKFFFFFFESLALSPRLQCNGTILAHCN 
LR 


9977 - 


23878 


A 


10051 


112 


359 


SKVSEEPTENEVEHNLKV*SFILPLKQY 
SM**QKKNIHIH\ISFYKK*ELTWPGMV 
AHACNPGRLGTEAGRSPEGOEFETSLA 


9978 


23879 


A 


10052 


346 


2 


RVSSYTLITLNLNRLNVALKRYRRLNGF 
FFLKKP*PN\ICCIQKTNLIYKSTYSLK 
VKGWKNIH\HANGLKKQAEVAI \LFISD 
KKDFKSKIVKRDKEGHYVMTKGSIQQ*D 
RTIL 


9979 


23880 


A 


10053 


376 


6 


TRPSQHPDFILLNKCLLFEATPFLVICY 
SGHRKLRPVAPPIRGSKSQIIPMPPAVL 
PRWHLS CLSFR I PS S FF* LRLPRLCKSR 
/ SAWGPVS PHLPQFLFFQAISFFFFL*D 
GILLCHPGWSAVA 


yyo\) 


Z JOO J 


A 


1 \J\JJ*r 


in? 

I uz 




FLSFFVDGKLCLEAKIWVLAPAVFIVTR 
VLLLLSTL/RQVELRNIYLCSNVHIHFH 
LYFCIYLSAYILQTMSLY*HL*YHSNV 


yyoi 




A 


moss 




51 


I S LVNWLF I LENMKLDP YLS P YTK IN I R 
/WQ*DLNVKNGTTKVLEEN*GTIYVMVG 
WKRPILACQQPKS 


9982 


23883 


A 


10056 


169 


342 


SQKQYSTCQNVFCFVLFLETRSHLV\TT 
LE CSGAI MAYCRLDLPG*RHPPTSAS* V 
DG 


9983 


23884 


A 


10057 


1 


586 


AAARPAQGKARPGLLLRRGRWELRAGF 
LTAKGAFRWWLSOKHVKMSYPPOSRCGC 
GGMGAAGPPSLTVHQELCGAYSPDGTTE 
ALPLAS WPRARPS PAKAPAYDTAKLPAL 
IGCGSRRPPGVNPGASSLKPGACVSEGA 
GPTGTLESAGSRPPTPLPPPV/CCPGPP 
ARADH*FCHVTPQCR*/PPPRPSP 


9984 


23885 


A 


10058 


393 


43 


HARPGAECERPAEGEAARADAFSGRHAC 
PHSLYCRL*RGFFVHKLHLNKA/RLIKR 
RKADNI KCWQRCGAPGTLMHC * WD YETM 
GLLEQTVFS FE IRSHCAAQAGVQRNNHG 
SLQEF 


9985 


23886 


A 


10059 


228 


466 


GLLHGPPIYDLPEPVSTTPSLYHPPIVP 
*ASPLGPSAILTLPFHSPPPA\PLFSPP 
HSVLGSSGPRSLPT/SPTHLPTLTPH 


99S6 


23887 


A 


10060 


293 


9 


DILQFYTFSPIQ\CPVYILHL*YISLQR 
L * FHQKYL I G I Y F I E TWS CS VARLE CSG 
RITPRCNLNFPGPTYPPTSASGVAGTTG 
VCDFLGLPWSR 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 
peptide 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 

H i n cf tft 

last amino 
acid 

ICMUUC Ul 

peptide 
sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=G)utamic Acid, F=PhenyIalanine, 
G=Glycine, H=Histidine, I=lso)eucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 

OssfvllltAm inf» R=Aroininp fe^prinp 
\^ VJ 1 U 1*1 III 111 Cy S\ f\l gllllllCf O^OCI I IIC, 

T=Threonine, V=Valine, 
W=Tryptopban, Y=Tyrosine, 

\T=I In Lr rin wn *=Qlnn /virfnn /=n/tccihlp 
/V UfilvlJUnll, — olUJJ lUUUil, / — JJOjjII/IC 

nucleotide deletion, V=possible 
nucleotide insertion 


9987 


23888 


A 


10061 


199 


428 


TAYCRTLPLKKSNPTISGSFFFFLETGP 
RS VPG * RAGG\DLMAHCS FDLPGSRDS P 
AS PRQWGATGPCHHPWLI FE 


9988 


23889 


A 


10062 


191 


2281 


LQTRGKLFKYKEKGICTKLHRSEIIIGK 
QWKEAVCPPTGQAQSQGGLDPKSREALR 
GGHGFVPGGGVCGHQEL*LHSDGLKAVH 
PAGS E PS DPGGVQAAACQDAATGGT PLQ 
PMPETARGC/ TATAS SREAGPGMDP /RE 
A/GPEAWQLGKARS SWGGSSTSGGPAWV 
FIRDTPS WWGI PGTVSPPAGPRCPPPTR 
PQWEQLGSGRPG * MQTTAEETNAAPRAG 
QERTPSRGPSADPGVSQHVLSHRRSCPK 
CERRSVPPNG\AGPTLWTELHPSDASVS 
ALTPL * GDVEVGP YG\CEGY I KPGRRTH 
QHNWLDLDLRPAASRTCRKMLELPRV* F 
LLRLPELAETYS S T P PACAQRPRGTGGG 
/ PPQRCVLMSGSHQGGAS * KGRDAPSLG 
SGLERPREGGPGWLS/RQPSPHAVQPCK 
HMLCLGPAF* RGRGARGVPVPVRASHFP 
FHGAAWWPGDHMS QP * WQMLS R P ARGKR 
PSWQVRRQCTVLGALKPGLPDLKCPGPT 
AFLPG * GVS /EENGQ I PNVSTQL YLQNL 
P LPR EQN * RRF PRDLS SGGAGPGTG * CG 
cnncxxTs.^ cdtccptjt Daucirn'MCTATAriDT 

<jL?vj>^ri/\xjoir X Jr o K Ki-irCrtrl o INJJiXJ o Nty^ f JL> 

CCWMPAASGC\PGIGCNGVPPVAAS*QA 
AA*TPPWSEGSLPAG\*QPSAHRCVTRV 
PGGRI PHLLAQAVPPDDRAFPNCQASGQ 
LLGVHSRPSLSAAGCRQFLAVSGIGCNG 
VPPVAAS 


9989 


23890 


A 


10063 


65 


453 


ISREFLFLETKSLYIMLKGSIQQEDIIF 
VNIYTLNTGAPRY/ IK*ILLELKRQIES 
DTVI PEDDNTPLSALNRPSRQK/ VDLI C 
TIEQMDLIDIYRTFHPTASEYLFFSAHG 

O IT OxVTV X X rJUunX\.l v. 1 l~i X 17 LJVi 


9990 


23891 


A 


10064 


100 


466 


LGILPLQSRPSGPGGQCTPTLPFWVSLP 
TjPVRGCFLHLCWCLPYSGALGSPrHTGL 
VCWH I PCF * CCGVS P * R\S CLEVHL.YTH 
THTHTHTHTHTRFVLM*MRTPLRKGKRQ 
VMRLCGGGGRA 


9991 


23892 


A 


10065 


226 


377 


GLAGSPCISGSVSFCSGCIY\RGHGIMF 
ICNDCKVFRFCKSK*NCFKEHNP 










JLyz. 




KVSS I CLKMSFLFLFFFRTGSHCH* KVS 
S I CLKMS FLFLFFFRTGSH/ SS PRLECS 
GMI IAHCSLKLLDSSDLTASAS *VAG 


9993 


23894 


A 


10067 


252 


36 


TGFNPLHFIYYYYYYYY\FESKSCSHAQ 
AGVQCCDHGSLKPQP \ PSSSHPPTSAP * 
VAGS RNAHQANSTAWPMVS 






a 

ri 


lUUOo 


JO 


/I R I 


n T T .KNPTTMT .M T KT ,FRMR CJPVT .MRFm? V J 
\j j. jjxvivv^rxnxjii ix rvi iriivt ixvOv_ v xji'iurCjyrcrs. / 

WIY*DESTPGEDTVNTVEITINDLEYFI 

NLVEKTVA/ GVETVDSNFERGPTVGKM\ 

T VPYRF T PHRKTC^ PSMOfYT .\ VP * K^PHP 

LQPSAATTLVS *QP S \ S KQDPLSARRLQ 

FTAGSD 


9995 


23896 


A 


10069 


263 


435 


HFSFLSFFF* KTESHSVT/RVEGRGGIS 
VDCKLCRPGSRDSFASASRVPGTKGISQ 
GRG 


9996 


23897 


A 


10070 


116 


474 


RGAGAWGVSGRSSPPLSALPIQPPPHLS 
TPPPLETP/PPVT.H*PLKDLGGWGALLA 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenylaIanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Lysine, l/=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
O— GlutaminfL R=Arpinine* S~Serine. 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X— Unknown, *=Stop codon, /=possiblc 
nucleotide deletion, ^possible 
nucleotide insertion 














Q*RIiPG*HSPKKKKKKKKAPRPPYK*GP 
PKTPVDPPDGPFLDPPLFGWPPPP\VFL 
PGERPPPL 


9997 


23898 


A 


10071 


159 


377 


KSHMPLTLNQKLEMI KLSEE / GLMSTAE 
IGQKL/GLLLPNSQWNAKEKFLKEMKS 
TSPGNT* TI IKWPGAVADA 


9998 


23899 


A 


10072 


180 


473 


AIEIQLLSRPLCLWPSGRFPDAQVANIR 
VRVA V * GNPLR * VGPAEPGEQR PSPWGF 
PSIS/WDIS PTWS S S ASPPGLSADCKFT 
CHYRCRALVCLDCCG 


9999 


23900 


A 


10073 


98 


440 


GQLNKLSGP YPENVG YTLPVL I NPLAQ P 
VIYSTIFAGTLITALSSH*FFT*VGLEI 
NMLAFIPVLTKK\ INPRSTEAAIKYFLT 
QATAS 1 1 LLI AI LFNN I LSGQKKKKKGR 
PF 


10000 

1 VAA/U 




A 


10074 


JJO 


404 


PCLYTYHLHNSNST* SLRLVS IMIEIVT 
SNLHRTVPHSYTPPTQLSINLVMAIHLS 
AGAWI ISYPSMIKNALVHFLPQGSPTPL 
I P ILV 1 1 ETI S IiL I Q\ PRPLAVRLTGN I 
TAGHLLMHIj I GS ATIAI S T I NLAS TL 1 1 
FTILILLT 


10001 


23902 


A 


10076 


3 


419 


KWRKKMWCLQKMEYYSS* KGKKLLSKVT 
IWMDPEEVMLS/EIS/SVTEGQILFNST 
YV 


10002 


23903 


A 


10077 


335 


414 


DKSPF/ D * RFFLPPNSNGFLVKMPPSVK 
RLPFFFIFPFSFFFFETESRSVT\RLEC 
SGTGLAHCNLRLPGSRDS 


10003 


23904 


A 


10078 


279 


1 


KDTQLPFIQFFFETGSCSVPEAGVQWCD 
HSSLQPPPP\GSSGSSPLSL*VAGTIGM 
LI FKI FSRNPLNFFFFLRWSLALS PDWS 
AVARS W FT 


10004 


23905 


A 


10079 


397 


3 


KSSS L FQNHL W ANS S T YSHL I PTLGY I L 
NHGNQ FD PDTL / RDKM * N F FCT I VCPH Y 
ELPSLEQW/VSLGSLNYDTILPL.DLFCK 
RQGKWSE I PYAQDFMTLYQNLT I CQT / P 
QNPPPPPKESSKVELDIIDDPPH 


10005 


23906 


A 


10080 


281 


1 


IFLGEWGPFFPPQKKSFFPKI PQWVFFT 
PPYRKKIFFFLPR*NWA/HPKIFFKRPP 
PFFFFFFFFFFFGFKSNCFILLPPKEKI 
TQAPAWGFAG 


10006 


23907 


A 


10081 


207 


35 


QENRRC*RGRGES/GTLIHGWRECQTVQ 
PLWKTVWQLLKRLNTEFPYDLAILLLGE 
FH 


10007 


23908 


A 


10082 


2 


408 


IAPLHSSLGDRARLRLKKKKKKGKIRPL 
LALFFKNQNPSFKICNSHSFYGALMEP\P 
PPHIPPFFFWKVQPPLFPFRGTLEFPGP 
IGTCKLQTLGSPG* WDPPCPLPHQSPGR 
GGLIPGRKGEDPPSFRGLPPPHF 


I \J\J\JO 




A 


10083 


1QS 




ECKQRPQLEMVSFRRRSHEPFFFETGPR 
S VAQAGMQWH/DLSSLKPPAPG\ SGDLP 
TSAS*VAGTI 


10009 


23910 


A 


10084 


219 


I 


RLKIKLLKIIITTTIDSGICSFKSCKL* 
H/LKIQNIWLDKLAHAYNPSTSGGRGRQ 
IT*GQEFKTSLGNMVKP 


10010 


23911 


A 


10085 


364 


413 


KKKGGEKGPLLKKER* KTNGQFLVHTNF 
/RLPGLKYFF/CPPPPSKWGLRAPPPKP 
GD F F F FF F FFL VFLVETG FHHVGQAGLE 
LLTS* PI PLGFPKCWDYRRE 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Iso leucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
v/^vjiuiamine, t\— Argimae, ^— oerine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y^Tyrosine, 
X = Unknown, ^—Stop codon, /==possible 
nucleotide deletion, \=possible 
nucleotide insertion 




/.yj \ a 


A 


lUUoO 


zuu 


J 


WRDHGTLQ P QPLGL KQAI L P / ASAP * V P 
RITDVHHHARLN 


1UUJZ 






JUUo/ 


1 j4 




\RRTDPLRPAVQDQPDQRGETPSLLKKK 
KKK 


10013 


23914 


A 


10088 


157 


390 


NSSSFFPPSCGAQ/KSEVKKSSGPHSSS 

QQfI*PPl? /PT DT T VQPPQfcJUPT aVPQT.T 

PVSACVFT* PFSMSLCVYIQISVS 


10014 


23915 


A 


10089 


305 


1 


KKKRAQSLPGQKGGRGFSQPVTAGDRSF 
RPKIRKEALGLGQLGLTDLCRTFPPTAT 
vj irro o/ilr c ir\j \ 1 tr rv tr ± xVi\.i\_L\ i in j_hx 
*FTRIKI ISRIFSGHEK 


10015 


23916 


A 


10090 


329 


2 


RGFFQPLGFWDPGPISNLGGPPKNLLLK 
AEIMGYNFCVPPFPPGGLPTTHFFFPPR 
NPPF/CFPFTPSLGFFSPNPPGFFFFFF 
FFLSRVS LCHLG WS AVA* S WLS AAS 


10016 


23917 


A 


10091 


84 


408 


kKAGl±»NKJ-ir WGrJjiiijGIljr IbbiKtKI 1 
FCIjVG Y F I F I Y L FGEM E/ FS LLL PGL * C 
NGTCSVHHNLPLPGASGSPASTSGEGGI 

WHVLGS PQPSPAWCKRFSCLNLRRGWY Y 
RHAPAPPSLLNIQKI ILGFWLAPVY 


10017 


23918 


T\ 

A 


10092 


184 


359 


LLPPP \GS S DS PAS AS * VAG I TGLHVGL 
SHK 


10018 


23919 


A 


10093 


298 


386 


KLCEK* * ITPIRMAI I * KKREGENS \KC 
QFLKILGIEL 


10019 


23920 


A 


10094 


310 


1 


ICMNEMILYKQDFGRTIIIRKIVIYFWI ' 
FN* FGH* IRIY*NLLRYKTCML*RG* * I 
NLPQRSFKNLRLKI * * W/WPGT VAHAYN 


10020 


23921 


A 


10095 


40 


407 


EHTHTTACEPGGI * IRPVD*MVLISWL* 
HGTlTCKMVTLGETGQKPQGISLilllFF 

PPPA T?riQT.P\7T->P"NrfTt7"WPPf2DT <?2VP*KT,P 

LPGPPLSPT*AS*VAETTGACYCTWFEN 
W F FT .R T . PT ,f?P Y 


10021 


23922 


A 


10096 


189 


2 


PPFFFFSNLFKKSFYEQGLALSPRLKCR 
GIIIA/HCS/L*TPG/YK*SSCLSLP\S 
SWDDRCAPPRV 




J.5yZ5 


A 


1 f\(\Ql 
\\J\J7 I 


ZOU 




TiHT .P <5P.Vnr>* T /^PiUfiPT^ciT .T.lOJCiV T .FlfS 
iJXlUjXoVjr VyL/ UoyilVjii JL o O-Uii-ixv V yjvJjUU 

RGGLQLIP\RRLMHENHLN 


10023 


23924 


A 


10098 


187 


464 


WYMTRCGSLGPSSAAHE3HHPPPQQLMN 
P\PSPSSAAHESHHPPPQQLMNPTIPLL 
SS*IPPSPSSAHESPGRVGRPEFTGGAK 
\j I Juri ±jo Jr ir 


10024 


23925 


A 


10099 


397 


1 


FFGGAPCPPPRRKGKTPPPPRKLFPPFF 
PKKPPLPKKNC*NKNPLLGPKKKNKRPP 
PGRFFKKGFF*NPPKAPLTNFWEKKKKK 

\ CDl?I?irr , T7'T7T?DDOTrDV'V'DD'DT« , T?T?T?T7n , n , T? 
\olrr Jc V r J? ir jrv?£.Jr ivlS-ir Pirr r r t* r c c r 

FFFFPTRPPRVRPRVRPRV 








1U1UU 


ZjJ 


J 


KQKRKCQVL / WRGCGET3TLVACWWE CQ 
MEQPAPVEN/ r /7*HFLENIKIQLPYDPAI 
HE 


10026 


23927 


A 


10101 


121 


404 


VLASWS I FFFFFFWKGGLI FS PRLEGG 
G/LILG- LKPPLSGLGPFSCLTLPEGWE 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E-Giutamic Acid, F=Phenytelanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K-Lysine, L=L,eucine,M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R-Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














LKPPPPDPGFFFFF*GKTGFPRGGQEGL 
NFLTWEPPPLAL 


10027 


23928 


A 


10102 

• 


468 


1829 


IRDTYTQNPBKAKGYFSCI FFE I EETTM 
SRKQNQKDSSGFI FDLQSNTVLAQGGAF 
ENMKEKINAVRAIVPNKSNNE I ILVLQH 
FDNCVDKTVQAFMEGSASEVLKEWTVTG 
KKKNKKKKNKPKPAAEPSNGIPDSSKSV 
S IQEEQSAPSSEKGGMNG YHVNGAINDT 
ESVDSLSEGLETLS IDARELEDPESAML 
DMLDRTGSMLQNGVSDFETKSLTMHSIH 
NSQQPRNAAKSLSRPTTETQFSNMGMED 
V/RPRHQ* KS * VP I FEKSVKDLQRCTVS 
LARYRVWKE EMDAS I KKMKQAFAELE7 
LINGSRKWALLAEMDKVKAEAMEFLLSR 
QKKAELLKKMTHVAVQMSEQQLVELRAD 
IKHFVSERKYDEDLGRVARFTCDVETLK 
KSIDSFGQVSHPKNSYSTRSRCSSVTSV 
SLSRPSDASAASSSTCASPPSHTSANKK 
NFAPGVY 


10028 


23929 


A 


10103 


254 


2 


KKKDFPLTPLGFGGLKIQLFFFLQKKKN 
FYFPP* IFPKIFFFFFFWFGTQSCPVT\ 
RLECSDRIPAHYNLHLQSSSHSRASVSH 
E 


10029 


23930 


A 


10104 


256 


419 


KYIELHTGR/ CAVGQHVGRPRRVDCLSS 
GIGDQPEQHGEPHLY*KNAQSSQAWW 


10030 


23931 


A 


10105 


153 


439 


TTVTSLCIYYTIYYYFRLLSTHAYTHIY 
THIYV* KLTIKQPQAGPSEGF* KKEF * * 
RRW/WVGTVAHTOTPSTLRCQGGWOT*G 
QEFETSLANMVKL 


10031 


23932 


A 


10106 


424 


157 


SSHLGRPRWVDHIiSPGVPDQPWQHGETP 
S L/LKLAGGACL * S QLI GRM / RLS SGGR 
GCSE P * SHHCI P VWETEQDP I S KNKQT I 
FRMFKN 


10032 


23933 


A 


10107 


213 


3 


NYKCP PTPNLNTLKAL Y FKKL I L S PKA * 
AIKEKID* /IIKIKHFYASNIINRMKRQ 
LTEWKKI FANHVSDKG 


10033 


23934 


A 


10108 


300 


1 


TPKQRGKKGSPREKPFPVNLPGKTLF/Y 
EV* FMNPLK*QKKKWFQGFFFFFLICGT 
VYHQFTSK* ERGVPGTVAHTCNPSTLGG 
RGGWIT*DQEFETSLG 


10034 


23935 


A 


10109 


120 


421 


KKIKKKNKNHMII\SIDAE*AFDKIQHR 
FMI KTLNKLGPEGKHI / KIMKAI YGRGG 
QDGQLHTAKK/EPLPLRENKISGKP 


10035 


23936 


A 


10110 


102 


418 


TLVCVCVCVCVCVCVSLSVFLSLSLCLS 
QAGVQWCDHSSLQPRPSG\SGDSPTIAS 
WGGITGVHCHIWPTFYFLFLIRLKSLM 
L* *TSSEMALTKMGVHSHGLFN 


10036 


23937 


A 


10111 


382 


2 


VNNATMPSKCSTEKSHMSLTSNQKLEMI 
KLSEKGMLKAKIGLKLGLbC/QVSQWN 
AKKKF*REI*WPTPVNIR/DMQKVLV/V 
KTEDQTSHNI PLNQ I L I QNKS LTLFNS I 
KAERSEEAA* EKCEDSRG 


10037 


23938 


A 


10112 


61 


427 


ENTI YQVQCS I KEE YLQRRTLKYAS LFQ 
KH I CG S LAFLHLL * P KL YHS RSNAE SDW 
1 1 QLFS VQ PNVKE I S KH KI G W * RGHACN 
/RQHFERPRHKDKLRLGVRDHPCQHGET 
PSLLSLLKIHI 


10038 


23939 


A 


10113 


71 


399 


NVLGKE ARSVG WE A VPGQ I SRV S IQMKW 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=PbenyIaIanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 

fl-^-f^l ■■ f n m ma D^-A mining C*CA»»jnA 

vz— Uriuiainme, i\— Arginine, oerine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Ty rosin e, 
X— Unknown, * =s Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














KGLIKGEKLKMEKSQACICIYFFFFETE 
TNFIAQAGGQGHDLGFMEPPSPG\*SDS 

P A * APT-TFONGT, /TfJMT-THTJP'R VT F\7 


10039 


23940 


A 


10114 


362 


1 


SKKKSWGSHSGSKCHTSFLSYPTKEQKS 
YLS LFYKCNLQS LTFKE LLDHFRPVCLP 
TIY*NKNWP/RDLPLKPSFKTLML*NQT 

T . / WPfiWAHA n\IP CI T T ,CZClC\dCZV3 T TPfSOE' 

LKTIKANTVK 


10040 


23941 


A 


10115 


314 


3 


WRERRCGAHLMRCASNMAHDKPSRMLNL 
* KPKMAT/RNTDHDNFW * GHGETE I LNH 
YWWDYKIM*PLWKTI*QFPKMFNIKIiPY 

UQYTnT.T/lTVPPMT.IfTWTUTK'T 
vox J. yijijui i irKiMijivxiN jlxI llvl 


10041 


23942 


A 


10116 


44 


394 


PGFQRVSQDGLDLLTS* SAHLGLPKCWD 
YRHE PLRPAG I YS YPAVLFVLSTYRASN 
IT*MLMTPKPTSG\YSFLLSFRFSSFSY 
LRD I S AYMSLS CLKLNVFKS KP I IFLS P 

T.MT.TiV 
UVViiUV 


10042 


23943 


A 


10117 


398 


3 


MESCCFMGIEFVLKEEVLEIGCRTM* I Y 
LHY* SLHLRMVKMVTSLLCVFYHN*TI F 
KKS SVFLYAGNKQL* I * ILKNYI YVMI P 

LKE I TD \ * NKWRDTTFKHQN 


JUU43 




A 


101 Jo 


104 




xWir V WAi L.o ViUXUlKLiAoriVjOWKljWlKijJr 
WRLRRDNRLSSGSRGA\AAEILPLCHCT 
PTWVTEHDSITYKY* KKGAP CWRAS S * C 

TT OH A T . PPP A 


10044 


23945 


A 


10119 


258 


401 


YNYP FLTCVMKE K I GAGMAAHACNPS T P 
GGRGG* \TRGQHLETSPTNMV 


1 f\f\AC 

JUU4D 


zjy4o 


ri 




1 1 A 


jo4 


rj\/pp"PT.TPT.vnnTTfr;r^PDPPPPRT p*ra 

FLFFFF/ETGSHSVAQANCSGSVSAHRS 
FHLPGPGDPPTSAS * VTGTTG TCHH^ 


10046 


23947 


A 


10121 


245 


479 


TSSLYRKIQKRFLYMLALEVLKICNTYF 
MNTERFPLKLFGQFS/FRI ISI* FFPFL 
FPKT YHYFLRDRV* TjCHPRW^ AV 

r JET X\. A. i Li A C lu\Ui\ V JJ^.ntrUrHO/lv 


10047 


23948 


A 


10122 


268 


469 


NSVFWLG WAHA/ W * VNPSTLGGRGGW I 
TRGQELETTPANILPP 


10048 


23949 


A 


10123 


221 


423 


GQAGAVAHACNPNALGRPRREDSLRPGV 
GDQPGQHSKTPSL/LKSDNF*KMGWLGP 
WAWWI/PP T T PALW 


10049 


23950 


A 


10124 


73 


183 


AGGGWALTNTVINPT\PPSTPFTPCLSY 
SCTISTASAFRFPG\STSTAVLTHVRIIi 
KSTPDAPHPPLKI PRARRNIQRDHTLSS 
NLFYYWHQLNVTEELTYYDLT* INPTAT 

THaTHTMPT.T.RT.HWT.NPTQT 
irinin x utc xjxjv xtriiH i_»in v l o jl 


10050 


23951 


A 


10125 


342 


2 


PNLGPCPPPRGTKGGPFFPSFHPSI\IG 
PPNFFSF*RP*FF*MLKSLGVP/SPTGP 
IPKFPFFPGFKNYFSPPL\PKGPLLKPP 

RSSC 


10051 


23952 


A 


10126 


85 


442 


KfCVINIKCW*RVWRTGSLIHCSWG*KMM 
PHWRAVWQS FR/SLS M YLS YE PALPLPG 

NH 


10052 


23953 


A 


10127 


180 


484 


KS FF F FS QSAQP I KVI PAP * NFHFFGVK 
RSSCLSFLRKWGPRWVPPPPFFFFFFFL 
I FVEMRSHYAAQAGLKLLASS\VPPPSG 
IPRTLD*SYPD 


10053 


23954 


A 


10128 


376 


i 


WS PGAQLIiF I SCLYYNLWSL.HTGRTPAK 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

a minft arid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

npntidp 

UvUllUV 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 

III 1 41 1 J 1 J 1 J , Jtv /ml JillllllV, ij k/vl lllvj 

T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion \~nossihle 
nucleotide insertion 














PP*GWTLQKTTPLSCPAP*I J PPPLPSPL 

QQ idlkeklvftapacsl/ phspnsspw 

PQRRWPSVS /P /PVAHPAAPRLPPPPVC 
ARPLWLPEAPLLGI PPH 


10054 


23955 


A 


10129 


140 


454 


CLIRSSINSVHNNIYIHSTPDRHLGCFQ 
FLWTKNAAVSRLAYVF+ * I * IPTYDHI 
HTHTPAGY I PRHGIAGSQLTNXFNFS S * 
LSSNSF\QVGCTSYFLFPSHI 


1 UUJJ 




A 


ll/J Jw 


T40 


47^ 


AGARHSVLRPRPPLiAGSSCPSS /CGGHC 
VALGVSCPGGGCRAACVPGCCCLLCGLP 
SDVCVC/SVCWGGSMGSRLGGWAAPGF* 
GNSREGPGNCSSGRKAGLAGGGLGPAWA 
EQAPDILCCGPGPHWLGAPAQVPLAPPIi 
SPPPCLYGCLYSDRGDL 


10056 


23957 


A 


10131 


280 


2 


DSLALLPRLEYNGVNVAHCKIiKSSWIRR 
SSHLSLPKHLDYRSMSH\LPGLEIF*YS 
YCK*CLLTSPYFEFLMNSLEDISSCFIP 
IRVMPKCRMK 


10057 


23958 


A 


10132 

■ 


244 


3 


IISTDLISHSWTLYLISSLVYVFLKQHS 
LAVLPRIjDCSGMI iahc /klotpglkr / 
FL/CLSLPST*DYRHTPPNRANFYYYYF 


10058 


23959 - 


A 


10133 


272 


3 


vgkprvfpyriafnfgrvtsilgphikk 
yhfqcrggkhcff* kappffff\fnetg 
fcsvprlecsgti I alcslnlqgscns p 

TSVSQV 






A 




£.11 




fpr fr p l vs p alte lgc f / hfiitrgl * k 
KKPFPFLiFFFFSETESCSIA\RIjWGSGM 
. ISAHCNLCLPGSSRTRG 


10060 


23961 


A 


10135 


229 


3 


KNDYLKCWQGCGRTETQMLGCWDIK/LI 
QP /LWKI IWQFLLKLNMHLLCNPTLPLL 
GIYP*EMNVYVHTQTCICTDAW 


10061 


23962 


A 


10136 


84 


352 


RREFKTGLATWQNLVSTKKYKTTVRYHL 
TSTRMDIIKGKR*QVCW/RKC/GDKGTL 
LD C/ WMRMLVQP I WKI V * VWKFCRK* KL 
NLFFPPPPQ 


10062 


23963 


A 


10137 


280 


2 


KCWVPEERILREKSQHLGKIKQEGRLN 
AWLIQPGQHGETLSLQKI / QLAGHGGA 
PLRSQLHRRMRREDHLSPGVRSRSEP* * 
L»S LHWVTEQ 


10063 


23964 


A 


10138 


2 


386 


HIQEKQLPNKDSTLNPYFLIiIS ILKRKD 
EKIQMPSKHMKRCSTSLVIREMQIKIRS 
HFPH * VGKI LKSAN I KRW* MCRKRG \TL 
ITCW*TYK*TYKLVKTF*RIILQHPLLK 

THQSYYPAI PHLKETQA 


l UVU4 


iJ7UJ 


A 




229 


I 


TRS YHESLTILHTNHHLNCS P FPQAITP 
*SRFPYSNRA\WLGSVAHTYNPSTLGGR 
GRRNA*GQEFKTSLGNLPK 


10065 


23966 


A 


10140 


261 


429 


LSHLFHVALI *LGLVW /SFVFVFFETGS 
LPRLECSGVITAHCSLDLLGLSGLPTSA 
F 


10066 


23967 


A 


10141 


377 


34 


WVFGSSGPPPPGFKXFFSPPPPKNKNPR 
GPPPPPIFFFFFCKKGVSPLFLPPGGGP 
/HPFLS PPPPGVKPENFFDPGGGGFP * T 
KIFSRPPPPGGKQNFFFLKKKKKKKEHQ 
IKR 


10067 


23968 


A 


10142 


208 


424 


F FS LFMQT FH I H I R FY FFLAGGGG AGVD 
RVS / LLS PRLECRDT I S VHCY I HLPGS K 
Q FS CL S LL- S M * N YRHAP 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, FHPhenylalanine, 
G=Glycine, H^Histidine, l=I$oleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
^f=v»iutamine, K— Arginine, oenne, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X^Unknown, *=Stop codon, /=possible 
nucleotide deletion, \= possible 
nucleotide insertion 


10068 


23969 


A 


10143 


1 


297 


PTRSPQVSNLNHKSFISFYLFIDLLSVC 
LS VS VY FET S LTLLPRLE CSGA I TTHLS 

SGIELLSSSHLPTPA 


10069 


23970 


A 


10144 


185 


488 


SGFPRGTELIGAVCVCVCVCACVCVCI K 
LGYI*NHKRIYI*LLYIYSTYYIFNLSL 

LKTCLENTHTLKNTH EGG 


10070 


23971 


A 


10145 


283 


3 


LGWPGRKADGSGKLTIDYGRLDKLVSP\ 

AHAFFSIR\NQSQFSFIRDSPQYAFTLP 
SEEHLNSLDGQ 


10071 


23972 


A 


10146 


146 


380 


C FLWDNFFFF I FTEKTGSCS I AQAGAGT 
GTTGMPHHAQ VI YFFI FL * RQGL 


10072 


23973 


A 


10147 


275 


3 


I KGMLKAKTG * KLGLLCQ/ TSQVENAKE 
E FLKE I KS AT P VNT / RMI RKRNCL I AEM 

T? l^T 7-*- T 7«"PT TnnTPtlUTT OT n/^OT tinCVPT T> 

Ji tw * VTLi I DQTGHN I P IjoQS JjPQo KCLi I 
LFNSKKAER 


10073 


23974 


A 


10148 


281 


2 


KIFLPQPPQEMVFQGFAPPGG* IFLVF* 
*KQPFPPLGRFVFK/LPGGIPFFFFFFF 
FFFFFETESHAIT\RLECSGTISAHCHL 
RLSGSSDPTRP 


10074 


23975 


A 


10149 


413 


139 


SLRVWPI FGLLQANKKKFFWILFLS/IR 
PLFFFSRERGGTPLFPPFFLRWPGKKSF 
PPG* PPPPLKIQFPGKYFFTSRPAPFFP 
±jb C_QPPGKNIjNYWGIjGPGi«PK.l£ PPPP.L 
FFFFF* DRVSLCRSGWSKVARS * LTWGK 
AFFTWVS 


10075 


23976 


A 


10150 


100 


431 


GPRLTDHLlRSGVPNHPGQHGETSFLLI/ 
rLN x bGG * GRKIANKliNPKGOCjCSEPKIbC 
HCTPSWGKKRNFVSKKKKKKRE 


10076 


23977 


A 


10151 


134 


417 


GLAAPLVG/WGKRNPPP*PFG*GEKTGW 
GPHPGKIFLKKKLLFFFFF*KGGLSFPP 
GGRERAHFWLKKTPPSRKKEILPPPPPR 
RGGGGPPPPPrlj 


10077 


23978 


A 


10152 


403 


2 


LAFP I LQEVPQCFCNLLHSLRMYS STGS 

LQSPNTALLLKYL*EAALKKKSR*HITS 
SL* KVDLS FFLETKPRSI A\RLKCSGTI 

o/-\tia_o J-iKJ-ii-toxron o v-rii Hoy vx\\j 


10078 


23979 


A 


10153 


382 


297 


GLN*ESWSGEKGTL/LHCWWECKFVQPL 
WKMV*RL 


10079 


23980 


A 


10154 


209 


15 


S S PKF * FFFLSPKKI PLPFF* PI FFFTK 
GAITAHCSSTS 


10080 


23981 


A 


10155 


153 


422 


RTSGKNGNRGQIILVTMVTNWLFSFFFF 
PLKTGPHFVIQAGGQGRAHGSLQA* IFG 
\SGDLLAIjVSIjQAG i AGFHHHAQJuSFFF 
KKTGLF 


10081 


23982 


A 


10156 


257 


3 


RRLQNS KDCCLFFPLEAPRGATARCQPE 
LSCMRCLSA\LLGGVSQSGYMGVRDPHK 

s 


10082 


23983 


A < 


10157 

I 


37 


386 


DATRFR INSVGTAEGGLDRIEPNDLQ* Y 
INLVGTAVRRPDRIESNFESSTVGKKLS 
NNIVCYRSFFCERBSQLTQQTSMLYYFK 
KLPQLPQPSAITTLH/SRQDPPSARRLQ 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

ominn ariii 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

npntide 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=G!utamic Acid, F=Phenylalanine, 
G=Clycine, H=Histidine, Wsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
0 = Glutamine R=Arpinine S~Serine 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \= possible 
nucleotide insertion 














LGQ 






A 


10158 


407 


272 


HKGGKRERKRKKKEKKE3GREKGQERRK 
ERE/RERKKGRKREMKKVNRYSKCTQ* * 
LLSN 


10084 


23985 


A 


10159 


248 


478 


VKWHRLLLDPSVEDPWKKSLKGPLQP* 
/RL* I FFF*NSGLLCHPSWRAVAQSRFT 
GVSTSQAQAILPSFSLPSSWDR 


10085 


23986 


A 


10160 


458 


129 


GFSVLNRLEDEDLRQENLLMPAV*DQPG 
QHSKT P I STKlsJNMN PAGHGGVYP * SQLL 
RTVRPEDHLY/ SNKPCSYLCAPAWATET 
LSLKNAFSCCSNHYGLSFQTSFHRILMF 


10086 


23987 


A 


10161 


82 


291 


YSKTKSLFFAGHGGTCIYSQLFRRQVQV 
RQEDHANPGV*GCS /DTMIMPMNSHCTP 
AWAA* * DPVS KKKKK 


10087 


23988 


A 


10162 


6 


409 


LR INI LPEP I SLS PLTAPAVFMP WTNYS 
S CDT FS LASMTLLPIjHFLSTS LALPWSIj 
FLAPS LFQIS KHQLYLS PVLLTLS SCYF 
LN PC FRNHI LMNLGRPRQAN YLRSGV* D 
QPGQRGE7PSL/LKNTKITWPWW 


10088 


23989 


A 


10163 


249 


415 


APPECSLPLLVLLLLLLLLLFLRWEDHL 
GPGG * GCS EPKS YHCTSAWVTR \GDPVS 


10089 


23990 


A 


10164 


1 


405 


RSEWRLRQPERQSETVKKKKKGFFFFSR 
GGGGGKIFGLMEPSPSGLGQFFLFTPLE 
NGGYRPPPLGGPFFFFFKKGGFPLCGPG 
GFEFPALGTPPLLFPKGLNFRGGPTPPA 
p PPFFWRPVR^VF^ PK"NT*NPPVyPOKNf; 
KNRP P KKKKPGRG \ WG P PR KFNP FGNKR 
GGVPRAGNSKPPGPQRGNPPFLKKKKKG 
PPKGGGR* PPFSRGVNKKNCPKPEGEGS 
INPKI FPPPPPREKKKKPFFFFLTVSLC 
RSGWRNLHSL 


10090 


23991 - 


A 


10165 


175 


404 


AIKMVWIQAVPSQCLHGVGFYLPRSVCV 
KTP * K KRHR QG WAHGCNPS TLGGRGG W 
I \ RGQE FETS LTNMVKP 


10091 


23992 


A 


10166 


317 


2 


P FKNS PLLLNEMTN I HFLE PTCNYVNVS 
PFLK*HKFL?F*EMESCSVT\RLECSVT 
I IAHCCLKLLGTSNPPTSASWAPGMI IG 
MRHCPAPKVCSHASAHASAHAS 


10092 


23993 


A 


10167 


325 


2 


KMGEKQGHFIKEEIRKANKHIERCSRLL 
AI EEM * I KATVT I /RKHQTKC * QG * RES 
GSLVHG* * KHKIVHPLWKILSVSYXTKH 
S I I I IGYSS VLLGTVLVHSGSDAW 


10093 


23994 


A 


10168 


245 


3 


GEPFLFCFPIKKPKVFFCPYFNQAVFFL 
ETHRGFFEFFFLF* KKKKKS/LGMVAHT 
YNPSTLGSRGGQI A*AQEFKTI LGNMS 




z.jyyj 


A 


10169 




1 


MNKMSQYTFPYKN* QWGG \S VAHP YNPS 
TLGGQAMQ I T* AKE FETS LSNMAKPC 






A 


10170 

11/1/1/ 


714 


419 
*t i y 


LS*PPPPFPGFKKFNRVLRFYGKSKPGP 
/FMFVALFOKGGPFYQNIPPPVKLEPFP 
PT I LRVS FFFL* DGVWLCHPGWN AVA* S 
QLTASSTSRVH 


10096 


23997 


A 


10171 


168 


1 


WTWWLFPGFS ITNNYCSHFSSLHLAI W/ 
LGT VAHACN ?S TLGG * GARGQE FETS LT 


10097 


23998 


A 


10172 


139 


3 


PAH/CES* PPGLKRSSYVSLPSG*DYR/ 
HTPPH\QLIFVFWEMGFHQPA/SA/FP 
KC*DYRS\DHHKWPT*HLK/SFHFIKK* 
N*LKRYLACSRLWYLFCLIRCMV* *NLD 
B LS I L YLK LNKFQFL F I YLF FEME S 
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SEQ Il> 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenylaIanine, 
G=Glycine, H=Histidine, I=1soleucine, 
K=Lysine, l/=JLeucine, M=Methionine, 
. N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, ^possible 
nucleotide insertion 


10098 


23999 


A 


10173 


253 


2 


FPLKI CNTLPPQIFSTQRGLLSLPFTI F 
LLSLILLRPFFFFF*TESHSVPECSGMI 
S AHCS LRLPG SS DS CASAS \ QVAGKTCA 
R 


10099 


24000 


A 


10174 


300 


3 


GRAFFFKGLVKWVFLTPNP I PFLLKNFL 
KRVFFLAPSEKFVFFKNWPP*LSFLIFG 
EKNFFFFFFFFSEMES CSVA\RLECNGM 
ISAHCNICLPGSSDSP 


10100 


24001 


A 


10175 


117 


381 


E I LRY IVSIRI RRTKLKFQQEELRVN I K 
NNLSSLGNMVRPRL/ECSGVITAPTAVS 
APGFK* SSHPSLPSSWDHRRAPLHLGDF 
FL I E TRSHHVAQAGQV2 FNVYS KFLLLK 
FKISPSDPYRNNISQDFIAIGPTNMVPL 
QPYIVLCACRTK3S1NKI IDLDSEQSSRT 


10101 


24002 


A 


10176 


194 


412 


TLKNLLLSLHCLSFVPTDLYQPERLSKP 
PRP AKKYKFL I L / WLG WAHACN P S TLG 
GRGGWIT*AHEFDINLTK 


10102 


24003 


A 


10177 


100 


339 


VPVKSLLLCVCVCVLLL*RISDHLPTLY 
ANLG * CFFS YLVI SYS \NST YC I LQRHL 
LAIQKQP/CSFLRQ'/HLVGWLHKKKHTH 
THTH THTHTH KS NTHTHTH KS KDLTGTQ 
NTTEFLLK 


10103 


24004 


A 


10178 


229 


2 


KEKYIKSKLHFEARHKGSHL*SQHFGRP 
RLMDCL\VQDQPGKCGENPYLQKNTKIS 
QAWWLVPWPVYSHRASKTLS 


10104 


24005 


A 


10179 


368 


2 


SGSSPSTSPQGRPDPLGLFSAKIVLGQL 
ATIPLPLCFLLKIFHPLTLPPLLLLGGE 
FNLSPQQQDS I SLPWFLCYRYGPTYKAF 
FYV*ASVPEKI\FFFFFF*DTVSLCHPG 
WTTVAQS RLTA 


10105 


24006 


A 


10180 


398 


1 


QNIMLYTINTYNFICQLKKKTIQWGENF 
FFNKWC*ENWI FTCKRKNLHPYLIPYTK 
TNSKWISELNL\KSLKVNRGGNLHDFQF 
GNGFLGDTKSKTK/AK1SKLEFIKIKKF 
CVLNDI I KKVKRQLIE WKKYIiQ 


10106 


24007 


A 


10181 


252 


419 


GARFKGSNFTSAGGQGIIFFMGPFKLIS 
RPVF*HRGEGKTPGVTQLNRLEEHPLFA 


10107 


24008 


A 


10182 


101 


380 


FCWIYNSWLIGYLFFPFRTLNILAMOGG 
SCL+SQYFGRQRQADHLSPGVRDQPGQY 
getps/ltki SWAWWHVPWPATQEAEV 
GGSLEPREVK 


10108 


24009 


A 


10183 


2 


377 


PGQVDCFSSGVQDQPDQYGEIliFLLKLQ 
KLGGHGG/IR* EDHLSLGGQGCKEPRLY 
HCTPAWVTEQYLVSQKKKKKKKK 


10109 


24010 


A 


10184 


140 


3 


DGVLLSLPRLECNGAISAHRW \SPSCDS 
PASASQAGCCGTCP* SQL 


10110 


24011 


A 


10185 


364 


3 


SVCPPRLGGVSQSGDMGVRDPFEEAV* P 
L*AELERCAGRSTAFFRAIR*AHLSLLK\ 
FTRNHPFPHVPCPRVMSR* FGLLPFFQR 
CPAQRGE I WRRS LAAVALRS CGGRLQVR 
TCWQLCLHR 


10111 


24012 


A 


10186 


2 


249 


S VRLNKNKENTQ I RNKKGDT I NLTE I KR 
I IKEYSEQLFANKLH/SLDEMNKFLER* 
NPLR PGAEAQRLVS * RFWRATAI DHMRP 


10112 


24013 


A 


10187 


270 


2 


PWAQRGCVFKTNQKI FFGFFE I PHLD* P 
CVFQRPPS\F*VRLIFIGGGFKPFFFFF 
FEMKSCSVARLECSTISAHCNLRLPGSS 
DSPASA 



1163 



WO 01/64835 



PCT/US01/04927 



SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

a II iino *IUU 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 

aing w 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid,'F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K= Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 

\£ — uiuidinmc, Jv — Argniinc, o — ocrine, 

T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 


1 ft 1 1 3 

ill J li 






JUloo 




1Q 


r ir ivjwa /Hi \jj i\ * vvoivixi J. r m jv»m » vv.cji\.i~ _l 

C/KKLRDSSLTSY/TRINLia-JIKDLNVR 
TK / S T KLLE EN I DI V/ 1 YDLGLG IG FLD 
MTLKARATK/AKIEKLDKIKIGNFYASK 
DPKKMKR* F I EF IEQGKI FTNHVSDDTS 
I 


10114 


24015 


A 


10189 


85 


243 


DRVS / LLS PRLECSGM I SAHCS FDLPG* 
SDP FTS VPKS S WlsTYRC P PAR KLS FT 


10115 


24016 


A 


10190 


96 


440 


LQSNHINTLNVNRLSSSMKRFRLSDEIF 
\KKHGPALCCL * EM YYKDKGTCTLKI KG 

WKNI yhanaknnkt/ gmvllsgkirles 

N /TVTRVKFR* TA7T /T"NIf?FT.T /OPnT.TT. 

LM / 1 v II\V IVEiU DV1 / XlNOJvXJ J. / ^dUJj J. i_J 

LFILNVYAP 


inn/; 






1A1Q1 


117 




AKDTiDRnF^KPnTHMTNRYMKTC/^nRO 
L I RETQ I KT TMR YP LT PLDLRKS KD KKS 
* * * YEQKGTPAHC* *ECKLVQPLQKI I * 
TFPKLLKINL 


10117 


24018 


A 


10192 


221 


72 


AKVK * WVF F C FLFFF* DG VS / HVWAQ 

E» O VJ.ri ± Oi-in v-li JJ^.JUJCr OOOl^O V in. V O 


10118 


24019 


A 


10193 


460 


121 


SDKTKYW*RCRAIKTLTHF* LKSKLVPL 
L*KTI * CYLAKLKIHIVFDSAI PL*DTY 
FR / E * FCI CAPKTYSNMVI E ALFVPANQ 
TSKKKNWK*LSCPQIVDNFLNIHLMKYH 

J_LCj 


10119 


24020 


A 


10194 


393 


2 


ATMPSYQRLLLTAVHELESPE*MDQFLD 
TYNLPRFNQEEIENLNSPIMSNEIQSVI 
KCL P FG PNPG PNG F 1 I EF YHTY KEE /LP 
ILL*LFQKIEG\ILPNSFHEVSITLIRK 
TH KT) AT + FTHlfP T Q T .MMTn A 

± L>XvL/A X Ci X ni\.2r -L O J_i: JJ.N X XJt\ 


10120 


24021 


A 


10195 


3 


353 


EAAPMPVKFFQLLEPSGPPGLSMSCLGD 
CNGYACFS VSLHQSANQS 1DQ/S I YQSA 
IHLLIY/VSIIY*SI\YLLL*SLI*PMY 
VPI Y PL S CL.S T YT»S FMPLS CLS FNNLS I 
TRKGSMCL 


10121 


24022 


A 


10196 


118 


395 


DFNAGLLiPSYQVLSLYFCF*NGV\CKCT 
P/RLGDRPRLR/ PPKKKKKKTPPPPLGG 
QPPGGF.PYTPRPGVGPPPPPFEHPPFFTST 
PGCFHPPPKAWP 


10122 


24023 


A 


10197 


437 


160 


FGYPRVFP LPALLKRG PG I L F WG PNKKN 
YSSPARGSKICFFKRGPPFFFFFYFFF* 
OTESCFVAPVGVOWCDHGSLOPOPPG\ s 
SGPPPSTS 


10123 


24024 


A 


10198 


259 


3 


RQGNHVTHVYS S RLQLI LS P PAS LAVF I 
QKIYVNLCTYIHIVNLYA*MCICVCICY 
TCIL\ ICI YGLVAVAHTYNPTTLGAKLG 
GS 


10124 


24025 


A 


10199 


1 


416 


I ISAHCNLHFPSSWDYSRTPPYPANFCI 
FSRDRVSPCWSGV*HHLEQESKMLSKK* 
NK I DMHGGMCL * S * LLK/RAE * DVLRQE 
NCLNPGGRGCGEPGSRHCTPAWATRAKT 
I.^KASVASSSSSPKKKKKKSPS 


10125 


24026 


A 


10200 


369 


3 


TAPPFFFFCRDGLRTSLI IFPSRATP YS 
LVTSGSFLPAGAGGRGLSLE IRVIiDLVQ 
DGCPCSQAWAAALGGPGWWAPFLCFFET 
ASCSVAQEGMQ*CNLGSLQPPSPG\SSD 
SPASWLWTSK 


10126 


24027 


A 


10201 


314 


2 


LHHMRPEV* PTPSKLGVTGYMVSNISYP 
WRCRAKK - S QVLVS LRHKKLF PLEMESG 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

4 IIIIIIO *JLiU 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D^Aspartic Acid, 
E=GIutamic Acid, F=Phenylalanine, 
G=Glycine, H=IIistidine, l=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 

\£ vjiiuutiniiie, tv — /vrginmc, o — ocrinc 
T=Threonine, V=Valine, 

AX/ — TfvnfAnhntt V — Tvmc i n r> 

tt — i rypiupimn, k — lyruaiiit:, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














SEVQL\DTNRCISANCNFCLPGSSNSPT 


10127 


24028 


A 


10202 


68 


413 


KIGILLKTH*PSIC/VSITNF*YNDTGE 
LKVK6 * KNI Y YAN INQK/ KAGMVMFVLD 
ETDFRAKK/DYQGH* IMIEVSVHKE/E/ 
L VI LNL YASNKR VTK YMK * KDTD I KGE I 

uj\o 111 VUUJJ 


10128 


24029 


A 


10203 


351 


1 


PKFPPLNPAWGTTIKIFFKKKNAHNWFS 
PRKBKALQKIHKKG / RLLAKE I S FLGG F 
* TD YKTNPDSQS IRTLGEKKKKIRPGTV 

MVGRV 


10129 


24030 


A 


10204 


2 


392 


FIjEGVLLLLPRLECNGAI PAHHNLHLPS 
PRFKRLSCLSLPSSWDHRHAPTRPANFV 

PT ,\7PTf3PTTWVnririnT.nT ,T,TT.* Q PPT .CZT.P 
r DV£i ivjr riri VVj^iJoJLtL^LiD 111 o it v— XjLtxj 

KCWDYRH\DHHAQPS F 


10130 


24031 


A 


10205 


264 


2 


PLSLSNLFPIPLNSILKIFFILK*L*IH 
GKLHKMYRKV/ LV* TS PS PSQC* HLR * V 
* HN I QNKKLMGLGAVAHASNP S TLGGRG 
GWIT 


JUJil 


24032 




JOzUo 


281 


2 


FYPPPLFFSPPP\PHFLWGFRPVPNFMP 
KIDPQTWAKNKDKGPNFPPPFFFILPPK 
KNIFFTKTYFFGPPPFLPPPPFFFSPPK 
KK/SYPFPLKKNSPNPPPS*K*AAISLP 

trtrC rirrrCCccOVtr LiZtlM 1 »" AJ'IM_L//iVV 


10132 


24033 


A 


10207 


340 


1 


STKKLVQKLGEHTQCWQGCEATGTLIKC 
pwppTnwnPT.wif / QawnTT.T3nT.*MPMWT. 

PYNPTIQLQGSYSRE\QHTCPYKDLYVA 
AtiFTIAKNC*QAKFLSTGE* INKLWCIH 
ITE 


1 A1 11 
JUl 3D 




A 

■t\. 


i A*>n& 
lUZUo 


1 JO 


i 
j 


TTNTPr:PPPPC:riV 1 17T7DF7 Dr.'F'P?P'B , Tr'F*nDV 

\YPSHPGWSAMVPSQLT 


10134 


24035 


A 


10209 


327 


2 


INKKYYEQLYA* KFDNLDEI YQFPGRYN 
L P KLTQE KANNLNRP VS LRETKS T I NNL 
PTn?Tf\7Pnp , nnT5*mPT.vr>TPK'pPTT , p\ t?q 

TEAKGILPNSFCEVTI IVRLFLLRQ 


10135 


24036 


A 


10210 


89 


4S5 


E I VPLLRFPLVI FI AS PTTF YTYGF VSG 
EKSQYKKREKLCDKTLSQSRSLLGMHST 
CGIALHPKREGQMCTLOVCFY* FFLKES 


10136 


24037 


A 


10211 


361 


462 


IQQIGSNTGNIPAMEIiALKRNSKNINRL 
AG WNG P * VFAC / P KNVL F YLH P * L FG * I 
* I YQLKI TF * NYEGMVP / FVFRFLFETR 
SL.LSPRLECSGTITTHCSLELVGSINSS 
T QARAEPLiORfJANT. 

-I- O^UU*U^ f JJ\^JX\.\j/-Mx U 


10137 


24038 


A 


10212 


62 


469 


SLLWKLRSSGASTSLRCLSA\LLGDVSQ 
SGYTGVRDPLEEAVCLLSELERGAGRTT 

Z^I.TTIP A^JHPOflPT CT.T7WT \ T.T ."D'FVrvTr'DZV'D 

GGGIYR/GQ*ALLSCGGLCPV*AS**/M 
CLPC/EA*TRLLKPPPGS 


lUi DO 




■t\ 




iy / 


i 


v v i \£Fi>* dwl v i iNivLyiAo r yyKv. ± ivjs. x j\ 1 / 
WPGMVAYTCNPSTLGGRDGWIT* GQQFK 
TSLANMVKPH 


10139 


24040 


A 


10214 


31 


213 


HASLDEGFHHVGQDGLN1jLTS*STCLGL 
PKCWDYRHEP/PAPNWWKKFLSSKAFKR 
*LGCY 


10140 


24041 


A 


10215 


442 


29 


PTINPTNIACKCLSKRKSHTSFTLJNIQKLE 
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NO: of 
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SEQ ID 
NO: of 
peptide 
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M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=G)ycine, H=Histidine, Msoleucine, 
KHLysine, LHLeucine, IVl=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R— Arginine, S^Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














T I KLS E EGMLKAEAG * KLDHVH / QMS Q I 
VNAKEKFLKEIKSATSLNI*MIRKQKRF 
IVNME/KKVLMFWTEDQTSHNI PLSQSL 
I RKR P * L FNSLKAE RS EEAPE KLE AS SG 


10141 


24042 


A 


10216 


195 


533 


LVWWLRVTYGVAAKMSSGTTI IGRLSW 
G WRLHF \ KMAHL * GYW WEAS VPLHMKLS 
QrunTn\ QT.p\nrnDT?t?PT Pfwrnnrvm 

OLnUiu \oJ_»K.V J^iJt'KtljKJjK.s^JVJjy^— i? XX/X) 

VLEVTWHFYCILFIRSKSLSPAHIQSK 
RI 


10142 


24043 


A 


10217 


274 


494. 


CVLKGLPFALPWFFFFF*DKVSPSPRLE 
GPPGLRR 


10143 


24044 


A 


10218 


142 


480 


VTIKIYLHPHDTTTPLRQGGSRLKEILV 
S Y* QQKVETRPSHVI FFTVISTQFKKTV 
T"T r Dcr , A\7aiJ r rr , Ki'Dc , PT driver* r\T /t /dp 

QEFETSMTNPVK 


1U 144 


24045 


a 




1 /TO 


A CA 


AVifoJ! VVKVL-JinUXiAoXiXr lilN ovjtvKUJNJs. 
FVFL VLQ AQGPAWRQGEVHKHLLP CV YH 
FAFWLTQQPQSFF*DG/QSHSLAR/LE 
CGGVI SAHCNLYLPGS SNS 


1U14D 


24046 




10220 




469 


T7-4- VWf"\M /TVTDPM17 A TJ T*/" 1 LTD CP T rTJrnPtiJT 

T*DQEFETSLADVAKPC 


10146 


24047 


A 


10221 


211 


473 


VTWLLLFFS I FRASSLSTDS VS YVI AFQ 
FYFYLFI YFYLFI FETESCSVA* AGAQW 

PHT fCDnDDDDPC \ OCPDOTT ^MTPrCO 

UX/Xj^oJryyir JrKljo \oc»ll.irc3 1 XiyMIovjoo 

NL 


10147 


24048 


A 


10222 


142 


394 


GVCQSLLGS LCQLGGTGVRDQLEEAVC P 
LAELEPCAWRTLLLRICCSLQSQQAGI F 
/ Kr L"oI XrxXL.Jrr Jry VXiXjooXM v Uj VXjoXK 
P 


10148 


24049 


A 


10224 


77 


396 


KRKTKKS FWTPSGFGIiRPP I PKKKKFFN 
SRGKQGGFGRGSGSRGGSGSTSGRGGGY 

GRTTIGI KGALKRLFA/NNEHDLR - 


1 AlilO 

iui4y 


24UjO 




WIZj 


i j j 


AA1 

44i 


oyAKilvjlijivoyri'r X^oirJroWr'XiXXifc'vjtjV 
FSPKFFFFFFETRV/ CAVAQAGVQWCNI 
DSSNSCASSS*IARITGAHHH\TELIFV 
F*TSPPSO 


10150 


24051 


A 


10226 


290 


485 


KRGEGPVQKKKFFFFFFETESCTVAQAG 
VOWHNT»cc7j*PPPPG\ Q^n^PAFFPR 


10151 


24052 


A 


.10227 


228 


424 


KLYQNLHDTLFGFELSH YEEVH* KMLKG 
SSWEFVMHSS/VSNKCILICSAKKKKKK 
KKKKKKKKKKKKK 




Z4U53 


A 

M. 






A ZA 1 

454 


uj_»r >i>^oyOijJ^yjjjji^j^iji v ii-ii. iuoiiifij.nl 

HTHTHTHTPWSLF*SR/ salslvc 


1 f\ 1 CI 


Z4U54 


a 






1 


r x x r o jxr x»x v wxjiN voriXjL/oirixv_.r ^i^o v 
1 1 SCFGVFY YYYY\ F F * RQSCS WHTGV 
QWCDRSSMQPQLQGQNDPPTSASQVAGS 
A 


1U154 


24055 




1 ATI A 


1 Q 

Jo 


AT) 

4/Z 


\3 Jr e*e*c\3 x KiXriKJjKiljKJiKCi 1 KJJoKliKV £»JK. 

VRERER* / RQRERERER3RERER 




1AC\^f», 

Z41D0 








DDI 


UP KTtfYYFO TKT T\7VT .yMTFTn F T OTf T.MTTf 
»V JvIVLM Ilryi IV J. 1 V I J iJZtlN 1 IVU I7 J. y rvLirl 1 xV 

CW*G*KEIGRAIHCTOECKM/VIQPVWK 
R/VWQFLRKLNMLLPHNPAVELLDIYPN 
ELKTHVCTKALYEAGRGGSRL 


10156 


24057 


A 


10232 


3 


402 


APRLPLGGW I SFSS PMGAMGLNRATGIM 
GSFFFFFLKRKSPLGPQGEGRGQWLG*G 
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SEQ ID 
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nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I~lsoleucine, 
K=Lysine, L^Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q— Gluts mine, R=Arginine, S == Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X— Unknown, *=Stop codon, /—possible 
nucleotide deletion, \=possible 
nucleotide insertion 














NPGPRG*TKFPG * TP * KAGNHGEGPPPR 
GNFGFLKXKGVPQKGQGGSKP/RDPRGT 
PRLGP?KGLE*RGGPPPPAPQK 


10157 


24058 


A 


10233 


48 


470 


PGRRFTTAGLGKQEPAASGPRAPRPAQT 
PLKREAARPSLRHIiPEGA*APRSSARSP 
LLI PAASLRS PRRRTCRRRLRHFGI ITR 
PFQPGVRLPDSGPGIoA/ RWACAAVPAAS 
RALPALRRPL* RRPGKGAPSSKSHGHSR 
SP 


10158 


24059 


A 


10234 


169 


446 


CPPGPAPSHLLEDRDDPQAGRQSSFYAG 
WKDRWREKTLFLFLRQSLTLSPTTPFFF 
\FEAESHSVTHSGMPGVISAH*NLRLPG 
SNNPPTSAS 


10159 


24060 


A 


10235 


264 


2 


KLtKPIFKMVFLKIMFMSYIMEVFFFIET 
GSRSVTQAGVRWHDLDSLQP*PP\GSSD 
PPTSVSQVGKt*7HKTWEAEIiAVSRDHATP 
RAKL 


10160 


24061 


A 


10236 


306 


3 


SQDMVKSPKKKFP*LV*GLTEI+KIRAL 
VFNKKFFFFFFFPETGSTGSCSVA\RLE 
CSGTI IAHCNLQLLGSSNPPASASRVIN 
SIKKLFANFIVSANPRPT 


10161 


24062 


A 


10237 


20 


469 


QCGA I P RRGDHLRS GV* DQPGQHGETPS 
LLKIQKLSRHDGH/CRRLSHKNGLNPGG 
RGCSELRSCHCTPAWATR\GNSVSKKKK 
S 


10162 


24063 


A 


10238 


313 


3 


TRPRPTQGPPHSGGRACLCIFSPHGPPQ 
TRLSTPVS P S VR PLGFQVLLS ARHGFLL 
CQHGMF\PSLFTMSNITLWPGAVAHTCN 
PNTLGGRGG W I T * GQE FQTS 


10163 


24064 


A 


10239 


274 


49 


KGNPFFFVETGSHSIAQDGVQWCNHSSL 
QP*PP\G\SSDPPTSASRVAGTTGVCPC 
EDKLSHILRRCYAQVFMLAG 


10164 


24065 


A 


10240 


144 


454 


VCK I K 1 1 YL I VLLED Y IN * F VKSS L * LK 
S*K/LIQQMWAIISLPECLNSWCLGKV 
G * KRCS QQGTVAHTCNPSTLGGRGGM I P 
R 


10165 


24066 


A 


10241 


289 


178 


FNYSS IKNNTDGWVWWCLLPVI PALWEV 
EAIESF/CSRDEIPLSCPGWSPVPKLKR 
F YCLNLP * CWDYRQEAP PHPAI S 1 1 FDA 
TVIELFS 


10166 


24067 


A 


10242 


352 


294 


PPRTTKTWV*DQPGQHVEAVTLQKMQKL 
VERGGKHPQSQLC/REDH*SMGGGGCSE 
PRP*PCTPAWVTERDHVSKIK*NKIK 


10167 


24068 


A 


10243 


293 


1 


VSLCRESSNSFIWQCPWSHILALCFQR 
EKKRLDLP/S *MVGK*EIPFLVPLLFLP 
HWPLLLMK/YM* IFFFGNRVSLCHPGWS 
AVAQSQLTAASVPRR 


10168 


24069 


A 


10244 


218 


3 


LERH I PVRP III CDYKWVWRF / HFLRPL 
HN* FLFSFFSFFFFFFETGSYSVT\RLE 
CSGTI S AH CNLCVHLLL 


10169 


24070 


A 


10245 


185 


1 


KKQPQQTKNPLNNVKKEGKGQHFFFFLE 
T*SRS VAQAGVQ WCDLGSLQPPPPG\ S S 
DPPPS 


10170 


24071 


A 


10246 


2 


265 


AHKKKPAKHIAPSTFLYGETEARRSMPK 
AAQLKCGRAG 1 1 LFVFKTES CS VAQDAV 
LWCNHGSLQP*PPG\SSDPADSACRVAR 
TTSI 


10171 


24072 


A 


10247 


2 


430 


EAGGREALDGCGLTGRESGVQRRRDSMG 
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NO: of 
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sequence 


SEQ ID 
NO: of 
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nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 
peptide 

sen nPTir p 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=IsoIeucine, 
K=JLysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














CQGTAGRGTRAGKEGPWRPEHLSRRPPA 
SS VLLS WAIHLA WEGS * TGSLCDLEAP 
GPTLS PWCSWSPELLQLHV /HHHTHTHT 
HTHS/HPNTHTDTQSLCHQPSFSVSPRA 
RTVL 


10172 


24073 


A 


10248 


201 


16 


TP PGGVFFLGFFFLFFF FETES CCVAQA 
GVQWCDLGSPHP PASRRPGNRSS I /AAA 
*GYHFK 


10173 


24074 


A 


10249 


167 


2 


GFDPFEGMLSYLEKEET/ WPGTRAHTCN 
PSSLEG*GGWIT*GREFKTSLTNMEKP 


10174 


24075 


A 


10250 


92 


389 


FQPGQHGEILSLYE IKKI * / ELAGCIDM 
CLWSY / LLRRLKQEDCS ST * VQGCS EP * 
CLSCI PS WTEQDPDSGRQ I * KfOCKI SL 
GRRGGIFLFLKIWHVFFL 


10175 


24076 


A 


10251 


221 


415 


KKKKRGGRFKGSNFTSPGWRGN I FFMG P 
PKLNSRAGV*QRRD/ WEKPGGPQFNCFG 
SNPPFPPGGK 


10176 


24077 


A 


10252 


94 


383 


GKLNRCG I WLLGRCQRSPAVSNALWLHR 
VAPEEEAGEAWAGTLSLQLS CKSKITPN 
KIVY* KKNKT / WLGTVAEVCNPSTLGGR 
GGWII*GREFETSL 


10177 


24078 


A 


10253 


249 


2 


YPTMIFKCSSKRKSPTSLTLNQKLEMMK 
LSEEGMSKAELS * KLGLLH/QVSQWNA 
KEKFLKEIKSATPANTGMIRKRNGLIA 


10178 


24079 


A 


10254 


132 


414 


NNT YFQGKI FI LNSQVKGML I CCWWECK 
* VEPLWKAVWRFLK\ DLEQNSKE YKL * T 
WNQ PR Y PAWDW I KKMW Y I Y FM E YHA\ A 
IKKNEIMFFAA 


10179 


24080 


A 


10255 


359 


3 


YSLRHNIEMRPVSNPTRASKCSHERRSV 
ISLTLNGKRE I I QEASKEGTLKANKPKA 
KLFVPVRQWNAKEKFLKEAKSAAPLN\ 
RMIRKQNSLiTADTEQV*VIWIEY*TSHS 
IPLSQSL 


10180 


24081 


A 


10256 


283 


2 


S LQ P W KL YVHVFL P K YAT Y P P FGTL P KT 
HFFHKILPSKAGSDLNS I YAMTYSHRHI 
TIFFETESHSVTRLECSGAI\SAH*NLH 
LPGSSNS PATK 


10181 


24082 


A 


10257 


216 


386 


PSKNPPPLINPFFFFETDSCSFA*AGVQ 
WRNLSSLQTPPP \GSNNSCAS AS /RI AG 
I 


10182 


24083 


A 


10258 


263 


1 


FSQWLI LPS PLS YS FLFLRQGFALLPRL 
ECSGEITA\PSSAS * VAEITGACHHAEF 
LNFFVEMEFHHWQAALKPLGSSEPPTP 
ATK 


10183 


24084 


A 


10259 


248 


47 


I PKSRVKRENRLNPGGKGCN * PRSPPCS 
PPWVAKQNSLSKK/NKVKIYQNLHTETQ 
TVHGD I RGQEK 


10184 


24085 


A 


10260 


159 


2 


KRDMSSLKLEY* KKE/RIWPGAVAHTYS 
LSTLGGREWIT*GQEFEISLTNMVK 


10185 


24086 


A 


10261 


190 


405 


ASLHMFMYFFFFFFPKQIFFFVPQAENQ 
WGNLGKRN PPPPGLRDF\ PAS PP KEAGF 
RGGPH* PGLFLVF* GE 


10186 


24087 


A 


10262 


39 


293 


PSSSTSRIYPEDTPLITQKYICKILFIV 
ALF I FPKHWKLCKCPHIG/E * I KLQCIY 
TWRYYVAVQKNEENLYQLMYWDYQEILV 
SD 


10187 


24088 


A 


10263 


282 


3 


AFYNRFFLKIFFKRGPRGKEKKGGKKEK 
KKWGKKKXXKFFFFS PGQ I FFLGGVFPF 
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G=GIytine, H=Histidine, 1=1 so leu cine, 
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X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














FLFFFEAESRSVT*AGVQWCDQ\SSSQP 
RPPRLKPFFQ 


10188 


24089 


A 


10264 


196 


376 


KKGGGHI INFPPHKGYKPGPEKKKFFFF 
FFEMESCSVA\RL* CSGMI /SQAHCNLC 
LPGSSN 


10189 


24090 


A 


10265 


514 


157 


GLTLSFRLE YSGMMSL*P * LPG / FR * FS 
RLSLPSS WVFPLMYVYHYAQQSFCI FFV 
ETGFHRVVQAGLKVLGLRQSAHLGLPEC 
W\DYRCEPL*LTKLICFTYGLR*RSKPP 
FFVDGIQLF 


10190 


24091 


A 


10266 


57 


378 


RSRIW*DSVILCS*LVSV/VDQSYFY*V 
TEILEQIKLLGSKIMCFCILQLHFLCFL 
CGS FVGS T \ I F I FS RNI WLGTVAHACN P 
STLGGRGRWIT*DQEFETCLASEWK 


10191 


24092 


A 


10267 


1 


212 


LIQKDTNIYYIKEAL*IPNKHI/KKISS 
I VSH Y KMQ I K I MR / YHTRMA 1 1 KRTDKC 
LYECRETRTLMHCW * GC 


10392 


24093 


A 


10268 


3 


399 


MANKHIKI FSASLIRMEI * IRATMRE/H 
* FI P I KWLVLKS KRESNKCWQGCGRTG I 
VIHFQRECKIVLLL*KTV*QFLKKLNIR 
LGAVAHSCNPSTLGCLRPRVQDQPGQHG 
KTLSLLE I QKLAWHGLHLS PS 


10193 


24094 


A 


10269 


190 


386 


FSPFLPLVTILECDKAICRRGKSFDFFL 
LGPSFYFFKLLL/FIFYF*ERVLLCGPG 
WSTVAQSWLTA 


10194 


24095 


A 


10270 


166 


327 


KTNS KTAE VCP LS V I TLN INGLNS P I KR 
QR*TDF*NPI\I CCLQETHFRFKGTN 


10195 


24096 


A 


10271 


180 


1 


I FFLFEVETCSVA* AGVQWNDLGSQQPP 
PP\GSSDFQLIFAFLVETGY*IMVPSSS 
CRT 


10196 


24097 


A 


10272 


273 


3 


FLGLF SGN PLKANK P VCHLTL P I LQR I F 
FFFESEFHS\VAQAGVQWCDLGSLKPPP 
PG\SSDFPGRRR* Q* AMITPLHSSRGDR 
VRLRLKKK 


10197 


24098 


A 


10273 


399 


27 


YHFVPTRMAI IRRKNN * CW* TCGE IGNL 
VYCWWECK/ LV* LLRKI VWQFLK/DVKQ 
NYHPVILLLSIYRI *FDCLFLCKSHVEM 
*LISGVRVGAWWEVIGSREWFLMSGFTI 
SSCHQDCEFSQDLIV 


10198 


24099 


A 


10274 


275 


462 


KINLVLQLDFFFVEMESHSVTQAGVQWC 
HLSSPQPPPP/GSSGDSRASDS*VAGIT 
SVHQHAGL 


10199 


24100 


A 


10275 


2 


379 


RGPGGRGLGSGTCFHPGRGGEPIVYQHL 
F*FFGP\PENYILILPGFGIISH/IVTY 
YSGKKEPFGYIGMA*AMISIGFLGFIV* 
AHHIFTVGIDVDTRAYFTPPYHTFEEPG 
YIISRRRRKTKGFLF 


10200 


24101 


A 


10276 


2 


407 


FPSPSPSPFASVSLSLSFHGLPLPEHTW 
FI DGS PTKPNRHS QAKAGYAI VPS I S II 
EATA/ PAPFTTC * QAELIALTRALTI AK 
GLCINI YTD/SHILHHRZWMG* K/GFLT 
TQGSS 1 1 D VS L I ETLLKAFLLP KE AGA 


10201 


24102 


A 


10277 


203 


545 


FLWRGQSLALLLRLECSGS ILL/ AHCNL 
GPHRFKRFSLPQPPE*LSNW\DYRRLPP 
CPANFCTFSRDGFTTLGQAGLQLLTSSD 
LPTLAS P KCW \ DYRHEPLVHGLTFNRS F 
NFFF 


10202 


24103 


A 


10278 


250 


3 


AKW FLEMKSTPGEDAANI I EM / TKDLDY 
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nucleotide deletion, \=possible 
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VSNS I TCYRELFHERKSRLMCQM* LS YF 


10203 


24104 


A 


10279 


333 


3 


SLILLLFQSMNHTHVRVTFIDQFCVCSN 
YSTD WLFPHLS PIS LRTPYS LRYNIKIR 
PTNIWPTMAMASTC'PS^RRF^PTSTiTTiNOK 
\ LET I KL / S EEGMS KAQI G * KLGLLCK 


10204 


24105 


A 


10280 


251 


3 


ITSKVIDSNPTTSIITLNLNGLNTSF*R 
QRLL/DWASSPK*DSI IC YLQEMN FKYK 
IFKKIGRRI YYANI SQKKVGVAVLTPDK 


10205 


24106 


A 


10281 


22 


226 


TKINSKGNKELNVRAKTIKLLEENIDRN 

LNVRAKT I KLLEEN I DRNLCDLGLGNCF 
LDMTPNAQTTKGR \ IDKLDFIKI *NFWL 
DT 


10206 


24107 


TV 
A 


10282 


238 


383 


PCPAEVE/AKPQKAAEKG* SSE 


10207 


24108 


A 


10283 


127 


377 


RIPGGSALGKEIRQCFGVKGQAEIGKIE 
MSQPTQEREWEGI GSQKAAQVDHLRSG 
i/nnnDnrivriFTD'T'T T.K'TVI*T.Zif^H^!/^P^ , 

V rijij) X \jd 1 ir J. i_ii_iiVJ.ri J_tH.vjxHj/ vjirv^ 

L 


10208 


24109 


A 


10284 


328 


370 


VGPGPGF*SSGRVSSCCPGWS\PSELK* 
SAHISLPKCWDYRCQPURPATFFFKTAS 
HYVAQAKVQWCNYS SLQPGTLRLKCC I F 

V If T <*Yl T\ Tf^i\ T\ T 

jviviy vvy vv 


10209 


24110 


A 


10285 


341 


1 


KWCVLTAPLSGRFSVCLPFLGPPYFLQ 
HSDIEIKPINYPTVASKCSNERRSHTSL 
TLN*KLETIKLIEEGGMSKAETGRKLSL 
L/QSVSQWNAKEKFLKEIRSITPVNT* 

MT 

1*1 X 


10210 


24111 


A 


10286 


199 


329 


LFFSFLNCLSPKRI ESSY* KEI KSAPPI 
TPGI PRK* TSLIAGKKNVFMDWKDQTKH 

IvXlr / XjovoivX X 1 JjJrivot 1 'lrvftiI» 1 Vi1.il, M_c, C xV 

L/ERGCFI/IRFKERTHLHNI KNKGPGWA 
HTCNPNTLGGQGRQ I T 


10211 


241 12 


Pl 


\\jZS1 




35/ 


VTNfl T.T .T .V\/*TinPfiHT .Mf^KFl *?T .FWTCWP * 

X r. '.''n 1 Ihll I V Xr VjiUJluOXvL/O J_>r LM I\.rV\_ 

KNEYSHAKKKQNH\ P * FSLYTKTNSKWS 
KDLNVRAK P I K I LE KK 


10212 


24113 


A 


10288 


111 


373 


SAQCQPDQHGKTPTLLRIQKLAGPDGAC 
L*SQLLR\ *GCGEPRSRHCTPAWAT/EG 
KTPSOKKKKK 


10213 


24114 


A 


10289 


197 


375 


I R IGLCKF 1 1 * FLETKLHTVAKVE CSSA 
I S AYGTT.S T.VDS GD S PAS AS / R WGTTG 


10214 


24115 


A 


10290 


118 


371 


QCLLMMS PG LKT * M TGRLRQENR LN PGG 
GGCGEPRS CNCT PAWATK\GNS AS KNKQ 
T\KNKQQQQQN 


10215 


O/l 1 1 c 


7\ 




1 1 A 

214 


•JOO 


r t riYwnKMVT<''T*KiPTrM c 5 TcrwnncfJF tcitt m 

X IIV^IUM JS.IV X IN C J\XV O i\.\~cY yvJv-VXC* J. \j X Lin 

HYWKDHKQV/Q/ PL*TIVQQFLKTRI 


10216 


24117 


A 


10292 


2 


376 


P F PIiMAQR PTHI/V P LS AS SGTS LLM PST 
LLWLPWLYKAHAVPTVPLSKLFDHTKPY 
AVSKNNLGALSI * S Y* VI PVPPRSRF/S 
FIRDKPTS 




1 lo 






t 
1 


J 17 


N I LHNIiDFFKNT TL E POS SR LORAV I AP 
LHFSLGNIVRRCFKKKSGWAQWLTPVIS 
AIj WEAD VAGS LE FKS ETrLGNMMRP CLIST 
FKKKKKKAFLFGAPGFFFFMGGKILFGI 
AAPF*SSGRVSSCCPGWS\PFELK*SGH 
IS LP KC ; DYRCQPLRPATFFFKTASHYV 
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X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














AQAKVQWCNYSSLQPGTLRIiKCCIFKKI 
QWQDI 


10218 


24119 


A 


10294 


201 


384 


GEERLSVQCSAFK*LR*HINISLQTFFF 
LRRESHSVA\RLECSGVISAHCNLCLPG 
SSDSPA 


10219 


24120 


A 


10295 


161 


400 


VCVI ESVKANHLVDEHALKSLS FHLSLL 
FGV*GVCVCVCVCVYMCF/CVCIYVC 


10220 


24121 


A 


10296 


] 


433 


GEGGGGCSGWSGVETAGSRPAPGAEQDL 
LPAPALGRWQGLNSVCYPAWPQPNRRQP 
LALLCAAGLGPGQRLECSGMIMAHHSLD 
LPGSGDPPMSAG\SIAGTTGTCHHAW\E 
FKHVAQGGLELLGPSDWPPTVLGLQM*A 
TTPGH 


10221 


24122 


A 


10297 


209 


1 


LTFELLYHHYTLYFF* ESGSCS VAQARM 
QWCDLGLLQPKALG\SGDPPASAS*DA* 
STGVD YYSWHTC I 


10222 


24123 


A 


10298 


225 


1 


RPGMPLSALPCTGHAPARQPHSAACSGP 
KCPWCQSPQPAVGWTV*VCPLPLSPCLC 
/FCVPQPWHGGLPLPCPPV 


10223 


24124 


A 


10299 


212 


367 


VTAGSGHMCYTWWECRLEQSL* KTVWRF 
LK*LKVELRFDPAVPLL/SIYPED 


10224 


24125 


A 


10300 


97 


406 


LHRCTPAWVTERGCLKTTNQNNGQHNVE 
GEACSSRPFTTI C/EEKIHLVCALTEED 
* RLAAETTANTTDI P IGSAYR I LI KKLK 
PSKLSTC/YVAKQLSPGQLQTK 




24126 


A 


10301 


3 


422 


YRSSSKTFLI IRRGFIPSSTFLLAHLST 
TLSTLSHFQKGWVKLFFFFFFKKGFPFW 
PPGGGGRAKP* F/HGNPPPPG* KNFPAL 
PRQGGGKKG\GPPPPGQFWFFKKKGVSP 
WGGGAKTPELGN PPPGPPKGGG I TGGPP 
PP 


10226 


24127 


A 


10302 


330 


1421 


RTKGRPLLLALPVLLLLFLHLAFFSAPP 
GPSSLSSTEDLKWTLLSPAQSRKLQPQS 
IiHRAGVRDLQWQGRAGAELTAAC/WPAP 
PRPETAVLLQGLPC/ QEGRLLLESLNGF 
ALWSAEGTI FYAS ATIVDYLGFHQTDV 
MHQNIYDYIHVDDRQDFCRQLHWAMDPP 
QWFGQPPPLETGDDAILGRLLRAQEWG 
TGT PTE YS AFLTRCF I CRVRCLLDSTSG 
FLTMQFQGKLKFLFGQKKKAPSGAMLPP 
RLSLFCIAAPVLLPSAAEMKMRSALLRA 
KPRADTAATAD / S KMRS * LTLS E S RLPN 
*FEIWDSAAPGTLPSLPFFGAFDGQHIF 
CIHTRALRGSTRTRQCIAAALEGASFRN 
HVEL 


10227 


24128 


A 


10303 


172 


383 


KIENKVWQGYEEIETLVYCWWECKMVQL 
LWKTTWVVLKILKLGPP* \DS 1 1 PLLGI 
NQGTEIRFCKRYSNT 


10228 


24129 


A 


10304 


205 


2 


NFYPA* KTDFFLVGSLFI Y FE * ASHP I T 
\RLECSGTXVTHCSLDLLGSCNPPASAS 
HVSGTAGAHHECI 


10229 


24130 


A 


10305 


423 


30 


LLEGKLTNRKD I RTKT PS VRRLHQS PKV 
DRTAKMGKKQSRKSGNS KNQS AYP PPKE 
HSSSPAMEQSWMENDELREEGFRQSS/M 
S KLKEE VRVNGK* VKTREKRLAKS LTR I 
TNAE KYLKDLME LKTS VLR P L 


10230 


24131 


A 


10306 


211 


387 


IFFFF* IKACSVAQAGGQGRHLGSLQPP 
PPGL-KQDS C/ S P S RGVGTTGMTYKAQF I 
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FFGF 


10231 


24132 


A 


10307 


229 


382 


ERAVVQPG* HGET/ PSLKKL* KLSRCGG 
MCLYSKLLRRLR*EDHLNPRSWGC 


10232 


24133 


A 


10308 


168 


417 


G S YC FFVRRNLL I VK YMRS VHLL F I MLL 
YRGD * WYVTTQYNFC I LQKQEKLI YR I A 
/HVRNPSSLRGGGG* IT*GREFKTSLAN 






j\ 




/jj 




PPPPTQKGFF/CFFFKKGTRPPGGKTGE 
RAKPNEIPAPGKKEFFLPKPPEKGEIKG 
/ PRPPGOTNF* FLKKTGFKKGGGGGSKP 

/ t X\ XT XT \J\^ JL X w X X XJXVXV X Vl IVLWJ W\J WVJW I\ t 

RPGGTPPPGPQRGGKNGGGPPPPPKKGF 
FGGKKKKKNPPR 


10234 


24135 


A 


10310 


182 


400 


LHVTM 1 1 FWLYL I KNMLLNFTYF/ C YI F 
SVATSTFKI VCV /WLGAVAHICCPHTLG 
G*DGWIT*GQEFKTRLANM 


10235 


24136 


A 


10311 


371 


400 


FIWGNI*ASNKHIKKT\STSLAFRKIQV 
K\TTMRYLSPPTPIRMT/I/I*KSYSSK 
CW*GCEEIRSIjIHC!KMIOPI * KT/WOFP 
KI INI QLLYNPAI ALLGNRTRG 


1UZJO 






inn? 


J ID 




ORCG K I GT FTHCWWEY KMVO PL W KTDWN 
LLQRLNI ELPCDLAI PLLNLYPREMKTS 
IYICT*MFIAALFII/AKWRQP/KCSSA 
DE/IGKMWHICIMK 


IUZJ / 








I HO 


1 
I 


QPPSP\GSSDSPASAF*VAG 


10238 


24139 


A 


10314 


157 


434 


F FFSE I FFFGPGVFS PPPFFI GGPGI FF 
FCPLKKKILPPPRGKKFFFFKKGP/HPF 
F F FF F F * EG VSL LL P RLECNGA I LAYC K 
IHLPGSSDLP 


10239 


24140 


A 


10315 


277 


2 


FFFTAFFLGVPGVFPPPRFFKPGPGFFF 
LGP*KKNFFPPPRGK/IKFFFLKGPPSF 

XJKJ U XVlVLI i i T XT XT I\\J XV / X XVX X X UiVwX XT «^ X 

FFFFFF*EGVSLLLPRLECNGAILAYCN 
LHLPGSSF 


10240 


24141 


A 


10316 


1 


388 


KTITHEVITECIWLKVLLLYFPNCMCIY 
VRMH I CVCVCVK I S I * MDHI VS YKCIH I 
\LCIY 


10241 


24142 


A 


10317 


134 


399 


LGLPGGDTRGGGT I ENP PFERALNSQNG 
GK\NPLLGPKTRPLGWGLCSTPKFWEKP 
RVRWDLN * TTPCLEKRAPWLKAKKTPA 
FEIRI 


10947 




A 


1011 R 
1 V/J 1 o 


911 


-i 
j 


E F LFS L I CA I CL * CALKOFS S CF FCLAL 

XJX XJX uUX w«xvu v*x» i JXV^X UU wX X wurxu 

IELLGSMGLLFH/SSLETFQPFFLQIFF 
/SFFPLFFGANYTYNKPFEYSLI 


10243 


24144 


A 


10319 


221 


8 


CDQPKS P PLSGVQMGD I LWLVFFLFLF 
AF*R*GSHCVAQAGLELPGSSDP/PQAT 
SQVAGTTGMHCTW LVF * F 


10244 


24145 


A 


10320 


1 


407 


RSTPDPVYLGFTSGGCKTBKRAGCFFLW 
KLHTKGAPARCQPELSGMRC\R* PLLGG 
V/SQSRYMAVKDSLEEAGCPLSELKRCA 
GRYTGIjFKGGRIiDHLNIjLRIjCPOTTLCP 
GALSQEDGS FI YKPLTGAAAFLS YM 


10245 


24146 


A 


10321 


115 


398 


ADEELLLMNKQIKYLLEMESTPGKDDVN 
TVEMTKESEYSI/NLKAVAGFERIDSNF 
ERLSAVGKML*NSITCYKEIFH/RKSQL 
MSQTSLLSYFKLP 


10246 


24147 


A 


10322 


137 


454 


FFSLYFQGFFFFFFLGEKFFFSPGGKGI 
CPKPPFGPPPPPGKGFSPPPPPGKAKKK 
GPPPPPP\GFLFF* KKPGGGFLGGFFFF 
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nucleotide deletion, \=possible 
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LGGKIFFSPAGGKGGPQKPPFP 


10247 


24148 


A 


10373 


246 


372 


IS I KSYQLW \LGGVAYACNS STLGGRGK 
WIT* GQKFKTSMANM 


10248 ' 


24149 


A 


10324 


415 


1 


IWGAPRVFPPPPF*NPPPEKNFWGPKKK 
KKFPPPPG*KIFFFKGPPPLFFFFFFFF 
SETESCSVT\RLGCSGGISAHCNLRLPA 
SSNSPPPSSLRMPSTPVLRPHGDLLSPQ 
LSAWVLSWPFLPSLLLPPDLDFIVQ 






& 


1fi19^ 


i 

j 




GDGITGLSPTISKFLFFVFESISIiSPRL. 
GSSGAIMAHCSLTSRREPPCRSINFFVS 
VITLNVNNLTNGKIGRKDFF*MM*YTIQ 
ICNLONTHCRP /KDTNCFKMKRRKEIIH 
ANSNQKRAGVAI L I PDKI DFKP KNVTER 
HRMLVRES / I SQEDVTI INTY 


10250 


24151 


A 


10326 


24 


387 


NQGLGARNQKSQRVFLAGFFFFESSSCS 
VAQAGVHW*DHSSV/PGSSAPPASATW 
VGTTGVHNHPWLI F/VFFVETGSH 


10251 


24152 


A 


10327 


145 


396 


CDYFGEMFSYLLLNIYLCDDPAIPLLPY 

VP YFT<rF"NFTTl<?T.TrnMTr)ZiHc; Q t?t HWC PITT . 

ETKCPGTDEWI *MTKKSDYCKHW*GYRV 
TGTLLYYEGECKI V* LLWGNV* LF\LLN 
I YLCDDPAI PLLPYVPYE\MKTRIHSKT 
+PMPIAALFI IAPNWK/PKCPGTDEWIN 
TL* HTHAVQYKVAVER 


10252 


24153 


A 


10328 


94 


639 


AS I S S S I KWDN I S FLVGLLGELNE IRN I 

CV JWnSMKril^J 'WK'WKT.TT T "iy\7TT?WMMVQa. 
^r^iAVl>ll T lJ\^/-\XjlNi\.VV JVJjJ-ijijl? V J-i!»l\itMl\i v o/\ 

S SNN F K I F Y AY Y I AKAVLL I ITEHITLS 
I VL I LKKQNNTRAVTKCW\RGGREKETL 
I WLVG * ECKLVQPLWRRVWRFLI KVKIE 
LPYDPT/ 1 LLG I YS KERKS VYRK\N I CT 
PMF \ VAALFTTAKT WKH 


10253 


24154 


A 


10329 


143 


3 


SAWFFFQDRVLIi* /SPRLECTVSISNHC 
NLRVPGSSKNYCLSLISIWDR 




941 






1 


IRQ 


RQGLTLFPGLVSNFWLPVIiS SHLGLPKC 
WGYQCYHAWPS* CF*NRSI /WSGTVTHA 
YDPSTLGGQGKPIT*GQEFETSIiA 




941^6 

1 JD 




IVjjI 


1X9 


49? 


RYKLDL* KLNTELPYDPTVPLLGNYPKY 

IV X IVUiVJJ 1,11 * X XJXJX X IS ± X V XT IJJJwIv X XT <L v X 

*T*AF\SKTCT*MFIAAYSQPPKGRSKC 
SSPNE*INKMWPIYIMR/YYSAIK 


10256 


24157 


A 


10332 


194 


1 


CKRQCNKEDRNI / CSK* *NPTI LF* TTK 
RKG / WLVAVAHACNLS TLGG * GGRAS *A 
OEFKTSRANM 


10257 


24158 


A 


10333 


348 


1 


NSGQSWIMYPLLKKGVGRVSPPDPDIVN 
SISMVWCGGRLFSNERFASLLTIKKKLA 
GHGGTCLQSLPLGRLRREDRLSQGGQG/ 
C*GYSE*LCCRCIPAWAAEGDLWSWSPN 
SVAY 


10258 


24159 


A 


10334 


388 


2 


I PGS PKRGPGGEKKKTFFHYLFLRAAPP 
LPRFI FFPPKNP PLKGGGGFFFPPPPKL 
KTPRLRKKKKKENFCPWKKFFKKKQTPQ 
RGRFFFFFFLRQSFAVAQAGVQWCDLGS 
LOPS P P \ G * S YHPARTRG 


10259 


24160 


A 


10335 


3 


396 


LLRP PCS LRHNDI E I M PHDNPM I ASKCS 
SKKKSCISLTLSQKLDVIKLSGKGMLKA 
KTGQNL/GLGHLYRTVRHAVNVKEKCLK 
EIKNAALIiNTQMTRK*NSLIADTEKM*V 
V* IQE * AS / HSMPLRQNLIQNK 


10260 


24161 


A 


10336 


165 


442 


GQME LL I LARNFG I LIiKFNLS I YYkRLF 
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peptide 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid* 
E=Glutamic Acid, F=Pbenylalanine, 
G=Glycine, H=Histidtne, I=IsoIeucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P-Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X^Unknown, *=Stop codon, /=possible 
nucleotide deletion, \-possible 
nucleotide insertion 














FS I PRYLFI FTYLFI YLFI YFESVAHSD 
AQVECSGTIMALT*KKKASLP/ASAPRA 
AGATG 


10261 


24162 


A 


10337 


33 


430 


TGSTHASANFCVSLVWTQLISRSSGQGL 
ARLQLRCLPCCILFFSFFSLVIYRLHSY 
LRLPWRRICLQVPSGC*QNSFPCNFRIQ 
GSLLL*/MQQW 


10262 


24163 


A 


10338 


284 


3 


SQHFGRPRWVDHLRSGVRDQPGQHGETIi 
SRETP\SLLKIQKLSGHGSACL*SQLFG 
RLRQENCLNPRGGGCSEWRQHQCTPV*V 
TEQDSVPKKK 


10263 


24164 


A 


10339 


414 


2 


ERAGPFSRKIFFSPPLAKKVKYPFPPTP 
RARAHKRPI FSRYVKKERGNSPDLFWPP 
FFPSLN*KQ*GPSLGRLFYPV/SWGIFP 
SKP*GFPGPRPKVFCFLG/LPPGPFGGP 
FFFFFFGDRVSLCHSRWSAVAQSQLTAR 


10264 


24165 


A 


10340 


21 1 


422 


AS LGNGGRP S PSLAAALQFDL * LRAG I I 
FALFKAARQGDLSL*RLLLSF\VCLCPA 
PRGGAYRGRQAFLSWGGLHPDRASRLFC 
LPKQAWAMAGAPPPALLLPCSLISYCRA 
SNQQDSGGIGPSEPSAGYKLLVPGFIiLP 
AEK\RTIYGGVTRFFRGP 


10265 


24166 


A 


10341 


116 


432 


SRQRMPSTYEVQGSGDTKTGEKPCANGN 
KGCGSSI LRS CLHRKRFS F * S LKALVWC 
GGACLEAQLLRRLRQEAHLSQGVPGCNE 
L* PGYCT PAWATE P \ DS VS KKK 


10266 


24167 


A 


10342 


103 


458 


SLNVCVLTTLPSSCSPISLPLRGPPYSL 
RHRNIEIRPINWPTMASKCSSKRKS*MS 
LTLNQKLE\MINLHEECMLKAKTG*NLG 
LLKQ I VDQWNAKKKFLKK I KS AI PVNT 
QMVRKQN 


10267 


24168 


A 


10343 


53 


408 


sfqffcsvfspslwfylllvfddgdvqm 
gfwcgypfcllvflltdktlscrsvgvp 
crvrcqca\llggasqlg\srgsgvrdp 
leeavcllsdlqlragrtttlf*avrqg 
hlslqrll 


10268 


24169 


A 


10344 


265 


421 


NNFPPSPPRGGGNPGPPPPPRFFFFFLK 
NPGI»P P F * VI YLL / YFTLTPS FCEjWC YT 
ISVYGCGFVFFFFSFFFFFFEKKIFFFP 
RVEPKNRIPVYPKILFPG*NNFPPSPPR 
GGGNPGPPPPPRFFFFFLKWPGL/HPF* 
KKGAKFPDLINPPPWP 


10269 


24170 


A 


10345 


207 


420 


WQKGCPVTFMELGWGPLYPCFLHLWIQ 
PTSHRKYIHL/WIiGMWHTCNPNTLGGQ 
GGWIN*GQEIETSLTD 


10270 


24171 


A 


10346 


115 


3 


APRGAVKKR/RPWMGAATKVCKPSTLGG 
RGG*IT*GQEY 


10271 


24172 


A 


10347 


41 


421 . 


FTFVSVCVHFKGIFLSFLSFFFETEVPG 
SPLGPKRGG3GGNSGPWNPPPPGLRGSP 
PPPPQRTGKGGPYPPPPGNFFFKFFFRK 
RGV/PPL*HPGGPQSVT*GTPPPGPPKF 
WGAQSGGPVWGKNPLF 


10272 


24173 


A 


10348 


228 


1 


SKQVKEKYDSFLHKIQKQAT*NRTFFRV 
TKKKVE I TKK/ WAGHWAHAYNAS TLGGQ 
GGWITLGQEFETSLANMVKP 


10273 


24174 


A 


10349 


405 


1 


GGFLKRGPPFFFFLRQGLSLLPRLECSG 
MTVAHCSPNI LDSSNLSLRRS WDYSHQA 
QIi I FL VS F* VLGSHS VAQAG VELLAS / P 
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residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=G)utamic Acid, F=PhenylaIanine, 
G=Glycine, H=Histidine, I=Iso leucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q-=Glutainine, R-^Arginine, S-^-Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=-Tyrosine, 
X=Un known, *=Stop codon, /-^possible 
n ucleotide deletion, \=possible 
nucleotide insertion 














KCWDYRHEPL/WPSQSQNLNSFFLDGFS 
LLLSRLECNAAISAHCNLRLLGSS 


10274 


24175 


A 


10350 


301 


546 


RHI FMNLPFHNLVKI I FKCRGKTLFLDI 
SNP YLKKQVS S S KPNCK /HLKE I * LFLE 
AKDYWLGWPKAYNPSTLGGQGRS ITG 


10275 


24176 


A 


10351 


873 


1258 


FCYQCYFHLLKHLHHHRPSPRLEPPPPG 
AALPPVPEGRI IPLPCPSSGPRQTTGPG 
KGRRHPAPHRTPAASAASLPARRTLPAA 
PW* PLSFQNTTPGSA\PGGGNQPASGAG 
GNSRT 


10276 


24177 


A 


10352 


184 


576 


GLSPLSHLKMAASSSEISEMKGVEESPK 
VPGEGPGHSEAETGPPQVIiAGVPDQPEA 
PQPGPNTTAAPVDSGPKAGLAPETTETP 
AG\PQKQPRPQTSA*AQEGNQRPTAAPK 
THAQETVSKPEVSKEATAD 


10277 


24178 


A 


10353 


110 


384 


VPGSPRKTRSPQQMERVLAPKTAKKGCP 
RPGDHRSRLQNCRDTRAR I KWADS VERP 
PWAG \ PVAHACNPNTLGGRGG W I T*GQK 
FKTSLVNM 


10278 


24179 


A 


10354 


123 


422 


RKQIANDKSKLFIINKYLNVNELNSPIK 
/RERLAGWIRKEKHDPSICHL*KTHLT\ 
KDTNSLKMEG* KKI YHANNY /QKKAGVA 
ILISDKIDFKFKKKWREK 


10279 


24180 


A 


10355 


36 


426 


PPERPQYS ARVCV C VF YTECRGIiAPKPE 
CRGAI IAHCLLNVXiGSQ*SSNLRVWSS* 

dytql i fklfvqtgshciaqagfell / s 
skdpptla/ fpkhwdyrh\dhhsw 


10280 


24181 


A 


10356 


193 


3 


FQTRI FKISFFFKEFLKFKKFFFFILI * 
ESR£IAQAGVQWRDLGSLQPPPP\GSSS 

ppasasly 


10281 


24182 


A 


10357 


205 


377 


S P WFQP YYRLT IPYQKC* LATVCI * VY 
* FFKL I LYPAI LLNS 1 1 1 WINFTSDPL* 
FS SH I TG * LSL I KNVWVGPGMLAHA\ *N 
PSTLGG/RGGW IT*GQEFETSLANMVKP 


10282 


24183 


A 


10358 


51 


376 


LCRPFQLCHGDRTDMWCPFPEELQDRCS 
CRSTSMRVAKGTTPCFLKVLQNIYTV*M 
FCRTFRKHGWPLATRMIiVX I YKNSGPV 
TLQGRGTTYLSCHHGKAGRVCRATQHAA 
GWVTKQRACI 


10283 


24184 


A 


10359 


295 


376 


ASELTSQSPGSLFENQKKGRCIAFLPC* 
KCNJSEILVHCWWGCKM\WKIVW*FLKTL 
\ RLELVYDPAS PLLAI YPRC I 


10284 


24185 


A 


10360 


451 


i 
i 


CQDI KTI LRNEQE VLVI PSVTK YXQVGK 
SFVSRRYLWSGCVLCYAFVTEASKTWHH 
FP YRS PGFWGRSGGRQFDS * PKSQADFR 
APQGPPTPEPEGAERREPEIAPPPTADP 
SPKPT/HSPPRSGAPPSAPFPLLGRAER 
GVG\PGQGRPRP 


10285 


24186 


A 


10361 


2 


776 


FFFLRWSLALSPRLECSGAI SAHCKLRL 
AGSRHS PASASRVAGTTGAR YHARLI /L 
/ YLFLVETGFHHVSQDGLNLLTS * SACL 
GLPK\C\WIYRREPPRPAYFIFFLRRSL 
ALS PRLECVQ WCDLRS L.QAVP PG \ LHHS 
PAS AS LVAGTTGAHHHTRLVL I LL\ FYY 
FLRWS VTLS CRLE \ AGVQWHNLGS LQAP 
PPGFTPFSCLSLPSSWDYRCPPPCLANF 
LYF* *RRGFTMLARMVS I S* PRDPPPSA 
SQSAHDLFI VGQ 
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USSN 
09/515,1 
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Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenylaIanine, 
G=Glycine, H=Histidine, I=IsoIeucine, 
K=Lysine, L=Leucine, ^Methionine, 
N=Asparagine, PHProIine, 
Q^GIutamine, R=Arginine, S^Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *—Stop codon, /=possible 
nucleotide deletion, Y= possible 
nucleotide insertion 


10286 


24187 


A 


10362 


188 


365 


CPQLHTQSNLINMDASQRYQFELKIQAL 
/WAGTVAHACNPSTLGSRGGWI T*GLEY 
ATSL 


10287 


24188 


A 


10363 


146 


1036 


GWAVYQTLQAAWLGPRTESGVS FHGRFS 
LPHHPPQPTLHAWPAPPLQPEPCVPLRE 
DPFMPESW*LPPS IPAEGVSF* ECDFSR 
RMMGKGS P VPTVAP PLGPAPE P FCVTAG 
CAPMHTPTHTQ\PPHCSIFLPKISFKSK 
HFY*LLFCKCLSSSVFSLPESILLLFIV 
ESVCQPGERCIiALNMGWDGKGSGRRWAT 
KSQGVMD I AATQTPA / VFS P V/ PATS CP 
AGVSGGK 


10288 


24189 


A 


10364 


229 


1 


PLENAKS I YWGKGRIjLKNNARLFTVA* S 
WNQPQFS LMVDW I KKK\MWRMYTTES CA 
AI *R/DEI VSCAGTWMELEAMY 


10289 


24190 


A 


10365 


186 


449 


RFFFFFPXFFWVFGLKSPFFFFG/RLGI 
*EKDGGFFFFFFFFPEMASGSVA\RLEC 
SGAVSAHCNLQFPGSGSLPASAPRVA 


10290 


24191 


A 


10366 


445 


3 


RVDNGVRGRYRGKRGSRGR* TRWGWGHP 
QRWDQDRSRERPRETGRERAAGRRRETE 
AAETGRRRRERGR S DRNRN / RG RP RE RM 
I EAGRDAGRRER * ARQESWTE K KTARQE 
EGDRD /RQGHQETRAHKG WGNN * D * GRQ 
AERSENRRMY 


10291 


24192 


A 


10367 


1 


491 


NTAP F PTS LESA F VWAVQD R VS / LL S P . 
GPECRGAI\LTHCC/LRTPG/FKPISHL 
S /LPS / SWDYRHVP P PLDNL I LCRDNVS 
QSGL KLLAS SNPLTS AFP ST/ YD YRRE T 
PGPRASP * TQMQPCFMP/RS * SQP* TQI 
A/R*QSNWGSQ*VLNNRELAFGTMFFGQ 
RPKAFF 


10292 


24193 


A 


10368 


572 


40 


LHSFRQHFSNTYPQCTPGTP*VTQSLAA 
RPAGLPEPTSASGGAGAVTP PPRTD \ P P 
S VRWSGKKGLDPVLRGGAPRPS PANPAP 
APQGPAPRKGTECVPTASKTPRPGGRWE 
PAAPPHAHPGKDLQPPSTLVKECPEARC 
PRALTGDLPSSGENRNGLSAFQGQRRPA 
QKSGENCEN 


10293 


24194 


A 


10370 


36 


517 


PFLFSDIDFLFSFYFLSFFPF* KRGLIL 
PP/GPEGQGGNKG*WPLHPRG*REPPNL 
TPQTRGNKGNRQTGPVI PGI LRKNGFP P 
FGQGGFKP PGLREP PGLALQRGGNNGQN 
PYP/P/PGKFF*RTKRGEKSPQPPYNKP 
NKKFNPAPMEE FS S FKEGRGP S LN 


10294 


24195 


A 


10371 


60 


403 


DVEQPKLRSYPTLKCKPRSAILTICPWA 
LQYRSHGLI FGFSWGFLS FETES CS VSQ 
AGMQWCDHGSLQT/PTDSTSQAQAIDPP 
ASAP \QAAGTTGACHHTCL IFVFL* RWG 
LAKP 


10295 


24196 


A 


10372 


358 


33 


DSLGVPGAPLLSPGVTPHKQWFFRPYTQ 
IKKLGFFPQIFFGFKV* KAPFFFWGF3N 
LKKKGGFFFFFFFSEMASGSVA\RIjECS 
GAISAHCNLQFPGSSSLPTSAPRVA 


10296 


24197 


A 


10373 


264 


3 


RQTSVSGSNDGKSWPPADIDTI \kyvig 
NRLITTQQ*LIKLTSKQV|ITIHNTXGRT 
*SLILVSLI IFIATTNLLGLLPHSFTPT 
THCI 


10297 


24198 


A 


10374 


298 


384 


TLVI * KDG VHTTPHHOQHAYPGPKE YML 
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nucleotide 
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correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=G lycine, H=Histidine, I=Isoleucine, 
K=Lysine, LHLeucine, M=Methionine, 
N=Asparagine, P=Proline, 
v^vpiuidmine, rv— /vrginine, o — serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
A-unKiiown, — oiop cooon, /— possiuie 
nucleotide deletion, \=possib!e 
nucleotide insertion 














HKAPLLHSVSRDPSPFAQSSNCYNRSIK 
QEPVDPLTQAEPVPRDAGKMGKTPLSEV 
SQNGGPS/HK*GQYSGDV 


i moo 






105 Id 


QA 

o4 




QLR/PMLGGNFIALN/AHIKKLGRTQIN 
NLTSQLKELEKQD /RNPKARGRQE I T\ K 
I*AELKETET*KKEIQKIYEFRV/WFLE 

MTMTfPN PaQPT.TOPMTFPW'TTPl\JnK'nyNrV 

TSAPPDV 


10299 


24200 


A 


10376 


70 


363 


FHQKNLVPGLWVQLLGRRTLQSQHPPAP 
PPSSSPEVNVPSEVQAAD*LCCIFYDVK 

/ CLC* KRGEKNLVGKT 


10300 


24201 


A 


10377 


299 


2 


Y P VAGRGN R* G CGAHALG VGGAAM PLGA 
TISSVSLTISSLCAFPRRGGLEPSLPAP 

HTPDPPPT? /P*7iQQPQPnT7WHT VPPrWH 
nLctrtrtrcCt/ tr x^.O O £r O r\.Ljil VY XI i_i V ir rr V, 1 i 

PGLPPLPPPRLSPLV 


10301 


24202 


A 


10378 


302 


3 


WPTVATTCSCERKSHTSLT*NQKLEGTQ 
LSEEGMSKAEIGQKPGLCHQT/VQWNA 

VVlfTJ.Tnr T VM2\T'D^7MT , PiTT , D1f'\ flWCT.T 71 
J\J\JNJjlJiV£j J. iMMril tr V1N 1^1 1KA \yWoiJ±A 

DT\EKV*VIQIEDQTSHV 


10302 


24203 


A 


10379 


125 


375 


NEYVFLEKSLKLAGRVGSCLLGRPRQVD 
HLSSGAQD* PGQRGKTPSLQNIQKLSRH 
GGMCP/RRLRWEDHLS/SRGG*GCSETR 
ST 


10303 


24204 


A 


10380 


458 


3 


TTTCTTC 7\ T 0 TITO A DD 7\ CO TiTi OrMSTiTt 7 

SQAGPAPLRGWEN*GTRRRDDWPEASLK 
D PRRS GE PEAPL / PLRLML * S RL P P PLR 
WRRRPPPPDSHTPSPGNGKSPRVPRTGS 

TmKT? PF AT ,PP * A AfSFPT .FZMVJVPP\70T-in 
XVL' IVTv tr 111 rA J_i ±r xr xr / Ario I-j ir l ir.HlM V ir tr V ^ilvj 

GGSVRSRGSYLTAT 


10304 


24205 


A 


10381 


327 


2 


LSAPKKFFFQKSLFLGANFRSPPFPGRP 
FTNRFSDLNVNPLTKKIYFSHKSSNPPT 
FFFFFFETGSNS VT* SGVEWRNHNSLRP 
ESPG\SGDRP*V*HTTGAPYHTWLV 


10305 


24206 


A 


10382 


754 


1 


NTALETDTTFSGGGGGHQLCVRSRVSGG 
G*LIMPDLHNHPLQQTSGRKQRLQGEVT 
DPSPGMSGGAQTGPKAPPRPPSALRHSG 
SLQGQP* FAHLYCGT I PYPCYQTAPLPR 
nr>PP r rPTnM*r*pri'P2i\ phitpt pvpnpQP 

WPRPPGQAQAGRNKH* FSGQRSSRDTTA 
CSPCSQSKQSPSRPPCVNDS/PGQGPGP 
KAPAGPPPALPEASLGSVHCSP+VGVMT 
PTRPGHFLVPGKVTLEQAPGGGASGGPV 
p 


10306 


24207 


A 


10383 


39 


433 


SHDQTYTEEELLFMDEQRKWFLLMESTP 
G E D A TO WE I Tr KN LE Y C I NL VDS TVA / 
GVSNFERRSTVGQMLSNSIVCYGKI F/R 
♦ TfP /QnQMWOTCJT.T.PVMK'T CfiPP+pcTP 

TILISQQPSMWRQDPPSEERL 


10307 


24208 


A 


10384 


283 


1 


NVFSCKPTS SCLIS PDS PTPDRRFS KAD 
TYRNFMFPLRRHTHMHTHTYMHTHVHVC 
TQLCLHLR PH / 1 THGLS L/TCAP PVL* R 
PSFKPLHPTPV 


10308 


24209 


A 


10385 


12 


471 


S P P WWTHGFRYWE PT PPRFSVPHDALGQ 
FHTALFRLTRRFQLLSS CIN I S PHTCLP 
PCLGLCVGGPLEPAVCWAR/CPP *RGMV 
SGWKAS PLG /CPPTLliPPC 
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SEQ ID 
NO: of 
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NO: of 
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Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Pbenylalanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q^GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


10309 


24210 


A 


10386 


287 


1 


SFCVRSRPAPTSGRTPFPAAPPFPAAPP 
FPAATTPSTAEDGVPG*G/ PAPAANCSP 
HSSQRGS AAMG I PLPQAS/PPAPTPAP? 
SP*SLPFPPAPMY 


10310 


24211 


A 


10387 


336 ■ 


3 


LIEGKLTNRKDIRSKTASVRHHHQSPKV 
DKPTKMGRNQSRKAENS \ KI KSAS S P PK 
ECSSSPATEESWMENDELREEGFR* LVI 
TNFS ELKEKTGKLNPAALVCGRS EDQTT 


1031 1 


242 J 2 


A 


10388 


15 


470 


APSPDAMGR\ FTEEDKATITSLRCKVNV 
EDAGGETLVRLQWYPWTQRYFDSFGNL 
SSASAIMGNPKVKVHGKKELTSLVDAIK 
HLDDLKGTFAQLSELHCDKLHVDPENFK 
LLVNVLVTVLAIHFGKEFTPEVQAYLHK 
MVTWANSLRYK 


10312 


24213 


A 


10389 


3 


438 


QTQRE PTMVLS PADKTNVKAAWGKVGAH 
AGEYGAEALERMFLAFPTTKTYFPKFDL 
SHGSAQVKGHGKKVADALNNAVAHVDDM 
PNTLSDLNDLHAHKLLVDPVNFKLLNHC 
LLETLDGHLSAEFTP\AVNASLEQFTAF 
EITWLT 


10313 


24214 


A 


10390 


2 


444 


TMS FNTLAI CLDCLCSTLQP TRS I PG YP 
SSPLPGNPTPPMTPSSSVPYMSPNQEVK 
SPFLPDLKPNLNSLHSSPSGSGPCDELR 
LTFPVRDGW / S GALP PAAQ PGC 


10314 


24215 


A 


10391 


405 


3 


KMILTCLALPAQHCPPSACPQTMAQPPL 
PLS I KGAASMS D KLPY KVAD I GLATWGH 
KALDI VEN/EMPGLMGMQELYSASRPLK 
GAC I ADCLQ I TVETAILI ETLFSLGVQE 
QWSSCSI FSTQEHAVAVFAEAGMP 


J03I5 


24216 


A 


10392 


248 


5 


RIiNKVGGHGGNYEHLYVNKLETLQEMYK 
FLD I YNL PNLS LEE IE\NLNRP I TSNKF 
ESVI KHF PTKKI SGLDG FTAE FLTNN . 


10316 


24217 


A 


10393 


262 


3 


LPRDRQAGRSQGP WPQAVGRARKMPP F 
I SYQVYS KNFFE I ESCSVTQAGVQWRN I 
SSMHPPPP\GSSDSPASASQAPGIPGAH 
HHA , 


10317 


24218 


A 


10394 


287 


410 


ALKGS SRI L I MLGAWLTP VI PA/ LWEAK 
AGRSSEVRS 


10318 


24219 


A 


10395 


175 


1 


GGREVDDEIPLVRESTIFHI FFFFFETE 
SHSVAQAGGQWRDLGSLPSS\PPG\SCH 
SPA 


10319 


24220 


A 


10396 


661 


963 


STLIAFIVISTLFPLLDMTEIYFSLLDE 
IVDTLGEGAFGKWECIDHKAGGRHVAV 
KIVKNVDRYCEAARSEIQVLEHLNTTDP 
NSTFRCVQ\MLEWFEHH 


10320 


24221 


A 


10397 


1 


268 


PQTHREATMGLSTADWTNVKAGWGKGGA 
HAGEDGAEALERMFLSFPTTKTYFPHFD 
LSHGS AQDKGHGKKVADALTN \ AVAHVD 
DMRNAL 


10321 


24222 


A 


10398 


33 


302 


S RGATL I YVDNENGE PGTRWAKDGLKL 
ESGPS IKALDGRSQVSTPRFGQTFDAPP 
ALPKATRKASGTVNRATEES V\ KTKGHL 
VQEHP 


10322 


24223 


A 


10399 


16 


233 


PTDYEFNTTTYRECGPRRPEFSTSLDLL 
SQPCRAVYM\VGEKNDIPFELRIVDLIT 
GKSILGFGEPKSQEGDR 


10323 


24224 


A 


10400 


177 


1 


KLLDLGQPQS PALAACLEYSGAI AAM \ C 
NIOiLGSSNPPTSASR J ARI tgvkhaep 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

peptide 
sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
uing io 
i.tsi amino 

acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, FHPhenylalanine, 
G=Glycine, H=Histidine, I=]$oleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q = Glutamine, R=Arginine, S^Serine, 
j — j oreuninc) v — valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unkno\vn, *~Stop codon, /^possible 
nucleotide deletion, \— possible 
nucleotide insertion 














YA 


10324 


24225 


A 


10401 


185 


419 


HVNKDTLESKLVWKLKSALLGRKSESVT 
GPSCHPBPSDKTVEL/WQGAVAHACHLS 
TLGGQGGWITLAQEFETSLDNIV 


10325 


24226 


A 


10402 


104 


3 


GIILIPKPGRDTTKKENF/RPIiIPIDAK 

X-LfiN Jx j. ixBIM 


10326 


24227 


A 


10403 


3 


344 


SQVMAVAGPAPGAGARPRLDLQFLQRFL 
QILKVLFPSWSSQNALMFLTLLC/LTLL 
EQFGNYQVGLI PNQY YGVLGNKDLEGFK 
TLTFLAVMLIVIiNSTVRSFPLSVSLVLS 

oJr V 


10327 


24228 


A 


10404 


3 


404 


ARAQRRGDLSATGRNWSPLPPAGLPATV 
VLRHSGS LMAATCE I SN I FSNYFSAM YR 

QTS LEGTEKANWMG \EQSQ FWPKTQGLD 
WISFQAEKNKYEASAIGFPRWDK 


10328 


24229 


A 


10405 


254 


443 


NQLSSIMVMFKKIRSFEEDFNDPEKVYG 
SGD KVAG\RL I VE VCEDTRVKAVRI LAC 
GVAKELRM 


10329 


24230 


A 


10406 


26 


461 


GE VARR S CCGAMAGTAL KRLMAE YKQL/T 
LNPPEGIVAGPMNEENFFEWEALIMGPE 
DTCFEFGVFPAILSFPLDYPLSPPKMRF 
TCEMFHPNIYPDGRVCISILHAPGD/DP 
HGLREQ\ PERWSPVQSVEKILLSWSML 
AEPNDES 


10330 


24231 


A 


10407 


1 


463 


QyAAKMAENSGRAGKbSG/ T / PARGRGR 
C/PAEQVIAGFNRLRQEQRGLAS KAAEL 
EME LNEHSL VI DTLKE VDETR KC YRMVG 
GVLVERTVKEVLPALENNKEQIQKIIET 

t TnnT r^nvpiruT "Krc i i?DC , vxnvTT'DT Mr'irrn? 
JLi 1 y \i Jjy/Uxo ivtSLiIV r* r iCE»I\riIN 1 Kljjyivjii U ill 

KPAAKENS EGAGAKAS S P 


10331 


24232 


A 


10408 


180 


446 


KKAAAS VSAASGSHLSNS FAEPSRSNGS 
MVRHSSSPYVRYPSDKPFLNSDLRRSPS 
KPTLGYPESNSRAIFSAL/KNLQD/KIR 


10332 


24233 


A 


10409 


1 


417 


EAGLVTMEEIGILGEKAQDEI PALSVSR 
PQTGLSFLGPEPEDLEDLYSRF\KKLQQ 

sLLitliE ijJiVWiljCiI ±JtUJl!»UlSIMJbJ\iVi!ir iirl/isjii 

EVKRIQS IPLVIGQFLEAVDQNTAIVGS 
TTGSNYYVRILSTIDRELLKPNASVALY 


10333 


24234 


A 


10410 


305 


53 


LEAQPHAPPLLGHHLPCWRI PSHLGTCR 
RPRAMRVQGSEEEGVA/GQAPPIiPWVHR 
G SL P LVRRRP AG KP TGGNAREE KGKVEG 


10334 


24235 


A 


1041 1 


419 


2 


KS E LLLTPQD KKHAPL PS AVPNP S A/ C P 
DLRPHTTPASPVS\PAKTVETRPSAPQG 

C LjtrKj tr VKKrKbKr o VK/v\oiry 1 oVjitfkj/i 

PRPPGPALREATDAPRAATPP I AALAGH 
S 


10335 


24236 


A 


10412 


74 


322 


MDEMATTQISKDELDELIEAFAKVDLSS 
NGFICDYELHELFMEANMPLPGYKARET 




Z'+ZJ / 


]± 


1 f\A\ ^ 
ILJ'f 1 J 


i 


*f 1 D 


POTHREPT^WVASADKT^^VMAAWGKVOA 
HAGEYGAEALERMFLTFPTTKTYFPHLD 
LSHGSAQVKGHGKMVADALTNAVAHVDD 
MPNALYALSDLNAHKLWLDP/VWFMLLS 
HCLL VTLVAHLPDE FTPAVHACLDKLL 


1033? 


24238 


A 


10416 


19 


454 


APS PNAMGHCTEEDKATITSLWGMVNVE 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino nrirl 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 
peptide 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid. 
E^GIutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=lso leu cine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 

O^C* In turn inp R— A roininp S=Sprinp 

VJ1UUIIIIIIIC, IV fll CIHIUCj <J JCI lilt, 

T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possible 

niiolPAttrlp incprtirm 














DAGRE?LGRLLWYPWTQRFFDSFGN\L 
SSDSAIMGNPKVKAHGKKVLTSLGYAIM 
HLJ3DLKGTFAQLSELLCDKLHVDPENFK 
LEjGKVLVTVLAIHFGI EFTPEVQASWPK 
KMNAVA 


10338 


24239 


A 


10417 


2 


395 


ALSSSQPSAPiyiHPFYTRAATMIGEIAAA 
VSFISKFLRTKGLTSERQLQTFSQSLQE 
LLAEHYKHHWFPEKPCKGSGYSMIGVNP 
\ KM I ST, I IjOAC I EGG I 


10339 


24240 


A 


10418 


2 


374 


lstphafgtgkwgtfisllflfssaysr 
gvfrrdahkseaahri kdlgeedftalv 
viafaqhlqqwpfedhWqlandvtefa 

TTCAADE S S EN CDQS LHTL FGDLL CSRA 
TLDESHCVQDGRP 


lfH4ft 

JUJtU 




A 


1 04.1 0 


48 


147 


DPKAOLPEPLRVLWOLTYAMAAGSR /TS 
LL/LAFALLCLPCFKAGPPNRPLSRLFD 
HAMLQAHRRTTAIDTYQEFENLYPKD/Q 
QYS/FLMTPTSSALDSIPTPS 


10141 


7494? 


A 


10420 


J oo 


1 
1 


NFLKKPGPFOKFFORGLPLILTFFFFFX 
FETKPHSVSL\LECGGTVSAHCNLCLPV 


10342 


24243 


A 


10421 


2 


251 


LGCTQHRSQELVAAATSHQTCIQASEDV 
KE I / FARARNGKYRPLKI S I ENGQLMI G 
SY/SSQPSDSWDNDYDSFVLPLLEDKQL 
CY 


10343 


24244 


A 


10422 


251 


1271 


KEDLS PRA PMSGTQST I TDR F PLKKPIR 
HGS ILNRESPTDKKQKVERIASHDFDPT 
DSSSKKTKSSSEESRSEIYGLV\QRCVI 
IQKDDNGFGLTVSGDNP\VFVQSVKEDG 
AAMRA\GVQTGDRI IKVNGTLVTHSNHL 
EWKLIKSGSYVALTVQGRP.PGSPQIPL 
ADSEVEPSVIGEMSPIMTSPHSPGASGN 

11 rui\. J. lor V J-t. Iv3 ri iztvi IN V V lTJ.NJVJ.rV VCil J-jtvIYl I 

LQKEQERLQLLQEDYNRTPAQRLLKEIQ 
EAKKHIPQLQEQLSKATGSAQDGAWTP 
SRPLGDTLTVSEAETDPGDVLGRTDCSS 
GDASRPSSDNADSPKSGPKERIYLEENP 
EEQKKG 


10344 


24245 


A 


10423 


198 


569 


QRNMVGQRLVEPRRLKPGFINVKSYNGD 
WEWHQGFFLVCFFFLRWSLALSPRLECS 
GAISTHCNLCLPGFKQFSCLSLPSSWDY 
RHAHHCTQIiIFVFLVEMG\FHHVGQAGQ 
GFFSLEKSLTII 


10345 


24246 


A 


10424 


2 


343 


PQTQR E P TM VL»S PADGTNVYAAWG KVGA 
HAGE YGAEALERMFLS FPTTKT YFPHFD 
LSHGSAQVKGHGNKVS \ DALTNAVAHVD 
DMPNALSAI^SDLKAHKLLVDPVNFKLLS 
HC 


10346 


24247 


A 


10425 


31 


342 


RAAVMPREDRATWKSNYFLKI IQLLDDY 
PKCFIVGADNEGSKOMOOIRMSLRGKAV 

VLMGKNTMMRKAI \rghlennpalekll 
phirgnvgfeitkedlteir 


10347 


24248 


A 


10426 


196 


2 


knliisqtkkvicpntlflvfpkrvfff 
ffffetesrs/vsprlecsa\sisahyn 
lclpglsdss 


10348 


24249 


A 


10427 

i 


1 


405 


rlecwlepphgaglqglgwvwscsvstg 
ptmqalv\lllcigallghmscqnpasp 
peegypdpdstraleddedpifidpenk 
l p sag snfg ydl yrxira i m 3 ftsn~ /llp 
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SEQ 10 
NO: of 
nucleotide 
sequence 


SEQ IB 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

icolUUc OJ 

peptide 
sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 

I<1SI diUlDU 

acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P^ProIine, 
V^vyiuiamme, k— Arginine, oerme, 

T— ThrPAninp V— Vilinp 
ji — j hi cull inc, v — vuiihc, 

W=Tryptophan, Y=Tyrosine, 

1L *—I in 1^ n twmin & — Qlnn r* t~\ rk / — ri /\ r L' I rl If* 

j\ — uuKnovvn, — oiop couon, / — pussiuic 
nucleotide deletion, \=possible 
nucleotide insertion 














PLMVATDLWDLSLEAQQLTESI I 


10349 


24250 


A 


10428 


223 


361 


EGFTEENMAAKVFES/ IGKFGLALAVAG 


10350 


24251 


A 


10430 


3 


359 


LTQREPTMALSPGDMTNVKAAWGKVCAH 
AGD YGAEALERMFL, YFPTT \ KTY FPKFD 
L SHG FAQ VKGHG KK WDALTNAVAH VDD 

CLLVTLA 


10351 


24252 


A 


10431 


273 


1 


NHKDGKKQSGKTEKS KNQS AS P PPKERS 

YSELKEEIRTHGKEVKNLEKKLDDRLTR 
ITNAQK 


10352 


24253 


A 


10432 


211 


3 


SFLWKFCLRGI PSHVSCQSA/ LLIjGGAS 
TTLFNAVRQGKLSLQ 


10353 


24254 


A 


10433 


281 


2 


TNQEKKKGDPNKIRSGKGDTTTDNTVTI 

T Dnwr / UTVTMl'T VAT /t PCTfriYCT "rvTV 

XKJJ Jf lu/ Til 1 AaNAIjJsJN/ Lib hi VjJivr hul 1 
YLPRFNQREIENMNQPITSNEI ESVI KS 


10354 


24255 


A 


10434 


219 


1 


S FLWKLRLRGAPG CMRCQS A\ LLGGVSH 
TLFKAVREGCLSLQKF 


10355 


24256 


A 


10435 


133 


2 


FPKPPGNFFFFFETESHPVT \RLECSGA 

TCl\TJr , T\TT IIT DHCCflCDa 

X bnnL v* Lai J-iiroo o D o f /\ 


10356 


24257 


A 


10436 


248 


376 


KGGVFFFFFFFFETESCSITQTGAQWHT 
\YGSLQPRPP 


10357 


24258 


A 


10437 


250 


1 


KGTRKELTFIEQLLSAKYWAAGCMHYFR 
GSGKYSNGYPVFVFVFIFSETESCS\VA 


10358 


24259 


A 


10438 


354 


2 


KRGLKKNPFFNPRNMGEFLKLWGPNPKK 

GKLNPFVI FFFFFFFFFFFLETGS YS IA 
\RLKCSNSGTNTAHCSPELPGSRDLPDS 
TTQVAG 


10359 


24260 


A 


10439 


126 


1 


RERQREKER/ERERDKERKREERK 


10360 


24261 


A 


10440 


29 


347 


IQKPTADTKKFFKKKLKYTTRENHLHNK 
EDRKEGKKEEKTTKQHGSSKSLLINKTL 
NINGLKSPIKTHRVAE/WNKNKDPMLYC 


10361 


24262 


A 


10442 


405 


1 


IFFPHPFFFKKSFRLSPQVEIQGIFLGP 
WKLFPPGFNNFS/CPNLPNNLGFKGLPP 
RPSYFFRI FCKKKGF/H/RVYPEGFPFL 
at riPTypT.TrpPVcjwrsiJ'nrinppQW/^ipp'C'T?"!? 

FFFVLVAEGTLS EG CRLIRGRW KNLVP 


10362 


24263 


A 


10443 


122 


372 


L YND P I S I YQVTL YRRVFF WVL YSANWS 

T D r*TTf E 'MITT . J? ITQ IT "P W~M 7i Q rYDT 77i PiA O \7P1Ta7 <*"• 

DLDS LQ PQ P P \GL \ S DTPTLASQVS ETT 
G 


10363 


24264 


A 


10444 


171 


3 


TESCSVT/RECSGAISAHCYLSLPGSSN 

T.P 






A 


10445 


i / j 


3 


CMNWCFSIFIFLKINFILETESHSVSQ 
AGVQW\LITVHCSLKLMGSSDPPSSASQ 
V 


10365 


24266 


A 


10446 


187 


329 


S FLWKFCLRG VPGHVRCQS A \ LLGVASQ 
LGYSRVRNPLEEAVCPFSDL 


10366 


24267 


A 


10447 


27 


226 


YS YPVYLLLHI FLCWKCSLYFI YF VFEM 



1181 



WO 01/64835 



PCT/US01/04927 



SEQ ID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid' 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Pbenylalanine, 
G=Glycine, H=Htstidine, I=lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, PHProline, 
Q=Glutamine, R=Arginine. S=Serine, 
T=Threonine t V=Valine, 
W=Tryptophan> Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














ESCSFAQAGMKWCDLQPLPPG\SSDSLA 
PASQVAGVDAA 


10367 


24268 


A 


10448 


119 


296 


SPIHYILVIIICFIHVILLIIIFSEVGS 
HSVAPAGV\SGAI IAHCNHKLLGSSSPP 

vsvs 


10368 


24269 


A 


10449 


111 


363 


S LKKLVFFFS I YTHNFFP FFDPTLKNST 
RHS LYkSLS LSLFLSHS V\ S RLECGGT I 
IAHCSLNLPGSSDPHTSMSQV\AGPTGS 
C 


10369 


24270 


A 


10450 


218 


1 


QRFWGPGPNKVFKNLGSAFFLGRGKLFF 
LEIWGGFFFFFLVFFLRQSHS IAQAGVQ 
WCD\LNSLQPLPPRFK 


10370 


24271 


A 


10451 


179 


3 


G P FKLTLFFMGKFAIiGRE KCH FFF FE TE 
SCTVAPAGLQWRDIjGSLQPPPP\GSSDS 
PAS 


10371 


24272 


A 


10452 


71 


232 


SSPLSDLLFNIVLSVLAKDIMQEVE/ IK 
LILIGKDEVKWSLIVDKIILYVKNSKH 


10372 


24273 


A 


10453 


153 


3 


ARGELYFLEIGGGFFFFFLVFFbRQSHS 
IAQAGVQWCD \ IMS LQPLP PRFK 


10373 


24274 


A 


10454 


338 


221 


MGFLHVGQAS LKLLTS I C P \LGLPKCWD 
YRHEPPRPVIPS 


10374 


24275 


A 


10455 


214 


366 


YNYKYLIHYYSYLFFIFIFF/NIIHYIF 
FILFFYLTSFIFLSYFFFFFITLF 


10375 


24276 


A 


10456 


155 


2 


DRVSLLLPRLECNGAILAHCNLCLLGSR 
HS / PRVAG I T VAHHHARL I FC I F 


10376 


24277 


A 


10457 


870 


1249 


EGPRWADHLRPGVQDHLG \QHGETPSIA 
QKIQKiiARRGGTC/LCQSQLLGRLSQEN 
CLNLGDGGCS E PRLHHCI PTWGDE\GDS 
PKKKKE 


10377 


24278 


A 


10458 


222 


404 


KQRPGRAQWLTPVI PKLWEAELGR\MLE 
ARTLRPAWAT 


10378 


24279 


A 


10459 


122 


362 


LSEDQLNCYPQRLHHFIFLPVI TRVQMS 
HYLCQQSCLFHFF/S FFFI/YFEMESS S V 
T\RLECTGMISAHSNLYLPGSSQSSG 


10379 


24280 


A 


10460 


3 


102 


AASTLALSPRLECNSAILAHCKLH/LPA 
FTPFSCL 


10380 


24281 


A 


10461 


133 


339 


KLSKFQGSKKPEIKANSLDLALFYHNYI 
TRQCFTS YSNFFGDRVSLLS PRLECNGV 
ILAHCNL/RTPGFK 


10381 


24282 


A 


10462 


3 


193 


ASNMMTELKSQIP ILTLNVNGLNAPLKR 
HRVAGWINDPTICYLQETYFT/CKNNHR 
LTAK 


10382 


24283 


A 


10463 


126 


314. 


RLGGSTIKVQRDLSSDRSKPGRFLYTSN 
SSYWKGKRNKACFTERVP/ PVNDI I 


10383 


24284 


A 


10464 


1 


332 


KKNLFLSKDCLSSLFRFSKSPPAILGPP 
RGWTPKTRFIGPKAFFFKKGSLSLGRKY 
HFGLKPVP/CPPFLSKTRFLPGVPFFFF 
FFFETESHSVA\RLECSG/AI s AHCKLR 
L 


10384 


24285 


A 


10465 


173 


365 


RFVCSTIKVLRDLSSDRSNPGRFLSLQI 
PP\LRKDKRNKAYFTX/RPS PVNDI 1ST 


10385 


24286 


A 


10466 


335 


3 


PFKKFYFKISTRALLEPVFFGKKIRVPP 
QRLS P I WAPLFKRG PPNSFQVNG VPLFF 
FFFFVRHGLALSSGVECSGTRLECSGAI 
/STAHCHLCHLDASHPPTSASWVARPTG 
T 


10386 


24287 


A 


10467 


818 


1008 


VHTDVLAD I LKS XNNAEKRQT/ RRCFLS 
\ WLQGHCP FLT VMM QHGLCGVELGG LPG 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue nf 

peptide 

lennenee 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
list amino 
acid 

residue of 

peptide 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
0=Glycine, H=Histidine, l=Iso leucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 

0={^lntaminp R— Aroininf* S=Seririf* 

T = Xhreonine, V = Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown *~Stoo codon A^nossible 
nucleotide deletion, \=possible 
nucleotide insertion 














PGDFCKRPR 


10387 


24288 


A 


10468 


111 


1 


MVS FGRPRQADHKV\GVRDQPGQHGETP 
SLLKNIRI 


10388 


24289 


A 


10469 


204 




LWGPILKIFCQILKTICEGNFRGPTFFI 
FFFEMESQSVTQARMQWCDLSSLQPLPP 
GS\SNSRASAS 




94990 


A 


10470 


100 


330 


ERIKKQDLS I CCLQVTHFTFKDSQRLKV 
KGW KK\ I FHTNKNQKR I WT 


10390 


24291 


A 


10471 


162 


2 


IKGPKRAPPANKGRAFFFFFFFFETESC 
SCLQAAVQWHDLSSPQPPTP\GSSDS 


107Q1 


949Q9 




10477 


170 


7 

J 


SWGFFFFFF \EVES YSVAQVGVQWCDIX3 
SLQAPHVMILVK 


10392 


24293 


A 


10473 


298 


1 


RRAL PCPAN FLY FYFLS LLLFFETE FHS 
\VAQAGVRWHELGSLQTPPPG\SSDSPA 
SGEYSANYTGEYSANYTGEYSANYTGEY 
<7 VN YTGE YS VNfVTGEY 


10393 


24294 


A 


10474 


1 


380 


AVTGRAGSMVAPR\ PLRRLVLFYQGKLR 
SMAGNFWQSSHYLQRILDKHDLLNERQK 
DLKVLSEEEYWKLQRFFTKVIQALCEHL 
KLRQPG IATATAYFKRF YARHS LKS I DP 
VLMAPTCV CLAS TAN 


10394 


24295 


A 


10475 


289 


2 


TQGFPPINNS FYALYPKQKIPFFSDKLF 
RVKGFKGPVFHYGVKDFKNKDGVFFI FF 

i\ W XVvJJ 1 1VVJ XT V i 11 iVJV XVJ-> i- l\lil\_L/V7 V 1. 1 -LI 1 

FEMASGS VAQAGVQWRNLGSLQPP I \ PG 
SSSLTTSAPRVAS 


10395 


24296 


A 


10476 


138 


402 


ITGRAPPKPFPLKRGYLSGGLTMIGFSF 
S FS FFETESRS VTQAGVQWRWLGSRS VJL 
EST/SPDSSNS/PAPASRVAAIT 


10396 


24297 


A 


10477 


2 


336 


ARETNDTTQTRLLPANKTMTRSDPYI SI 
LTLNVNGLNT P I KR \ KRVAS W I NKQGLM 
GCCLQETHLS SHETHNDTHKLKI KT WRK 
IYQANGKQKKARW/ILISDKTDFKP 








10475? 






RKVSLFKKWCWNTLFICKKALSLDTYRF 
AKI NFEWITDLNVREKT / 1 KLLKGNI KE 
NICDLGLGKDFF 


107QR 

J UJ76 








71 fi 


9 


GFTRKALSGFFKTGWFWVFPNFSKLFFK 
KNFSERFFFFFFFEMESHSVT\RLECSG 
PISAHCNLHLPGSSLV 


10399 


24300 


A 


10480 


191 


3 


FSTDGNVNCLVKYS IYIYI YFFFFLI FF 
FEMKSCS /VSPRLECNGALSAHCKLCLP 
GSCHSPARA 


10400 


24301 


A 


10481 


135 


768 


LGGQVLSCASQPVSLFPLFFFLI SFLGC 
LLAFLSACFASTFSCVLGS PETSFSPCL 
HPPPKVLSSHP\PPCSRP/CPQPPKG/P 
PLPKHACPP 


10401 






104R7 


176 


777 


MWGQRYCVMCFVI \VLCLFLLV/WWW 
FLW/CSVYIVYWYGYAWLFIFFFFFFS 
LLCFAVCLFVWLLDRF 


10402 


24303 


A 


10483 


240 


2 


ATEKNMGLGTSVRGLSGSNPSYGRVRER 
IGGGVCLWSQILLRGC\LRWEDHLGLGG 
GGCSEPRSSHCTPTWVTEQDPVLV 


IvHUj 


74704 




104R4 


17£ 
1 JO 


9 


NRPSYIFSFFFETESHSVAQAGVLWHDL 
SSLQPLPP\GSSDSPARA 


10404 


24305 


A 


10485 


161 


328 


EKNFKPYGSCRHFYLFIYFETESRSVA\ 
RLE CSGT I S AHCNLRLPGS SN / S PAS 


10405 


24306 


A 


10486 


193 


347 


I LFS FLLFI Y FFETESCSVAQAGVQWHD 
HGSLRSRAPG\SYWAPALDSQGTE 


10406 


24307 


A 


10457 


114 


2 


GAWQSEVQWRDSGS LQAP PPG \ SRHS P 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last ammo 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AKanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
GKJIycine, H=Histidine, Wsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
i — inreonine, v— vaune, 
W=Tryptophan, Y=Tyrosine, 
X=Unkno>vn, *=Stop codon^ /=possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 














ASASQVAGLV 


10407 


24308 


A 


10488 . 


314 


49 


I RRSTERSE PRYSMVCRRQPRQGGPCL \ 
RSQLLGRLRQEDHLSPIAGDCSEPRLH\ 

t 7ir** r pr>7vM7v r PC/^'CDxrr vwtuiodtt T?r*j ttt t*t 
LiAUl rAWAItiyiSFvJbJxivMfaF 1 lir L-Lir L»W 

PLPCQ 


10408 


24309 


A 


10489 


291 


1 


KSPPPPNRRGLTLRSRGQIGNGVSFYYE 
LQLPPPG\SSDSPASASRVAPVSQGHAT 


10409 


24310 


A 


10490 


226 


3 


GSRCTMPRRRLAHAFPAARMPKRKVSST 
EWATEEEPKRRSARLSAEPAPAKVETK/ 
ptttt 7A 7AnTrnvc; QmTirvrvnf t.v 

C & JVrt/\ljZS»L>rLO OU I\J\. v v J- V 


10410 


24311 


A 


10491 


2 


153 


ARGLVFFVYLVEMTLCHVAQAAVKPLTS 
SEPP\LGLPKCWDYRREPPRPT 


10411 


24312 


A 


10492 


123 


342 


TH YLCLLR F FF JbDRG F JLaULi PRLD C / NAP 
I S PNLNLRLPAS S D S PASAS R VTG I TW 
NHYARLTLLFIFQNETLF 


10412 


24313 


A 


10493 


194 


2 


IPGWPFKNFSLFPLFFFFFEMESCSVAH 
AGV/CyfaSDr fAbAby VAIjI IfaMKiWAWJj 
IFVFFSRN 


10413 


24314 


A 


10494 


373 


1156 


KVQLKVYLFFFLRLSIiALLPRLECSGTI 
LAHCT^PPSRFEQFSCLiSIjPS S vvNYkRP 

pphharlifvl/ SVETGFHHVGQAGLEL 

Li I bLriJFFAbA/ r PKL-WUxKriri 


10414 


24315 


A 


10495 


181 


3 


SFLWKFSLRGVPGRVRCQSA\LLGGASQ 

T PVCPT DTVDT AD 7\t7Y"' , D CCHT ITT OAPDTT 

TLFK 


10415 


24316 


A 


10496 


156 


2 


AASTTRLGDRDQPGQHSETPSLLKIQKL 
G\ SGGGHIiRLRQENCLNP\GGRGC 


10416 


24317 


A 


10497 


157 


300 


JL^bWr N±Jji\fa±H V IHlibNKJjrv/ c*ViMnl V 
STDAEKAADKIQHPFLIKKQ 


10417 


24318 


A 


10498 


3 


289 


Li V KKM It i vj fa 1 lAjKfa x Kiib P r fa 1 W y vjHINA 
LRKAAG\PLPRKAGY\LQGFSPLRYGL\ 

TlU^lSAJLtl XiT\2rU 1 K£iVj5 VJjbKlolvKbKr 

L/CSCLPLGTECLS I 


1 f\A t O 

10418 


24319 


7\ 

A 




1 o o 

lio 


1 


LQPPPP\NSSNSPASSC 


10419 


24320 


A 


10500 


138 


I 


WEGLIYLLLEMESCSVAQAGVQWHDLGS 


JU42U 






11DU1 


zUU 


5<\5 


T.'R , 7AHQTrT,QPPTT?T T T f?PriTK^H ( ;\7T\ PT. 
jjfirtnorjjor r in x xr rui Aono vi \X\.ju 

ECSAAISAHCSLLLPGSSNS 


1U421 






JUjUZ 






QEDHVS PGGRGCSE PRSRHCT PAWAT\ E 
SNSVSKKS 


10422 


24323 


A 


10503 


141 


362 


EWNCCVCC/CCCCCCRCF 


10423 


24324 


A 


10504 


1 98 


320 


\RFLDVDNRVKLPV 


10424 


24325 


A 


10505 


30 


356 


GYPCNPILYYMSTVGFTTSLLDFDIPLE 
HP I LKVCPTHTRALRDLS SDRSNPGRIL 
STSNSSLiY/ r.KXJPRNKAYFTK/ RPbPVN 
DUST 


10425 


24326 


A 


10506 


16 


375 


FFGRGWPKKS PPKKKVFFKKFPGGFKKP 
PLKKKKI FFS PPVKMGP PKGFFKKGPPP 
FFFFFFFFFFFFFFFFFFFFL / LCLI PR 


10426 


24327 


A 


10507 


156 


2 


AASTTRLGDRE)QPGQHSETPSLLKIQKL 
. G\SGGGPLRLRQENRLNP\GGRGC 


10427 


24328 


A 


10508 


313 


2 


QSKWLLEEESTLGEDAVNIVEMTTKDLE 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

Amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
uing io 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A-Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=Pheny!alanine, 
G=Glycine, H=Histidine, I^Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 

— *jfiu uniinc) lv — Argininc, o — oer me, 

T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X = Un known, *— Stop codon, /—possible 
nucleotide deletion, ^possible 
nucleotide insertion 














Y YRNLVD KAAAG FE R I DSN FE S S S TV AK 
T T iQNP T VP /V /PPTT.WRi^T^QPPMWnT^T. 
SYFKELPQTTQHSAI ITLTSPRA 


lU4zo 




7\ 
t\ 




32j 


l 


YEHYNKKENLEAMDKFLDTHNLPRLNHE 
EIENLNEPIMSNETETV/I/NSLPSKKN 
SGSEGFTAEFYQRYKTPLILKLFKK 


JU42y 


^Afyn 
243JU 


A 
i\ 


1 \)j 1 U 


J2o 


1 A 

14 


r>QQfinvnWT.TfDT TPV /T.WT7PPAP.PQPT7V 

RSSRPACPTW 


10430 


24331 


7\ 

rz. 


IUjI I 


A1 K 

435 


I by 


£ v IVr ±r irXikSrlriV-. uuuo Lair l_tvXJK.iJijV—ovj V X 

T\AHCKPQLPG/SQSDFPASTSQEZjGP/ 
TGACHYTRLICCREEVL\HVAQAGLELL 
P P KD P PT A 9nH A R Ti YPH \ GH P AO r » 


10431 


24332 


A 


10512 


147 


412 


RFVCS T I KVLRDLS S DRSNPGRFLSTSN 
SSLY/EKDKRNKAYFTK/RPSPVNE 


10432 


24333 


A 


10513 


251 


466 


QTCPTGSPASLLELFNSIATQWELVRSlx 

Xt\\jL\ f\±J l\Xv Hi IN XJ \o/\v/ ru ¥ lXiriOXi Xl'lO I IVrVrt. 

AAEDYKADCPPRNPGP 


10433 


24334 


A 


10514 


286 


1 


SFLRKFCLRGVPSRVKCQSA\LLGGASQ 

T .nY QP VP V DT .17 T? A VP P TTQWT .PT .T4 AfZP TT 

TLFKAVRQEHLSLQRI LLPLVGRVGRPN 
DSRCDRVIGH 


10434 


24335 


A 


10515 


28 


428 


RFVCS T I KVLRDLS SDRSNPGRFLSTSN 

OCT V /T?VTllrDMV7iVI?TV /D "DCDTrMTlT t c 

ooLiI / £J\JJ]\K]NI\Ai r 1 J\/KFoi'ViNL/X io 

T 


10435 


24336 


A 


10516 


263 


2 


I KKP ADS E S AGAL I LD FPAS QT VRNS FL 

T T TCUCVPVPPVC CDMPT DnnT lfTTTPP 
Lbicnb vcirv. i ooirlNvsijKvvxjJVXXXr r 

FFLRQSCSV\SRLECSGMILAHCNLCLP 

GSNN 


10436 


24337 




HOI 7 


62 


2y2 


xrJ-\XXj.r OV3 V X Ijir f I iNOX,lNfiX7 OvjoXJorvOi-iXl 

K/CEMLGTENFNFFTCHTHI IAEHSNST 
HTHTWTWTnTHTWTT-TTPT.T.FYF 


10437 


24338 


A 


10518 


172 


2 


RGKKI FFPLEKGKNLVWWITALFFFFFE 
DF 


10438 


24339 


A 


10519 


42 


194 


ILVETF\CLRFVSLENLVKMRFLKLKKK 
KKKKKKKKKKKKKKKXKRKKKKI 

Jv i- v IV lv .T\_X\_ Xx. J\_ rVAVJV IVlViv I X. I X. i\IV i. V J\ XV XV. JL 


10439 


24340 


A 


10520 


120 


1 


I FAKRWP I KFFPFFFFFFETVFHS \VAQ 
AGVOWCDLGS P 


10440 


24341 


A 


10521 


202 


375 


S FLWKLHLSGVPGHVRCQSALTGG\ASQ 
LSYSGVTDTIiEEAVCPFSDLKLRAGRTT 
TL 


10441 


24342 


A 


10522 


272 


402 


P KI F YVTKKAWNY YP YT I TE YTC S F/ 1 R 
KFFIHI ETKYEDNKGSN 




24343 


a 
rt 


11DZ3 


1 A1 

14/ 


43 y 


T . T T . WWV T <5 R V c; TW"n c; rtj p V YT . T PT? 
Xj X Xi X V1N IX OX\.X\kj XVI Uj t\Sitr ^ X X X i_j X Jtr Hi 

KNCYKYHCNFLCT/SSSIPFPTPSTPSR 
AS PKKKXKKKKKKKKKKKKKKKKARGG 


10443 


24344 


A 


10524 


7 


415 


RFVCST I KVLRDLSS DRSNPGRFLSTSN 
SSLY/EKDKRNKAYFTK/RPSPVNDIIS 
T 


10444 


24345 


A 


10525 


321 


442 


TS FSFL FF F / LETE FHS VA\RLE CRGT I 
LVHCNLCLPGSSNS 




Oyl 1AC 

243 4& 


ft 




All 




I EGKLPNRKDIHTKNPS VCRHHQRPKVD 
KTTKMGKKQSRKTGNSKNQSASPPPKER 
SSSPAMEQSWTEKDFDKLREEGFRGSIY 
SKL/REEIRTNGKEVKNFEKELDEWITR 
ITOAKKSLKDLKELKAEAQELRDECTSR 
AASANCVAVPQAFDRSHGVQ 



1185 



WO 01/64835 



PCTAJS01/04927 



SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, 1=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop cod on, /=possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 


10446 


24347 


A 


10527 


265 


411 


RFVCS TI KVLRDLS SDRSNPGRCLSTSN 
S/SPVRKDKRNKAYFTK/RPSPRN 


10447 


24348 


A 


10528 


81 


390 


RFVCSTI KVLRDLS SDRSNPGRVLSTSN 
SSLY/EKDKRNKAYFTK/RPSPVNDII 


10448 


24349 


A 


10529 


253 


23 


R I FLF YFHS FLRQGLARAEGQ PHNHS S L 
QPPPPG\SGDPPASASRAARTTSAYHHI 
PLFFFRS YCVAQAKI INMLSLL 


10449 


24350 


A 


10530 


10 


398 


RFVCSTI KVLLDLSSNRSNPGRFLSTSN 
SSLY/EKDKRNKAYFTK/RPSP 


10450 


24351 


A 


10531 


289 


'423 


RFVCSTSKVLRDLSSDRSSPGRFLSTSN 
SSL/QRKDKRNKAYFTER 


10451 


24352 


A 


10532 


152 


401 


PHCSIHSQVRIQANVHKGHRQRTYGSVI 
PHILPLHVLKKTFSLRDFHFSVSLKKNL 
VLTCLHL FL / GVRT PRNDP F VSMMLL FT 


10452 


24353 


A 


10533 


2 


515 


LDRISLLLPRLKCSGPVSAHRNLYLP/G 
FKRFCSLGLPSSWDYRRASPRVAKFVFL 
VDTGFHHVGQAGLKLLTSSDPPSLTY/P 
QGWDYRREPHLAAF 


10453 


24354 


A 


10534 


469 


566 


PTKNTKI SREWWCVPV/ I WEAEARESLE 
PGKQRL 


10454 


24355 


A 


10535 


354 


47 


KPKNLPGGGGPPPYSSPSKGKKIPLPWK 
AKVPINQILPLPPHPGGGKKTPFPKKKK 
KKERKKKKMATVQKGM/PHKYYHGKTGS 
YNVI QHAVGNAVTNRTRGS 


10455 


24356 


A 


10536 


1 


409 


TPKKRFFPKKPKGFFFPPPKKKKKT FS P 
PPKFWPPQKNFKKAPPFFFFFFFFFFFF 
FFFFLKSNPFFFFF/SKNPILKPPLPPL 
FRVFPPPKKKKKKDP 


10456 


24357 


A 


10537 


204 


435 


TSGSLYKLLGMGDLGLGKTTI FKQGGHQ 
NSS \GWHPEATTG/VNVJAFKGLHWDPKA 
LAHLQLWNIAL/QER/FGKMTKACY 


10457 


24358 


A 


10538 


208 


382 


RFVCSTI KVLRDLS SDRSNPGRFLFTSN 
SSLFER/DQRNKAFFPK/RPSPVNDI IS 
T 


10458 


24359 


A 


10539 


63 


409 


GWRLTNFPVGKTPSTSGTGLLFIFLLLN 
LMGRLKPSS I SMKREFRKQNRGQKC / EF 
MKKKKKKKKKKKKKKKKKKKKK IGGGAF 
LKI FRGGPNFPGGEKIFFFFLGGGIKSR 
GGGFG 


10459 


24360 


A 


10540 


3 


1322 


RGYAWPNGALPASTVPCGFAACPGEFLC 
SVNGRQAPAGSSGFLPPVPSLCPHTVCR 
ATFQCKEDSTCISLRRFPWGSETHLLCP 
API PSVPGVPCGTFTFQCEDRSCVKTLS 
ASPLPLLLCPPHHLPPSPDCGLQGPSSR 
I VGGAVS SEGEV7PWQASLQVRGRHI CGG 
AL I ADR WV I TAAHCFQEDSMAS TVLWT V 
FLGKVWQNSRWPGEVSFKVSRLLLHPYH 
EEDSHDYDVALLQLDHPWRSAAVRPVC 
LPAR/ SENREQGLQ/ CSWITGWGALREG 
GPISNALQKVDVQLI PQDLCSEAYRYQV 
TPRMLCAG YRKGKKDAC/QGES PG 


10460 


24361 


A 


10541 


154 


3 


GVGNFFFFFLRWSLTLAQAGVQWRSLGS 
LQP/LTSGSLFRQFSCLSLPSTRP 


10461 


24362 


A 


10542 


j 


406 


LMVLWTAHLPALAPGS RTS LLLAF \ ALL 
CLPWLQE\AGAVQTVPLSRLFDHAMLQA 
HHAHQLAIDTYQEFEETYIPKDQKYSFL 
HDSHTSFCFSDS I PTPSNMEETQQKCNL 
ELLRISLLLIKSRLEPVQVLTSMF 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 

NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

ominn ariri 

<i 1 1 1 1 1 1 \j av.iu 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 
peptide 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, LHLeucine, M=Methionine, 
N=Asparagine, P=Proline, 
0=Glutamine T^=Arpinine. S=Serine. 
T=Threonine, V=Valine, 
\V=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possib!e 

nnrlpftfirfp insertion 


10462 


24363 


A 


10543 


3 


414 


HSSAHAYGTYESSGKRKYLQRPSRTAEL 
KKALKEKKNRLLLQQRSKSVTSSSSSSR 
DSSARDSSSEREETSTSSSSEDSDTDES 

sssssssaystnssssfdsdsdsss\sr 
grststdsssadd/ stneeesele 


10463 


24364 


A 


10544 


240 


509 


TFTS FI IGTPNLGEFCKCNNFGI /GKFR 
MKVACQI ETLGI LS FFFETESRS VTRLK 
CGGAI LAHCKL 


10464 


24365 


A 


10545 


275 


2 


GCSRP FDTAI PLMGP YPEE KKLL FEKNK 
/DTCTCMFIAAHFPVAKMWIQPKCPSIN 
SWIKNPVTHIYIHTHTYMCIYIYTHICV 
CVIHIYI 


10465 


24366 


A 


10546 


233 


2 


WSWMAIQSVLPQTAFTELKNKYSCIQET 
PILFYFLKQSLPLSPSLECSGVIIAHCN 
/ LQ / RPGSTTFPAS AS KVERTTGA 


10466 


24367 


A 


10547 


2 


420 


KKKI PNRGGCLKRDTKAFLKTPPMLQRR 
TPPAFPKGPGGFLPDFNPLPPSKGNPPI 
YSWERFFPRFFKNIWAPVPKANLLSFFF 
FSIFWRTGSRYIAQAGLELLGSSYPPAS 
CL»PESWDSRR\DHRARPTI 


10467 


24368 


A 


10548 


190 


1 


AASTFFACVCFRLFFVFVFETES PSVAQ 
AGLQWHSH3SLQPPPNLG\SGNPPALAS 
PVAGNC 


10468 


24369 


A 


10549 


155 


1 


ARGEKYFLEIGGGFFFFLLVFFLRQSHS 
I AQAG VQ W CD\ LNS MQP LP PRVK 






^ 


1UJJU 






RFVCSTI KVLRDLSSDRRNPCRFIiSTLN 
/SLAVRKDERYKAYFTK/RPDPVYDIIS 
T 


10470 


24371 


A 


10551 


183 


3 


WQKLLFLFGTES CSVARVGVQWRHLSSP 
KPPPP\GSSNS / PASASRVA \TG AHLKA 
RLIFVFL 


10471 


24372 


A 


10552 


297 


10 


TDEELLCMEEKRKWFF \DMAS I PGEDTV 
N TDE MTKT / DLE YY I NLVDKV SAG FERT 
DTNF\ERSSTVSKMLSNSITCYRKI/F/ 
RSRKSQQMWQTS FLS F 


10472 


24373 


A 


10553 


509 


3 


LLTDRTLS CRSVGV P CS VRCQ CA \ LLEG 
ASOLiG\ SRGSGVRDPIiEEAVCPFPDIjOL 
RTGRTTALFKAVRQGHLSLQRLLLSF\V 
CLCPAPRGGAYRGTQASLSCGGLHPVRA 
SRLLCLPKQAWAMVGTPPP\TSLPPCSL 
ISDCCASSQRDS /VGVGPSKPGAGYNLV 
VRRF 


10473 


24374 


A 


10554 


262 


453 


GWGILTANLLVFGGGSLSISERI FFSPL. 
ATGFCS VAQAGVQVJ \ L 1 1 AECSLEFLGS 
SD 


10474 


24375 


A 


10555 


112 


372 


KKKGGGPPFFFFFIFFLIRQRKAKLWL 
AFNRRGLRKPD / 1 FY YAGVGKI VKKKYS 
GFFIELVTTCGKNYLMCALAVIDPRDSN 
I IRS 


10475 


24376 


A 


10556 


50 


358 


RFVCSTIKVLRDLSSDR5NPGRFLSTSN 
SSLY/EKDKRNKAYFTX/RPSPVNDI IS 
T 


10476 


24377 


A 


10557 


203 


350 


QLHADKVSRFATHAECRGVIMAYFNLNL 
P1I/FKRSSCLSLWSSWDYRHAP 


10477 


24378 


A 


10558 


98 


387 


I QDTDL I RGVYR YRD KKRAR EKDS PPER 
EGERERE I EKQRKKER/ERKKERKXEST 
HQKE 


,_10478 


24379 


A 


10559 


237 


3 


E V FFKNI FE I ES CS VTQARAQGQDHGSL 
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SEQ 10 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

pepuae 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=GIycine, H-Histidine, I=lsoleuctne, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 

— Vjriuidiiiine, tv- — Arginine, o — ocrine, 
T=Threonine, V=Valine, 
vv — i rypivpiMn, j — .tyrosine, 
X=Unknown, *=*Stop codon,/=possible 

mif*i0n^ifl^ nAlA^inti V — 'nACCinio 

IIUUCUUUC UcietiUIl, v — pusoiuic 

nucleotide insertion 














QPPTPG \ S SS PPTIiASQS A VI TGVNHHT 
QSGLTFQ S P KQ KALGKM CIS SGG 


10479 


24380 


A 


10560 


37 


359 


RL KCGG P V IAYCKLE I LGS SNT PTS TS W 
VAGTT/ TCHHAQ 


10480 


24381 


A 


10561 


170 


3 


RPFFLPWGGKKFQGVSFFFFFFLVFFEI 


10481 


24382 


A 


10562 


1 


388 


LPE PLMVLRTANLVAMAPGS RTTLL \LA 

LQAHRAHQLD I DTYHELEETY I PEDQKH 
SLLHDSQTSFCLSDSIATPSNMEETHQK 
SNTjEMVR I SI .TiTi T ETWLE 


10482 


24383 


A 


10563 


224 


392 


CVDPILKWFPLRTATRQGCPLS\LFNIV 


10483 


24384 


A 


10564 


185 


2 


RYFGGPDSPQNGKTVFNPLFFFFETRV\ 
HSCCPNESNGTISTHCNLCLPGSSNSPA 


10484 


24385 


A 


10565 


175 


324 


APG YGTKYRV I LCK I LLLRRLRHKNHLN 
PGGRGCSEPRSHHC/ITPAWVTE 


10485 


24386 


A 


10566 


3 


367 


HELLWSTIKVLPYLRSDRSNPGGVISTS 
ST 


10486 


24387 


A 


10567 


128 


391 


GETCEGVYGPQQGGKQTWKVFHSKKDEG 
irL-oi\I Wlv VoUjoorUr IrijivivVKrr-Pr r 
FETESHSVTRLECSGTPGT/HCNLCYPG 
TS 


JU4&/ 


OA ICQ 


7\ 




o< 1 
1j 1 


4&J 


LGYSGVRDPLEDAVCLLSDLKLCAGRTT 
ALFKAVRQGHLSLQRFLLPLFSY 


lA/jOO 

iU4oo 


*JA 1QQ 


7\ 




i /O 




r virr'XijJvi\i v ivjt'r ijAJviNjyaixx'r r r r r oEiPl 
ESRSVT\RLECSGA/TISAHCNLCLPGS 
SNS 


10489 


24390 


A 


10570 


212 


386 


RFVCSTIKVLRDLSSDRSNPGRFLFTSN 
SSLY/EKDKRNKAFFTK/RPSPVNDIIS 

1 


10490 


24391 


A 


10571 


134 


489 


SLFFLPSPRDRWSLTQSTEMPPSWKFPG 
REIRSPLLAMLESLLLQPTAFLCPSSTL 
LLSLKPDSLRTGSLFLSFFSFLYFEACS 
VA\RLKPSGTISAHCNLCLPSSSDSPAS 
AYIVSG 


1 A/1Q1 




T\ 
J\ 


1 ACT") 

1U!>/Z 


J 


4/U 


OSJoKvjXi-'ijijijI LiUijtltJXrr VoXjJLiIjO V liNLiX 

SKNSAQFSTILQTLSFPATFTPSPSIPL 
SSAYFFFFSDRVS /LLSPRQECSGLNLG 

HTTTPGLIFCIF/SVEA3FHHVAQTGLE 
LLSLSNPPASA/FPKCWDHRR 




O A*xcyi 


B 


■U_>/.> 




411) 


ITMr.PMnTYPQT.PNPTPPPPPTClQHPTaX 
v vi J_ji\.i Y J^ ± i r o i_iiiiN Jr JL I L rrLiuour \ 

RLECSGMIPAHCNIiCFPGSSNAPASASQ 

V 




7A1QA 




JlO/4 


ZJO 


1 


a PTFT? t h t t .K2\ c: mnwa t Tf;\rr .wwp atw 

>ixv i r r in i i-i iv/ao * — vjt ljin ,*-vx io u xjwrirCHtX vv 
CPSVLLDRDSLSERVCEILCIHAC/VMR 
I HEHTHT Y I YAHTHTHTHTHTHT 


lU4y4 


"lAICLG 

ZnSyD 


a 

ri 


i n^T<: 


O/l 1 


5j 


WAGTPG\SSDPPTSSSQ5AGI IGMSLNL 
FLQWLIIDQAGNTN 


10495 


24396 


A 


10576 


294 


72 


FKNFKS PLFS LFFFLI LGTR FCYVAQAG 
LK\SWPQGDPSAIASRVAGTRGVHRHTQ 
LQVSFNYYKVLAMHSGSQL, 


10496 


24397 


A 


10577 


1200 


1625 


PDGWSYSSGHSWLPEEASNGRKAWSSCW 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ JO 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

nprifi rl f* 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenylaIanine, 
G=Glycine, H=Histidine, 1-Isoleucine, 
K=Lysine, LHLeucine, M=JVIerhionine, 
N=Asparagine, PHProline, 

0=r^lntnminp T}= A roininp S=SpHnp 

T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 

nucleotide insertion 














VPPQLS PPQAP PS PATHLGPTPPLA/ S P 
PQAPPPPATHLGPAPPPPLSSAP\PPPD 
TLLGPAAPPPPSSGPDPFRYSP/EVPPQ 
LSPPQAPP 


10497 


24398 


A 


10578 


226 


325 


ERHTHTPHSHTRSHTRLTHTHA/YTHTH 
AHTHA/YTHTHAHTHFPS 


10498 


24399 


A 


10579 


108 


320 


tinvflkvnvmklsylrfntrncsfffl 
agvqgcdhgslqlqppg/sscdppssas 

RVAGTTGAYQHTQLI F 


10499 


24400 


A 


10580 


1 


288 


ARGERERERERERERERERERERERERE 
REREREREREREGLSFFFSRSRGGGKLW 
CVC /RPSARPRERERHAVFSA^PSLOKK 
KKVFGGREGTH S L F YPP CGNS LRR 


losno 


74401 


A 


J UJOI 


1 


476 


REWGLALSPRLECSWI IAHCSLGSLGL 
KRFSCLRLPSWKDYRCAPPCPSLNLFFV 
ET\GPHYVAPGWSLLLGSSD/PPPLASP 
KCRDYRHE 


10501 


24402 


A 


10582 


210 


346 


PHCS I HSQVR I QANVHKGHRQRTYGS V I 
PH I LPLHVLKKT / FSLRDF 


10502 


24403 


A 


10583 


316 


461 


LPN I LGWLFFVSETGFCSVTHAGVQWC 
DEHGSLQSQTFG\SSDLPTSD 


10503 


24404 


A 


10584 


172 


383 


NLDLYLTSYIKINSKWIIDLNVIAKTIK 
LLE \ KNVAGENI PDLGLGKEFLD\ KTQK 
ALS \ TNKKID KLNFTR 


10504 


24405 


A 


10585 


273 


374 


GPGTEAHTRNPSTLGGRGGAIT/ R/GQE 
FKTSLAKM 


10505 




A 


10586 


ISO 


^44 


KLKC K 1 1 1 KLG I K I S FAV I WVE I GFHF 
CYLGV/VQW\PIRTH?GSSDPPASASQSA 
GITSMSHHT 


J vJUU 


94407 


A 


1 UJO / 


1QX> 


•i 
j 


S FS Y I RVHLGHI KHTRHKAS CE VYANCK 
AKGI ITGIKKK/RFWPGTVAHACNPNTL 
GGRGRW ITRLV 


10507 


94408 


A 






1 

i 


PRGAPTCMRCQS AL \LGGVLQLGYTG \ V 
RDPLEEAVCP I S ELKRCAGRTTALFRAV 
RQGRLSLQKFLLPFLV 


10508 


24409 


A 


10589 


190 


3 


ISAQPWAAFEKRFIFIYRAPA/LKFMIF 
FSPFKGVPFFFTFFFFFEMESRSVARLE 
CSGVISAH 


10509 


24410 


A 


10590 


369 


40 


KTERNSININKNDVHTKTPSKGRQHQRP 
KADKSREMRKNQHKKAENSKNQNPSS PP 
RDHNSS PARKQNWMENEFDELTEVG/ FR 
KVGN S SELKEH I LTPCKETKNRTRGW 


10510 


24411 


A 


10591 


114 


398 


RFVCSTIKVLRDLSSNRSNPGRFLSTSN 
SSLC/EKDKRNKAYFTK/RPSPVNDIIS 
T 


10511 


24412 


A 


10592 


132 


1 


I LYLWCVC VCVCVCVCVYLLI L.V/ CKVL 
SCLGEPALSVGAFQMS 


10512 


24413 


A. 


10593 


3 


287 


IGFITLNVGLYLYLLCYIYVYFFQVNYI 
VFILVLAVFLLKNRLLRYNTALYNEEVT 
REKISHYFI/NSYIHTVGKINKND 


10513 


24414 


A 


10594 


2 


363 


RAEVGVMAPLYLLLLLSGALGLTDTWAG 
SHSLRYFSTAV^RPGRGEPRY IAVPYVD 
DTHFLRFDSDTAI PRMEPRKPWVEQEGP 
QNWE\WTTGY7AKANAQTDRVALRNLLRR 
YNQSEAGE 


10514 


24415 


A 


10595 


156 


3 


NGLELDS DDGC I ALLF FYFETE FCS \VA 
QAGVQWCDLGSLKPQPLGFKPTRP 


10515 


24416 


A 


10596 


80 


306 


MDEEKLFRDEQRKWFLEMESTGEVAVNI 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

(illllllU (IL.1U 

residue of 

nentirlp 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 

H ino irk 

last amino 
acid 

residue of 

npntiflp 

sequence 


Amino acid sequence (A-Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=GIycine, H=Histidine, I=lso leucine, 
K=Lysine, L=Leucine, M=IVlethionine, 
N^Asparagine, P=Proline, 

0=Gluf n m inp R=Aroininp S=^prinp 

\£ \Jt U Irtlll IIIC, XX r\t gill UIC, o OCI IIIC, 

T=Threonine, V=Valine, 

W— Trvntonhan Y=Xvi*o^ine 

X=Un known, *=Stop cod on, /=possible 

niirlpntidp rlplptinn \=nn«ihlp 

nucleotide insertion 














V/ BMP I KDLG/ Y YRNLVDKAVPGCERVD 
FNFE I SSTGWVW WLTPAI PVL 


j vj i u 


24417 


A 




7^9 




ILCVIVFCCCCCCCFETEFLVIQAGVHWC 
DLDSLQ/SPPPGSSDSPASAS 


10517 


24418 


A 


10598 


463 


154 


MGQGKPRFPLFPHPFYIFFFLKQGFSLS 
PRVOGRAKKVPGTLASKGOKI LLPOPPR 
/RVRQENCLNPGGGGFSEPKIPPSFPAW 
GEKGGSLLKNLKKKKTKHIC 


10S1R 

I UJ i O 


944 1Q 


A 


iosqq 


908 


JO 


LGPPQGFFKTAPPFFFFFETESRSLAQV 

gvqwwdlgslqlpppg\ssds PGLSKKF 

LL 


1 0S1Q 




7\ 


1 UCMJU 


700 


-x 

J 


CQPELSCMRC\RQSLLGGLSPSGGMEVR 
DPVEEAVCPLAELKRCAGRALLVRICCS 
LQSQQAGTFK 


10520 


24421 


A 


10601 


73 


427 


RWCSTIKVLRDLSSDRSNPGRFLSTSN 
SSLY/EKDKRNKAYFTK/RPSPGNDI I S 
T 


10521 


24422 


A 


10602 


193 


401 


GEVSLLSPRLECNGVISAHCNFR/LPGF 
KRFSFFFL 


10522 


24423 


A 


10603 


181 


395 


RFVCSTIKVLRDLSSDRSNPGRFLSTSN 
SSLY/EKDKRNKAYFTK/RPSPVNDIIS 
T 


10523 


24424 


A 


10604 


384 


10 


PRRPGGECVKCLGDSLVTDRRYSGLRSS 
DOTT . RnpMP A VP W <=l M c: T , c; C <3 H YR T .TfP A 

LJ\J JL JJOVjL/trriJjrtV tr TV ;dl>JO JJOV.OX1 X r\i_»I\r/\ 

SGITWGCKQAGFFSFFLFFKMESHSV\T 
RLECSGAISVPCNLCLPGPSNSPAPGGS 
LGPRSSRPPWTT 


10524 


24425 


A 


10605 


215 


379 


RFVCSTIKVLRDLSSDRSNPGRFLSTSN 
SSLC/EKDKRNKADFTK/RPSPVNDII 






A 


1UUUD 


i 


1 OJ 


QTHRE P AMVLS PAD KTNVKAD \ WGNVDA 
HAGEYGAEALERMFLSFPTTKTYFFR 


10526 


24427 


A 


10608 


283 


2 


IQKHGLI IRCPQETPFKNNDVGMQKVKG 
RGEKSIMOIL/YFFFFLNS /HLSPRLE/ 
CSGAISAYCNLRLSHSSDSPA/VDGTIG 
ACHHTQLIFLFLVE 


10527 


24428 


A 


10609 


144 


327 


CSWSWFCGCVRVSTLIKKKNFFFEMEFR 
S \VAQAGVQGRDFG I LKPPP PG \ SSDS P 
ASASRV 


10528 


24429 


A 


10610 


137 


1 


HKCPSTDEWDERV/WCMYTKEYCSALTK 
EEIMSFGKTWMELENIMI 


10529 


24430 


A 


10611 


201 


334 


sf;fwrfclrgvpsrvkcqca\llggasq 
lgysgvrdpleeavcpf 


lOSSO 

1 UjJU 


744^1 


A. 


10A1 7 


146 


7 


KTPLKKGLKRVPFFFFFETESCSVTQAG 
VQWHNIGSLQVPPP\GSRHSP 


1 UJJ J 


744^7 


A 




70S 


7SQ 


TTMPGLSFCTRTKSNKNLS FLTLDGSAR 
VDLFLYLLLFRTI FQNF/CGRDKVLLCC 
PGWSQTPELKQLICLDLPKYWDHRYEPL 
CLAYPFALGQNLTKI 


10532 


24433 


A 


10614 


153 


2 


GVLMVSDGVLSMLANLFFSFLFF/LFET 
QSHS I \SRMECSGVI SAHCNLCLP 


10533 


24434 


A 


10615 


213 


8 


KPPFFFPFFFLFFFLRWGSHCWSGY/C 
KGTIIVHCGLELLGSSCSPPAFQVSGIT 
GLCYCAS IFIFEE 


10534 


24435 


A 


10616 


146 


1 


ILTIRDYWVKITLFFFFEPKFHS/VLPR 
LECSGAISAHCNLCLPASSDS 


10535 


24436 


A 


10617 


208 


2 


NNFWFSSSGKYGQYFSSETESRSVTWGH 
DLYSLQPPPP\GSSD3PASASQVAGITG 
S CHHAQL I LVF I VE 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
fast amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=G lycine, H-Histidine, J=I so leu cine, 
K=3Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 

lu tarn ine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /-possible 
nucleotide deletion, \— possible 
nucleotide insertion 


10536 


24437 


A 


10618 


120 


2 


S FFF FFETE FR S VAQAGVQ WHD LS S LQ P 
PPPG\SSDSPSL 


10537 


24438 


A 


10619 


162 


1 


SFHWKLRLRGAPGCMRCHSA\LLGGVSQ 
LGYSGFWHPLEEAVCPFSDLTLCAG 


10538 


24439 


A 


10620 


193 


419 


TFFFFETESRSVAQAGVQWRDLGSLH/S 
QPGQQSETP 


10539 


24440 


A 


10621 


313 


460 


KPGLWRGTRSQKQVLFFEPESHSVA\KIi 
ECGGTISAHCNFCLPRSSDFSA 


10540 


24441 


A 


10622 


243 


505 


PTGCPKQEQVQAMLRPWFFFVFETQFN 
S/VPRVECSGTISAHCNLRFPGSRDSPA 
IiASRVAG I CRR / CATTAQLI FVFLVETG 
FCHLVG 


10541 


24442 


A 


10623 


67 


430 


LARRTWKNQHSSTHGCDLFFFFFFFFFF 
LIKKGVFFFPPEGGHGHRFFFINIKFLN 
YKRFFFCLFEDVLMI SV3PPLAFFFFFF 
WRGGFSYLLLRCV/LTFLALGAPLFAPM 
FFLFLFGEEL 


10542 


24443 


A 


10624 


3 


207 


PGGQLGSECSGVRMDEDVL\TLKSLI IG 
ESGVGKSSLLLRFTHDTFDPELAATTDP 
SSWNPARNAATP 


10543 


24444 


A 


10626 


151 


380 


KMLFGQVQWLTP I I P ALWEAE AGGS LE \ 
liRSSRPALKRSETSSPPSLPCQKKKKKK 
KFPRAGGQMLEVPFPGRVGAG 






A 


luUi / 


910 


j 


NFCQVKGFWKQKSSQIKGFGKNKKKKGG 
SFFFFFFETESHSVT\RLECSGTI IVHC 
SLDFPGSSDPPTSA 


10545 


24446 


A 


10628 


237 


2 


LCFLSGASCSPY I DVYTEVNGVNPTFI I 
LTNKTNTFGPLFLI I FFFGETESLS IA\ 
RLDCSGAI STHCKLCLPGSRHS P 


10546 


24447 


A 


10629 




352 


RGPLS IQDYISKALFFLLLFGFLLLLLF 
FEVRS CS LTQAGVQWCNHSHI SSLQS PT 
PG\RSPSDPPASAS 


10547 


24448 


A 


10630 


48 


200 


AI KPKNFLFFSETKSSS VTQAGVQWRDL 
SS PQPPPP \ S SSDS PAPAN 


10548 


24449 


A 


10631 


209 


376 


SFLWKFCLKGVPGCVRCQSA\LLWGASR 
LGYLGVRDPLYEAVCPFSDLKLCAGRTT 


10549 


24450 


A 


10632 


141 


1 


AGFFFFFLFFFLETESRFVA\R1AQCSG 
AI TAH CNLCL WGS SNYHAS 


10550 


24451 

- 


A 


10633 


18 


522 


PLYSLTKRHTPREA/KDNLKSTQLLSVI 
DAI SEGPI EGPVDGLKSVLLNSTPVLDT 
EGNTNISGVTWFRAGEQEQTPPEGFES 
SGSETVLGTEVKYDTPITRTITSAN IDR 
LRFTFGVQALVETTSKGDRNPSEVRLLV 
Q I QRNGG W VTEKD I T I KGKTTSQYLAS V 
V 


10551 


24452 


A 


10634 


127 


2 


NLSPLFFFFETGSRSVTQAGVQWSDLGS 
LQTPPP\GSRHSPTK 


10552 


24453 


A 


10635 


336 


452 


RI FFFLFETEALSVAPAVAQWFDLGSLQ 
SPPP\GSSESPA 


10553 


24454 


A 


10636 


259 


3 


GTRIFFPFNSSPLSFPPVQKKSVSPFFF 
S F/LFFFFETESRSVT\RLECSGAI S PH 
CKLHLPGSRHSLASATRNVLRPLWR IHA 
SCA 


10554 


24455 


A 


10637 


96 


413 


GDNTSQHSAILIN YF I FFFTFLFF /ETG 
SHS VAQAVRLE YSS VI TAHCNFRLRGS S 
NPPKKKKKGGAVLKDPWGGQSLPGLATY 
YFFPYRGANKNLLGDFWEGPLFV 


10555 


24456 




10638 


180 


473 


CYMGRKAWLVLSNLLTLKKKKKKKKKKK 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

dinino iiLiij 
residue of 
peptide 
sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 

e\ i r> ft tn 

uing iu 

last amino 
acid 

residue of 
peptide 

aCljucui-C 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
KHLysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 

\£ — I IJ 1*1 Hi lilt:, iV — AT g 1111 lie, i>— OCi lllc, 

T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
A-unKDOwn, — otop coaon, /— possiDie 
nucleotide deletion, \=possible 

UUUCUllUC II lot: I HUM 














KKKKKKKKKKK/ EMPVKQKNRGRALYKK 
KKGGRP F P I FW P F F F F FFLGGGKKHPGG 
FFRKKFFFGGGKKKK 


10556 


24457 


A 


10639 


312 


480 


STVDSTSGEDDVNIAEMATKDSEDYINL 
V\GKAVAEFERTDSNFE\RGKMMSKSI 


10557 


24458 


A 


10640 


2 


516 


MKPLENLSQTASLARGATLLRPVLRRIiC 

L> Ij fir Lt\2 K r'.AAxl. M if Li KAKolJoAvjirJjx iarlil 

LPTSPLQKALLAAGSAAMALYNPYRH/E 
ERPR I STSTLDLGKLQSLPEGSLGRE YL 
RFLDWRVSPDTRAPTRFVDDEELAYVI 

y f\. I !%.£» V iZUL il^iLi X l.i L-iv3ll it X IN J. ±J\J& L V V Jrvrt 

FEAVQ 


10558 


24459 


A 


10641 


56 


455 


PLCSYIQFIYHSKFNSNQISLVHTIPLS 

TVFC I VFYWVFVHL I SVPMY I CVCLCTN 
VCVCV/CCLW 


10559 


24460 


A 


10642 


272 


33 


GRWFLFLGPAKYFLTGGRFPSDFFGPLK 

T MP P T? FTTTrTrPFnTP QVanan^fH WHTsIT . 
iiNrr r rrrrr rui Ciono vnsj^uj v nvvniNxj 

GYLQ/SPPPG\SSDSPAPAEFHHTV 


10560 


24461 


A 


10643 


285 


1 


MGNFLKRFFKNEKKFFFPILRAHPFNFF 
PGKSPFKTPRRALPLGVPPKHPFFFFFF 
FSETE SRS VARLECSGMI SAHCNL \NLH 

T Df 1 OPVODAT 

brbbbiorAu 


10561 


24462 


A 


10644 


100 


466 


FLLKFRLREVPSRVRCQSA\LLGPTSQI» 
blob v KU L Li&nJ\\ \~r r oUJji\JjV-/vKXli.N i 
LFKTVRQGHLSLQRFLLPFVQLCPVPRG 
GVYRGROAS LS CS GVHP VRAS P PLCLP K 
VPPPS 


10562 


24463 


A 


10645 


260 


460 


LKPHAERETIDKGRLPYYSFFFFETESC 
FVAOAGVO WHDR YS Tip P PP \ PGSEGSRA 


10563 


24464 


A 


10646 


360 


504 


QIGHI CAYVEKTELRLI FFFLFFFFRQS 
CSVT\RLECSGTILAHCNLCL 


10564 


24465 


A 


10647 


241 


45 


WEVEVPRVAPCHILKRDYYFI ILFLFLF 

PT7MPQVCPTV T?T T7P<3QaTC''TWr r NrT.PDPf3 

SRDSPGIPPH 


10565 


24466 


A 


10648 


76 


462 


FLWEYQEKAHILWSLFSKIilLSRSRFTV 
KEKVRQKPGILFIYLFIFEMESCSVAQA 
GVQWRDLGSLSPQKXKDS/DQSKAI TFV 
EGIN^KRGGWTGPHF /HCSLKMIFLI 


10566 


24467 


A 


10649 


3 


359 


QTQREPTMELTPADKTNVKAAWGKVGAH 
AGEYGAEALERMFLSFPTTKTYFPHFDL 
SHGSAQVKGHGKKVADS LTNAEAHEDYM 
LNATiTATiNDT.HAKKLWVDOVNFKlA LiSH 
CLLVTLA 


10567 


24468 


A 


10650 


266 


3 


TPEKKKKIGGLGAPFPGYPKKIWPPNFF 

EVESCSVAQAGVQWRDlxDSLQPSPP\GS 
RGSP 


10568 


24469 


A 


10651 


278 


461 


KKI ICGFFFDTGSCFVAQAGQQWRDLRS 
T.nc:PPPf;\ ^^n^T.T^a^RWRM^PMMRA 
VF 


10569 


24470 


A 


10652 


161 


456 


VFFFPGENIAFVENKTTLSYHFMFTRMT 
AAKKMENYKGCWGWETI GSFMPCWLES K 
RVQSLWXKVWQFFKWIxNVKLIiYGLAIPL 
TGICPKKLKKYVQGY 


10570 


24471 


A 


10653 


91 


486 


PCFNHGHTTWYMYFFSLSLILFFVFSLK 
PVILAYKTFSFPITIRVLLQRRHRQKFV 
LDGFinGVPQNNILGS I CVSLIKI FFLTE 



1192 



WO 01/64835 



PCT/US01/04927 



SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

ftminfk ariri 

ilJJJIIJU ilLItl 

residue of 
sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 
peptide 

UtJ 1V.C 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E^GIutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, Wsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 

V<f • vjiu lit JiiiiiCj a — >» r gin Jiitr^ L^^rycriiitTj 

T=Threonine, V=Valine, 

W=Trvntftnh # in V=:T , vmQiriP 

X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possib!e 

II UtICUllUC IlloCl HUH 














SCFVAEAGMQWCNIGSLQA?RP\GSCHS 
PGISPPVLG 


10571 


24472 


A 


10654 


267 


470 


NLHMRCEKEQNFNKLS FKDLFGLGWLWF 
FIFCFWRRILTLSPRLEC/RG/MIFAHC 
NLCLPGSGHS PA ^ A 


10572 


24473 


A 


10655 


775 


1401 


TFPSQIRYGLRKRPNSLTLSPRLECSGA 
T c; AHM OR P P PfJ FT P P pT , c t , p o c wu \rp p 

J. ijruli I^IxiT CSr\3C 1 rr ijUDOUr OiDVin I Jvlv 

PPPRPAICVCVCVCVCVCLVETGFHRVN 
QDGLDLLTS /S/IPPASA/FPKCWDYRR 
E 


10573 


74474 


A 






d^d 


LPL I Y FL Y F YE TE CP P LAOAG VO WRDTjS 
SLQS?PP\GSNDSSASASPTVPTSAM 


10574 


24475 


A 


10657 


212 


64 


LFLSKFYFFEMESCSVAQAALQWCDLGS 
VQPLP \PGS EDS PAS AWGYLD 


10575 


24476 


A 


10658 


368 


473 


GFIDHTRRERPECP/LTDEWIKKMWHIH 
TTFYY^AT. 


10576 


24477 


A 


.10659 


358 


1 


ILAVFCFCGSAPGFEGGKLTPFFNRGRE 

TFLALVKTPPPGKARGPVFI S KNKKI PE 

FKOPPMPNPN^ F*RPFFPrTPQHT T ZXP AH 
l r\.\£ir iri» itlh r iior r r r r r d i c*orx x x.rtxvrtvj 

VQWHNLCSLQSPPPG\SSDSPGRWSLQR 
TEIAPL 


1 vJ / / 


9 4.47 ft 
z. e t t * Jo 




IUOOU 


L 




MHTETLIRTTMRYHLILIKVTIFKKTDN 
MTAVGVHI CNPNTbTGHGRG I A 


10578 


24479 


A 


10661 


9 


538 


CVTVRIPSRPTRPLSSDRSNPGRFLSTS 
NSSLY/EKDKRNKAYFTK/RPSPVNDII 
ST 


10579 


24480 


A 


10662 


374 


38 


S FL WKLHLRGAPGC I R CQ S A\ xxLGG VSQ 
LGYSGVRDPLEEAVCPFSDLKPRAGRTT 

TT.FK'AVRnnPT.QT.nKPTJ.PPVnT.r'PZiZlP 

GGVYRGRQASLSCSGLHPVRASRPLCSR 
R 


10580 


24481 


A 


10663 


268 


47 


ALIPLSSLTISAFHLLLLETVSTSSPKM 
ECSGAI TSHCS FNLPDSS PS PTSASR / V 
TGTTGARHOAOLMFTYFC 

x sj x x vj^vcvrj^j^^xji IE x x x 


10581 


24482 


A 


10664 


64 


451 


FSSERKSHMSLTLNQKLEMIRLSEEGLS 
KAKVGO KIjVR IjHOTVS OVVD AKEK T ,T »K T . 
IKSATPVNIGMIKQHNLIADIEKFGMIW 
TDCQTSHRTVLCQRLIQSK/ALTLFNSM 
KAERGKEAADEKLEVRRG 


10582 


24483 


A 


10665 


355 


474 


ILFYFI FFETESCSVAQAGVQWRDLGSL 
QAPPPG\SRDSP 


10583 




A. 


1 uooo 


Jl / 




nHTPPWOTFFFFT.FRDRVT.T.HHPf^WciAV 

<J1 1 X V— X r»\^X X X XI XJX £\XJX\ V XJXJiXIX ITV3 »V i^rt V 

TQSWLTAAiTXFGPKRSSCLNLLNDWD 


10584 


24485 


A 


10667 


57 


329 


VKN TQWG KDS LFNKRV \ FKNWAS I YRR I 
KLD \ LTS YAKINS KWI KDLNVRLE T VKV 
LQVE YPS FK I LGNGS VLDFVFFYSG I FA 
LHLMGEHP 








ll/OOo 


oO 




PWYK" T PT .Mf?f?KTtf FT .T .FP c: T PP YFFP T7CT? 
LGLTLLPRLKCSGDHCSLQPRPPGLKRS 
SCL\GFPKCWDYRNEP/CVPR 


IUjoO 

•* 








9^/1 


ZD 


rtQyirin/pas TYQMTr*rQQFWrt'nrjr , \r\7r: 

UOflxUAV r\rir}>x I OlM 1 L.!OiJ£inV3iA3tjV Vu 

GCSSGSTTHPSPVADSFFVWETVSL/S 
VTQAGVRWCDLSSLQPPPPGI 


10587 


24488 


A 


10670 


206 


3 


YGRPRKLCNIGLKSGFLDPTHWGQHRVI 
SFFFIMETDSRSVAQAGVQWHNLGSLHP 
PPPG\SSDSPAS 


10588 


24489 


21 


10671 


20 


355 


GFTSQSELLYSIDPITRPTDQVTIGISA 



1193 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/535,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=Iso leucine, 
K= Lysine, b=Leucine, M=Methionine, 
N-Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S~Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possibie 
nucleotide deletion, \=possible 
nucleotide insertion 














QSYCRVHVHNRVYDLDVESGHPNGAPAI 
KGSVCSTIKVPGDVSSGRSNPGRFVSTS 
NSSLY/EKDKKNKACFTK/RPSPVNDI I 
ST 


10589 


24490 


A 


10673 


112 


359 


NKAQQCVHENHFKLKDANTLN I KVWRN I 
CHSSPNQKKYGLAILNLDKSGFRSRKDT 
GDEE/HFIKIKKSVIQEDFSIINIYA 


10590 


24491 


A 


10674 


317 


2 


TGPGFFPQIWKVFPPFPLKIFFTQKFLV 
SFWGFKTIKVGPFVFFPKGPQPRFFSPF 
LGVQAEKFFFFPFFFFFFFFCERGSHSV 
A\RLECSGAISAHCNLCPPGFX 


10591 


24492 


A 


10675 


240 . 


495 


DHRPEKTKSSTCQAGEVPLLGLFVPFRS 

stdwvrpafi/meg/hicptqstnsnin 
p vwkh phrhtqdnvwpnaw s phg pvklm 

HKLiI 


10592 


24493 


A 


10676 


271 


484 


npagqtcriksfffsffetescsvtqtg 
vqwkdhgsmqpqslgp\sdpptsanssa 


10593 


24494 


A 


10677 


364 


121 


kqewrslshnalcndqasplpgsghwk 
skqklskaapcagsskhkhl/hkecwkh 
ththththihthththihahtkdrfa 


10594 


24495 


A 


10678 


354 


539 


FFFFFFIVFEESHAIAQAGVQWCHLSSP 
HLR\SPPPGFN\DSPASASSAGITTLSS 
SVRL 


10595 


24496 


A 


10679 


123 


853 


RWSLCHPRLE\CSGTISAHCK/L/RAPG 
FTPFSCLSLPSSWDYSARHHARLIFFVF 
LVETGFHHVSQDGLDLLT/ sg /ippala 
/ FPKG WDYRRE 


10596 


24497 


A 


10680 


251 


30 


ASLGVSAPLCI HVS PCPPNGDLGKTWWV 
GGGFFFFEMEFHS/VLPRLECNGTISAQ 
CSLCLLSSSNSPASASRIPP 


10597 


24498 


A 


10681 


203 


1 


NFPAPGKLGPPRDSLKTAPPFFFFFETG 
SRSVA/EAAVP/W/CDLDSLQPPTLPDS 
SDSPNSASGVAGITG 


10598 


24499 


A 


10682 


237 


397 


DSLTLSPRLECNGS ISAHYNLC/RLGSS 
NSPASAS 


10599 


24500 


A 


10683 


8 


388 


LYMCVJFRPGFLAHNSHDKGYSLTLSCWG 
ASGLKKQP/ CRLSEKKKKKKKKKKKKKK 
KKKKKKKKKKDS 


10600 


24501 


A 


10684 


201 


463 


IYWEDIVQGIVADRLTSRKHNSKFLGEC 
LMFGQGFLFFEMDSHSAARAGVQWRSLG 
SLRPLPPG/SQVILLPSSDSPALASRVD 
EIAGV 


10601 


24502 


A 


10685 


151 


1 


PEKNRKTQKGAPPFFFFFETGSPSVAQA 
GLQWRSHSSIQNEPP\GSSDP 


10602 


24503 


A 


10686 


165 


3 


CVFVEDI I SNFFFFFMETESRSVAQAGV 
QWREHGS \ASRVAETTGVHHHAQLI FR 


10603 


24504 


A 


10687 


292 


384 


NTE I GWVWWLTTVI PALWEAEAG / E I TR 
SG 


10604 


24505 


A 


10688 


161 


365 


RPPCW I RTSRWCWR Y YRFDGST I KVLRD 
LSSDRSNPGRVLSTSNSSLY/EKDKRNK 
AYFTK/RPSPVNDII 


10605 


24506 


A 


10689 


286 


3 


QPSSVWSFRSVICAPAPFSGMGLSPELQ 
QCrVGNFASRYTMTKSSAVLFIIiIFSLI 
FKLEEL/REAPASLVSFLPPQMIiRIKWE 
SEQWLVLLCDRG 


10606 


24507 | 


A 


10690 


169 


2 


SDTPAWWPRKHVFPPGFFFFFETESRFV 
T\RLECSSTISAHCNLCLPGSSDSPASG 
S 



1194 



WO 01/64835 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQIB 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

5] m inn opJH 

residue of 

nentide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 

r] inn tn 

UJffg CIS 

last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=GIycine, H=Histidine, I=Iso)eucine, 
K=Lysine, L=Leucine, MMMethionine, 
N=Asparagine, P=Proline, 

0=f»lnfan> in** 12 — A roininp S=Sprinp 

\JtU 111 lit If JC^ f\ — /^f g Iff IIIC, xj^ijd IIIC, 

T=Threonine, V=Valine, 
W=TrvntODhan Y=Tvrosine 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, Impossible 
nucleotide insertion 


10607 


24508 


A 


10691 


240 


406 


DGIRRLSFFFFFFFETKFGFVA\RLEGR 
GTI VfVNVNLHLPGS RDSFAS ASQVAG I T 


10608 


24509 


A 


10692 


343 


3 


QVPIiNLSLHLKLGMLFPGCINGSLSQFL 
QFFAQISPYQTGFPSFYKTS I YHNSLHQ 
YSLSFLPYFIFLFFFSFFFFLKQSHSVS 
\RLECSGMVSAHCNLRIiTATSTTQVQVI 
LV 


10609 


24510 


A 


10693 


196 


3 


YDAGHTKKTLKFPTVYPFVLFCWWECKM 
VQPRWKIVRR/FLKKLNTKLPYDPAIPF 
LSVHQKELRT 


10610 


24511 


A 


10694 


245 


391 


KTDYQPGAVAKFGRPRREDHPRLGVQDK 
PDQLEKPVSTKNTKLVW/SWVJL 


10611 


24512 


A 


10695 


252 


389 


KTGGKVLFFFFETESRSVAQVGVQWRAL 
RSLQPP\QPGTSDCPASAS 


10612 


24513 


A 


10696 


385 


1 


P PNKAKM I S S KDNKNLHWG KDTLLMKW C 
WESWIVTCITMKLDSHLSPYTKINPKWI 
KD/LKTI KILGENI KKTVLDIGLHK\NM 
SKTSKAMTKILDLI KLKSFCPAKEI ISR 
VNRKSTEWKKVFASYLSN 


10613 


24514 


A 


10697 


296 


430 


KHIQARRGGS /CGN/ RQHSGRPRRADHL 
RSGVREQPEQPGEKPHL 


10614 


24515 


A 


10698 


280 


468 


DYLCL/SLSI YLS I YLSIYLS I IHQFIY 

I1JLJJL JL JL J X -UJ O iri i-l J kD 1J X L L ± IV VI ±Ji^s 


10615 


24516 


A 


10699 


15 


393 


RSVGVLGPVRCQCA\LLGGDSQLG\SQG 
SGVRDPLEEAVCRFPYLQLCTGRTTALF 
KAVROGHLSLORLIiLS F\ VWLCPAPIGG 
AYRGRQASWSCGGLHPVRA/SMLLCLPK 
EAWAMAGAPPPASLPPLS 


10616 


24517 


A 


10700 


176 


1 


DWTTNFTTFLYNFKPSSLMPYLSHLFKT 
LR/MW pgavaharn P S TLGGRGGW I MRS 
GDE 


10617 


24518 


A 


10701 


494 


80 


FNKKD IHS DTPS EGHQLQR PNVE TL K KM 
GRNQCKKGENPKNQNASSPKDHNSSTPR 
EQNWMKNESDELIEVGFRRWVITNSSEL 
YKG\DVLTQCKEAKNLENRLGKVLTRI T 
SLEKNGHGLMEVKNIAQELCEASAGWR 


10618 


24519 


A 


10702 


264 


410 


KKGPLFTPPGGGGG/ PQKKPPGPLNPGG 
QRDSSFFPPPGGGNTGETPPGG 


10619 


24520 


A 


10703 


125 


3 


NRGNKGQV/QWLMPVIPALWEAEAGRSP 
EIRSRDQPRQHGETL 


10620 




A 


10704 

IV// v*t 


184 


o 


RLRVLAPCRHLPRAPRTWPKRPFFFSFF 
LFFPQTQSHSVA\RLECSGAISAHSNLC 
FPPTRP 


10621 


24522 


A 


10705 


187 


406 


LFLWKFCLRGVPGHVRCQSALXLGGASQ 
LGSSGVRDPLEEAVCLF5DLQLRAGRTT 
TLFKAVRQGHLRLQRILL 


10622 


24523 


A 


10706 


82 


410 


ILRGKFGKHYFNRIDWREALRQSLSLFN 
FIIFSNFliASLHKPEMETELKGSFIELiR 
KALFQLNARDASLLSTVDSDFS FCRKFS 
R /CSKCGO 


10623 


24524 


A 


10707 


390 


2 


SKDCRTAKI AACS FLWKLRSRGAPARCK 
PELSCMRCLSA\ llggvsosggtg TRDP 
LEEAVFPLAELERCVGRSAALFRASRQE 
HLSLLKMHPQLPLPSGALSQADGSFIYK 
PLTGAS AFL S EMP CQERR 


10624 


24525 


A 


10708 


229 


392 


YSWWRQMHSVAHAGVQWCD/LVSLQPQP 
PG\SSDPPASASLAARTTGAGHHNQLIF 


10625 


24526 


A 


10709 


193 


2 


RYJjCYQDHVI llllllllflrwsltlsl 



1195 



WO 01/64835 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q^Glutaniine, R— Arginine, S = Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














RLECSGTIMAHCSLDS\SSNPPI SASQA 
AGTTGACYH 


10626 


24527 


A 


10710 


180 


11 


LLFFFETRS /HVSQTWQCGMISGHCNL 
RLSGSSDPPPLASRVAGTTGKHQNIWLS 
R 


10627 


24528 


A 


10711 


234 


408 


MKLIMLKVILFPQIHLQIQPNFYQNIFF 
FETELCSVAQSGVQWHNLSSLQHPPP\G 
SSD 


10628 


24529 


A 


10712 


312 


470 


TGPKC VT \ RLE CS GAI TAHCS LD FAGLS 
TSPTSVSQVSGTTGTWMKLETI ILS 


10629 


24530 


A 


10733 


64 


392 


PKMV IRIS SET SLY AShVLQMKGQRQK/ 
CEPQPKKKKKKKKKKKKKKKKKKKKKAR 
G 


10630 


24531 


A 


10714 


169 


427 


NNQKTNNKMVGVS FYLS III LNVNELNA 
P IQRHRVAEWI KKEKKKKK/DPGICGLQ 
QTLFFYEDP/HDPLRLKIGGW/RKYYPS 
RGTQIK 


10631 


24532 


A 


10715 


176 


410 


ARSSWGLHTAVPVAFFSLRFLLEIFLDR 
DTRCSPPAAFFFETGSCCVA\RLiEGRGA 
ITAQCSLNLLGSSNPPTSASRVA 


10632 


24533 


A 


10716 


234 


389 


NSGNMDR YKDVQN I I QNP I CW PGT VAHA 
CNPSTLGGRGGQIL/RGQEFETSLA 


J VVJ J 




A 


1 071 T 
i\j 1 1 1 




£. 


GPFPPHPGGFLRGFFVPNTI PPPTFFFF 
FFLRRSLSVAQAGVQWCDLSSLQPLPPG 
P\SNSPCQ 


10634 


24535 


A 


10718 


201 


] 


WPLFL PKM FFFFKRFPHMGPS PWVGAQA 
KKKKI FFFLTGSHS VT\RIECSGTISAH 
CSLNLPGPSH 


10635 


24536 


A 


10719 


195 


419 


EYHTSLVTCGNPCVYRSNNKLNQTSRRK 
VITKIRAELNEIETEK/LQGSGETKIWF 
FEKINKTGLELLGSSDPPVW 


10636 


24537 


A 


10720 


218 


407 


GKKNLAFKKKKRKEKKRKKSPSQSNMNS 
AKIEARTWI KLWKHGWKNCE 1 1 DA\LQ 
KAFGDNA 


10637 


24538 


A 


10721 


231 


409 


GTQLHLGGFFFSEKELSFCFWF/MFALF 
EMEACSVT\RMECSGTVLAHCNIiRLPGT 
SNSSA 


10638 


24539 


A 


10722 


40 


401 


PLCPSESSGNTLMASSDPSTPAVPPPNT 
THPPLCLSKSHLPLRPKQGLPSGNLLQL 
PLTLLI PLLGAPVACWQLPQQCTLSTFF 
FETKSHPVAQAGVQWCCLGSLQPLS PG \ 
SRDSPDSAC 


10639 


24540 


A 


10723 


210 


14 


HVMGLLLLFLNKLTVNNFGWLAGFWCLG 
F FWFFS F FETRSG S VT \ RLECSGMI S AH 
YKLCIPGSSH 


10640 


24541 


A 


10724 


347 


462 


TFFFFFFETXSRCDI QAGVQW CDLCSLP 
PSS\PDSSDCP 


10641 


24542 


A 


10725 


169 


386 


DKKQAKT I KWGKNS FSNKWCWNNC I ATG 
KRMKLDP\YLTPYKK/ INSKWIKDLSI 


10642 


24543 


A 


10726 


257 


54 


PLFFFFKRQGLTLS PRLECNGTVTAHYN 
LKLLGSRDLPTSASP\SAGITGISNPAR 
LFTYLNPTHLQT 


10643 


24544 


A 


10727 


2 


401 


NNYDRAETQIYQYMCLNPTFYCLQETHL 
TCNDIYRLKVKGRREIMQIENKRVGVAI 
L VS DKTD FK P TTVKKKLH Y I IKGSI QPE 
DLIILCTYSPNIRASRFIKRIPDLRKEI 
A/HTVKVGDFS I SLNRLSRQNT 


10644 


24545 


A 


10728 ! 154 


i 


PMCSLLGLSKGGII FFFFFEAEFCSVAQ 
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ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E-Glutamic Acid, F=Phenylalamne, 
G=G!ycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q == Glutamine, R— Arginine, S^Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














AGVQWRDLGSLQPPPPG\SSDS 


10645 


24546 


A 


30729 


34 


151 


PRP PVP SLLDRGRLQLWRQRGLRHRAHS 
NGF I GGKQQ I MKLLKNYVRRP VG /VAVA 
IMFDPDPRYPRSWIEDDFNYGGSVASAT 
VHIRMGSLVENNKS 


10646 


24547 


A 


10730 


2 


408 


VFLLTVRTLI CRSVGVWWRSTPDLVCLG 
ISSGGCRTANIGLYQMLLPDHSSGSFCL 
RGVPGHVRCQSA\LLGGPSQLGYSRVRD 
PLEEAVCPFSDLQLHAGRTTTLFKAVRQ 
VKLILQRFLLRFVWLCPAPRG/GVYR 


10647 


24548 


A 


10731 


274 


89 


EN I P IVFNPPVCS PLLWQPQE / CEYPKL 
CKLCLKLHKHRVFY\FLFFFFETEPHSV 
ARLECSG 


10648 


24549 


A 


10732 


251 
i 


1 


TFQMMQKCFSHRKIFHNLLDKASYKIVY 
KEDPFSPSSLSSSVSLKNNFFFLETESC 
SVTQAGVLWCHLSSLQ P PPPG\SRDS P 


10649 


24550 


A 


10733 


375 


1 


APFPPLWVRGSPPFSPLVFPHPKLKPPL 
AE I LGFFKERKWGS IRKPCLFKVKKLVS 
VWPGLELQIFKFIGEFPFSFPSLVGRKP 
NFFLGPVFFFFFEKESRSVA\RMECKGT 
ISAHCNLHLPGP 


10650 


24551 


A 


10734 


82 


398 


SFLWKLRLRGVPSHVRCQSA\LLGGASQ 
LGYLGVRDPLEEAVCPFSDLQLPAGRTT 
TLFKAVRQGHLNLQRFLLPFVQLCPASR 
GGVYRGRQAS LS CGGLHPVRAS 


10651 


24552 


A 


10735 


2 


341 


TFCAISWLESGVEDGPRSRISYRISTFF 
FFFKGAPEPKEVRGRPKPGLAPTSLPGA 
HGTTGLSGWTTPSIGGWQPPPPPRENPK 
GEH P PAP VAGDT FQ\ P KKP P P I LKVF FP 
K 


10652 


24553 


A 


10736 


171 


1 


RVFYLLAFALFVDTGS P F VS QAGVQ WCD 
HSSIQSQTSG\SNDPPASAYRVAGTTGV 


10653 


24554 


A 


10737 


190 


1 


EHKTTVRKI PKMEEMADSGSNMLRI 1 1 F 
LFLFFETKSCSVT\RLECN/GSISAHCN 
LHLPGSSNS 


10654 


24555 


A 


10738 


252 


413 


GLLGLQNFSYKVHLTKAHLKKCSWLDAV 
AHAYNPITLGGRGEW/IQEFQTSLTNV 


10655 


24556 


A 


10739 


101 


249 


AGSTSRCI /QELSDLLEHLEQENCLNPG 
GRGCSEPQSCHCTPAWVTETQKK 


10656 


24557 


A 


10740 


185 


403 


LGLPVHTMKSNYHYHLCHHHHHLQHHHH 
HHHCHCHYHS /HHHNSQHPPPPPPPPPP 
HHHHHHHHHLP 


10657 


24558 


A 


10741 


119 


425 


IKFHLHS MERTHLFLWNCWAHKQRNCFK 
ADFVI TDDI KQLCPPQSWWTRAGKLPLG 
AGRGGSHMLSHHFWRPRHADILYLGVLD 
QPGQQGETPSL/LKNTKSSW 


10658 


24559 


A 


10742 


112 


2 


GETFFFETGSSSVA\RLECSGAISAKHN 
LHLPGSSKSP 


10659 


24*560 


A 


10743 


188 


3 


PPPLFFFFFFFFFKTGSGSAT\RLECTA 
HCNCCLPGSSHHPTSAYQVARTTDVCNH 
AWLIFV 


10660 


94561 


A 


10744 


176 


3 


SLYSKNKNPSHLLFLPIPIKFFFFFFET 
EFRSVAQAGVQWHDLSSLQPPPPG\SSD 
SP 


10661 


24562 


A 


10745 


250 


1 


GPRRI FFLKEFYPRFVFGKNPAPGGFFS 
GGKKPGPFPFNPRPIKFFFFFFETEFRS 
VAQAGVQWHDLSSLQPPPPG\SSDSP 


10662 


24563 


A 


10746 


155 


3 


PPHPFPFNPLPIKFFFFFFETEFRSVAQ | 
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C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=lsoIeucine, 
K=Lysine, LHLeucine, M=Methionine, 
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O^rihitftminp R=A rorininf* S=Sprinp 

V£ V* • U trtlll 1 11 C, i\ All gilllllG, kJ Utl lilt*} 

T=Threonine V=Valine 

Jl M 1*1 t-Vll lilt j ▼ ▼ 4«I1IJV} 

W=Tryptophan, Y=Tyrosine, 
X— Unknown * :s =Stf>D endon* /=nossible 
nucleotide deletion, \=possible 
nucleotide insertion 














AGVQWHDLSSQQPPPPG\SSDSP 


10663 


24564 


A 


10747 


157 




TPRPFPFNPPPLKFFFFFFETEFRSVAQ 
AGVQWHDLSSLQPPPPG\SSDSP 


10664 


24565 


A 


10748 


157 




TPRPFPFNPH P I KFFFFFFETE FRSVAQ 
AGAQWHDLSSLQPPPPG\SSDSP 


1 UUUJ 




A 


10749 




2 


PPPLSWTPAPLRFFFFFFETEFRSVAQ 
AGVQWHDLSSLQPPPPG\SSDSP 


10666 


24567 


A 


10750 


157 


I 


PPPPFLFFPRPLKFFFFFFETE FRSVAQ 
AGVQWHDLSSLQPPPPG\SSDSP 


IUOO / 






lu/JJ 


1 S5, 
i j j 


-> 


NT F P FF FTPLT P KF FF F FFE TE FR S VAQ 
AGVQWHDFSSLQPPPPG\SSDSP 


J l/OOO 




A 


1 U / J 4. 


86 
oo 


H\J\J 


S FL WKLHLRRAPGCMR CRLA \ LL.GGVS Q 
LDYSGVRDPLEEAVCPFSGPKL.HAGRTT 
TLFKAVRQGHLS LQKFLLPFVQLCPAPS 
TGVOEGROASLSFGGLHPVOSSR 


10669 


24570 


A 


10753 


364 


469 


GQFLHSLD/ SHWKKSEDFCFLWFPHSEN 
VSAIHQDH 


10670 


24571 


A 


10754 


250 


61 


GEKKKKKPRVFFFFFFFETEPGSVT\RL 
ECSGVLSAHCNLRLPNPNDSPASASRVA 
ASAKLG 


10671 


24572 


A 


10755 


66 


446 


SFLWKFCLKGVPGCVRCQSA\LLGGASQ 
LGYSGVRYPLEETVCPFSDLKLRAGRTT 

GEAYRGRQAS LS CGGLH P VRASRPLCLP 
TQALAMVGAP P PGS L 


10672 


24573 


A 


10756 


252 


482 


RL PRQP VRKWVAGVRGCCVWGWS KSLQ 
/TRLEGNGEI S AHCDLCLPGSN 


10673 


24574 


A 


10757 


205 


488 


PLESLRSPLGPLSWRTASSVSPVICIYY 
TYIYIYTYTHTHY/HIHTHTHTHTHTHT 
QYI FVC WFLKDRVLLCHPRQSAGARS W 
PTRTSASQRQEM 


10674 


24575 


A 


10758 


272 


471 


YSYVLFFIFLGIESCSV/AFSAGVQWHN 
HSSLOLOTPGLKOSSHLSLP /ASASPVA 
GTTGMRYHARL I F 


10675 


24576 


A 


10759 


17 


342 


GTLSSGAQVLIGRIESIVWGLKPWALG 
GCPS PRAVHWLLASSDWRPSLQDGAEGW 
KKGEANGNHKR/GIAVISDQIDFKTKTI 
KGD\KKSHYVMIKGPIQQEAITIINI 


10676 


24577 


A 


10760 


42 


498 


EFRERGREKEREKERKTEERMEDRERKA 
EREREKEROEGREREREKGRORERERKR 

LJ 1 VJUl Xv JL_» IVlJIV^xJVJlVlJ 1 Vi-^lVJUJl VVJ A\y 1VU -.\. i_il\ AVIV 

ERERKRERE/RHEPGSL 


10677 


24578 


A 


10761 


1 


305 


AS WDD PAHNNNFH I PGGWAHFFFCRLL 
DCPPWAPRPPAPR/VLL/TPAAAAAAAA 
AASRPEKKRAEENVGAIPPRKRPPRWEE 
RRG PRKGS AP PGRGAGRRAR 


10678 


24579 


A 


10762 


364 


2 


FPQKQNGQYLPLFPKPTGLCGKGKTRLG 
FFLNYFGKKKKKRGREIiGFPPFCQIWGT 
S I YRGSMG.YFFFFFFWRQVSLALLPMLE 
C\RGMIRVHCSLDLVGPSNPSTSASQIA 
RTTGTCMCHH 


JUO/7 


•j a con 




1 U /Oj 




404- 


R FVCS T I KVLR DLS SDRSNPGR FLS TSN 

JTvL V fc-J JL JL IV V J»H\l/Uw lw J—* luJJl» Jt \JTJTv JL JL-JkJr JL Uli 

SSLY/EKDKRNKAYFTK/RPSPVNDIIS 
T 


10680 


24581 


A 


10764 


208 


420 


S FL WRFPLRGVPGHVR CQS A \ LLGGAS Q 
LGYSGVRDPLEDAVCLLSDLKLCAGRTT 
ALFKAVRQGHLS LQR 


1068! 


245S2 




10765 


319 


85 


GKLLNNNRCFGSKFQGIMFFFGETESPY 



1198 
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amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Iso leu cine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possibie 
nucleotide insertion 














ITQSGVRWHDFGSLQPP\PPGSRDSPAS 
AS \ RG I TG ARKHTQ P I FLL VSNS 


10682 


24583 


A 


10766 


122 


3 


KFFFFFETEFRSVAQAGVQWHDLSSLQP 
PPPG\SSDSPSL 


10683 


24584 


A 


10767 


240 


86 


RAPPFFFFEMESCSIAQAGVQWLSLGLL 
QAPPP\GSCHSPASASCELMFSKI 


1 UUOH 




A 


10768 


244 


85 


RAPPFFFFEMESCSIAQAGVQWLSLGLL 
QAPPP\GSCHSPASASCELMFPKI 


10685 


24586 


A 


10769 


35 


296 


E VKS PSARQPPRLGSEEHLRPAAAPSGR 
EVGGQPPPGQPPCPGGEG/PPPPGSPDW 
E VRS P S ARQPP RLGGE PNS S LRTGHDDD 
GGFV 


10686 


24587 


A 


10770 


260 


484 


MDEELLLMNEQRTWFLEVESSPGEDAVS 
I IQLATRDLEYDLNLVEKGAAG/LERKH 
Y S FE RS S T VDK I LSHNTAC Y 


10687 


24588 


A 


10771 


248 


393 


TQEGKKLINWPGTVAHACNPSTLGARGG 
RTTRGQELETK\LANKIKPCL 


J 0688 


24589 


A 


10772 


239 


2 


S FLWKFCLRGVPGRVRCQSA\LLGGASQ 
LG\SQGSGVRDPLEEAVCPS SDLQLRAG 
RTSALFKAVRQGHLSLQRLLLSF 


10689 


24590 


A 


10773 


158 


455 


LFFTLCPSLLQHIAVMLELGLKGPKSIQ 
PIFWVFLQGTEP/HFLVTPVR/CCLPLL 
KLFCLLVFGMESPSVPHAGECSGVISAH 
CNLCLLGS TDS S AS PS RV 


10690 


24591 


A 


10774 


166 


403 


KKTFLGE PLFWGGAKKKKPGKKNPGFFP 
RG/IKPRVFFSRFFFFGPPPKKGFPQKS 
FFLKSLPGFFLFGGCPPPFFFFFFFFEM 
ESCSNTRLECSGVILAHCNLCLPGSSDS 
P 


10691 


24592 


A 


10775 


208 


1 


RLCFFYFRKALLGKAQIKNIAFFPRKGS 
FFFFSETESRSVAQAGL/LDCSGAISAH 
CKLRFPGSRQSPAS 


10692 


24593 


A 


10776 


192 


29 


I FRKEFPCLNFFLLFFETESRSLAQAGV 
QWRDLGSLKAPPPG\SRRSPASRDTGV 


10693 


24594 


A 


10777 


275 


490 


KPGFFLLQMAKVYM I FFFFEME FCS \VA 
QAGVQWHDLGSKQHRPPG\SGDSPSCLP 
SI/AGIPG 


10694 


24595 


A 


10778 


340 


3 


LKVPVGKPRSTLLGVKKVPFFGNKRKKF 
LALFFFPRPPPGEGFSTAFLAQKPTPRV 
VPALGFPKNQSPLSPFFFFFFETKSCSV 
TQAGVQWCGLRSPQPMPPG\SSDSPASC 
L 


10695 


24596 


A 


10779 


313 


1 


ANPFGGPSGGDPFSSRVFPPPGPKNETP 
FFKNKTTATKKTGNKGGWAPSPHRGGPK 
KG PALWDKXGKNLWPFFFFETESHS VT\ 
RLQCSNTILAHCNQCLPGS 


10696 


24597 


A 


10780 


116 


422 


ILEDTNIQTIETLLAIREVQIQTTLRNH 
FTLTGMAI I /RKTDNNKCWRECGKIETL 
I CCWRECKMLCNEVGTALEI VWQFLQSL 
NIELS YD LAMPGFIIFPRE 






A 


10781 


120 


41S 


TQTTGAPQLHLASRWLSRSGGLTSSPQE 
IPKLFWSIESPLGSSKHLSLQVFVCLFV 
CSFVFEMESCSVARLEC3IVISAH\CTL 
HLWGSSHFHASASRVA 


10698 


24599 


A 


107S2 


134 


1 


SS FFFLCQTESCFGVQWHDLSSLQPSTS 
RAQA\LSLPSSWDHRR 


10699 


24600 


A 


10783 


2 


435 


CSHRGDS SS YSQLSGIRAGDLGGGGKDI 
FRLLPTTLNI FAGKES YDWCVTHERMC 



1199 
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Predict- 
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nucle- 
otide 
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ding to 
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residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine y D=Aspa rtic Add, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=Iso leucine, 
K=Lysine, L=Leucine, M-Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R = Arginine, S^Serine, 
T=Threonine, V=Valine, 
W— Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possib!e 
nucleotide deletion, \=possrb!e 
nucleotide insertion 














hC/S CLF I YMCLCMC VHV / H Y I HE ACVF 
MCLCL 


10700 


24601 


A 


10784 


432 


161 


FSRAG FH / RVSQDGFDLLPS \ CLPPLGL 
PKCWDYKREPPRPAWKLMCRQVHRKCTW 
LFIQIGSTLFKTNGGLSAVAHAYNPNTW 
GGRGERIA 


10701 


24602 


A 


10785 


270 


442 


NVRLRLGLALSPTLECRGTIMAHCRLDF 
PGLMQSSHLSHRVAGTTG/TCHHAWLIF 
KFL 


10702 


24603 


A 


10786 


12 


424 


LIQLPRLECSGALIARCNFTY/SLGSGD 
PPTS ASQ / VLKTTGVCHHAQL 


10703 


24604 


A 


10787 


350 


3 


DSSSVQTNKGIPGQVLSDILSCAVKASV 
LVI I PLHNALS S WLVPALFLWKS WQVGK 
SQQAHSSVSGLC/MHYRSHTHAHTHSPH 
RHRKT YTYARAHTHTHTQMLSAYLPS KQ 
PSGSLS 


10704 


24605 


A 


10788 


155 


3 


HAFFALC I RNRLECNGVILAHCNLC/RL 
LGSSDSPVSASQVNG I AGACHHAQL 


10705 


24606 


A 


10789 


188 


2 


RRDLSSLQPPRGGQ/MRGCIYTHTHTHT 
HTHTHTHTVHWGWGKRHHVPKGKMSANG 
ESGRAK 


10706 


24607 


A 


10790 


19 


405 


I RPT I SRVERG INSLVASEGQRL PWDG I 
ACSQGLWVQQTWGPLGPFPS LLGMPHR 
PTFRDLNS EPAPGVANVSGTLSTPLPGA 
SHGLLVFF FETESHF VAQAGVSWGDLRS 
LPPPPPG\SSNSPVSAS 


10707 


24608 


A 


10791 


157 


1 


KPGPFPFYPGPLKI FFFFFETEFRSVA\ 
RLECSGTI SAHCNLHLPGSSDSP 






A 


10792 


157 


1 


QPRPFPFNPRPPKFFFFFFETEFRSVAQ 
AGVQWHDLSSLQPPPPG\SSDSP 


10709 


24610 


A 


10793 


83 


387 


S FLW KLH PGGAPACMRCQLA \ LLGG VS Q 
LG YTG FRD LLE EAVCP FS ELKHHAGRTT 
AVFSAVRQGCLSLQKFLLPFVQLCPAPR 
GGVYRG/RQALLS CHRLHP V 


10710 

1 \J 1 1 V 


7461 1 


A 


10794 


204 


419 


KGVYGHSGS FSPPAPIiACFRDKTLFPVS 
LWKEFVHSPWCKCTLPQPLWKTVWRYLK 
NFKME \ I PYVPEI PLLG 


10711 


24612 


A 


10795 


207 


2 


RRGFTMLVGQNSLDPSTSRSAHLSLPKC 
WDYRCKP/ PAPS/LREGFSYLLHSLLHP 
TPAPGSHHiiWAALIC 


10712 


24613 


A 


10796 


2 


237 


FFFLRER I LLALS PRLECSDANMSHCS L 
NLPG/FSQSFCLSHP\SRWDHRHMPPYP 
VKFFGI FVGDRVLALFPKLI S YI I 


10713 


24614 


A 


10797 


342 


40 


DRVFFCS PRLECSGAI IARCS / LRTPGL 
KQSSKLSLPKCWNHRCKPPRPARCSLNE 
SHSAKKWPGQSTDSSACEEVQPTSPFPT 
VP YPTLLS PSAGHGRGR 


10714 


24615 


A 


10798 


209 


2 


CSVLAVILNPECIYWPWLELLAHDPIP 
RLVCPKS F FFETKFCS VAQAGVQWHDLG 
SLQPPPPG\SSNSP 


10715 


24616 


A 


10799 


142 


2 


I FFFFFEMESRSVARPGVQWSDLGSLPP 
PPP\GSSDSPASATPSPMQS 


10716 


24617 


A 


10800 


276 


2 


I LP 1 1 IRNTCCCFFLKE 3RSCS VTQARV 
QWHII/GSLQPQTPG\SSNPPASAFQVA 
TGAHKHTSLIRNTLIIINWEKNTHKLSG 
CTSMHFRFL 


10717 


24618 


A 


10801 


12 


364 


LHHYKTVSGIYKCLCVLNGPPTRFFPDF 
LPPLGLPySLRHNNIEISPIKNPPIASK 



1200 
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location 
correspon 
ding to 
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peptide 

sequence 


Amino acid sequence (A-Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, ]=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, Y=possibIe 
nucleotide insertion 














ysrermscisltlnqtlvmiklneesiil 
kansdhklcllr/ svyqwnayekfl 


10718 


24619 


A 


10802 


235 


3 


HRPSEDLDFPPHWAVMRAPQYPSWDTRE 
GGPRSFLFLFLFFEMESYSVAQL/EGTI 
S AHCNLCLPS S S DS PTSRALCS L 


10719 


24620 


A 


10803 


349 


1 


NQTPFFFFFFGGTETTSTTLCS\YGLLI 
LLKYPEVA/ESASQRDPEWEAAVWRWLE 
GPGSAQPPSAPAKGQELDPWGQRPVPS 
PDDHVQWPYTNAVLLEIQRFI S WKRTL 
TLDTLY 


10720 


24621 


A 


10804 


365 


3 


IDVCNVCVRKQYRGFCDQKYRAWISPVY 
PHKCG WHRVYNTPT PHCETEWL WVI LHA 
QEH/TFSLTGRHTHTHTKTHTHTHTSKL 
APPASRALFGVAHVEAQKALASPSSGRY 
LAITMFVQPCI 


10721 


24622 . 


A 


10805 


406 


96 


CPPE FS EESPRLLKFRVGG YLTPQVS KC 
GLGWRIFKVFFWSPPKVQTSLFFFSKT 
GSHSVT\RLECGDTILAHCDLCAPGSGD 
PPASATRVTVTVGLPPCPAR 


10777 








JO 




FFFFKGDRAQNNS /WGERCLLNKGYWDI 
WISTCKXMKSTPYLTLHTKISSKGLKDL 
IIRAKRIHLLKKYIGINLHDLGLK\DFL 
NMTPKTLATKEKIDTLDFIKIK 


10723 


24624 


A 


10807 


126 


1 


KEPFFFFFFETESCSVAQAGVQRCNLSS 
LQHQPTG\SSDFP 


10724 


24625 


A 


10808 


204 


415 


HLGFDSLTTCISLSDGLKYKATVFLVFF 
FERESNC\AVWAECNGPISVNCNLRLPG 
SGSS PAS PSRGVE I T 


10725 


24626 


A 


10809 


168 


2 


KISKRPFFFFFFETGSHYVA\KLECSGV 
ITAHCSLDLFGSSNPPTSASWVAGTTGT 


10726 


24627 


A 


10810 


97 


389 


LAVSPSLSLSLSLSLSLSYLGLPYSLRR 
SNIEITPINTPAEGSVCSSERKGHMSLS 
FNE KI E V I TLS AQDMSNTK I G / R KLDLL 
CH / TSQWNAE EKFL.K 


10727 


24628 


A 


10811 


24 


416 


LEYIARRYLGWWLFFFFFLNRQGEKSR 
GPFKFFFPRGFFSTRNGAPPGGPWGPLP 
WGGGP P VGFQKQGKGGAP \ P PKKNRFP K 
GGPLTQPNLPKTPIENPKGPPTRGFFPS 
GPPPKKGAGPPPIFRVGFG 


10728 


24629 


A 


10812 


60 


435 


KKRKNFPQKKISPYFYPLKWFKTPPLWV 
KNQTPPV/CCFFEAPFSLKKPPRGLKKG 
/WGKNFPPPLVPQKXKKKDTAGVAILVS 
DKKDFNPTKIKKDKGHYIMVKGSMQQEG 
LGCPDAWVPS 


10729 


24630 


A 


10813 


618 


1519 


FWFGVNCKSVCVPFLPVLCFVVCVWCER 
GVLSQRSMGQAQSKPTSLGTMLKHFKKG 
FKGD YS VTMTPGKLRTLCE I DW PALEVG 
WPSEGSMDRSLVSKVWHKVTCKPGCPDQ 
FPYIDTWLQLV/YRPPPY 


ifl7^n 
it// j\j 


i^UJ J 


A 


10814 


179 


15 


KVKRLKTP FFFFFFETRTCSVAQARVQW 
HNHSSLQPQPPGPKRS \PTSVSREAGT 


10731 




A 


10815 


61 


422 


NCFFLKGPPLFFFFFFFFFFLLQIIVFI 
YKFFAFLFQMEFRSLPRLDKCNGAISAH 
CNLSLPSSWDYRNLPPRIAKFFLFLVET 
RQPASA/ FLTCWD YR 


10732 


24633 


A 


10816 


170 


440 


RQGLSLLPRLECSSMTTAHCDLKLLSSS 
NPP I SAPH I ALG/ LTGLCHHTQL 


10733 


24634 


A 


10817 


322 


a A'y 


FIFI FYFFETRSHAVAQAGVQWRDLALL 
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Amino acid sequence (A=Alnnine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIalanine, 
0=Glycine, H=Histidine, Msoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 

C\— Clint omino U— * A i*irinin a CartriP 

— uiuiaiiiiijc, iv — /irgiiiiiic, kj—oci me, 

T=Thrpnnir»f» V=V^linp 

W=Tryptophan, Y=Tyrosine, 

y\ — UllfiJlLJVVIJ, — OlUp tUtiUfl, JJl/jjIi/JC 

nucleotide deletion, V=possible 
nucleotide insertion 














QFPPP\GSSDSPA 


10734 


24635 


A 


10818 


134 


3 


KKAHLPFFFFLEGTESHSVA\RLECSGT 
ISAHCNLCLPGSSVSV 










to 




AIRL\EKKIKDIQIGKEEAELSLFADMI 
IF 


10736 


24637 


A 


10820 


233 


440 


FFEFGALKKKKSMTHKKVERGKRTTLNG 
GG I KAAKTAS LKIOILOGWG KVMIjMANKL 
LRGERAW/ FPPRI RG 


10737 


24638 


A 


10821 


293 


402 


RNPFFFLETEPHSVAQGEVQWCNLSSLQ 
PPP\PGS 


10738 


24639 


A 


10822 


168 


1 


PLRGLSDSSLGI MKRMATDLSSLQPLPP 
G\S SDS PAS AS R VAGITDSHHHAWV I F 




Z4D4U 


7\ 
/\ 


1 AQO'J 


y f 


4Uj 


FEMES CSVARLECSGAISAHCNLL/LPP 
SFGSSDSPASVT 


10740 


z4o4l 


T\ 
J\ 


i noi/i 
1Uoz4 


J 60 


1 


FDPTTQT "\7ZiT>K'T.OK'WPP'D /QITWT PT.Z1A7T? 
Cifyj 1 ±oJjvrtJjJVJjV-i>>**'t KK/ O L>JlJ-tr< 1 V t\ 

DQPGQHDETPSLLKNTKISWAQWQAPVI 
PALEPV 


10741 


24642 


A 


10825 


199 


1 


KKTQIGGAPGGALFFFFFFETGFPFVA\ 
RLQCSGTNQAQCSLNLLSSSNPSFPALQ 
VTGTPSACP 


10742 


24643 


A 


10826 


252 


381 


GQQEFFFPCSVPQAGGQWCNLSSLQAPP 

r>/^\ C UUC'r)7\ C7> C? "D W 7\ 


10743 


24644 


A 


10827 


77 


427 


IFQVHCPMSFPVPMACIPRVSSFTSWVF 
HNLLPPSECPLGPLVPASSHPRPCVCCR 
PCTSWS\CPLWPRPPCSNSPV/TCVPCL 
PCLCISEIPSCVPWP/WTYSSLCPMSHV 
PDSPCPLP 


10744 


24645 


A 


10829 


20 


CIO 

Jib 


OCA TTCT T OUT TT?TCCaTMCrCVATT T PT. 

SVFL/RAQTLTAPCQTRGPRRGKDRGSG 
SSSPSGPKATKESSVERRKSFDSWGHRF 
AA\ QRLMDNQAERE SEAGVGLQRDEDDA 
PLCEDVELQDGDLS PEEKI FLREFPRLK 


10745 


24646 


A 


10830 


351 


3 


LHFS PFLQHRQNI KVWLMRASMQRHSRT 

SDKILSTLPAFTCANLSSRFCSSSPCSC 
SCLRA / CCTCHFDLSTLIHAHTRTHTHT 
HTHTMY 


1U/40 


Z4o4 / 


£\ 




o 
0 


J /O 


LEGDLFIGPKAE\EHRG 


10747 


24648 


A 


10832 


336 


38 


GVATEGVGEAAQGGE PRQ PEQ/ PPPQPY 
PPPPOOOHEEEMAAEAROA/AGAPMDDG 
FLS LDS P S YVL YSDRAEWAD IDLVLQNV 
HPNP WO T T YS DK V TLWK 


10748 


24649 


A 


10833 


206 


1 


TYFFPFPPGLFFIAGIFFFFFLETGSH/ 
SSLRLECSGI ITASCSLNLPDPNDPPAS 


10749 


24650 


A 


10834 


226 


376 


RISQAI ISFYFYFLFETESRSVAQAGAQ 
wQnT.QQT.nppcjpnvV ^dqpat.p*? 


10750 


24651 


A 


10835 


213 




DRVLLLSPRLGCSGMITAHCNLHLPWFK 
RFSCLCPPE/SSWDYRCPLP/PPRLTSV 
FLVETGFHHVGQAGLR 


10751 


24652 


A 


10836 


351 


447 


RENLWLTPVI PALREAKAG I \LEPRSSR 
PAWAT 


10752 


24653 


A 


10837 


71 


472 


SASTAPMAP VKKLWKGG/ K/ KKKQVLK 
FTLDCIHPIEDGIMDAA/NSTNYEQFLQ 
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N=Asparagine, P=ProIine, 
O^Glutamine R=Arpinine S=Serine 
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W=Tryptophan, Y=Tyrosine, 

X=llnknftwn p, r» nnrinn /— r» A^ci hi f» 

nucleotide deletion, ^possible 
nucleotide insertion 














ERIKTORKAVNLGGVVTIERSKSKLTVT 
PEVPFSRRYLKYLTKKYLKNWNLCDWLC 
IVANS KES YELRYFQITQDKEEEK 


10753 


24654 


A 


10838 


37 


382 


SRCIMAFYLYGIRSFPELWKSPYLGVGP 
GHSYVSLFVAGRCGVRNOORLCSVKTMS 
PQNTKATOTIAKARYLRKDEGSNKQVYS 
VPHFLIAGAGKERSQMNSQSEDHK/LA/ 
PVRNAV 


10754 


24655 


A 


10839 


313 


617 


AFFL I VFQYVARERERVSRS / VNCGLYQ 
HCYFRTL 


10755 


24656 


A 


10840 


309 


511 


WEQGSWEHVLQAPCSLIAPRITHAYTHT 
HTHTHRYTHTKLYPH/ APTS I L 


10756 


24657 


A 


10841 


269 


450 


TPWPLKKEFFFFLKWSLALVAQAGVQWR 
DLLGSSDSP\LQPSRFKQFSCISLPSSW 
DYR/PC 


lU/j / 


Z40JO 




IUo4Z 


/ion 
4ZU 


i 
I 


PLCHPPPLLVTMEEEITMLFIDIGSSMY 
KAGFAG/DDASRAMFPSIVRCPWHHGVM 
VGMSQKDS YVGDEAQTKHS I LTLKYP I K 
HDIITNE\WDNMEKIWHHTFYNKLHVAP 
CI 


i m<o 


Z4ojV 


7\ 


\f\QA1 

lUo4i 


A1 1 
4 j J 


i 
i 


CSGNLSFSGCVPGLPAGAVPPWVPVPFQ 
GGA/S WVWKGPS PWTLHLLLRTWGLAGG 
vfsrip t ,np wp zv w prtM prc cinnnf; a p p z\ wo 
ATGTPRS RTGS TGG I ETVT I LEGSHVSG 
MGIR 


10759 


24660 


A 


10844 


55 


297 


QRWPGTLLGVQPG/APPBSTSASGSGGA 
RGGPVPTLEGGNTGSRKWEDPCWGSQNS 
PSSDGSKPPPGPLTSKVCDDSRLSE 


10760 


24661 


A 


10845 


109 


427 


QTGPSAAGLLEFARGPLQTLFAWVPAAV 
AAEQQ I FVNRECCCLI VPLEFCLRGVPC 
CVRCQSA\LLGGASQLG\SRGSGVRDPL 
EGGSCPFSDLOLHAARTTALFTA 


10761 


24662 


A 


10846 


17 


416 


S FLWKFCLREVPGRARCLSA\ LLGGASQ 
LGYSGVRDPLEEAVCP FSDLQLRAGRTT 
TLFKAVRQGHLSLQR I LLPFVWQCPAP I 
GGVYRDRQASLSCSGLHPIRA/SRAAVP 
TQASAMAGAPPPDSLPPCSLSSN 


10762 


24663 


A 


10847 


265 


429 


LSGAS CCLFSLP I VLFALLLLS VLS FS F 
CAWCVPSWSLSPTLFLTFVLHFL* RFIS 
CF*VNSNPKNLTFMVNFQGKGMKIGLAL 
ERIC I CGCSLVFPEAI LETS CQNLFCTY 
ACAGVLS S VYDYLALLAVS FLFL * FFSL 
FFFFLYYLFLFALGASLPGLYLLLYFLL 
LCCISYSLF 


IK) /OJ 


1A6AA 
Z4004 


]± 


1 Aft 4 0 
1U54& 


1 o 




OTLGTKMNEGLFATFIAPTI LGLPDALL 
I ILFPPLLI PTS * YLINNRLI ITQH*LV 
KLTSKQM I T I HNTKGRT* SLI L IS LI 1 1 
I VTTNLLGLLPYS FTPTTQLS INLAMAI 
PL* AGAEVIGFRSKI KNALAHFLPQGTP 
TPLIPILVII 


10764 






i voty 


9 


462 


TTLHAFGTMKREAF I TLLCLFTS ANSRG 
VYARDAHKS E AAHR I KDLAEKDFLALVL 
I AYAQYLQQCPFEDHVKL*NE VTE FAKT 
CVADESAETCDKSLHTLFGDSLCTVATL 
RETYGEMVDCCAKHEPVRYECFLQHWDD 
CPNLP * WRPEVD 
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Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q = Glutamine, R=Arginine, S^Serine, 
T=Threonine, V=Vaiine, 
YV=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, ^possible 
nucleotide insertion 


10765 


24666 


A 


10850 


164 


1 


RGFQVRGE*NSGRETER*RERE*KLARE 
RAIGQEREREKSREAGRBRERERKKE 


10766 


24667 


A 


10851 


417 


100 


MML WGNLETRLWSQAQT I PCTKQRS S L 
FFPVLNAAIFRSKS KVLNCLEYFLGNTL 
FSWDLI PEILFPLSHYHHYHHHHHNHQY 
* KS * QWS ASNTRFQGILDKNS Y 


10767 


24668 


A 


10852 


339 


3 


ETTHSEARRGRSAAASCRGSALRRGRFP 
E SRRGREAAP VCPRHVLL * GAQS KQAAV 
AGKRSGTRHASRWPKSLFTPRRRRISLK 
RALHFWQQSADPSPSVSRIPPHGVGSS 


10768 


24669 


A 


10853 


1 


373 


LQQKGMRK^GQSEMAPAGVSLRATILC 
LLAWAGLAARDRMYLHPFHLVIHNEST* 
EQLTKANAGKP KDPTFI PAP I QAMTS PV 
D EEALQDQLVLVAAKLDT VD KLMAAMVT 
MLAI FLGFRI YG 






A 


10RS4 


i 




VS CS FL KL KTM KHGLLLLLCG FLLKS QG 
VNYTEEGFFRARGHRPLDKKREEAPSLR 
PAP P P I TGGGYRAR P AKAAATQ KKVER * 
APDAGGCLHADPDLWV*SPTGCQLQEAIi 
LiQLERPITNTVDELNNNVEADSQTSSSF 
It 


10770 


24671 


A 


10855 


343 


3 


RGCE AH PL PR S EG PAGSAIiAQ P VMY CT I 
FAGTLITAISSH*FFT*VGLEINMLAFI 
PVLTKKINPRSTEAAIKYFLTQATAS 1 1 
LVIAILFNNILSGQ*TLTNTTNQ*SSSI 
I 


10771 


24672 


A 


10856 


147 


1 


TRTPTGQCVSPKSMFLGAVAHSCNPITL 
GG*GRRIT*GPEFDPSLANMV 


10772 


24673 


A 


10859 


189 


422 


NHTMDDFERRREL I RHKRE EMRLEAER I 
AYQRNDDDEEEAAR * RRRRARQERLRQK 
QEEESLGQVTDQVEVNAHNSVP 


10773 


24674 


A 


10860 


90 


273 


S HQE I EQNSAMAPRKRGGRG I S FMFYCL 
RNNDQR YMT* RL * SG I GWMLLS SGRKGY 
ALPG 


10774 


24675 


A 


10861 


25 


411 


AP CAKPCGD WRS RGS LVWAMSG CN7ARKG 
DCCSRRCGSHL*N* I PTDWPLN* FFLTS 
AKVKECFPKKESIYSQTVY+SPG*KM*T 
* DKEHPR YLI P * LC I QFYHLCWVTGTGG 
GIILKHGDEIYIAPSGV 


10775 


24676 


A 


10862 


33 


302 


SRRATLIYVDMENGEPGTRVVAKDGLKL 
GSGPSMLALNGRSQVSAPRFGRTFNAPP 
SLPIATIRALGTVNRATEKFVKTNGPLR 
Q*QPRS 


10776 


24677 


A 


1 \JO\JJ 


lU/J 


1591 


YCHTGKGEQLGERFCEGVSRRGPAERGS 
DSQTPWPWPLCAAAGTSAGTSHSGSSSG 
AFSSWPCWTAAEPAARKRGRPAGSWSSP 
ATGAPGRCRHRI LSRGAGGSAGFVCSGL 
AESGL*ESSSPGRSQG*PQGQRHPQPNG 
LPAPPSTSV 


10777 


24678 


A 


10864 


245 


418 


TS KLAFP I S I P V I YANKVCP * FS KKKKK 
KKKKKKKKKKKKKKKK KKKKKKK KKKVF 
FFEGPGFFF 


10778 


24679 


A 


10865 


132 


398 


LNMGKGDPKKPRGKMAS YAFFVQT* REE 
HKKKHPDAS VNFS E FS KKCS * RWKTMS A 
KEKGKFEDMAKADKARYEREMKT YI PPK 
GETF 


10779 


24680 


A 


10866 


115 


455 


LLTRNMDRLLILAGGMPGLGQGPPTDAP 
AVDTAEKVY I S YIiALLKMLKKGRAGVPM 
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N=Asparagine, P=Proline, 

— vjriuidm me, iv — Arginine, o — ocrine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X— Unknown, **— Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














E VMGLMLGE FDDD YT VRV I D VI AMP QS * 
TGVSAEAVDPVIQAKMSDMLKQTGKPEM 
D 


10780 


24681 


A 


10867 


64 


483 


QGERPAAA^IK I * S LTLLS FLLLAAHVLL 
VEGKKKVKKGLY I KVDSE * KDTLGNTH I 

EG I S LKVE CTQLDHE FSWA YAGN PTLCL 
KLTDEIVYWKPVARNLRLQNDIMIYSPQ 


10781 


24682 


A 


10869 


229 


1 


DTTILKLGRLI I LQWLLS EGKSHMS FTL 
NQK+EI I KLSKECMSKAELSQKLGCFCQ 

V uoy V l v lxli-\ i\_Cj 1 v J JJ JL> I\J2> JL x\0>i. 1 


10782 


24683 


A 


10870 


934 


515 


KKVLLCLPGLGCGGVILAHGSLALPGSS 

QT.HT c;t nQQWnT.THM* Vn7APT.TPA7VT?FP 

O Linjjo Jjl^DO WlJiJ ± on X UiHJX JLt ± C v If r n. 

EMGSCH I CQAGLNS YNSS I P PTLVSQNV 
GTTGVSHRAQLAP VFCFC FVTLVDVKAL 
HFGKQENR IFFFFI LFPGGSRDEDRVA 


10783 


24684 


A 


10871 


373 


424 


EFKTNLANMG*CLKSQHLGRPRCANHMR 
LGVQNQLGQHGE I P FLLK 


10784 


24685 


A 


10872 


1 


469 


DCDcrricT n7vnnDncp7\c)\T 7s tpcpt t? o 
KbKSijlJSljPACDKPSljAbAljtAlCiLi lr i> 

AISCFWDLPAPSLRLTPSCQPTMSSQIR 

QNYSTDTEAAVNS LDNWYLQAS YTYLCL 

vjr i v JJKJJDVAijJiVj VoytrKiiJjA^ 

YDRFLKMQNQRGGRALFQDIMTPAEDD* 
GKTHNAMTAAMGLETK 


10785 


24686 


A 


10873 


244 


2 


NIYSCKETFSVPLLAIHVYHFLVGRGGS 
QGTEMLWHRVDLRYREQAGHSGSHL* SQ 


10786 


24687 


A 


10874 


83 


427 


I SLNM I R I AALNAS ST IDDDHE + S FTRH 

E ESAKAYHSHLEAS LLREAVS S GDENEG 
LKHPGLILKYSTYKNIAQLASQREDLET 
AME 


10787 


24688 


A 


10875 


94 


1 


KSQKACNPSTLGG*GGWIT*AQEFTTSL 
ANT 


10788 


24689 


A 


10876 


2 


413 


GVTRGFNMRIEKCYFCSGPIYPGHGMMF 
VRNDCKVFRFCKSKCHKNFKMKRNPCKV 
RWTKAFRKAAGKEIjTVDNS FEFEKRRNE 
P I KYQLEL WNKT I DAMNRVEE I KQKRQA 
~rC RTMNPT.*K'T*FT.P'K17nnTnFVnT.W 

ZvT" -L I'llM 1\. J_i A. i. Hlj£r J\.v l^ri V ^l_iLV 


10789 


24690 


A 


10877 


65 


417 


R FAGAGAI PEARAWPTDVHAAEEEKEMD 
LPDliASRVFCGRILSMVNTDDVNAI ILV 
QKNMLDRFEKTLEMLLNFNNLASARLEQ 
MSERLLRHTRTL * DMKPDLGSLFRP IRT 
LSWKL 


IU/9U 


z4oyi 


7\ 


1 f\QHQ 
1U5/S 


4 


A A O 


NAGRETLGRLLVDYPWTHRFFDS FGNLT 
SGSVIMGNPKVKAHGKNVLTSLGDAIKH 
LDDLKGTFA* LTELHCDKLDVDPENFKL 
LGNELETDMAIHFGQDFTPEVYAYLQNM 

V 1 V V>\1M 


10791 


24692 


A 


10880 


1 


419 


GKHIRQYHEEKETGQRINIHEYYLGNML 
AKNLLFEKEREAEEKE KS YE I PTKNIQG 
QMTP Y YP VGMGNG T PCS * KQNR PRS S T V 
MYICHPESKHEILSVAEVTTCEYEAVIL 
TPLLCSHPKYRFRASAVNDIFCQSLPG 


10792 


24693 


A 


10881 


54 


335 


RE I FTMSGALDVLQMNEEDVLKNLAV I T 
HLCCTKTDSPMEQI I YGS*TEYIYSIKL 
NRT* QNLMLAVF * KYDI *NHDE VTVMTF 
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beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=JLysine, L=Leucine, M=Methionine, 
N^Asparagine, P^Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














NNTLQMFEP 


10793 


24694 


A 


10882 


216 


3 


MS IKS MS I CCLRKKGKHGFQIRGK*NSG 
RETERERERERKRARERARGQERSRQKT 
SEGGRERERERKKE 


10794 


24695 


A 


10883 


3 


390 


GELKCS KEKCQS MS AKSRTI W I IGAPFL 
KGQPRGGVEEGPTVLIKAGLLEKLKEHE 
CDVNDYGDLPFADI PNDSAFRI VKNPRS 
VGKASEQLAGKVAEVKRSG* ISLVLCVN 
LGGGLECLCGHAKVYSR 


10795 


24696 


A 


10884 


3 


375 


STMRAWIFFLLCRAGRALAVPQQSSLPD 
ETEWEETVAEVTDVSVGANPVQGEVGE 
FDDGA*ETE*DWAENPCQNHHCKHGKV 
CELDENNTLMCVCQDPTSCPALIGEVEK 
VCSYDTLTFYSS 


10796 


24697 


A 


108S5 


95 


2 


KCTQGPSAVAHACNPNTRGGRGGW I T * G 
QEF 


10797 


24698 


A 


10886 


1 


114 


MGFHHVSQDGLDLVTL*GACFSLPKCWD 
YRREPTHLAH 


10798 


24699 


A 


10887 


1 


368 


E PTME LTTAE KTNG KAA WRKVGAHAGE Y 
GAEALERMFLTFPTT*TYFPHFDLSHGS 
AHVKGHCKKVADALTNAVAHVDDMPNAL 
SALNYLHAHKLRLDPVNFKLLSH* LLWT 
LSAHLHVEFT 


10799 


24700 


A 


10888 


336 


9 


FRGQVIPAIFFPFQTTSLSYFQTGQPN* 
TPFFQKILKLTRGGGKLLDFQTPGRLRQ 
KREDPLFPGV*NCN*P*SHPWWGTKQNP 
VS KKKKKKLKTS S I FVHRVSMS P I 


10800 


24701 


A 


10889 


3 


362 


GFLLAPLEMQELGTPMNRILQLTIAEQE 
TFLTPALLLPIPHQTYSTASAVPLAKPD 
T*PKDVGIIiALQVHFPAQYEDQTDLENY 
NNVKALKYTEDLGQTLMGFCSVQEDINS 
LCLTVEQP 


10801 


24702 


A 


10890 


66 


333 


TLPGNIGISFVERVMEVLRPQLIRIDGR 
NYRKNPVQEQTYQHEED* *DFYQGSMEC 
ADEFCDAY*VEKTPQGFRTTLRAPS*LY 
TPIVG 


10802 


24703 


A 


10891 


405 


49 


I PWMGTERTRASAQRDVAGPARNPACPL 
PEQCFPGPAEWTGPLTPPGSC*LRPGER 
ERHGPGCWGPRERLSSGQTHPPSPQGCA 
NTCPCHKHQQELLAGGVAVFYKLFYRYG 
NSPKWDL 


10803 


24704 


A 


10892 


2 


197 


PQPLRVLWTAHLAAMAPSSRTSLLLAFA 
LI CLP WLQKAGAVQTVPLCTLFYHAMLQ 
THRAHQLVIDTYQELEETYI PKDHKYSF 
LHDSQTCL* FSDS I PTP SNMEETQQTSN 
L*LPDIPAPGFCPDLPALASKGWCRPNR 
SAMHAFLPRYAPNASRAPTGH 


10804 


24705 


A 


10893 


3 


334 


DQLPEPLKVLWTAHLLAMAPGSRTSLLP 
PYALL+LPWLQEAGAVQTLPVSRLFDHA 
MLQAHRAHQLAIDTYHEFDETYIP*DHK 
* S FLHDS QTS FCLPDS I PTPSNMEET 


10805 


24706 


A 


10894 


3 


341 


LLTPGVSDAICYILDSGYI IMSDTFTAY 
V I G * R FE VNG * HATVR FAAWP P VALP W 
LGV*WDNPERGTYDGTHEWTVYFKCRHS 
TGGFFIRSNKVNLGTDFVTADKNLYVLD 
Y 


10806 


24707 


A 


10895 


25 


351 


AMIQTRDLQGGRAFGLLKAQQD* RLDEI 
CTQLIiDDLKYSKDEDLPSRLEGFKEKYM 
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ing to first 
amino aciu 
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sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
mug tu 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=G lycine, H=Histidine, I=IsoIeucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 

T=Threonine, V=Valine, 

TT Jl I jr |JIU|S1I«I If, I *— * A jt VolllCf 

X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














E FDT, T GNGHI D T M YLKLMT .YKIiAVPOTD 

XJ X X«/XJ 1 vi« \J+ X«L> JlJ * JL J-J-IV UttU X XVXXJT1 V XT X JL/ 

LQLPRS IGEVYSGSGETISY PDFLM 


JUoU / 






1 \JoyO 


J 04 


JO 


FSOFK I AOFLKKI PFPGFSPFFONGGNF 
LKFSKFFPNKRGFFPHPFF*KRGPFFCP 
KWGPQGH I GAHGPPAPQGQKI P PFQVPK 
KMGIKGPTPPPGFFFFFFFFFFSW 


JUoUo 


JJ\ l\Jy 


A 


JUoy / 


1 Jo 


• 1 
j 


CTT ,KKP * R PGMV AH API\] P<^TT ,GGR ^GR T 
T*GQQFKTSLTNMVKP 




£S*f l \J 




IU070 




Z07 


GNKYCLT I LLLLLLLSFKROSLTLLPTL 
ECSG*TRTPGPKGSSLLNPS *VAGHGGV 
CL*SQLIRRLRWEDHLSPGV*GHSDPRL 
HHCTLTWATE*DSVS * KTTTTTTTK 


10810 


247H 


A 


10899 


316 


374 


CCCCCCCCCC*CCCCCCCCCCYKETNFE 
QQ 


10811 


24712 


A 


10900 


38 


206 


VYCVLVFTICTLLCNTSLGLFHPEFFFF 

PTU QrrVAH ZXGT ,OWH HP * c*T .P T J . PH R T T 


10812 


24713 


A 


10901 


364 


2 


DYQHISPEKHCRPEGSGMVYLMCRKKKK 
P TTKTCNT .^TK T T iNPOP ^ FT T .* KPT? W ^ ARK 
S*QKEITKIGADSLIIENRKRIEKIYIK 
ETMSWFI E I LNKI DO PLARLTM I ERKDS 
TKFRNERG 






A 








ATSS P WG Y ^ A YR VA I LiLFIiY FSNKIiAFT 

JT* JL UkjJT V? VJ X t^JTLX XV VAX XJXJX7 XJ JL JL uiV 1> t 1 ill X 

VLYGFV*NYFLQEI 


10814 


24715 


A 


10903 


234 


352 


NFCFFETGSCSVT*AGVIMAHCSLDLPG 
SSNSLTSVSEE 


i r\Qi ^ 


24 /lo 


7\ 
C\ 




2jU 




TWIRMMT.DKKnT * A T FT»F*PKTnDK A AFT 

TLNINNAL 




24/1 / 






Ton 


3Z.J. 


PTTQLSINijAMAIPL*AGAWIGFRSKI 
KNALVLCTSTPLPTSTMLPVEMMDTSSD 
IHPKI * S IR * QTCI QHRLLLWLKC 


10817 


24718 


A 


10906 


271 


365 


KWRPG WAHT YNPS I LGSRGGWI T * GQE 
FKT 


10818 


24719 


A 


10907 


135 


358 


IiRYKLTNPKLTLLTCGLI FLKGNIVNIG 
QCNGVHM * YQHFGRPGQDCLS PG I QGQH 
GOHKESLSIi*KSLKISWS 


10819 


24720 


A 


10908 


107 


212 


I KNEGMGQVR WLM P V I PALWEA * VGGTP 
EVSHSWL 


10820 


24721 


A 


10909 


165 


383 


PKNRP I I PLKRSS I SNPGDFQKSLRPGL 
TPMGPHFKRGNYSYDENLENLCPVGGNK 
GPGSH*GIiLTCETCRGF 


10821 


24722 


A 


10910 


119 


291 


GGYRFYVKDTF* K* GFWPLTLFI I LKPL 
AGHGGACLWS WRLMRLK* EDHLSLGGRG 
CSELWLCTLA* ATE * DPVS * KVRLLALN 
FVY^TKTFSPJ^WWCMPWLATMEAEVGG 

r v l xv x xv x x o rvxiV » v« v^i i x v v uri x n J— *Jr\ j_j v w\j 

SL5SGRSRLQ 


10822 


24723 


A 


1091 1 


283 


2 


SDNTTDTFLPFIYKYIFPITWEQGETWK 
KNPPJNSRLKKTLRSETIAQI PLKCNLWP 
GMVAHACNPYTLGD * GG WVT *GQE FETS 


10823 


24724 


A 


10912 


1 


360 


PHAFGTMKWVTFMSLLFFLSWANSRGGF 
P nisi A P K c> F V AHR T .KD WGKKTtf FK A MG * M A 
LAQNLKQGPFENHGKLGKEVPEFAKPGV 
ADDAAENGDKWLNTLLGNYLAPVAAVRE 
TYGERAEC 


10824 


24725 


A 


10913 


270 


361 


S KTWPGTVAQTCNPRTLGSQGGW I T * GQ 
EF 


1Q825 


24726 


A 


10914 


157 


i 


VFFQLSRLAWKGFSRFFFVFETESHSVA 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 
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USSN 
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26 
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nucleotide 
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correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=G!utamic Acid, F=Pnenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L^Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop cod on, /=possib!e 
nucleotide deletion, \=possible 
nucleotide insertion 














*AAVQWRDLGSLQAPPPGFTPFSS 


10826 


24727 


A 


10915 


1 


357 


LEEWGPEREFAEEEKEKKNTKKKKIKNS 
TPPKKPARKGEEKPGPFKRAITTIFVAR 
IPLFCLKGFLWPSC*IIGRSSPTPGSKG 
HTGAPNCP P C P LAAF P KDVPNFNP KMVE 
AVKNPIL 


10827 


24728 


A 


10916 


102 


471 


PSTPIILTSSYPHYVKSSVTSTFIISLF 
PTTI FMCL YQEF I I SN * HLPTTQTTHLS 
LSFKLDYFLIIFILLTFLVTWSIIEFSL 
*YINSNPNINQFFKYLLIFLITILILLT 
ANNLFQLFIG 


10828 


24729 


A 


10917 


2 


138 


REPTMVLSPADKTNVKAVWGKVGAHAG* 
YGAWLFTWLTLNVISD 


10829 


24730 


A 


10918 


178 


393 


LVL CT VLLM WRFH FHDS LS I LKAQD VTE 
NLSNPTVLRGETGPSAVAHTSNPSTLGG 
*GKV7ITQGQEFETSI±A 


10830 


24731 


A 


10919 


278 


124 


TAW ADWGSNETP FLLKL P KKLT * GGGVC 
LEFQVLGRVRPKNPFNLGNQGFN*PKFR 
PWTSTWGAKQNFV*KKKKKNKNKKP*NS 
RHTPPP 


10831 


24732 


A 


10920 


221 


243 


IIIGSPLIDIKISKCFLKIVAFGQAQWP 
VIPALWEAEAGRSPEVGSPRPA*PGSLK 
VHVDNNWES IN 


10832 


24733 


A 


10921 


33 


292 


GTM WLE I RHAVE VQ CVLVS ELV I PT S G D 
KPEQC*DHYLIT*YLILGKWGI**ISGA 
LEKKKKKKKKKKKKKKKKKKKNKIKTGG 
FI 


10833 


24734 


A 


10922 


108 


3 


KLAVYGSVCL*SQLLGRLR*EDHLiNLGH 
GGCSEP 


10834 


24735 


A 


10923 


106 


2 


KLAVYGSVCL+SQLLGRLR+EDHLNLGH 
GGCSEP 


10835 


24736 


A 


10924 


195 


2 


ASLFKSQLNRAFIYLFIYVFIFETGSL* 
PMLVCSGVITAHCNLHVLGSGDPPTSAS 
QVAETMVL 


10836 


24737 


A 


10925 


127 


2 


YVCILKNLKGRPGMVAYACNPSTLGGQG 
GWIT*GQEFKTSL 


10837 


24738 


A 


10926 


639 


282 


FFLS I KG WVQQFMP I I PTP WGLKQEDHL 
RPGLRDQPVQNSKTPSLLKI * KLARRGG 
ACL*SQLFRKPR*ENCLSTGELKPKGGI 
FIRLVQNKIPTTRGEKRKQSHTGSYQCP 
KIKKKG 


10838 


24739 


A 


10927 


1 


363 


ALLTQALTCRQAGAEAPHAGATPSLMPP 
SLPQGFRDCSPSDAAYTMEMTIDKKQNQ 
VIILFKFKMGHKAAQTTRNINWAFGPEI 
ANKGTVQWRFKNFCKRDESREDDE*YAQ 
PSKVATDQL 


10839 


24740 


A 


10928 


53 


379 


TEAELLTLYLLPNALLNHFTSPPLMFAD 
RRLFCTNHID IGTLYLLFGA* AWLGTA 
LSLLIRAELCQPGNLLCNDHIYNVIVTA 
HAFVI I FFIGLP I IIGGFGN* LFPL 


10840 


24741 


A 


10929 


180 


350 


EPMAKGKTESPGPKRCGP*I*WVISQRG 
TLRFRGAGLFFMGEFLRLGENLLEIPRG 
A 


10841 


24742 


A 


10930 


361 


379 


RRYWWLG * VQWLMPVI PALWEAKAGRS P 
KARNL 


10842 


24743 


A 


10931 


121 


543 


H RNTGS THASAHAYH I VHTNP * PL.TGAL 
SALLMTSSLiAMGIHFHSlTLLILGLLTN 
TLX I YK * WRD VTREST YQGHHTPP S/QKG 
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SEQU) 
NO: of 
nucleotide 
sequence 
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SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQD) 
NO; in 
USSN 
09/^15,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanme, 
G=Glycine, H-Histidine, I=lsoleucine, 
K= Lysine, L=Leucine, MHMethionine, 
N=Asparagine, P=Proline, 
Q=Glutaniine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














LQYGIILYIT* *VFFFAGFF*AFYHSIL 
SPTPQLGGHWSPTGITPINPLKDPLLNT 
S 


10843 


24744 


A 


10932 


227 


339 


VGGVKSVQLLLNCHFSR*MKKKKKKKKK 
KKKKKKKKKKKKX 


10844 


24745 


A 


10933 


64 


409 


DQRNKSAHLRAHLKKKKKKKKKKKKKKK 
KKKKKKKKKKKKKKKKKKKRGSL* KKIN 
WAFFLPIiLKRKFVGENFLKKNFFWGGNI 
GGQ I S S KKYRGGGK 


10845 


24746 


A 


10934 


2 


362 


NKLSGP YPEKVG YTLPVLMNPLAQLVI Y 
ST I FAGTL I TALSSH* FFT * VGLEI DML 
AFI PVLTKKINPRSTEAAI KYFLTQATA 
SIILLIAILFNNILSGQ*TITSTTSQYS 
SLIILRAM 


10846 


24747 


A 


10935 


154 


3 


GCPLSPLLFNIWKVLVRAVMQEKEIKS 
IQIGLEVK* SLLTDNMI LYLGN 


10847 


24748 


A 


10936 


250 


363 


TAWPGYLYS FS FLY* ETTK I WLGAVAHA 
CDPSTLGGRG 


10848 


24749 


A 


10937 


199 


339 


NVLG I hhVFR 1 1 VE I YKCGRLWLGT VAH 
TCNPSTLGCQRGQTT*GRE 


10849 


24750 


A 


10938 


130 


240 


KNEQDPRDL * DNDKWPN IHVI GVPEEDK 
DNGTERVFD 


10850 


24751 


A 


10939 


35 


235 


FILVENTKKMCICPLl^NMIYAILYLFVP 
S VFL * E ENKQ * G I TE K KKKKKKKKKKKK 
KKKKKKKKIKPGGL 


10851 


24752 


A 


10940 


179 


12 


DQPGQHNKTLSLQENKNKSSWVQCCV* P 
QLLGRLRWEDCLSPGGRSCEEPCSHSG 


10852 


24753 


A 


10941 


297 


278 


INQDNELTLINQSSKQIKHKNINQTLRT 
KMNENLFA*IIAATILGLSATVQIILFP 
PILIPTSKYLINNRLITTQQ*LIKLTSK 
QMITIHNTKGRT*SLIiGGD 


10853 


24754 


A 


10942 


3 


318 


FGGGRG VRN Y PDAF VLS VLLP SR LL FPH 
LFPHLFPLPPFLGLAPYCFLTL* YF* K* 
*LYYSQSIIFFFYREMKNKLLVFWAINL 
FFLYYYNYMEMWTLICYWKN 


10854 


24755 


A 


10943 


190 


3 


KLSHKQS VYFKTP F I * KNHVLFK*KNLW 
VYPNDKFFFFFFESLALSPRLQCNGTI L 
AHCNLR 


10855 


24756 


A 


10944 


281 


1 


KKWLFS S YSS L YGNGLFL I PP FLRVSG F 
GKGFWKKFFFLRARGALFWGSPLKGFFL 
GFFWVFFFF*DGVSLLLPRLECNGIISA 
HCNLRFPGS 


10856 


24757 


A 


10945 


169 


319 


NGVEDAFKNMVLGWAQWVTSVI PALWEV 
KVGG S P E VRS FRPTRPI* KYG FRLGTVG 
YICNPSTLGGQGRRIT*GQEFQTNPANM 
VKPC 


10857 


24758 


A 


10946 


312 


1 


FARPGLLKSWDFQP * PLDPVWGFFFKRF 
PENWLEFEI FPFLI IMAKRKKLSKSLFP 
FPFLG*FKD*GQKTPFFFFETVSLLLPK 
LECNGAISAHCNIiCLPGSSD 


10858 


24759 


A 


10947 


208 


329 


IFTGDRSRNNRIG* ARWLTPVI PSLWEA 
EAGGSLEPKSLR 


10859 


24760 


A 


10948 


116 


374 


F YFGYLL FFFC FFEMAPCS RG * S WS ARG 
QSFCNLI FPGSSDLYVSAFRVARITGAR 
DHACI I IVFWDGGCVDTITSGLAGWLR 
RG 


10860 


24761 


A 


10949 


264 


434 


QLAFCTDTLTSVREQCEQL* KCVKARKR 
I ELCDEQ * S CRSHTD * DCTDELFDFLHA 
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correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F-Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, LHLeucihe, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














T 


10861 


24762 


A 


10950 


226 


429 


NFGFRLDKTVFFS KI RINAWAQW FMPVM 
PELWEAKAGGIT*AQELETSLSNIVRPW 
LYKKFKN*WWC 


10862 


24763 


A 


10951 


188 


433 


YSAEWKIDLGIEVFWVGKMTYKQ*FFWF 
VFKFLFF*RQGRTLSSKLEWMSVITGHC 
NLDLLGSRETFASAFQVTWTIGS CHS 


10863 


24764 


A 


10952 


298 


445 


LFS KCS SKS I KEEG PGWAHACKPGTLG 
GRGGWIT* GSEFESSLANMVK 


10864 


24765 


A 


10953 


244 


1 


KTPQGLWFCLFKKS PLKG I KGVSMGVL 
ICFFLKSRINLFNPRGLGQRKNFFFFFF 
ETDSYSVAQTGVQ*HNLGSLQPPSC 


10865 


24766 


A 


10954 


199 


3 


QETKKEQNKENKQIK*RSTRKKHRQGTN 
KTKERGERQTPPVGNRQTPTLGIHARPR 
" RRATTSPRA 


10866 


24767 


A 


10955 


157 


3 


YSYTFSFI ITTS ILI IQTLSGHGGACL* 
SQLLGRLRQDNHLNLEHGGCTRA 


10867 


24768 


A 


10956 


263 


357 


GLS LKVLTQWLGAVAHACNP S T LGGRGG 
*IT 


10868 


24769 


A 


10957 


333 


2 


KWKADML I PLV I KS F FLG I FF KMFTN S K 
RYIFLLQLLAQHSSFFSFIRCMEL*WPY 
P I PLCYGQSNQP S V * LAYCRDL FNEL I F 
FFFFETEFHSCCPGGISAYCNLCLSC 


10869 


24770 


A 


10958 


130 


228 


GSSFLGGS PSVAQAGVHLPDHGSl4Q*DK 
SPSVQKMS KWIGCSGACL* SQIjIjRRLRC 
QGRLS P* GQACSEP * SGRCTPAWATEGD 
P PKKLLP YTTDS 


10870 


24771 


A 


10959 


250 


3 


VQACS I TGALYQRRNAEDE PQTARP I S G 
FTTSIAMRLMLLTCSGHIWPVADLAFSG 
IMP*GLFFPSRQNIiALSPRLECSARA 


10871 


24772 


A 


10960 


199 


326 


VLGRMWSNQKNYTLIARM* KI K* *NNTR 
CWGGCGATRRI IHC*QECRI * KCQPQSL 
WRTV* QFLNTLNI R 


10872 


24773 


A 


10961 


323 


3 


LLF FHLPDNWQH* YVFCHYNLS FLQF Y I 
N*IIQLWFCVWLLSMSIILKSHPCHFS 
K *LS TVTFFFRQS RS VSRLBWSGAI SAH 
CNLCLPDS SDSCAS AS * VAGS C 


10873 


24774 


A 


10963 


120 


3 


PFFFFFF*EMRSCFVAQARVQ*CDQSSL 
♦PSTPGLKQSSC 


10874 


24775 


A 


10964 


168 


441 


LTPVI PALWEAEAGRS PEVRSSR* S PP 


10875 


24776 


A 


10965 


19 


207 


APLKLNVAMELSLGQWDVSRSNLWEIPL 
KKGDTGRVQWLTPVI P ALWE AEVDRTP E 
VTS VTRC * PLKLNVAM E LSLGQWDVSRS 
NLWE I PLKKGDTGRVQWLTPVI PALWEA 
EVDRTPEVTSVTRC 


10876 


24777 


A 


10966 


209 


1 


TKSALSNMWTTHIGIjCKNFCIVL*DRV 
SVTQIGVQWHDLSSLQSLSPRLKDPPTS 
AS RVAG I TGMHLV 


10877 


24778 


A 


10967 


109 


1 


DYLRSGV*DQPGQHGKTSSLLKIQKLAR 
LVGPIISC 


10878 


24779 


A 


10968 


141 


2 


GIjAMLPRLENI I FR PGTVAHACNPNTLG 
GRGGQIT*GQEFKTSLSC 


10879 


24780 


A 


10969 


175 


348 


LVYECKHILCVFNTSLFFPH*TCYK*RG 
VLWLGEVAHACSPGTLGGRGGWIMASGD 
RD 


10880 


24781 


A 


10970 


108 


2 


KTKKLHMRSG WAHTCNPSTLGG +GG WI 
T*GQDSC 


, 108S1 


24782 




10971 


184 


2 


S FLWKLHLRGVPGRVRCQSAPTRGASQL 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenylaIanine, 
G=Glycine, H=Histidine, i=Iso!eucine, 
K=Lysine, L=Leucine,M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutam ine, R = Arginine, S^Serine, 
T=Threonine, V=Valine, 
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X=Un known, *=Stop codon,/=possible 
nucleotide deletion, \— possible 
nucleotide insertion 














GYLGVRDPLE EAV* LFSDLKLCAGKTTT 
LFLV 


10882 


24783 


A 


10972 


245 


339 


SQISGPGTVAHACNPSILGGRGEWIT*G 
QEF 


10883 


24784 


A 


10973 


295 


340 


KRVFF*ARKKNPFLSPLWGLTLENPLGF 
RLFLNPFLGLKISQFFFPRQI * PLAFFF 
WGTLPRPLNLKFGPFFFFFFFFFFF* DG 
VSFCCPGWSQARRYIEINISLSTCPRA 


10884 


24785 


A 


10974 


186 


340 


RPYLTIRFKGHPPNEPIQ*SFFVLFCYC 
FDTESRSVTRLECSGSNSAHCNL 


10885 


24786 


A 


10975 


253 


491 


FLECTSLLKRQRVHPGQHSKAPPTKELQ 
TTGRGGALLQSHLLRRLRQEHCLGPGVG 
SYSEP*LHHCTPAWVIEGDSSKK 


10886 


24787 


A 


10976 


141 


1 


SFSFFKFSPTGDMIGFF*HFFFFLRRSH 
SVAQAGVQWCHLSSLKPRA 


10887 


24788 


A 


10977 


3 


138 


HEETGFHLVSQDGLNLLTS * STHLGLPE 
CWDYRREPPPGREGDF 


10888 


24789 


A 


10978 


186 


323 


YESRSATQAGVQWCELGSL* PSTSRFQ* 
FFCLNLSSTWDYSGLAPS 


10889 


24790 


A 


10979 


183 


3 


IKILFPFFFLRLSLALLPRLECSGTFSA 
HCNLCLLGSSDSAS * VAGITGTTTPSPM 
QLV 


10890 


24791 


A 


10980 


309 


55 


KNFGPNWKFLGGKGGEMAFLGKFFFPF 
FFFFEKESWFVFQG*MQWGDFRSLQGPP 
PGVTQISRLGKIGGPLFPKKKKKRERGN 
M 


10891 


24792 


A 


10981 


637 


838 


SQHLGRPMRVDQLRPGV*DQPGQHGEMP 
S LLR I QR * AGHGGTHL* SQ VLRRVRQDN 
CLNSGGGGCSE 


10892 


24793 


A 


10982 


275 


2 


RNRTLKME FLS W F WGFWNWLLNM IRPKN 
VKDSTSKSMENTDS PWHELFKELGKINA 
FDTPDSLLVRGKFSDSIHNTFDHM*RTK 
EYNEARA 


10893 


24794 


A 


10983 


98 


346 


GHGHATLRGLCVLTFS FH I TALS VSGTN 
DAEDCCLCETQKPICGYIERNLLYLLIK 
D VCR VPA W* WERVYSLI SR YS LWRD 


10894 


24795 


A 


10984 


30 


410 


LPEFTGRPKRTRTRGFSTNHTDIGTLYL 
IiFGA * AG WGTAVS LLI RAELGQ PGNIiL 
GNDHMYNAIVTAHAFEIIFFIVLPIIIG 
GFGN*LVPLIIGAPDMAFPRINNICF*L 
LPPSLLLLLASAIAE 


10895 


24796 


A 


10985 


141 


360 


QTLRTKMNENLFASFIAPTILGLPAAVL 
IILFPPLLIPTSKYLINNRLITTQQ*LI 
KLTSKQMITIHNTKGRT 


10896 


24797 


A 


10986 


3 


347 


HELRTKVNEHLIASFMGPTSLGLPAALL 
IIL*PPLLIPTSKYLISNRLITTQQ*LI 
KLrSNQMITIHNTKGRT*SLILESLIII 
I ATTNLLGLLPYS FTPTTQLS INLAMAI 
PL 


10897 


24798 


A 


10987 


46 


317 


KSMTP I RKINPL I KL I NHSL IDLPTPSN 
ISA* *NFGSLLGACLILQITTGLFLAMH 
YS PDA* TAFS S I AH I TRDVKYG * 1 1 RYL 
HANGAS 


10898 


24799 


A 


10988 


245 


1 


VS CLLEGKLTNRKD I HTKNPS VHHHHQR 
PKVDKTTKMGKKQNRKTGNSKKQTASPP 
PKK*SSSPATEQSWMENDFDELREE 


10899 


24800 


A 


10989 


255 


1 


S CLP WAN W I S VQNQHKKTFLV I RTFFP I 
GS FKVT* LLGPPLILX S PPRI FFFFETG 
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nucleotide 
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sequence 


Predict- 
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nucle- 
otide 
location 
correspon 
ding to 
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acid 

residue of 

peptide 

sequence 


Amino acid sequence (A-AIanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIaIanine, 
G=Glycine, H=Histidine, i=Isoleucine, 
K=Lysine, L=Leucine, M=Metbionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 














SFTVARAGVHSGAI SARCNLRIjLGSSNS 
A 


10900 


24801 


A 


10990 


166 


307 


GEKPWGPLIWGGLVGRSSWASCQVLELL 
* EMVFRHVAQAGLKLLSS SNP PASAS QS 
AGITGVSH 


10901 


24802 


A 


10991 


109 


330 


TNQFKTKKEREAGKKKVKELERERERGK 
KRKEQRKNEKERR* P*NMD*RRIRERNH 
SFDVYEFLI CGLLNLLHV 


10902 


24803 


A 


10992 


244 


326 


GRYLKGWLDGPA*AWLGTALSLLIRPE 
LGHPAILLGIDLISSVIVTAHAWSILLV 
RVI P 1 1 1 AG FGK * LVPL 1 1 GAPDMAF PR 
INSISF*LLPPSLL3 J LLPPPI*QMRAGV 
GEREVRVR 


10903 


24804 


A 


10993 


101 


2 


KTFWARFVGTCL*SQLLRRPRQEDHLRL 
GGRGC 


10904 


24805 


A 


10994 


177 


1 


TPSLLFFVN I I CFCLN PGGGG YSEQKLC 
LCTPAWVTE * NS I SRPSQNNSRQSRQVN 
HLF 


10905 


24806 


A 


10995 


214 


326 


KEERKNPRAIRW* PWGFFFPLFKEISL 
FLFVFRFWHGYPFPGAFFSTQKWLVF* T 
MVLFFFFFS * VSLLPRLECSGVISAHCN 
TFLPGSSDC 


10906 


24807 


A 


10996 


184 


379 


LCGMVYHFPS ILLLYL*CKLFFSPKLEF 
SSCCPLECNGTVLAHRSLRLPASGDSPT 
SASRVAGHGGTCL 


10907 


24808 


A 


10997 


136 


3 


GRVDGQHIMTHQSHAYHIVKPSP*PLTG 
ALS ALLMTSGIiAM * FH 


10908 


24809 


A 


10998 


274 


363 


PWA YAE P LTLHDAT YS CT * S L I RAS FTL 
FIA * TTRLRLL P * s FTP ttql cmnlama 
IPL*AVAPDIGYSSMITNALSHLLPQCT 
PTPLI S I LGI IETVSLLIQP I TLGVRRT 
ANIT* CHLLMHLIGSATLSISTI 


10909 


24810 


A 


10999 


305 


1 


NLHTTPSQIPLT* PLPNINIHNIiHLTTT 
Q * I K YL I L I P SNL FHNK PNL KTAYS VP P 
SPNLHLPS S SNS PASASQVAGNAGARHY 
AWLI FVFLVETGYSLLV 


10910 


24811 


A 


11000 


154 


2 


IWGILQSRRCGSRL*SQHFGRPRQADC 
FTPGVPDQEYFTLLPEWFFLV 


10911 


24812 


A 


11001 


142 


367 


GVCI>YQR I S I Y I KKTKKLSQGLF YFLLL 
LLLF* EGGSHSVTKLECSGGVSAHCNLC 
LLASSHPPTSSSQVAGTTG 


10912 


24813 


A 


11002 


60 


385 


GNTLLSHTLRAHL I ENLHAE F I DRT I LG 
LPAA*LIILFPPLLIPTF*YIiINIRLMA 
NRH*LIKLTSERMITIRNSIGRT*SIIL 
VSLI I T I ATANLLGLLPYS ITRTT 


10913 


24814 


A 


11003 


214 


83 


SKKSASPLTFNIVFQVLDNSVREEEIRY 
TEMEKEEI KLS * FVD 


10914 


24815 


A 


11004 


241 


368 


SYLL*PSP*PLTGALSALLMTSGLPMRF 
HFHS ITLL I LGLLTNTLT I YQ * WRDVTR 
ESTYQGHHTPPVQKGLRYGI ILFITLEV 
FFFARFF*AFYHYSIiAPTPQL*GHWPLT 
GINPLNA 


10915 . 


24816 


A 


11005 


3 


396 


HEPHALGMPLTADLPSMASCSQTSLLLL 
LHLLHLPWI *EARAYQAAGCSKLFDHAM 
LQAHRAHQLT IDT YQEVEET Y I PEDHKY 
SFLKDFQTSFCFSDSIPTPSNTEETYQK 
SNLELLRISLLLIESWLEP 


J0916 


24817 


A 


11006 


156 


335 


FGCCCCFLFF* L VY CCYVMMLY Y I YVLI 
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26 


Predicted 

beginning 

nucleotide 
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Predict- 
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nucle- 
otide 
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correspon 
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last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Pheny Alanine, 
G=Glycine, H=Histidine, I=lsoleucine, 
K=Lysine, L=Leucine, MNMethionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W^Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














SFGFYF*YWIM*YFYRLVYNIYLVLFYL 
LYMS 


10917 


24818 


A 


11007 


107 


2 


KCSSSKHFTKEDSQITNKHIEKCSS*LL 
VREMQII 


10918 


24819 


A 


11008 


129 


412 


KTRSGFKKTPGQKGKTPLLKKTPKITGP 
GGGALKPPFSGGFRPENPLNLRDKGSSG 
PKTPTFFS PWG KK* NFFQKKKKNLL Y PG 
KIFFFWMKNI 


10919 


24820 


A 


11009 


63 


268 


CTFKFKNHCSTIQSTT*LDRSEKKKKKK 
KKKKKKKKXKKKKGGPL* KNFSLKGG 


10920 


24821 


A 


11010 


154 


422 


NEFSFFFFFWKPKPPFFAQVGGQYRNWG 
*LKPRPPWLKNFSGFASGEAGITGGVPH 
PG* ILVF* VKAGFHIENLVWNSETGNQP 
ACASQ 


10921 


24822 


A 


11011 


23 


375 


TRYLTKIKKKKKKKKKKKKKKRGAPFLK 
TPWGAPFFPGSAIFIFFFFGGFF* TPLG 
FFWETLFFWGGKILGHLSPKILPFWGKK 
NFFWVKGGKNLLNS PFLKI FFLGVFFKK 
FFPPG 


10922 


24823 


A 


11012 


151 


361 


ACTTTPSSFFSNIDTVSLCCPG*VPGHN 
NSSLINFPSFLRSWYYRHAPPHPAHFFP 
ILIRSRYVAQAGLEFLALRDTPISDCPC 
TVKYPPSLAN*SVFFFFLTQSFVSLPRL 
KRK 


10923 


24824 


A 


11013 


146 


378 


NFWFSIEFLHICY* IYSFICLIKVFI** 
* KKKKKK KKKKKKKKKKKKKG ENLKGAG 
GGKKFLGGGKNNS FFRNGGCF 


10924 


24825 


A 


11014 


82 


264 


TELFLYT I CNW SAI LYILC* RNKVFLS F 
INTEKKKKKKKKKKKKKKKKKKKKKKKK 
KKKKKKKKI ERK I 


10925 


24826 


A 


11015 


381 


2 


GRTALFTLLKGPIHRPQMFFTPLRVFPG 
SYTGIS I RQSG FFF I KRVI FFRL* * KAL 
FLLPFLbPFFS FFLKIGT YS VAQECNGM 
I IDN*NFELLGRRNP STSASQ I AGNTS A 
Y*HTQLVFLCEDGI 


10926 


24827 


A 


11016 


1 


251 


PNCIVHSLRTGTLNFIILLLLCFTFE * F 
LNKNLIFESQKKKKKKKKKKKKKKKKKR 
GAP LKKT PGGPKF YPASKKK I S PQKGG 


10927 


24828 


A 


11017 


234 


1 


DINIKGSWVKGI PKPSVLSLQLFCNSKI 
FPN* KLIKKKNPNEPGVMVHVCNPSTSG 
GRKGQMA* GQEFKASLANMAKP 


10928 


24829 


A 


11018 


107 


342 


FQLCIGAIVHCFLFMKQHSEYKSTDHRA 
SSKCHTVQGSSQLSTSAFKQK* THKKKK 
KKKKKKKKKKKKKK * KKKGGAS F 


10929 


24830 


A 


11019 


264 


1 


CFKPLLFKYPGI IKAVNLPVSTASATFH 
KF* YVTFS FS * KKFLCVGGKTGSHS ITQ 
AECSGTIKAHCSLELPGPGDTPTPTSRV 
AARA 


10930 


24831 


A 


11020 


1 


288 


RTRGTSSRSRAATLFFFFPFKKGGFFNG 
NLPTTFGKLGNKKFPLNLFPGGPPLPHG 
GLKKGPGAPQVPPLFLKKPQTSPP*GLR 
EGPPPLKKFYLF 


10931 


24832 


A 


11021 


115 


287 


STCE I S FG CSRGCCWDYLMVW EFSHFYF 
* FLETECCS VA* AGVQWSNLSSLQSLHP 
E 


10932 


24833 


A 


11022 


186 


1 


RPKVDKTAKMGRDQSRKA*NFKNQKASS 
PPKEYNSSLAREQNWMENEFELT3VGFR 
KSVITH 
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C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threontne, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


10933 


24834 


A 


11023 


180 


353 


IFCFCIFSRDGVSVC*PGWSQSPELMIH 
* NTKI S * AWQRVPK IPATQEAEAGESLE 
PG 


10934 


24835 


A 


11024 


155 


1 


PQARRGGTCLQS * LLRRLRLRQEDRLS P 
G V* GYS E L * S C P CT P AWETE RLV 


10935 


24836 


A 


11025 


22 


325 


KLILLYDPQLTFIFLNTRAWAHCAFCC 
WPIHFVFGNYGFVFSCLFTFIFVYSGKA 
DLFFKPKKKKKKKKKKNKKKKKK* KKGG 
RGLKNSWGGPIFWGFPK 


10936 


24837 


A 


11027 


127 


259 


GQVQWLTPVIPTLWEAMVGGSTKLRSSR 
PFFSTP*PCIVMILYG 


10937 


24838 . 


A 


11028 


37 


408 


IASGALFFFFKAAGKRGDFKTEGAYQRQ 
RTI FKNKKRARREKTGRENLRGNYKNMG 
RGLKTPRGALGGPYLDKKGPLRGKGPFQ 
GGTLLGGGPKIKIQGTL*IRRNYWPYIR 
RYNRFKKRQKNM 


10938 


24839 


A 


11029 


146 


3 


LLVLKDLPRIMVRSAPTYIYIY7YLYIY 
IYIYTHTHTHTFIT*VLFC 


10939 


24840 


A 


11030 


268 


435 


LMCLRNETN* F * KVE * WLGAGAHTCNPK 
TLGG*GGRIA*GQEFETRLRNIVRP 


10940 


24841 


A 


1 103 J 


322 


2 


KLPHLQQQC VKVN C YKP FLKNGK I GHGG 
VCL*SQPLRRLRQENHSNPGQCTSAWVT 
QRDSVSKNKKKRKMEKYFSSVRQNKPVK 
HTKRMS PTKADTKSTHCGGGRV 


10941 


24842 


A 


U032 


251 


380 


HKRLHTI YYFVPLKKKMGPGLVGHTYNP 
STLGGQGGQIT*GRE 


10942 


24843 


A 


11033 


64 


324 


FWFFFFFFFFFLNPPGGGKKKKKPPPFL 
GKKKKRGKKGGEKFFSPPPRGGGGGPKK 
ISLKRVLKFF*NPPGGTKKGGGPFLIPP 
GSP 


10943 


24844 


A 


11034 


420 


534 


CPGWAHAYNPSTLGS*GRQITIDREIE 
TILANMAKPH 


10944 


24845 


A 


11035 


196 


1 


KNGHSLPPGPQKGTPFPKKKKTPKINLS 
ELGKSKAYSWPGTWHACNPSTLGG*GG 
WIN*GQEFE 


10945 


24846 


A 


11036 


10 


248 


PSDR * LFS TNHKD IGTL YLL FGA* AG VL 
GTALSLLI RAELGQPGNLLGNDH I YNVI 
VTAHAFVIIF 


10946 


24847 


A 


11037 


32 


405 


DYVS KRKE KREKKRNV I LETS I SSHLVE 
WMLCSRYHPLIKKRSSWILIALLCPQK 
VRDMS *GIGSKKQCWDSWLSIWIKMESD 
PFLIPYPKINA*RLKDLCERINLKIIIK 
REYLHDYRVKKV 


10947 


24848 


A 


11038 


337 


448 


KNPRGFFGKNPFFWGGPFGGPPPPKKMG 
FGEKKKF* FKRSKKKKKKKKKKKKKKKK 
KKKTG 


10948 


24849 


A 


11039 


307 


415 


RFVCSTI KVLRDLSSDRSNPGRVLSTSN 
SSLK*KKK 


10949 


24850 


A 


11040 


213 


403 


VHRGI KYYFLNKLPDFPRQTFKK* G* RL 
GAVAHACDPSTLGG*GGWITLGQTFETS 
LTRMGKS 


10950 


24851 | 


A 


11041 


104 


2 


SAFFF* ETESCSVAQAGVQWCDLSSLQL 
PPPGFK 


10951 


24852 


A 


11042 


177 


3 


AYTAYICVCIHIYLYKWICINVHVCK 
ITYLYIYKYLKYNAIYLHIGFFSFLHTI 
S 


10952 


24853 


A 


11043 


314 


395 


TVYILNLSINSVQSVLLSVFHRLRHQPR 
6RVQWLTPVI PALwEAQAGGS P * VR I SQ 
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nucleotide deletion, \=possible 
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IKTPA 


10953 


24854 


A 


11044 


310 


403 


RGKGGKKPRNPGGGKS *ALTKKKKKKKK 
KKKKKKKKPGGALKKKPRGGQKKTGGGK 
KKFLPKRGAKKKPPGDFGKKNQIGGGEK 
RGKTPPKKKKP*GKKKNLKGKGGKKTPK 
SGGGKKFPPRW* KKKLPPGG 


J 0954 


24855 


A 


}}046 


181 


3 


EGPLSIiSLLFFFF*RQISSLLLRLKYSG 
TI IAKCSFKFPSSRNPPTSASQAASTTG 
VHH 


10955 


24856 


A 


11047 


392 


41 


KE SR S LS QGGRE KG P F KFLAP PAP KFKR 
IFRPHPLKKLGPKGPPP3PS*ILSLKKK 
GGPPILARWFWNSCPQGFPPPGPPKRLG 
FKGGPPPPPLF*KKKPPFLGGENQKVKN 
FFFFF 


10956 


24857 


A 


11048 


147 


307 


TVIFVFLLRKGLTLLPRLESGMIMAHLK 
S * PPGLKQSSCHSLLSS WDYRRRRG 


10957 


24858 


A 


11049 


118 


1 


HGKIPSPLQ*HK* IAGYGSACLQSQLLR 
RLRWESRSISI 


10958 


24859 


A 


11050 


54 


284 


RKIRRGGLHLWSNLLGRLRWEDLLGRLR 
WEDRLSPRRGGCSEPRLCHCPPAWATE* 
DPVSKKKKKSLFVEKPQGGG 


10959 


24860 


A 


H051 


130 


340 


HNMHFAAHGSRINFDFFFFFFFFERGFP 
FCPPVGRAGTHFGLLEPFPSRV*KQYYW 
LG T VAHVCNP STLGGHVGES * GQE FKIS 
LANIVK 


10960 


24861 


A 


11052 


140 


337 


NI IMFFFFFFETVSFLLLGLEGKGTIWV 
N*NLCLPG*GDSPLGCS 


10961 


24862 


A 


11053 


197 


428 


IRSINNPTVASQYSSEWKSHIPLILNQN 
LDMIKFSEEGMLKAKIGLLRQTVGQWN 
AKEKFLKKNQSATPLS I* IRK 


10962 


24863 


A 


11055 


180 


1 


AI I ALLY S RLGD YARR FHLRRKKE Q WQG 
MVTHVYNPSTLGGQGGRTA*TQEFKSSL . 
GNIA 


10963 


24864 


A 


11056 


158 


426 


FFPLPLPLPTVSLFPRSPSDAEPKLDCT 
AAISAHCNLPA* FSCLSLPSACNCRRAP 
PRLTASASRGAGIADGVS FTQCSMVPRL 
ECSGV 


10964 


24865 


A 


11057 


220 


421 


YALHLNM KNNS YF * MR KK KKKKKKKKKK 
KKKKKKKKQNKKDPGGAVYKKI P 


10965 


24866 


A 


11058 


162 


464 


SCSVGLKLFSMKTSLALSYYCLLLLLTL 
H Y Y YY YY Y YY * SLVFM 


10966 


24867 


A 


11059 


1 ! 


134 


APENRVDPRVRKTLVPLILPIITTLANP 
CKKD* YP YYVKIS IAC 


10967 


24868 


A 


11060 


382 


273 


SSCL*PQLLRRLRQEDYLS*GA*GCNEL 
**OTCSPAWVTEQDPVSKNIHTYIHSQS 
INYGSM I GY I HEQKG 1 1 ADHKP 1 1 AE PE 
DSTYPRDHT 


10968 


24869 


A 


11061 


298 


378 


SCSASPCCSSRHGWSPCG*QTCLIiLAIA 
LRCLPWLREAGALHTGPGCRLFDHAMLQ 
AHKAHQLVIDTYQEIGENYIPQDQKNSF 
LLESHTSFCFSDSITTPSNMEETQQKSN 
LKLLRISLLLIETWLEPVRFLTRIVANN 


10969 


24870 


A 


11062 


39 


486 


RPTRPDAYHI VKPSP* PLTGALSALLMT 
S GLGMR FHFHS ITLL ILGLLTKTLTI YQ 
* WRDVTRES TYQGHHTP PGQKGLQ YR I M 
LFITSEVFFFAGFF*AFYHSSLAPTPQL 
GGHWPFTGITPLNPLEVPLLNTFGILAI 
GGSIT*AKK 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

etb 

od 


SEQ ID 
NO: in 
USSN 
09/535 7 l 
26 


Predicted 
beginning 
nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 
peptide 

r Mr 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenylaIanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N^Asparagine, P=Proline, 
Q—GIutamine, R—Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


10970 


24871 


A 


11063 


269 


429 


ATWQKLPLQIQKNYPGLGLVAHACNPST 
LGGQGGRI T*GQEFLNQPANMVNTH 


10971 


24872 


A 


11064 


248 


65 


I IEGQAQWLMPVI PALWKAEAGGSPEVG 
SSKPA* PCDPPASASQSAGITGVSHRTQ 
LILNS 


10972 


24873 


A 


11065 


176 


1 


LFP PPKJCVPPWGNTQFKS I CLYTTNFF F 
FETESRSVARLECRGTISPHCKF*LPGS 
RE 


10973 


24874 


A 


11066 


2 


397 


M VLWTAHLRAMAPG S RTWGLLT FALL CL 
P WLQEAGAVQTVP I S RLCDHAMLQAHRA 
HQLAI DTYQE FKETY I PKDHKYSLLNDS 
QTS FCDSDS I PTP YNMEETQQKTNLEVL 
RMSLLE IDS *LRPAQSNKR 


10974 


24875 


A 


11067 


172 


399 


Q I LYL * KYAYTVTMQHLSHP I L * KKKKK 
KKKKKKKKKKKKKKKKKKKKTGG 


10975 


24876 


A 


11068 


192 


3 


PKLTPMLKLIVPHI I LLPLT * LS KKH 1 1 
* INTTTHS I I I S I I PLLFFNQ I TSNLFS 
CSPTFC 


10976 . 


24877 


A 


11069 


204 


345 


DKSGEHSKTSSIQKNLRLGAVTHTCNPR 
TLGGQGRWTT* SQEFQTSP 


10977 


24878 


A 


11070 


3 


416 


HELPQPLRVLWTAHLGATAAGSRTSLLL 
DFALLCLPWVQEDGAGQT VP * S RLFDHA 
MLQAHRSHRLGIDTYQEWETYIPEDRK 
LSFPDDCHTYFCF*HSIPTPSHLGETLL 
TSNLE LLR I CLVL I DS WLE PAR I LTS 


10978 


24879 


A 


1 1071 


108 


2 


PSPFFFETESCSVT*AGVLWRDLCSLQA 
PLPGSSC 


10979 


24880 


A 


11072 


400 


278 


KVK * LRQENQWNS RGRDCS E P R S RHGTL 
PWTTEQDS I PKNKTKPYKI S 


10980 


24881 


A 


11073 


139 


3 


KLAGCGRAHLS FQLLRRLRREDHLSPGG 
*GYSEPWSRYCTPASRA j 


10981 


24882 


A 


11074 


17 


421 


DHV IGQPRWLMPVI LALWEAE PGG LLE P 
KVQDQPGQNCKSLSLLTIKKKRRENFPG 
RGGTQLCSQLLRRLRVGCCIDPKLHNCL 
PAWMTERDPTFKRE* KKNDVLEAKNFFT 
ILTFMNTTKEKMVAQRIGSLFHG 


10982 


24883 


A 


11075 


412 


1 


QFFFLGVCCPTQ I FGVGKKGE KKKKKRG 
FPPRVFFFFNPQIFSPFFFPGPFFFLGK 
FPPFFGERFLFFSKSKFFFPGVFKRSFP 
FPPKNFFFF*RIF*KFWFSPPKPFFFFF 
FFF*DRASLCHPGWSTVAQS*LTSC 


10983 


24884 


A 


11076 


279 


410 


VFCCVAGRDGAPGVKAHFPAAQKPWDGG 
FPGP*SAGVGPPPIPR 


10984 


24885 


A 


11077 


12 


386 


I AHLLLFS FYHKDTGTL YLLFGAGAGVL 
STALS LL I RAELGQ PG I LVGNDH I YNVI 
VTAHAFG I I FFI I I P I I IGCYGN* LATL 
1 1 GAPDMAFPR INNI S FGLLPTSLLLLL 
AYAIVDAGARTGW 


10985 


24886 


A 


11078 


412 


3 


LAILSLSLSLGFPYSLRYKNIKIRPINN 
PTMASKCSSKRKTRTSFTLNQNLEMLKL 
I EKSTAKAKRLKVRPLVPVSQWNAEEK 
FLKE I KSA I PLNI QM I R * QTS L I ADMEK 
VSVL* IKDPTSYNI PLSQS * IHSK 


10986 


24887 


A . 


11079 


78 


443 


RHRLPSDTCLTAAVP * PCKGSKKKKKKK 
KKKKKKKKKKKKKKKKKKKGGGG FKKN S 
GGGQNFAGGKKI FFPFFGGVFKNPLWI F 
WKKTFFWGGKISGQLSQKKLSLWGEKKI 
FWGCGGET3F 



1216 



WO 01/64835 



PCT/US01/04927 



SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A-Alanine 
C=Cysteine, D=Aspartic Acid, 
E=G)utamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=Iso leucine, 
K=JLysine, L=Leucine, M=jVlet bio nine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine > 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


10987 


24888 


A 


11080 


395 


2 


KRVPKKXPKKNSPPGGRKGGSPP*SPPF 
GGPDQPIF*VGGFDPPKLTPENPFFFKK 
KNN* PGKGGMPLILKSLGGWGKKKIFTP 
GGGLPKN*KETPPPPPGGKKKI PFPKKK 
KKKKKRAAARDLELVPTRP 


10988 


24889 


A 


1 1081 


24 


182 


TVS TTKCFShh I P YYGRKDME *KI*IIQ 
KKKKKKKKKKKKKKKKKKKKKKKKKRGG 
PP 


10989 


24890 


A 


11082 


173 


3 


NNVI CYILYI PS VPKYKTTLNMRGSAFF 
FFFFFIF*DGVSLCHLGWSAVARSWLTA 


10990 


24891 


A 


11083 


284 


391 


S VAHACNPGTLGG * GEQ I TRG WEFETSL 
TNMEKARL 


10991 


24892 


A 


11084 


257 


3 


IKKLREVNMLGWLYYMREEARPVDYVHW 
RGPEYSSLTKTIRNVIVKGHQHY* KASA 
GRGGSRL*SQHFGRPRRADYLKSGVRDQ 


10992 


24893 


A 


11085 


42 


171 


ARRMRAG I HRVSQEGLDLLTS * S AHLGL 
PKCWEDRREPLRLA 


10993 


24894 


A 


11086 


176 


325 


FKRS KKKKKKKKKKKKKKKKKKKRGGAL 
LKKF* FKRSKKKKKKKKKKKKKKKKKKK 
RGGALLKKFLGGPQISGGGKRFFFFF 


10994 


24895 


A 


11087 


67 


379 


KKKKKRKKPKTKKRTGFVRQNVRWGRSL 
SPGRAPAEGPQGGSTESPPLLVGRCLIL 
GLREPGVPPA&YGRFLRDYMNSIQGKLE 
KQRPQ*GLPSLTGAAELIKH 


10995 


24896 


A 


11088 


128 


326 


KKKSGGGVFFFI IGGGGHFFGGGRFFFF 
FFLGGFFFKLFFFF*EKFFFLGGNLVKH 
SSKKSSWVGW 


10996 


24897 


A 


11089 


309 


2 


KTHSRPGTVAHTCNLSTLGGQGEWIT* G 
QEFETSL 


10997 


24898 


A 


11090 


52 


347 


DHSLLELQMKLYASAQAAITKCHRMGGL 
NNTHFLFLFLSLFFFFLKQSLAIiPPRLK 
GRGPILVN*KLPLRGKRDSPASPPQCVG 
ITGLAHPPRLILFF 


10998 


24899 


A 


11091 


386 


48 


IHLQPPLQICTLKFFTOKNPHLFLTGCP 
KYALI FLCFGALVKTGFHHVGQAGLQLV 
PLCPDRLGLPKCSD*RREPPHPAKKQFF 
IDRVMLCYPGWCPALGINPSSPVGLPKR 


10999 


24900 


A 


11092 


164 


3 


VYSLLLRCPFS *ALTAKFINRCTYIHVC 
MYMCEPYAYTCLYTSTHIHVCMYIY 


11000 


24901 


A 


11093 


148 


325 


CLKLQPFGLGWAHACNPS T YF I SYQ * L 
SPVIPALWEAKTGGSPEVRSSRPARP 


11001 


24902 


A 


11094 


162 


3 


LLGLKRP I FFFWGGKFFFFFFFFFE I ES 
PSVAQAGVQW* CNLGSLQPASGRV 


11002 


24903 


A 


11095 


259 


358 


LRPGTVADTCN* GQARWL I PVI PALWEA 
KAGGSLEVGSLRLAWP 


11003 


24904 


A 


11096 


254 


2 


S WG F I PMEERERENR YWDRQQGHAI VSN 
IVINAFLYRYTEREKPGVSLSPRVECSG 
MI VAHCNLSLVGSSHS PAAAS * VAGTTG 


11004 


24905 


A 


11097 


132 


3 


KKKKRQPRWAHACNPSTLGGCGGW IT* 
GQKFRTSLVNMVKP 


11005 


24906 


A 


1 1098 


2 


455 


LNLEQL I EPLEVLWTAHLEALDPSSRTF 
VLLAFALL*LPWLQEAGAVQTVPLSRLF 
DHAMLQAHRAHQLAI DTYQEFEETYI P * 
DQKYSFLHDSQTYFCFSDS ITTP YSMEE 
TQQKSNLELLRIYLLLIESWLEPVQFLM 
SMFGNNLEYDT 


11006 


24907 


A 


11099 


2 


375 


QLPEPLMDLWTGDLVAMAPGSRTSLLLA 
FVLLWLP * LQEAGAGQTvPVSRi>FDRAM 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
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sequence 


M 

eth 

od 


SEQ ID 
NO: in 
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Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AJanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIa!anine, 
G=Glycine, H=Histidine, l=lsoleucine, 
K=Lysine, I>=Leucine, M=Methionine, 
N=Asparagine, T=ProUne t 
Q=Glutamine, R=Arginine, S=Serine } 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














IjQAHRAHHLAI DTYQEFEET Y I PKDQKY 
SLLHDSQTSF* FSDYI PTPCSMEETQQK 
SNVELLR ICLhV 


11007 


24908 


A 


11100 


1 


359 


LQLPKPLTVLGTAHLPAMCLAYATALIL 
GCALLCLA*VE*ARAAQTVPQSNF*DHA 
MLQAHRAHQLAIDTYHEFEETYI PQDQK 
YSFLHDYQTSFCLSYSIATPSNMEETQQ 
KSKLELL 


11008 


24909 


A 


11101 


I 


370 


LPEPLTDLWTAHLVPMAPGSRTST..T.LAF 
ALLCLPWLQQAGADQTVP + S WLFDRAML 
QAHRAHQLAIRTYQE FEETY I PKDQKDS 
FLHDSQTSFCF * DS I ATPSNMGETPQKS 
NLELXjP I SLLL 


11009 


24910 


A 


11102 


2 


361 


AHLLAMAPGYRT YLLLAFALLCL PWLQE 
AGAVQTVP* SRLFDHAMLQAHRAHQLAI 
DTYQEFEETYIPKDQKYSFLHDSQTSFC 
FSDSI7TPCNMEETQQKSNL*LIRICLL 
LNESWMEP 


11010 


24911 


A 


11103 


8 


366 


PLRALVJTAHLAARAPGSRTSLLLAIALL 
CLPWLQEAGA* QTV? I SRLLDHAMLQAH 
RAHQLPIDTYHEFEKTYIPKDHKHSFLH 
DSQTSFCF*DSIATPSNMDETQQKSNLE 
LLRIFLL 


11011 


24912 


A 


11104 


3 


341 


EPLRDLRTAHLVAMAPGSRTYMIiLAFAL 
LCLP WL * EAGAAQTAPLFRLFDHAMLQA 
HRAKQLYIDTYQEFEETYIPQDQKYSFL 
HDS QTS FCFSDS I STPCNMEETHQRSNL 
E 


11012 


24913 


A 


11105 


240 


329 


RNWPGAEAHTYYPSTLGGRGRQIT*GRG 
FE 


11013 


24914 


A 


11106 


2 


316 


HEERERERERERESAIVRQIHTQGEGLT 
ELVFPYRRPAPSGLKFTFVYVQEHILGV 
GGNGLSPLQIGVILRDGRGLAHIRCGTR 
NKI LR I LKSQGRAPDLS * DLY 


11014 


24915 


A 


11107 


169 


1 


EDGHHSGLHSKQQKQPAWPSMVTHTCNP 
NTT/3GRGGWIT*GQEFETSLTNMEKPCL 


11015 


24916 


A 


11108 


192 


355 


GMSLTGHSNY I ILNLNFLWPCVVAHTCN 
PMTLGGQGGRFT * GQE FENNLAYRGK 


11016 


24917 


A 


11109 


145 


383 


SVNKDKGKYHVSQGIQSFSYVAYVYFRK 
LI YTMMT I VNSALLYTELV I CQGRRVLV 
FDPGF* FWFFETESCS VTQAGEQ* CDLG 


11017 


24918 


A 


11110 


499 


400 


TRAGFFPPFKKFLKGPGPGAPPFNLNFL 
GGRGGGFF* IPKKKI YPLLLPFLKVQKT 
PSKPPFFFFFFFLRQVLLCCPGWSAMAR 
SRLTRA 


11018 


24919 


A 


11111 


373 


177 


QFETSLGTMSGPSFYLKN+KSARCGHTC 
MRFQLLQRQSREDHLS SGVHRCSNP * LH 
RCTTTWVTK 


11019 


24920 


A 


11112 


204 


399 


NHFWDWYHFSSPFFFFFFEKNLCSVPR 
AEDQGMNLS *LNPLPPGLKQFFA* PSQE 
AGIIGPGHH v • 


11020 


24921 


A 


11113 


192 


16 


KKTENNKR * RGYGETETPAHCGRACKLG 
* PLWKPV*QFLKKLKMKLPYDPTIPLPA 
IFS 


11021 


24922 


A 


11114 


113 


369 


AGGQGPNLFLQKPGPPGLKGPPPLPLPG 
GGNYGGGPPSRG IF* FFEKKGFGG VPKV 
GPNLRG*RDLPAPPSKGAHPLGNGFFSR 
W 
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nucleotide 
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peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
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last amino 
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residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine } 
G=Glycine, H=Histidine, Mso leucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X^Unknown, *=Stop codon, /=possible 
nucleotide deletion, Y=possible 
nucleotide insertion 


11022 


24923 


A 


11115 


250 


387 


ILCDVEHSLWPGVEVHAYNPSTLGGQDG 
RIA*AQEPETSLNNIARL 


11023 


24924 


A 


11116 


192 


1 


VCD L FW KP S PHALTRRRTS V FHLKR * FS 
WLGAVAHTCNPSTLGG*GFWRIT*AQEP 
ETSLGNMV 


11024 


24925 


A 


11117 


289 


202 


VFYCQ* KFIRKKKKKKKKKKKKKKKKKK 
KKLSKKIKKMDQQKLRSPTI K 


11025 


24926 


A 


11118 


412 


421 


RA*PTEQVTLGITEQSYSRGHINNRVHD 
LDVGS *HPDGAAAIKGSFVQRLKGY 


11026 


24927 


A 


11119 


16 


472 


DRLKPSATHAFADAYHIGKANP* PLTGA 
LLTLLMTSGLTM*VHLHS1TL1 J ILGLLT 
NTLTIYQRWREVTRESTYQGQHTPPVQK 
GLRYGIILFITSEALFFAGFF*AFYKSS 
LAPTPQLGGHWPPTGITPLNPLEDPLLN 
TSVLLASRVSIT 


11027 


24928 


A 


11120 


304 


346 


GLWHATG I TP * PLTGS F* LLLMTSSLAN 
* FHYHS I TLL I LS LLSSTLT I YH* WRE V 
TRES TYQGHHTP PVQKGLR YG I ILFITS 
EAVFLARFC*AFYHSSLAPTPQL*GLWH 
ATGITPLNPL 


11028 


24929 


A 


11121 


174 


420 


FISALLKSAIQRNNAMFSTYYALTCTVG 
TAVS WSSVS * PQRNMMMS * VHAPAMAGR 
LVCHYJRVYRRFEHVCKRAVHGQYP 


11029 


24930 


A 


11122 


299 


466 . 


I FI LHE I CITFFFETRHFCH * AEVHWCS 
LGLLKPPPPGLRCDPSTSPSRVAGTRPS 


11030 


24931 


A 


11123 


138 


1 


NPFNEKNPPPKGPGPKNAFFFFFFEMES 
CSVAQARVQ*HDLGSLQP 


11031 


24932 


A 


11124 


241 


499 


NQRERS *HLYLLYNTVCLTHGI>PILI I Y 
SAYQLAIYDLLLRRDALARIiQADLADRR 
RGLQPKKRFAFKTRRKDAASS TKVDAAP 
GI 


11032 


24933 


A 


11125 


252 


391 


PLSLDLQSNASLSHFTSPPLMFADR*LF 
STNHKNI *TLYLLFGAWA 


11033 


24934 


A 


11126 


45 


459 


AFLKNHPLET*SNHSFIDLPTLSNISA* 
GNFG * LLGACLI LQ I TTGLFLAMHYS PD 
ASTAFSS IAHITRDVNYG* I IRYLHANG 
AS IFF I CLFLHSGRGLSYGAFLYSQT*N 
IGIILLLATIATAFIGYVLP*GQISF 


11034 


24935 


A 


11127 


109 


301 


QDSLMRKKSKNRSPSHER*RSKCLERKR 
SGDKKKKKGGPPKKKIGGFPFGGQMELK 
PFSILGGF 


11035 


24936 


A 


11128 


322 


2 


S CHH VS S LGDRAR P CLQKE K KKER KKVG 
DFNTPLS IMDGYQHWYQARKI I IMDGTT 
K* KITRK* KIRNQ * DLTDVYRTLHPR VK 
YTFLSCAYGTFSRTHRIRGQAI 


11036 


24937 


A 


11129 


182 


2 


FLPRGFQPPPWVPGPLGFLFPPPSSRD 
PGFPSMPFFFFFFFETESCSVS * AGEQW 
RNLG 


11037 


24938 


A 


11130 


138 


1 


GWVFFLWFSFSFLFFFLR*GLSLSPTL 
ECSGAITAHCSLKLPGLS 


11038 


24939 


A 


11131 


304 


83 


WKKTDFPRKGRNSASRLPSDSNCNSSLS 
LQGLS LL YR F WTYHAVTGKGSQS R PQEQ 
VLGYHARKNS R * VHRVK 


11039 


24940 


A 


11132 


247 


225 


KNFWGGPPLRGGKKNFFFFFGGGYKK*L 
LLCXVNMYI*CV*YI*HKRIISDPKKKK 
KKKKKKKKKKKKKKKKKKKKKKKKKKKK 
K 


11040 


24941 


A 


11133 


242 


3 


KKKKI FFPPRYFWAPPYFFFLGPPPFFF 
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E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Pro1ine, 
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W^Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














FFFFFFFFFFFFFFFFFFFVF* TQFHLS 
PRLECSGTISAHYNLHLPGPSDC 


11041 


24942 


A 


11134 


361 


2 


ASQS AG I TGVSHRTRPTHGFKDT* RKPA 
EPEVTIH* 1*1 TLMSCNIESPEKVCADL 
IRGAKEKNLKVKRPAGTPIKASRITTRK 
TPCEGSNTRDRFQMRIHKWLTDLHSPFD 
GRVGGRV 


11042 


24943 


A 


11135 


1 


370 


IRQTFFQKREIVGIILCYVJNVGYYSQLH 
* SS RMLW I FH F YRVY I SVKV I LVI LVPH 
EKKKKKKKKKKKKKKKKKKD 


11043 


24944 


A 


11136 


273 


2 


GGRAGQQILFCRGIWLGPPRSCQCTFRS 
PPGLAL* PWECFSLPPALKIRFPSFLPF 
F FEME FSRLSP RLE CNGVTVAHCNLRLP 
GSSNFP 


11044 


24945 


A 


11137 


130 


354 


GQLNKLSGPYPENVGYTLPVLINPLAQP 
VIYS T I FAGTLI TALSSK * FFT * VGLE I 
NMLAFI PVLTKKKKKGGPL 


1 1045 


24946 


A 


11138 


366 


2 


LVEMGFLRVRQNGLYLLTS* SARLGLSK 
CWDYRREPPCPASDWVFILTSPLIHALD 
GKEHTHTHTHTHTHTGLG I CQS S LGKQS 
GGWGWLS ANRGQFS P FAVCLWS FLPEV 
PWTSALFT 


11046 


24947 


A 


11139 


228 


1 


CRAGRGL PAAVL 1 ILFPPLLIPTSKYLI 
NNRL I TTQQ * L I KLTS KQM I A I PNTKGR 
T* S L I LVS L 1 1 FKATTNLLG 


1 \ C\A~1 




A 


1 1 1 Art 


1 ^7 


j j 1 


UTPTPPHTTfPFR PQ TRHTtfClT YYJ .KRYf 
nxr 1 tr Jrn 1 lV-rljivrol rvriiv O -I X i J_jrvx\ X V-. • 

LLR 1 1 LS LLPLQTS ? * PLTGALSLLLMT 
SGLSM * S H LH S I T LLAIjRLLT Y TLT I YQ 
*WRDVSRHSTYQRHHTPFVQKGLRYGI I 
VFTTS KD I VFSGL F * AFYHSRLAPTTQL 
♦CHWPPSGITPLN 


J Jirto 


94Q4Q 


A 




1 ftft 
1 oo 


j 


IPTPGGPPPLPPQKVLFKPGNPRAPPGV 
FFFFFETDSRSWRLECSGVI * AH*NLH 
FRGSR 


11049 


24950 


A 


11142 


224 


3 


SVSSPCISOTKKVDVRSSALLQESTSSR 
EVMTGHFHYL 1 1 YET* LRSLAQSARLAC 
SGTISAHCNLRLPGSSN 


1 1050 


24951 


A 


11143 


262 


371 


S YCG PG WAHAYNPSTLGGRGRR I T* GQ 
EFETSLTN 


11051 


24952 


A 


11144 


244 


379 


TIGKQRCHAGIEEVSDSSQAPWLKPVIP 
AL*EAEAGGSLEPRSLR 






A 




1 J7 / 


1 JU J 


DRLESLLEMHI PGVYPNQWNTNFYLF I Y 
FEAESHSVAQTGLQ*RHLGSLQLPPPQV 


11053 


24954 


A 


11146 


284 


1 


GRGRQGGDSSGKGEGNRAMDI TGGLVGK 
GQGAGEAFVEVWEKELRGFKVEGEGDMD 
KERWD*GL*GGRERERERERERERERE 
RERERERESLV 


11054 


24955 


A 


11147 


64 


286 


THALAWTRNQRIGSGHGAMCLYSHLLRK 
LRQNDHLTPGV* KCLEL* LHH * TPTWTT 
XPTAKAIFFNLKKKDSWD 


1 1 UJJ 




A 


1 1 148 






VSVSLIIFIATTNLLGLLPYSFTPTTQL 
S INLSMAI PL* AGAV 


31056 


24957 


A 


11349 


81 


367 


QTLRTKMNKNWASL I DPTI LGLPAAEL 
1 1 LFP PLL I PS CKDVINNRLMTTQQ* L I 
ELTWEQMITIHKTKGRT*CLILECLIMV 
IATTKLLGLLP 


11057 


24958 


A 


31150 


257 


2 


SVCAVITKYLRLGCFADAADTQRPLPPA 
7VKLTMFFN I ATHGE PLGCI F*I>F VDKV 
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PKTAENFHALSTGEK* FSYKGSCFHRIL 
P 


11058 


24959 


A 


11151 


257 


1 


WRCGEKETLIRCWWECKLVQSLGRTVWR 
FVRELKLELPFNPAI PLLGIHPKEKK* L 
DPCTCMFIVALVTIAKMWNQPKCPSMDL 
V 


11059 


24960 


A 


11152 


17 


367 


NHKDGKKTEEKKQSTS PPQKDCSSSPAM 
EQSWTENDFDELREEGFRRSNYELQEEI 
QTKGKEVKNFEKN*DECITRITNTENCL 
KELMELKAKARELCEECRSLSSRCDQLE 
ERVSA 


11060 


24961 


A 


11153 


213 


2 


TFCHFFFFFETESGSVAQPGVRWCDLGS 
LQAPLPGFTPFSWRQRLQ* AE I VPLHSR 
LKLHFKKKAGVFSC 


11061 


24962 


A 


11154 


1 


347 


GTSNHFPFLYTYQVHNSTDYPRNRCCLN 
PSLRFHTSSKPLPARPHIMTHQSHAYHI 
VTCPS P * PLTGALLALLMTSGLAM * FHFH 
SITLLILCLLTNTLS I YQ* WRDVTREST 
YQG 


11062 


24963 


A 


11155 


223 


3 


YSTNLKILRNHFILFF*DRVLLCLPGWS 
AVASSRLTANSASCLSLPSSWDYRRAPP 
HPANFCTDRFHKVGQVG 


11063 


24964 


A 


11156 


143 


1 


RGFLNQCPSQNFPFFFFETESCSVTQAG 
MKWYDLS *MQPPPPRFKLV 


11064 


24965 


A 


11157 


149 


355 


LQPLPPGFKRFSCLRLLTSWDYRKDVDS 
ALLSNYVT*KYISKTLCRHQSPEIVREF 
L T AM KS H KLTKVG 


11065 


24966 


A 


11158 


156 


370 


RSDGCQKIPRNMLGGWSVLHHTVDEQ*G 
PNRYI KKCSASLI IREMHI INTVRRCLT 
PATMATVRR KTTS VG 


11066 


24967 


A 


11159 


298 


4 


HFQWIMWRQLVKGRLQAQKNAAFTDSH 
THLCYLKCVYFCVCVCMCICVFVCV*SR 
AL* * S LYLFLC I KAVKAGHS VSGLESQH 
YGRLRQADHLRSGV 


11067 


24968 


A 


11160 


364 


285 


NSWGKIKILKGK* PPPPKKIFFPKYPQS 
FFLSPP*KKKTNFPHPRKYLSPPGIFLK 
APPPLIFFFFFFFFFFFFFFFFFWG 


11068 


24969 


A 


11161 


198 


406 


KS VYMTLKKKRGS FHERGDVPAI CNFFF 
* IHIYMCVYIHICVYIYI YTKI YI YTHT 
QIYIYIYIYIYIY 


11069 


24970 


A 


11162 


95 


376 


KI ITKHNIARTNP YTFCIMN*LE I TLQO 
EPKLR PP KPDKKKKKKKKKKKKKKXKKK 
KKKKKKKKKKKKKKRGGGGFKKKPGGGP 
KIRGGEKIFFFFLGGD 


11070 


24971 


A 


11163 


343 


391 


WR*LSSTNHNDILTLYIjLFRARAGVLRS 
AVSLL I IiAEQGQPGNLLGDDHI YS AI DP 
AHAFAIMFFIVIPMIIGGFDN*LIaAVRN 
GAPEMAIPHINNISC*RLPPSLLLLLAS 
AI VEAGAGTG* TAYPPLE 


11071 


24972 


A 


11164 


1 


158 


QPMTPNRGPLSP* PLTGALSALLKTSGL 
AM* FHFHS I TLLILGLLTNTLT I YQ* WR 
DVTRES 


11072 


24973 


A 


11165 


265 


466 


KLLTSLSFFNLLRETNQVGLLELSQAKY 
KKTPRLGTVAHVCNPSTLGGQGGWIT*A 
QEFETSLANMA 


11073 


24974 


A 


11166 


204 


458 


KRCLLPLPPDSNTQSSQCRREQIGRAIR 
KI LTFETQNI KRLKTEKQLIiKLLE *SGK 
I TAHCNLDL PGSS S PPTS ASQS AG I TAG 
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s 


11074 


24975 


A 


11167 


256 


3 


KILFFKPGVLSPPWGGVNFFQLMPFQGG 
PPKKKRGPLSPPKKGPFFFFFFFF*ESR 
SLLPRLECSGAISTHCNLCRLGSSRTRG 


11075 


24976 


A 


11168 


1 


457. 


HRTPSVRTPNCRGDPRVRSTNHKDIGTL 
YLLFGA*AGVLSTALTLLIRAELGQPGN 
IjLGNDHI YNVIGTAHAFVI I FFI VIPI I 
IGGFGN*LVPLIIGAPDMAFPRINNISC 
*LLPPSLLLLRASAIVEAGAGTG*TVYP 
PLAGNYSHPGAS 


11076 


24977 


A 


11169 


343 


428 


GR*LFSTNHKDIGLLYLLFGA*AGGLST 
ALSLLIRAELGQPGNLLGNDHIYNVIGT 
GHAFVI I FFIEI PI I IGGFGN*LVPLI I 
GAPDMAFPRINNI S F * LLPTSLLLLLAS 
GIGEAGAGTG*TVYPPIAGNYSHPGAYV 
DLT 


11077 


24978 


A 


11170 


91 


2 


PRQDPSFFFFFF*TESGSVAQAGVQWCD 
L 


11078 


24979 


A 


11171 


219 


I 


IFKSKILCLPPPEPNFFFFFFFFFFFFF 
FFMRHGGACCSPSYLGG*GERMT*ASRV 
AGTTGVCHYAWLI FVFL 


11079 


24980 


A 


11172 


292 


365 


RI *SLGQAQWLTPVI PTLWEAKVG 


11080 


24981 


A 


11173 


251 


1 


I CDLQI LTHGITTCVKMGLTTLFVSGI T 
FLIHHSLAI KFGV* KNPS *LGTWHTCN 
LSS *GGRGGRIA* AREFETCLGNI VRP 


11081 


24982 


A 


11174 . 


200 


407 


KI FETDVYVKCSNIXrCLWAS ISFSNYFG 
GPFYLNL* * FLSYFS FLETESHI VAPLE 
CSGM I LAHCNLCL 


1 1082 


24983 


A 


11175 


11 


155 


STNFFLFSETGSGSVLQAGVQWPGLDLL 
ASSHSPASAS*VSGTTVDAA 


11083 


24984 


A 


11176 


140 


1 


PSTFPDWAMRRVSCLSKGDKGQV*WLT 
PVIPAIWEAKAGGSPESI 


11084 


24985 


A 


11177 


210 


1 


TDEELLLMDEQRK* FIEMESTPDEDAVN 
IVEMPMKDIiEYYINWD*WAGLKRTDC 
SFBRSSTVGKMLSN 


11085 


24986 


A 


11178 


133 


148 


STSTR* LFSTSRKDIGALHLLFGA*AGV 
LGTAVSLL I RAWLAQLGSNKEA 


11086 


24987 


A 


11179 


236 


] 


KFWKKDPLTKGGCPLKKKGGGGI FSKKK 
KNVFFPKKKFKGGKKKNFFFFKKKKKKK 
KKKKKKNKE * LF I S FAHIAVHL 


11087 


24988 


A 


11180 


210 


1 


QSLGQENLQVFFPLFGP* PNYLDKGEKW 
IRVGKIWGHLFFFFETESRSVAQSPRLE 
CSGAIAAHCNLCLP 


11088 


24989 


A 


11181 


292 


348 


KKGALFFSPGG*KKNFPPGGKKKGPLFS 
KKKKNPP * KKPGEKEKTRFPPFLGGIiGG 
GFPLAPGGKPPRNRVLAPPPNPGGKKKK 
KTPFSQKKKKKKKKP 


11089 


24990 


A 


11182 


168 


3 


KPEASMLRMDFFFF* eagshsvahagvq 
WYYCSSLQPPTPELSDPLAPSSKVGGIT 


11090 


24991 


A 


11183 


87 


351 


KGIKWSLNIQRGVQPHLRSYNYSFFSFF 
S *DWVLLRLECSGTTTAHRSLNIiPGSSD 
PPTS ASQAAGTTGMHCHTRLI F * ETGFH 
HAVQ ! 


11091 


24992 


A 


11184 


55 


344 


TSLNLCFFFFFWGEKSPFLPPGGDEGPL 
FGQLDTRPWGIEKIFFAPPP*KGKRGVP 
PGGEEFKNPPAPKGEPPLS*KK*NQPGQ 
GAP PCYP P FLGGGAKKI FS I PQGRVSN * 
P KRGPS S PPGGKKGDFS PQKKKKKQRFR 
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EV**PNSPSGAP 


11092 


24993 


A 


11185 


344 


190 


LVETRFHRVSQDGLNHLLTL* STRLGLP 
KCWDYRCEPPRPANTANSLNFPM 


11093 


24994 


A 


11186 


135 


3 


IFPKVFGGPFFFFFFF*ETESRSVAQVG 
VQWCNLGSLQAPPPGFT 


11094 


24995 


A 


11187 


305 


14 


IMLITSIRRSRLSSTAEKGILSRLRICG 
RVAKYLINNRL I TTRE * LI KLTSKQM IT 
ILNTKGRT*SLILVSLSVVYGYHLF*G* 
FD*SLLLY 


1 1095 


24996 


A 


11188 


198 


1 


LEQTFNMMFPPNKS I IS I PSFGFLFKI S 
FPFLLFFF*DEVFLLLSRLECNGTILAH 
CNLCLPGSSD 


11096 


24997 


A 


11189 


352 


3 


QLCNTPLPKTNHAMLTS SNANTLI YRRS 
AIAAASSPPILFPASKILLNSRLITPHQ 
*LI KLASKQMIAI PNTKGRT* SLILVSL 
IICFEVSLISHCWWISRLLMRYLEVGI 
NRGG 


11097 


24998 


A ' 


11190 


270 


137 


CIINRLWCVYIYIYIYIYI*IYFYMYV 
YNILCILGHRRESGVI 


11098 


24999 


A 


11191 


38 


342 


ACRKRKMSSKFALRATGNCYYSCTFCSI 
SFPHPK*MPCYQKKKKKKKKKKKKKKKK 
KITV* K* SKTKKNI KGLGAR* ELS LA* L 
LGEP P I FAKGHPLMLKWVEI CKRP 


11099 


25000 


A 


11192 


169 


3 


ERSG I IRVYQD *NQCLAWSLA* KI IHPW 
LAAVAHTCNPSPSGGRHGWIT*GQEFK 


11100 


25001 


A 


1 1 193 


407 


1 


KRACFR WLTLL I STLWEAKQENCLNAG V 
HDQPGQHGETPS PQKI KKSAWHGGPRL * 
EVRWED * LSQGG * G YSEPCSHHCTPAWA 
TERDPV*KQKRS IPWKELASERKDSGRC 
RWLVREMWPSCQASVIQVRRI SF 


11101 


25002 


A 


11194 


110 


412 


VCGFRLLHSPAI LMPLLMPS Y FRRHSNT. 
E I KP I SNPTVAS KRASERKS CTSLTLNQ 
KLEM I KLSEEAT * KAEVGQKLGLLHQKV 
SQWNAKEEFLKE I ESA 


11102 


25003 


A 


11195 


34 


155 


AWHEGMHHVSQDGLNLLTL* DACLGLPK 
SWDYRRKPPCPA 


11103 


25004 


A 


11196 


289 


453 


NQPGKS FM I SST FHFMME FTLVHAGV* W 
LDLGLLQPPPPRFKGFSCLHLPSGWDR 


1 1 104 


25005 


A 


11197 


177 


432 


KWEKQAQTLGGGNFPKGPKTRVGKNFEK 
LKGNPPFLCL*KRGRKGDTGPWKKPAPS 
GERG I PGKKGGKTGPPGFKGPMEKIKRG 

K 


13105 


25006 


A 


11198 


139 


3 


AMVSLSPRLECSSMIMAHCSLGSLGSSD 
P PTS AP * VAR I TGMHHP 


11106 


25007 


A 


11199 


150 


492 


GDPL PGHH P V * E VRR P FARQL P RLRS E E 
PLRPVATPSGK*GASLSGSHPIQEGGGG 
GQPLPGHLPHPGWRSGVKPPPGQSARPG 
GEGHLLPATPTGK*GGPQSGQPHLSGGR 
RE 


11107 


25008 


A 


11200 


239 


64 


YQKYYYYYFF*ETESRSVARVKSSGLIS 
AHCNLHLPGSRLQLCLFPTSAVFGQESH 
IT 


31108 


25009 


A 


11201 


31 


396 


ILTMREIVHIQAGQCGNQIGAKFWEVIT 
DEHGMDPTGTYKGDSDLQLDRISVYYNE 
ATGGKYVPRAILVDL* PGTMDS VRSGP F 
GQI FR PDNF VFGQS GAGNNWAKCH YTKG 
AELGDSVLDV 


11109 


25010 


A 


11202 


85 


407 


VLLPKS EALEGADTMPHS YPALSAEQKK 
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BLSDIALQIVSPGKGILAADECVGSMAK 
RLNQIGVENTQENRRLYRQVLFNADDRA 
KKRIGGVIFFHETL+QKDDDGVT 


11110 


25011 


A 


11203 


302 


115 


THLDHVAI YLS I YLS I YLS I YHLS I YHL 
S I YLCR F* LID* DRVSLYFS GWLQMPGL 
KPSSCF 


11111 


25012 


A 


11204 


130 


348 


GNAYGGPCS VLVQRT* * I PALTFPSTCL 
DS KFQRDLVTLRTD FSNFTSNTVAE I QA 
LTSQGESGLDGALGWGQ 


11112 


25013 


A 


11205 


232 


62 


ISARNEGFNTLN*RTFFFFFFSSHCLSR 
LECGGMILAHCNLQLPGESLEPGRQSLQ 


11113 


25014 


A 


11206 


161 


2 


KKKGFLLPL I E PFCIQE I QLRLGAVAHT 
YNPSTLGGRGGWIT*GQEFKTSLAN [ 


11114 


25015 


A 


H207 


251 


3 


PNHLPKAPSPNTTLRIRFQHLNFRGAQQ 
TFGLQHKLTSKQMITIHNTKGRT*SLIL 
VSLIIF I ATTNLLGLLPHS FTPTTQL 


11115 


25016 


A 


11208 


145 


287 


GFRLKI>* INQKYSFLKKKKKKKKKKKKK 
KKKKiCKKKKKKKKKKKKKKKKKKKKKKK 
KKKKK 


11116 


25017 


A 


II 209 


436 


110 


ISGRGEQKFRYFSPAPFFRGINPPPLGG 
E *NQNLKPP * KKI FLGS PKVPKKKGGPQ 
KILKKG PGGGG P PLFPP PLGGQGGGF PF 
SQGFKTPPGKJCKKPPPKKKKKKKK 


11117 


25018 


A 


11210 


217 


2 


DKWS VLHI N YN I LS SSLVHVLFKYLEN * 
K* KKNLWPGMVAHACNTNTLGGQGGR IV 
*AQEFETSMGNTVKC 


11118 


25019 


A 


11211 


146 


3 


QKVSHAQYRCLSWGYFFFILFLETRSRS 
IAQAGVQWRDLGSL* PPPA 


11119 


25020 


A 


11212 


187 


384 


GLICFYLTCFYFIFYFILFYFSLLCYVM 
LCYVLCYVMLC YVMVCYVMLCYVI *DRV 
SFCHPDWSAA 


11120 


25021 


A 


11213 


373 


482 


MI KLRHRLGTVAHTCNFS TLGGRGQWI A 
*GQEYKTS 


11121 


25022 


A 


11214 


64 


362 


ITGVSHFTQPLPFIGGLALSPKLECRGM 
I IAYCNFELFGSGDSCHLSLPSSWNYRY 
MPCMPS *LKKNF* * RQGLTVLSRLEKDY 
LL*DFSALRSFPWGR 


11122 


25023 


A 


11215 


181 


368 


CDSAVPLLDIYPTEFKADLKKIRIPMFI 
IALFAVAKR*KQPTYPSIDKWIXKLWAG 
CGGSRL 


11123 


25024 


A 


11216 


268 


369 


TTRPDMWHTYNSSTSGGQGGTIT*AQQ 
FKTSLT 


11124 


25025 


A 


11217 


90 


3 


I GQ AQ * LTP 1 1 PALWEPKSGGS PEVRS L 


1 1125 


25026 


A 


11219 


277 


272 


SLTAPGL*SQLLRRLRWEDHLSLGQSSF 
S EP * S C P CTPAWVTQ * DS VSKKKEKKKE 
KKRKEKEKVNQTPKATESPGQALIGSHS 
LQVQKRVYAHP I L 


11126 


25027 


A 


11220 


19 


343 


LELEVEPEDVSELLQSLDKNYLSCFLKR 
FLEMKYT PGEDWNT I KMTTEDLKYNTD 
LVDKAAAGIiERINFNFEGSSSVGKMLSN 
SIAWYSEIFYERQSQFMWQT*LLA 


11127 


25028 


A 


11221 


266 


356 


FFEMESHTVA*AGVQWCDLGLLQSLPPG 
FK 


11128 


25029 


A 


11222 


105 


2 


RFVCSTI KVLRDLSSARSNPGRFI *TSN 
SRPRKS 


11129 


25030 


A 


11223 


145 


350 


RKGRQLliD* DLGWV * WLTPI I P I L WEAE 
LGGS LEARSSRPAW 


31130 


25032 


A 


3 2225 


355 


348 


PLFF FCFHRNTFY * K5 VCRFPKKKKKKK 
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Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Pheny)alanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














KKKKKKKKKKKKKKMEGGGGGANTKFIiY 
GENRKPPLFFFF 


1 J 133 


25032 


A 


11226 


271 


3 


NFLS VFNFTR I CLGMCFLS I FLL I FGGS 
LKYQLFLAQGFFFFLHYFFTSSIHSSLL 
Y*MLDFLDLFFLYLFFCNFNFSVFLFCI 
LSKFF 


11132 


25033 


A 


11227 


3 


413 


S PAE PHRS YTMWNVKVGVNGFGR I ERLV 
TRAAFNSGKVD I VA INDP F I DLNYMVYM 
FQYDSTHGKFHGTVKAENAKLVINGNP I 
TIFHERDSCKIKRGDAGAEYW* STGVF 
TTMDNAGAHLHG * ATRGI I SAPS AD 


11133 


25034 


A 


11228 


48 


413 


VFVLVSCGFFFFFFWEKKICFFWPGGKK 
RGK I GGTGTFG P KG KGNFRG P P P KNKEK 
GGPPPPPNNFGFLKKKKFYQGGQGG*KT 
RNFGKPPAPNP P KGGDLGGG P PTQGQNG 
FLKKKKKQKK 


11134 


25035 


A 


11229 


172 


404 


SFEKSCSPGWQRRGVMIPGVTVEDMNQQ 
EFIRALVAtiLKKSRKLKVPEWVDTTKLA 
KRKELTL * DEKWF YT * AAS TT 


11135 


25036 


A 


11230 


247 


2 


QPKKKSVS KKKKKELPCDPAI PLPGIYP 
XEKKS VYKKDNLHLRVYCRAVFT I AKI W 
NQPKC* SGDK* IKKI * YIYTVDLRR 


11136 


25037 


A 


11231 


1 


372 


TETTVEVAWCELQDRKLTKSERQRFKEE 
AEMLKGLQH PN I VR F YDS WE S T VKGKKC 
I VLVTELMTSGTLKT* VHQYYKS * PRSM 
REFDFLNFRSTSFTLWQPKKGTVLPTV 
TVLAYFLAHKVL 


11137 


25038 


A 


11232 


311 


464 


KKATPRNLCKWPGTVAHACNPSTLEGRD 
GWIT*GQEFETSLTNMVPP 


11138 


25039 


A 


11233 


64 


418 


VPVASRRRRRCGRVGGGKAMADLDKLN I 
DSI IQRLLEA*GAKPGKKVQLHENEITG 
LCLEPRE I FLRQP I LL * LVAPLKI CGD I 
HGPYYDLLRLFEYRGLPPKSNYLFLEDY 
VDRRQH 


11139 


25040 


A 


11234 


233 


421 


PTTSNMRSRLLRGAS S * KKTAG PQQRNL 
EPALPRRWGGRSADNPPSGSLRKSGKNM 
QKTPGTA 


11140 


25041 


A 


11235 


343 


1 


NNTHGLSHSSVGRNPGAVGSGSCMSGIK 
LSEGLRSLLEAVCISRAAVIKCHSWVAS 
NNRHFFSPSSGG*KSEAKVWAGPCSL*R 
RVPSLPLPASGGSRHCLVCGCVTPFSAF 
TW 


11141 


25042 


A 


11236 


122 


419 


RTPRGPKPNPGLQRKTFFLIGGRIKTYL 
GIFGKDLYFWGGKNWDTLLPKI*RSREN 
KNF*REKWVNLPPYPGCLKNGARQQHFA 
HLLILRDTKTYMLAF 


11142 


25043 


A 


11237 


1 


379 


AFNHLHAGHGLSGAAMKSLVLLLCLAQL 
WGCHS S PHG P WL I YRQPNCDDP * TEEAS 
LEAIDYINQNLPWGYKHTLNHIDEVKVW 
PQ*PSGELFDIEIDTLETTCHVLDPTPV 
A* CNVRQLNEHAVQ 


11143 


25044 


A 


11238 


1 


397 


QTMTPNRGPLCPSKDLRSSHVI SLPLHS 
ATHTRPTNQHTNHI PMMARRSTRKHIRR 
APHTTCPKRPSIRDNPIYYLRNFFLRRI 
FLSLLPLQPSPYPPIRRALAPNRHHPAK 
S PRRPTP * PLTGALSALLKTSGLAM* FH 
FHS I ALL I LGLLTN TLT I YQ * WRDVALE 
STYEGHHTPPVQKGLRYGI ILFITSEI F 
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G=Glycine, H=Histidine, I-Isoleucine, 
K=Lysine, LHLeucine, M=Metbionine, 
N=Asparagine, PHProIine, 

f}^fZ In turn mp Ti~Araininp S=.^Arinp 

1111*1 III HFC, -IV f\ I gllJIlIC, vj OCI IIIC, 

i — i nreonme, v— valine, 
W=Trvntnnh*in Y=Tvro<;ine 
X=Unkno\vn, *=Stop codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














FFAGFF * AFYKSSLAPTPQLGGHWPPTG 














ITPLNPLEDPLLNTSVLLASGVSIT 


31144 


25045 


A 


11239 


267 


3 


FVS VR I TP I YASAATWLS YVWVWASS PN 
VCP PGDRVALS PRV3CSGT I S AVRNLRL 
PGSSDYPASA*PADGWAPPQYRCPPPRL 
ANp 


11145 


25046 


A 


11240 


391 


31 


VFFFFFFFFFFFFFFFFFFFFFFFPPPF 
LKNPFEPPPNFFFFCPKF* 1LFLN* I * F 
*IKF*I*KIPGFFLNPPPFFPPFFFFPP 
GALFFFFFF*EGVLPCSPGWSAMSHLLS 
3KRKLTPRMSYN 


11146 


25047 


A 


11241 


371 


491 


KI FLCWARHDAHACNPS I FRGQGGH IT * 
GQEFKTSLANMV" 


11147 


25048 


A 


11242 


1 


221 


HRIDPLCRNSARAFFFLSTGCHRVSQDG 
LDLLTS* SAHLGLPXCWDYRCKPPRPAL 
NCFFRTI SQLLNLCIKD 


1 1 14R 
i 1 l**o 






1 1 *)A.1 






TYKTKDFKTF*RWFHPNTSRVFAFKTiFT. 
x i i\. j. ivL/r Mr jcvv* r n it xoi\ v finQiujr u 

SRGQRGDFLARPSESSPGGFTLSVR 


11149 


25050 


A 


11244 


169 


Ml 


TQKDEKESTTTTKNAENSKGQSAYSSPS 
DCNTSLARAQN*EEAEMDELTEVGFRRR 
V I MN FVKTj KGF I L.TO CREAENHDfCTLO K 
LLTS ITSLKWNINDLMELK 


11150 


25051 


A 


11245 


200 


462 


SLIYNEI.CTIHILYQYYVA*QRS*CFQQ 
LMSFY * RTLKNLKESLCKVALLYPS FLK 

WKRHWT ^TVADT * TC P ^TT .flGOGfVR T A* A 

I » rvcs.VJ »» JJVj A vflL/l ivrij i-iJV,HJyVJ\jI\.J.ri 

QEF 


11151 


25052 


A 


11246 


484 


301 


I FCE KKG L * FQLLRR VRWE G P FKTPGNQG 
CS*LLSSPCPPGWITE*DPVSQKKKERK 
RNQVMELVSAQENKNKPRSVGLQACARS 
MLAFSTTPR 


11152 


25053 


A 


11247 


226 


458 


NTVAVDWG PRC FFS LLLSHAENCDM WFT 
KIELFI FTNHLLLKQDLEMLW * GQAHAC 
NPSTLGGRGGWIT*GQEFETS 


11153 


25054 


A 


11248 


304 


436 


SPSRCRK*KIG*AQWLTPVISTLWEAKV 
GRSLDPRNSRSAWTMY 




/.JvJD 


A. 




Z. 1 o 


4- J 1 


THKYLiF^YLGYNPMLYIERETGSCSIA* 
AGVQWYSHGLLEP* PSGLKRSSQVAGVI 
CLAN F FFFFLKK WS LWLPRLG 


11155 


25056 


A 


11250 


260 


474 


DKYFLLYPLKALNWLGAHINI *KVSKI 
MC*K*DKNMYNFFFSF*DGVSLLLPRLE 
CNGTILAHCNLCLPC 


11156 


25057 


A 


11252 


193 


496 


MTSCYSFYL* I KDAS MV KK KKKKKKKK K 
KKKKKKKKKKKKKKKKKASSSQKAQDYS 
KASDYK 


11157 


25058 


A 


11253 


348 


478 ; 


DLGWLKLGTVAHTYNSSTSGGQGRRIT* 
GQKFKTSLANWKPC 


11158 


25059 


A 


11254 


174 


475 


MHHWVLILQDIDDN?*NCNLHFSLCSQL 
KSIQKEKKKKKKKKKKKKKKKKKKKST 


11159 


25060 


A 


11255 


292 


48 


L PLT CHRPAVSHTH LA FSN P TAG I T I F * 
TR3SSWNAGTFIIFKIVTFFLFFFF*ER 
VSLKLTRLERSGAI PAHCNLKLPEF 


11160 


25061 


A 


11256 


347 


16 


TQSADWCIYNPPARHKSSPSPHPTQ*PS 
WLHLVDPAPGPWVELPTSPAPSACTPQP 
LGSRWDQPPOSKGQCPSGRLGPQGSPPQ 
VGLGKAGWRSQALPVGLTRSIESHHW 


11161 


25062 


A 


11257 


238 


590 


APGAYIFPQQHNPTDSQVIFSLTASHLL 
ICSPLCISVLHFNGMl^IDDENRNAHVHA 
RACAHAHTHTOTHIHTHTLYMNYRETL 
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ing to first 
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Predict- 
ed end 
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ding to 
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residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Ty rosin e, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














MLLKTRGRATGSHARC I FS * SLLQACP 


11)62 


25063 


A 


11258 


310 


461 


LSFGDVLQLESDRPGAVARTCGPSTLGG 
*GGWXT*GREFETSLANMLKPR 


H163 


25064 


A 


11259 


461 


174 


MGFHRVSQDGLDLLT*SSAHLGLPKCWD 
Y*CEPLRLAPVGKLFKCVGYKLVLCHGS 
GYMSMVI KIRMPGLIWSLEGGSAAWHR 
YWSIWEISFGC 


11164 


25065 


A 


11260 


343 


472 


LLRRLRQKDCLSSGGQGCSEPRLDYCTP 
AWVTK* DPVSHKQE 


11165 


25066 


A 


11261 


229 


3 


VKMKLQ * VP VKVKI * KYTP LLGI YLKEL 
KAGFGRNTCTLMFIAALFTIANRWMQPR 
LP* IGECVNNM* YIHTMES 


11166 


25067 


A 


11263 


163 


399 


KLLPGTRNLHAMEHPIFGWLRNPHATAQ 
G*HPLSQSSLALHGRADHICYPELCTSS 
SS C 1 1 AG YPN * EGMFASQHHRVH 


11167 


25068 


A 


11264 


316 


427 


TNKKHI 1*1 INTTTHSL 1 1 S 1 1 PLLFFNQ 
INNNLFS CS 


11168 


25069 


A 


11265 


184 


418 


KTPPGFFEKNPFFWGGKKGPPPPPKNLP 
LGGKKKFLRGKGGKTFFFFFF*KIGSSS 
SPPPPPGGEQAPPPPPKSPPRR 


11169 


25070 


A 


11266 


219 


13 


KFFFSLEHQNFGGGGVPLFPPPKEGFLP 
KKPKQGFITPPLKQKIKPPPPGGYSGPP 
RVL*KGPPHIFYK 


11170 


25071 


A 


11267 


3 


173 


SHHARPETGFHRVSQDGLDLLTS * S ACL 
GLPKCWDYRREPPRQARKLYCLGMRTNE 


11171 


25072 


A 


11268 


157 


2 


PVDI THVKTTVGLGMVAKACNPSTLGGR 
GGRIT*ARGSRFLLPPHWTSELR 


11172 


25073 


A 


11269 


291 


468 


ISEGKRSLFVILHFILFFETGSCSVAQA 
GVQWCDLDSL* PHI PGPK* SSHLGLHTG 
ACH 


11 173 


25074 


A 


11270 


29 


478 


GWN P YNQQD I GTL Y L * S GARAGVLGTAL 
CLLI RASLGQPGDLLGNDHI YNVIVTAH 
AFAIIFFIVTPIIIGGSGN*LVPLIIGA 
PDMAVPRINNISF*LLPTSLLLLLASAI 
AEAGA* TG* TVYPPLAGNYSHPGACVDL 
TIFSLHLAGV 


11174 


25075 


A 


11271 


61 


474 


PYNQGGNDAHAAWPHLTKWRSQGTTEIC 
REPEGAWQRPSQPRLHKGRSPGQLRALS 
QNLPSYSGGSHLLSAYYVPDTISKHFIN 
TIGWLGTVAHACNPSTL*GEDRWIY*AQ 
EFKTSLANVL 


11175 


25076 


A 


11272 


277 


478 


KIYQPVKIHGPRPGAVAHTCNPSTLGDR 
DGWIT*GQEFETSLANMNSPSL 


11176 


25077 


A 


11273 


350 


11 


KEKTAGQAR WLT PV I PAL WE AEAGAS PE 
VRSSRPA*PPP 


11177 


25078 


A 


11274 


138 


3 


NY INL YAWLGAVAHACNPS TLGGRGGW I 
A*AQEFKTSLGNMAKP 


11178 


25079 


A 


11275 


359 


219 


FVIKMGFPQVGRDGFNFLAS * SAPLGLP 
X WWDYRH * PPRLALFFFF 


11179 


25080 


A 


11276 


262 


3 


FCMSLVHLFSHSQDNYFTSFSL*IFNIP 
SGPFLFAFQDTLQ * P * DF I Y FYFETES P 
SVTRLECSGAISAHCNLRLLVSSNSPAS 
AS 


11180 


25081 


A 


11277 


285 


1 


VKKGDS VSKI FRKKKKRKDKIRLHFMIT 
TVNNLG I EGNLLNM I KEI YKN PTANI I L 
NGERLKAFSL* SGTRMSFLTPTFF* ILV 
DTQQVYIFMSH 


11181 


25082 




11278 


160 




TDKEVPL XNAQRKW FLEMES I PGEDAMN 
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Amino acid sequence (A=AJanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenyla la nine, 
G=Glycine, HNHtstidine, I-lsoleucine, 
K=Lysine, L=Leucine, MNMethionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S^Serine, 
T^Threonine, V=Valine, - 
\V=Tryptophan, Y=Tyrosine, ' 
X=Unknown, *=Stop codon, /-possible 
nucleotide deletion, \=possible 
nucleotide insertion 














IV*MTTKELQYYINLVEKVLARPE I 


11182 


25083 


A 


11279 


334 


488 


PLNTCGY*NLN* *NFQPGSVAPACNPST 
LGGQDRW I T * GQE FE TS LANMVK 


11183 


25084 


A 


11280 


527 


356 


FFVFLVETAFHHVAQDSLDLLTS* SSRL 
NLPKCWDYRHEPQRQAKNVGEFSETRS I 
Q 


11184 


25085 


A 


H2S1 


486 


248 


VRQLFSLLLPRLECNGVISAHCNLRLPG 
S CDS SASAS * VAR I TGASGS Q A WLQVQ 
CLQPVQPGELLRVDLFQLWLQR 


11185 


25086 


A 


11282 


38 


157 


STGLHHVSQDGVDLLTS *STLPKCWDYR 
HDPLRPASQRL 


11186 


25087 


A 


11283 


474 


339 


FVFLVQMGFHHVGQAGLELLT S * SACLG 
LPKCWD*RHEPTDPAK 


11187 


25088 


A 


11284 


2 


462 


LETTRTARWNSVIALRHHNSCPYKMANL 
IDKCVCSDYSKDWPFPIFLPLLRPPYCL 
RHNNMETRS INNPPWPS KCSSEKTNEEG 
MLKVEIG*KLCLLCQTFSKDMNAEKKFL 
KE I KSATPMNTSMIRKQNRLIHDMEKVL 
VI R I E YQASHN I L 


11188 


25089 


A 


11285 


65 


280 


SRAVE FNLLTT* S ACLGLPKCWDYRCE P 
PHPASP*FSKLLSSNLKTYLSSL*DSHS 
GFYSFVCSLGLI I LI 


11189 


25090 


A 


11286 


244 


488 


MRLGRVPSVLFCYVRMQREGI ILEAENK 
P S PD I E S VGAL I LDFTAS RT I RNK I LL F 
IIFPG*GILDSPNVP 


11190 


25091 


A 


11287 


345 


445 


KRPGTVAHACNPHTLGG* GRW IT* GQE F 
WTILA 


11191 


25092 


A 


11288 


223 


500 


S PGTRPKVLE CQS TLGRQAI LLLGEG WL 
WGQSHRLGVQSQLCLCVLWDLTCSLSCS 
FQI IKLrR*RPGVMVHTCNPSTLG3CGGR 
IAQGQEFK 


11192 


25093 


A 


11289 


157 


22 


LSCQE VKGE I GEAEI CWAQWLPPI I PAL 
WGAKVG * S PE VRS SKPA 


11193 


25094 


A 


11290 


305 


497 


KLVWC I YFKGVS FMVREL YCNKI V * WLG 
TVAHIYNPSALGGACGWIT*GQEFETSL 
TNMAKPRL 


11194 


25095 


A 


11291 


478 


1145 


QHVQACPERPQMMGTLERSRAVASKIGH 
SYSLDSQPARAVGKPWPQQACTRVTELT 
EATGKliIRTSHIGKPHPQSFQPAAATQK 
LRP AS QQG VQMKTQGGASNPALQ IGSH P 
MCKSSQFKSDQSNPSTVKHSQPKPFHSV 
PSQPKSSQTKSCQSQPSQTKPSPCKSTQ 
PKPSQPWPPQSKPSQPRPPQPKSSSTNP 
SQAKAHHSKAGQKRGGKH * RANSRDL 


11195 


25096 


A 


11292 


352 


468 


TF PGQHG KIPSLLKIQ KLAGHG D ACL * S 
RLSPR 


11196 


25097 


A 


11293 


295 


24 


ENIQGSKKNERQHMHNGVGTWNSNPGFV 
STWVYRNCLACLD I S FSFI KWVKSHKDT 
SMGMFTAALFTIAKT*NQSTCPSMVDWN 
STTRTS 


11197 


25098 


A 


11294 


274 


429 


ECEPVQTLWKIRKFLKKLKIELPSDPAI 
PLKG INLKEVKS VP * SDTYTPMC I 


11198 


25099 


A 


11295 


168 


469 


PSLGNKSETPSPKKKKNLFLGFPYFREK 
LVGILLKTVLQKGAGGVFFGPKNPPEKN 
PDPPLDSPPKTFRRLGAFVKTFPXGLKA 
AWFP ALV* PQGQMGG 


1H99 


25100 


A 


11296 


37 


348 


IKIFNKTLKYFKKKKTIKKKKKKGGGPL 
KRPRGG PR I Y FE5GARKN LPK I GTAKKH W 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenylaIanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, LHLeucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Giutamine t R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














GGYWEKNPHGGGGNTGPTPPQKTRPLGG 
KNNLKRVQGEKPRPRLGGGKTRSQLFSC 
HA*P*VQGEKPRPRLGGGK 


11200 


25101 


A 


11297 


93 


388 


DEYASDNCCYFPGSSHDSQVKQIGSQFT 
TQIQAHLNLAKPRRAL*NHS * PMEMRSK 
CLLLAYKKKKKKKKKKKKKKKKKKKKKK 
PKYLSSRGKDHF 


11201 


25102 


A 


1129S 


61 


243 


YYIHRTTVFMCMN*GLKDNVDK*TIDIiC 
LCKKKKKKKKKKKKKKKKKKKKKKKKKK 
KKKKKKKKPQGGGA 


11202 


25103 


A 


11299 


250 


341 


IYMSI>TLGLQNL*EIHLTHKDSHKLQVK 
GWKNI FHANVNQKQTGVAILISDNIDI K 
ATTVKKKKKRENPT I LNI YVPNTGAPKF 
MKHLLLNLKNERDGTTVLG 


11203 


25104 


A 


11300 


3 


285 


DAWERKKRSQMNQKTTNKVAGVS PYLS I 
MAWNVNELNFL I KRHRVA* C I KKTKSM I 
C YLQETNFIKDTHRLKIRGWKKI FHPNR 
N*KLAGWY 


1 1204 


25105 


A 


3 1301 


175 


407 


AENNPDVLQLKA* TSRGTFMP * ILLRVK 
RGSNS RNNRC WQG WRGTGMLLHC * QECK 
LVQPLWKTVWRILPDLESEIL 


11205 


25106 


A 


11302 


75 . 


280 


YHLNLVLYPHPPKNRVC*KKKKKKKKKK 
KKKKKKKKKKKKKKKRGGPLKKNLLGGP 
FFPGGKKKNFFF 


11206 


25107 


A 


11303 


299 


3 


IHASFLFRGCITLAFTVRCSHRS ITSSE 
TSDLTH Y Y YYYY Y YF * LASQSVALAGVQ 
WDQLGSNH PPTS AS * VAETAG AS HHAR L 
IFNFFVEMEVSLYC 


11207 


25108 


A 


11304 


137 


323 


FFTVNGRHRKEKKSNKDQLNLGLHDFLD 
NPLLLF*KYLGREKKKKKKKKKKKKKKK 
KQKKKK 


11208 


25109 


A 


11305 


64 


481 


RPTRPHVYHI VKP IP* PLTGALS ALLMT 
CGLSM* FHFHS I TLLILGLLTNTLTI YQ 
* WRDETREST YQGHHTPP VQKGLRYGI X 
LFI TSEDIjFFAGFF * AFYHS SLTPTPQL 
GGHWPPTGITPLNPLEVPLLNTSVLLA 


11209 


25110 


A 


11306 


142 


330 


FFTVNGRHRKEKKSNKDQLNLGLHDFLD 
NPLLLF* KYLGREKKKKKKKKKNKKKKK 
KKKKKKN 


11210 


25111 


A 


11307 


259 


440 


RP W I RDNP I Y YLKS C FLRT I FLS LLPLQ 
P RP YP PTRRALS PNRHHPAKS PRS P * PL 
TGALSALLMTSDLTMGLHFHYITLLILG 
LLTNTLT I YQ*WRDATRESTYQGHHTPP 
VHKGLGYGIILFITSKWFFARFF*AFY 
HSSLDPTPQLGGHWPPTGITPLNPLEVP 
LLNTVRL 


] 1211 


25112 


A 


11308 


433 


3 


FPPPDIRHSRSLGFPPSFGRAGFLNFAF 
S KGQE PALPGGF Y YP S PKKKNLAFCS PP 
LFYWGKFGSKFSLGVGPLFFP**SPKSF 
CQFLENKGFFPEKSFFKFFPPAGGCVFF 
FS EGGFP FCLFLFFFDRVWLCHPGW I S V 
ARS 


11212 


25133 


A 


11309 


2 


423 


GRVGDR* LFYTNHMD I GTLYLLFGA* SG 
VLGTALS LL I RAELGQPGNLLGNDH I YN 
GIVTDHAFDIIFFIAIPIIIGGFGN*LV 
PLIIGAPDMAVARINNI3V*LLPPSLLL 
LLAYAI VEAGAGTG * TVYPALAGNYSHP 


11213 


25114 


A 


11310 


234 




TFFGXFPFCLKFKPAQRGGENTPGPFPF | 
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nucleotide 
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Predict- 
ed end 
nucle- 
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location 
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ding to 
last amino 
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residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine,M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine* 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














LVFFFFEMESRSVAQTGVQWRDLLGSSN 
S PT S AS * VAGTTGACHHARL I F 


11214 


25115 


A 


11311 


406 


211 


LSLFLHSLISTHGARPTDPVGLVPTQDG 
LDLLTS * SAHIGLPKCWDYRRAPPCPAC 
GFDLHLPND 


11215 


25116 


A 


11312 


379 


398 


KT*NFTVRGSIPLLNNIPMASMTFSKKK 
KKKKKKKRGGPLKKNPGGAKFNGGRKRK 
IFSLKGGEKKNPGGILEKKPFFGGGKKG 
ENPPKKNKGLREKKKF*RGKGGKKPQKG 
GGKKMGS S S AKTPT WG 


11216 


25117 


A 


11313 


390 


489 


RYRLGWAHACNPSTLGSRGRWI I *GQE 
FETSV 


11217 


25118 


A 


11314 


286 


434 


I LWYMNY I S I KLLKI I I YKLGAVAHTCN 
P S T LGGRGGW IP* G W E FETS L 


11218 


25119 


A 


11315 


322 


3 


FTFYGCRQFQVF* IVTKNDRI IFSSYTK 
NLLKI YLGLY* FYYVPLLRTKSLKKTGS 
FESNIiSFK*VFLWSGTVAHTCNPSTLGG 
RRGWIA*GQELETSIjTNMERPR 


11219 


25120 


A 


IJ3I6 


251 


389 


TFFFFEREFHFVPQVGVQGGSLGSLQAP 
PPGSHSLA* AS PRCWDGG 


11220 


25121 


A 


11317 


313 


80 


EYICPRCLSPGFRD* PGQHGETPSLQKL 
QKSAGCGGAHLRSQLLRRLGDRVRHCLK 
KR I YLSLFNS L VLSNNRFQ I I 


11221 


25122 


A 


11318 


145 


410 


APKTLKVAL.TSAAAARESI CQLGVLDEL 
VI FLE LNRS TDQRPGAVAHTCNPRTLGG 
HGGWI T* AQEFKTNLANMVKTC 


11222 


25123 


A 


11319 


393 


3 


NFFKIIFTLSPPLPLIFTPPFFFFFKKI 
PPKPLFFGVFHSQTCPQISFFKNFYPFL 
FWFSNPFFLLFS PSRKGPPPFSRFFFFK 
VGNPLFFLFFFFFFFFFG*EGVSLLLPR 
LECNGTISAHYYLRPRV 


11223 


25124 


A 


11320 


135 


1 


HRDVKLEYAIRKMEVRPGVAAHACNPST 
LGSRGGWIT*GQEFETS 


11224 


25125 


A 


11321 


282 


2 


NSESVLIKEKGDBEEKVLSFLLFSKRKV 
LESKTFPTLHSGCKKRSYFLKITTSSFL 
♦PRVBCSGVIIDHCKLKLLGSGDPPTSA 
FPVARTTGM 


11225 


25126 


A 


11322 


149 


30 


WGSGGQVRWLTPI IPALWKAEAGGSPEV 
RSS*PWPCLY 


11226 


25127 


A 


11323 


316 


399 


STLGGRGAQIT*GREFETSIANMPNPIS 


11227 


25128 


A 


11324 


223 


397 


1 1 KAYQ * DVNREQ I KT 1 1 1 YFQNKLKC I 
KKMIWPGAWHVYNPSTLGG*GGWIT*G 
QE 


11228 


.25129 


A 


11325 


3 


396 


NYSTAALFLRGH I WGWRWGTLYYG S S P I 
YCRGLNSTLGLYPLNAS 3NPHPRKLCKP 
KI YTH I AKCP IGHKI I YPYANTP I H*HK 
THTHTHTHTL*DSLLWPGWAHTCNHNT 
LGSWGRQIT*VQEFKTSLA 


11229 


25130 


A 


1 1326 


113 


442 


KR Y I Q I RAHS E VLCGHE L WGTLHN P VQT 
IQTWEPFPYLPLCSRVTSGKSHHRFVPL 
LFHL*NRNNSHHHHHHHHHHHHHRNPC* 
ECCRAHSDSYAVWSDKVSQEPVAAM 


11230 


25131 


A 


11327 


145 


3 


ELDIKPPNPFFCHTGVQ*CDLGSPQLLP 
PGFK* FCLSFPSSWDYRCA 


11231 


25132 


A 


11328 


189 


1 


KTNIiLL I H S KLRLHLQS S VFQC * S KS EK 
SNVCT * CAFSCLPKLFWLGKVAHACNPS 
TLGGRGG 


11232 


25133 


A 


11329 


193 


405 


ID* Q-1'I.TKKKKKKKKKKKKKKKKKKKKK 
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ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 
peptide 

r r u 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E^GIutaroic Acid, F=Pbenylalanine t 
G=Glycine, H=Histidine, I=lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, PHProline, 
Q^GIutamine, R=Arginine. S^Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \~possible 
nucleotide insertion 














AYTSKKKT 


11233 


25134 


A 


11330 


209 


11 


GGKRGTPPTINNIFFNFFF*EKGSCCSS 
RAC* CNGEI * AHCPLRLPGSSNSPAAAS 
RVQETATTPS 


11234 


25135 


A 


11331 


38 


390 


VDQSTVCKMGQSAGCGWGSDKGIKAGCL 
K*QRQKKKKKKKKKKKKKKKKKKKGGGG 
LKKKLFLGPGGEKFFFFFWGPFFFFGGR 
FLKRGGGKKPG YKKNKI SGHPPPFLGGG 
KKKKR 


11235 


25136 


A 


11332 


38 


326 


VDQSTVCKMGQSAGCGWGSDKGIKAGCL 
K*QRPKKKKKKKKKKKKKKKKKKNRGGG 
FIKKIFFGPGGEKIFFFFWGGFFFFGGG 
FKKRGGEEKNLF 


11236 


25137 


A 


11333 


1 


234 


RTRG KNWMAF I KEG WVPTGRMGGS FIKH 
SFLESKNTWLGKKKKKKKKKKKJOCKKKK 
PHDKI KDLFDPV*NKKQNI SIP 


11237 


25138 


A 


11334 


244 


398 


INHLYSNKNDLIFLLVFYLFFFP*RRPG 
EPPLIKGWLPYLGGVLNLRKDPL 


11238 


25139 


A 


11335 


174 


1 


HQVYP YVKQEHYKQDSCQC Y I KYMNVW P 
DAVAHGCNPGTLGAQGGWI I *GQKFERT 
RG 


11239 


25140 


A 


11336 


148 


411 


LFFFPHCYLLNAPS ILYLNEPKWYNLH 
LCVSAVCLLVPVHS PHRTCLY* KKKKKK 
KKKKKKKKKKKKKKKKKKKKKKGGAFFK 
KLLF 


11240 


25141 


A 


11337 


237 


453 


GQRPLLSFREKGSFFFLLETEFCFAPQA 
EGQGHNLG*LQLPLWG*SNSLVSSSQEV 
GITGAGHHPRLFFVF 


11241 


25142 


A 


11338 


65 


347 


VCRVDDFVPEARTRFFKS I * EAWNNKNI 
KPLLSTFSQVPGSENEKKCTLDQAFIGI 
LEEE I INHSS CENVLAI I SLAIGGVTEG 
KYGSVLFCLK 


11242 


25143 


A 


11339 


158 


393 


VPCTHSEGIRKGRKCLPKTRENAREERD 
KKIPPGLC*AGKLFPQEKSSTWPGKVVH 
AWNPSTIiEGRGGWIT*GLEFKTT 


11243 


25144 


A 


11340 


117 


3 


G IRPGAVAHVCNPS PLGG * GRW I SRD WE 
L*TSLANIR 


11244 


25145 


A 


11341 


292 


60 


SGQGLRLRGAFRGKVQAKEWMP I PKLGC 
LVKDMKI KSLEE I Y FFSP P I K* FEI IDF 
FLGAS FKDEVFFLFVCLFVCF 


11245 


25146 


A 


11342 


149 


30 


TQI FPPPPPPGKKKKFFFPKKKKKKKKK 
ERKKDRKNKQK* KGPGVGVS PLFPPPLG 
GQGGGFPQGLGFKTPLiAPKGKPWFP*KT 
QKFPRRGGGPPKI PLFGGGGEKKPFS PG 
GQGSNKPKFFPPPPPRGKKKNFFSPKKK 
KKKKR KKERKTEKTNKK 


U246 


25147 


A 


11343 


120 


504 


GVHDRCLWDDMTRRPPAAAATACPAGRP 
P PRREEEHSQLLLI SFQGFRWD* DQDVN 
TPNLDHliAREGVKAKYLMPPLVTMTSPS 
HFTAI TG KRHS AHFTRCPSNPQR P SFP V 
IRSKSSVSSREVEVA 


11247 


25148 


A 


11344 


213 


399 


ALVIHCLTTTIQVSGLSGSKYLMRQGMV 
AHGCHPNTLGGRGG W I T* SQESETSLAN 
TVKPCL 


11248 


25149 


A 


11345 


308 


3 


NPTPRSGDLWVEEGSPGAGNWGCLSEQT 
LiRAl I KATTSYS FYCYFFETGSHSVAWA 
RVQW YNPGCS I CGS SHLPTS AS * GAGT I 
G MKKKAWL I FL F F VEn E 
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SEQ ID 
NO: of 
nucleotide 
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SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E>=Glutamic Acid, F=PnenyIalanine, 
G=Glycine, H=Histidine, I=lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T-Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


11249 


25150 


A 


11346 


189 


433 


GHCRAGTSLTLTHLGGS AMEL I IHLGLC 
LGRMKFIHTTCI ILFLFFLSATQAGGQG 
GDLS*LQPPPLGFKQFSCFSLPRSWD 


11250 


25151 


A 


11347 


257 


3 


KWGELCFLTIKEERMGFPLAKKRKIKGV 
F*FGLFFFFETESHPVAQAHCNLSLLDS 
SYS PASAS * VAGITGACHHNQLIFI FYE 


11251 


25152 


A 


11348 


159 


434 


LSPS CRDVGTYSARAHPPHTHTRVRAHT 
HACTHVRAHTHTHTTAKGSHLTKEQPGL 
CTLPGSSW* PGLGGRAIATAQAWVKCAG 
IITAAHPP 


11252 


25153 


A 


11349 


240 


406 


AFKKLAKNNSMGVNRSLSRPGWAHAWN 
PSTLGGRGRQIT*GQEFETSLANMVKP 


11253 


25154 


A 


11350 


163 


360 


PQLIiQLMKFLQN* liLERLRPQNGLNLEG 
LACSESRSRPCAPVWRTBRDSVSSPQPR 
PGAGGRPSFF 


11254 


25155 


A 


11351 


145 


406 


PLNSGAKPTLPPWSPRRPELNAPPPPPG 
EKEFFFFLGRGFLFAPQEEAGGENIV*G 
KPPPPG*REFFAPPPGGDGNKGAPPPPP 
LFL 


11255 


25156 


A 


11352 


122 


337 


ILWVLLLVRESLKFFFFLLLLFFiiFFET 
DVA*AAVQ*SDHSSRQP* PPGLRGSYTS 
AFR I AGA I G I CHH L Y 


11256 


25157 


A 


11353 


325 


416 


KCVY I MGWA* WLTP VI PALWEAGAGRSL 
EV 


11257 


25158 


A 


11354 


260 


415 


LEELQNTACQKLEPFLSRTETKQGCLLS 
PRLFK I VLEVIA* AVRQE KE I KGI 


11258 


25159 


A 


11355 


160 


2 


ISE ISTKRNISIYFLKNWLGWAHACNPST 
* RG * GGQTTRGQDFQTSLANMVKP 


11259 


25160 


A 


11356 


160 


3 


ENLKLYNGSPFLKI IWLGMWHTCNPST 
LGGQGGR I P * AQE FKTSLGNVRR P 


11260 


25161 


A 


11357 


235 


408 


GQFLPAALFFYQWHLHFKELLAHPWGKT 
VWRFL* PLKKEG PHD PAI PLLD I YPKNK 
KW 


11261 


25162 


A 


11358 


11 


393 


QTERTPTPKPKLYVSFPPNSQDPQLQGG 
KLTNRKNTHTKTPSVCHHKQRPKVDKTT 
KIGKKQSRKTGNSKNQSASPPPKECSSS 
PATEQSWTENDFDELREEGFR*SNYTLL 
KEEVRTHGKEVKNLE 


11262 


25163 


A 


11359 


283 


475 


FM VS V S I KVY F F I F I FLF FE ME S CS VTQ 
AGVQWHDCLSFLC*VLASSPGNSSPVR 


11263 


25164 


A 


11360 


266 


407 


LEIHHS KTL* FWLGIVARACNPSTLGG * 
GRW I T * GQEAE I I LANI VK 


11264 


25165 


A 


11361 


93 


1 


DVLS ILLPRLECNGAI SAK * WFRLPGSS 
NSS 


11265 


25166 


A 


11362 


131 


377 


MEEM I KGRGLSKP PALSS YLTLKS S * LK 
RVRARRRLFLKSRFSLDLQVAGFFVCLF 
VFWDRVS LCHPGWS WVQSRLTAALT 


11266 


25167 


A 


11363 


212 


392 


QAEECAPLFIGVKEPPKNLTHWLGTVAH 
TSNPSTLGG*GGQITWGQEFETSLANMA 
KPCV 


11267 


25168 


A 


11364 


141 


3 


NELWLNHQNVLE W LGT VAYT CN P S T VGR 
WIT*V*EFSTSLGNMVK 


11268 


25169 


A 


11365 


215 


384 


ANFNIRPE ISLASKCPSERKSRTLLPLN 
QKLEMI*LSEEATSKAKTG*KLGLLHQI 


11269 


25170 


A 


11366 


48 


373 


EPWTRKEVLKISSSLKIHCYLSILCLIL 
MPTLECTAAIWAYSNLELPGTSCPPTCA 
S * VAGTTGACHRAPTVLKIFLRDRIFFF 
Fr TIwT YPWCHTPADPAI FKANAR 



1232 



WO 01/64835 



PCT/US01/04927 



SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 

f\ inn t(\ 
U 1 II g IU 

last amino 
acid 

residue of 

jjcpnuc 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Lysine, L=Leucine, M==Methionine, 
N=Asparagine, P=Pro!ine, 

(~) — (~7 Info m inp U=A roininp Q — Q a r* i n p 

T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 

ml To f\t tflD /lolof iftn l^nAccinla 

IIULIcUuUc UclctlUfl, \ pVobiUIc 

nucleotide insertion 


11270 


25171 


A 


11367 


310 


2 


GLICLNRENAYFFVIRQISMSCLiEKEKP 
APYSSPYAKSVTRGEKRRORFFLSRAPP 
IPFLFFI * IV* AESRSVAQAGGQWLHFS 
SLQTPP*RRTSDSPASAS 


11271 


25172 


A 


11368 


303 


50 


MFALTNRGVQGS PV I FLGVFLI SKMGFT 
C FTCG F 9P F KVTiVF FT.G VF P <5 F F F FT iR AR 
VLLCCPGWSPTLGIK*SSCLGLPQCVLT 


1 1 Z /Z 


j. j l 




i 1 Joy 


i no 




IGALYLLFGA*AAVLCTAISLFILDWLG 
QPCNLLRNDH I Y YV I VT AHAFAR I F F I G 
X PI I IGGFGN* LVPLI IGA 


1 1071 
I 1Z/J 


ZD 1 /'I 


A 


J 1 j /u 


1 1Z 


Zjo 


GGR FK"G 9NFT 9 AGMORNT FFMG PPTt * N 9 
RAGV* QRGEGKNPGVTKLNRL 


11274 


25175 


A 


11371 


823 


1078 


SQHFGMPRWADHLRSGVQDQPGQHGETL 
QT.T.TfTD'K'T.AfTRGGR PT.*c;r>TJ,PRT.PnFN 
CLSLGGRGCSEPRSHHCTPAWATQ*DSV 
S 


I 1Z/J 


Z3 1 /O 




llj/Z 


154 


zr>n 

4ZU 


T TCnDTTWT .FT.AT.VKFT iVDT.T.WKTVWDFT.K 
KLNIELLCDPAILIiLGIYPRELKMYVHT 
KTCT*MFIVELFIKSKK*KQPTC 


11276 


25177 


A 


11373 


234 


44 


LAT I TDRLLALS DS AI PPLG I Y AKEKKS 
VYQKDNCTYIFVAALFTVAQI*NQPKCP 


11277 


25178 


A 


11374 


73 


456 


P PIGS PLVPYI PS P AliSS PQAPRMGSK P 
CLIPSASCHPPLETDFLSLJjPTHFCLAV 
FTKERFSPPPPSPYPRF*KFLRS*KFSF 
FLPPFLFFGGTVLLCLSGWSAVAQSWLT 
A A CI T Q T «G <? DP PT T ^ 


11278 


25179 


A 


11375 


43 


456 


EFFHHVGQDGLDLLTS*SAHLGLLKCWD 

* I RTKPS QAPAS WGLWNLAHH P PR SH P S 
CPMANLICSTLSSFDGGSPGTGPGGWCP 
LGLSGSPARAVFKDSSCSLHPLATGI 


11279 


25180 


A 


11376 


288 


484 


EWVLSAVGGSQYGVCLPFLHCFI FFETE 
SRSVAQDGVQ*CDLGSGSSLPPG 




ZDlol 


A 


1 1 177 


1 70 

izy 


i 
l 


WFNAT.NT.TRAGAVAHAYNPCi'HT.GGFriGR 
IA*GQEFETSLSNTA 


11281 


25182 


A 


11378 


307 


1 


DSEI PSLGLAKFWNHRREPPCLALFLTL 
NOKLEMIKLSEEGI SKAJCTR*KAGFLCO 
RVSQDVNAKEKFLKEI KRATLVKTQMMR 
HQNS F I ADAEKVWMA WMK 


11282 


25183 


A 


11379 


200 


397 


CI SLFSHYYKDLPETGSFIKKRGLIGSH 
WLGAVAHACNPS I LGGQ3RR IT*GQEFE 
TSLASMVKPC 


11283 


25184 


A 


11380 


167 


1 


SNGL YSREARMVQHTH I RPGWAHTCNP 
STLGGHGGRIT*AOELKTSLGNIVRPC 


11284 


25185 


A 


11382 


12 


395 


AHSSFLSLDLSVFATCPHPRAETQTGHR 
FSTLLPLSALYPK* D YFKKKKKKKKKKK 
KKKKKKKKKKKKSGGGA 


11285 


25186 


A 


11383 


160 


2 


TMEMMLD I K* I * E I FLFEFKMGLKAAET 
TQNINNTFGPGTANEQTAQRWFNK 


11286 


25187 


A 


11384 


274 


389 


GQAQWLTPVIPALWETKAGGSLE* GQAQ 
WLTPVI PALWETKAGGSLELKNSRPSLG 
NMVKPP 


11287 


25188 


A 


11385 


389 


37 


WSYESSWSFI FKS FLF S FM I S AYDM * YA 
SQILFTPKLYRTHPYFLLVLFLILSILV 
ILETGSRFVSRLECGGGISAYGS FK*LG 
SGNPPAAASL»SSRI P YVHQS CWSDLAFN 
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NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
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M 
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od 
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Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=IsoIeucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q^GIutamine, R=Arginine, S^Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














PEFEM 


1 1288 


25189 


A 


11386 


214 


374 


KGGGRFKGSKFTSPGGGEKNFFIGAPK* 
NSRAGV* KRREGKNPGVTQLKKGVG 


11289 


25190 


A 


11387 


142 


410 


RFVCSTIKVLRDLSSDRSNPGRFLSTSN 
* KKKIKKKKRGAPFKKKNFFYRGGGGKY 
FFFGAPKINFLGRVFQPGGGENPGRSPF 
KGAWG 


11290 


25191 


A 


11388 


144 


1 


CLSNEDSIKTPKGLGTVARTCNPSTWGG 
RNGWIT*GQEFKTSLANTVK 


11291 


25192 


A 


11389 


167 


2 


RNISSHLLNMAITKQTNNNNCW*GYREI 
ITHIHC* WE FKMMQLLQKLVPQKVKHT 


11292 


25193 


A 


11390 


2 


412 


SDQVQDAGRLISSEASUjGLPTATFLLC 
LPTGFALCTRVPGVFLLLRTSVILREGP 
TLVTS FY LNYLLKDLVS IQSHWG LGLQ P 
MNFGETQFSL*QKLV* KEKESMPGMVAH 
ACNPRNLGGQGGLIT*GQEFKTILA 


11293 


25194 


A 


11391 


1 


206 


TGRLAGHRVSSLP IRGGRAEAPLTSRTG 
RLNQDGLNLLTS * SARLGLPKCWE YRRQ 
PLRLAVISGFLT 


11294 


25195 


A 


11392 


262 


388 


VS K* KKKKKKKKKKKKKKKKKQ 


1 1295 


25196 


A 


11393 


78 


414 


ICFFLRAPPFFFFFFFF*TKIFFFFPAL 
NSMANFFFFWTPPFLGGIFLPPPFFLEG 
GGPGLASPPGGAFFF 


11296 


25197 


A 


11394 


238 


11 


IKWETAGRSSACL* S * LFERLRRENPLN 
PGVHG * DE I * *HHCTPAWVT VR PRLS KY 
INKNKK*ARLAARCGGSRL 


11297 


25198 


A 


11395 


340 


1 


KHILVPPFINVSBIQRYSLV*HLLFVGV 
*NLFFLFLSFKKPVLCSM*IICHEIISP 
PSKPKCKASHLS FV* KKNSNLWLGAVAH 
AOTPGTLGARGGRIT*GWEIETSLTNME 
K 


11298 j 25199 

| 


A 


11396 


363 


245 


QMDLNRHLSKDDK*MANQCMNRCLTSLA 
IRQMQIKTIMI 


11299 125200 

! 
1 
| 


A 


11397 


292 


2 


DPKKEAGRGGAQL* FQLLRRFKHETHLN 
PGG * GCS E SRSHS S P PAGATK* NGVS KK 
K* Y* APKEERHPRPVLVEPRVPQVPSPG 
SQTYRQDHSPRV 


11300 


25201 


A 


11398 


121 


321 


KKL YTHKS KSDVM I FLKIETVS WLiYCPG 
WRVMAHCDLKLLGSSKPPTSAS * VAETS 
GTCHHARL I FL 


11301 


25202 


A 


11399 


118 


394 


QALPQPIPVLILLQFFFFFF*GTGFCFV 
AQIGGQGGHYG*LEPPIRGLKPSSRRTL 
RGGWD *GGHHNAQLI LGFFEKTGS CFW 
QAGLKAIiA 


11302 


25203 


A 


11400 


273 


379 


AE PG WAHS YNPGTLGCRGGG I T *AQE F 
YTSLGNM 


11303 


25204 


A 


11401 


360 


38 


KI FGQARWLTP VI PAFWETEVDGS PEVG 
RSRPA*PS 


11304 


25205 


A 


11402 


254 


404 


G PAGHD FHM I TPS KS TETGLGAVAHVCN 
PSTLGGRGGWI I * RQDLETSLA 


11305 


25206 


A 


11403 


161 


3 


I LTPTLQYI KINLRGQAWWFTHVI PALW 
KAK*SGGLLELRSLRPAWVIEGDL 


11306 


25207 


A 


11404 


I 


170 


MGVSLLS PRLECNGV I SAHCNCHLL* FT 
FK* FSCLSLPSSWYRCVPPCLATTFLIY 


11307 


25208 


A 


11405 


249 


3 


NAKSR IQVI S I WVFTVKFFHLY YLCNFS 
Q*NEMAGGQSIFRKKHLDFSLEHSGMMT 
THCSLDLPGSSDPPNSASSVGGTRH 


11308 


25209 


A 


11406 


70 


401 


CGKHGVNLI IHSFLNHLKYLLLLQMANL 



1234 



WO 01/64835 



PCT/US01/04927 



SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 
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Predict- 
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peptide 
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Amino acid sequence (A=Alanme 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Iso leucine, 
K=Lysine, LHLeucine, M=Metbionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possibie 
nucleotide deletion, Y=possible 
nucleotide insertion 














FISQFLLTCVSEPLLF*YCHQIIKVGRL 
HEVKQMRGTKKQEKMFPYVJNNDKNVCPG 
VPAHACNPTTVGG * GGW IT* GQE FNT 


11309 


25210 


A 


11407 


257 


389 


GGSLLQKYLRPLVAFLFSFFFFS FETES 
* SWRLE CNGM I SAHC 


11310 

s 


25211 


A 


11408 


364 


420 


SQPFGRPRQADCLSSG*AQWLTPWPAL 
WEAKAGRLLELRSLR 


11313 


25232 


A 


11409 


312 


413 


TSGMGPG WAHACNPITLGGQGGWI T * G 
REFETS 


11312 


25213 


A 


11410 


283 


425 


HGHKDYVHWLGAVARACGPS ILGGRGMW 
IS*GQEFETSLANMVKFCI» 


11313 


25214 


A 


11411 


389 


3 


CRIDGLFFNLVFLRLVWKF*VIFFLSY 
FSFLVIILQFLTYMLNCTFF*QILiNLFP 
ILVLLTKILNTLLSTENYCHFSLLYCLE 
FWFHPIIKHKMYNLFLI*FILIFILRQS 
LARSPRLKYSGTISIH 


11314 


25215 


A 


11412 


267 


3 


QEFQQEDTSELDLEIRSLCKRKGGLKQG 
DTKDNLS KR VWETDKGKAAW WPGAVTH 
ACNPSTLGGQGGWIT*GQEFETTLTNRC 
PRV 


11315 


25216 


A 


11413 


3 


281 


GAWSCRIPGEGSLSASQAIQPFSLYPVN 
PQ* ISLSARKKKKKKKKKKKKKKKKKKK 
KKKGGGGVLKKI YFPPRGEKKFFFFWGP 
PKKTGGGGF 


11316 


25237 


A 


11414 


275 


2 


KTAHFQLFNSDFKFPGSRNPGQSWSPRA 
KQSFCTQALLKYFLK*MNEMKLNKSRSG 
AVARTCNPSTLGGQGGWIT*GQEFETSL 
ANMAKPR 


11317 


25218 


A 


11415 


319 


1 


RNIiG VQVYEYWLVYKYCS VS SNNKMKN 
KRDLLVI * VPGI VIHLIYRLI KENPELY 
ISLFIYSMRSLKKLTYRPGAVAHACNPN 
TLGGRGGWIVCSQEFETSLGNM 


11318 


25219 


A 


11416 


121 


428 


LSCDKWGNRDMTKMLSKELKKKKKKKKK 
KKKKKKKKKKKKKKLGDEEEDKE*QSSW 
GGGGALIKNILFPSRAASGFFLSLLKYK 
KLGAAVYIILGEKTILWL 


11319 


25220 


A 


11417 


79 


253 


KIFNFQFSNMLENNSHTTRINNYFPGRE 
RWLTPVI PPLWEAKAGRSLEVRS * RPS C 
P 


13320 


25221 


A 


11418 


180 


4 


EVGFMKPGLCGVFLEAGSHSVALTIVQW 
CDLGSPQHPPLGSRDPPP*ALQIVGNTR 
RVP 


11321 


25222 


A 


11419 


154 


3 


STLKDKENQKLS IQPDTMAKVCNPSTLG 
GRGGW IT* VQE FE I S LNNMVKP 


11322 


25223 


A 


11420 


436 


478 


AI PLPH* RKKILHAALFFFFLKGS FVLS 
PRLECSTKIWAHCSLPPPNISNPPASSS 
QEGGTT 


11323 


25224 


A 


11421 


143 


398 


AAFGFWFVLIRFLRQRLALSPRLDCFFA 
KSASCQPRPP*FKRVCWLSLPKSWDYRF 
DLLCAAGLVWVYLSVSRLoHINEKNQPPP 
P 


11324 


25225 


A 


11422 


1 


138 


MVFHRVGQDGLDLLTS * SAHLGLPKCWD 
YRHE P PRP AAHS YFLLHK 


11325 


25226 


A 


11423 


2 


390 


LELPILLLNTLILFLFLILYIYLVFSPN 
AFRILFITIVLESFSFILLVPSKPWKYS 
LILSNAFVLCMMISSPPFPLFSLSTTRI 
NKKLDFWLGAVAHAYNPSTLGSQGGWI I 
' GQEFETSLTNMAKPHI* 
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26 


Predicted 
beginning 
nucleotide 
location 
correspond 
ing to first 
amino acid 
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r **r * ***** 
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Predict- 
ed end 
nucle- 
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ding to 
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acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenylaIanine, 
G=Glycine, H=Histidine, I-Isoleucine, 
K=Lysine, lHLeucine, M=Methionine, 
N=Asparagine, P=Proline, 
0 = Glutamine R=Arpinine S=Serine 

V£ ■ I-l IdfffJUJ L-y AX /*l gllllll^f KJ k^W w lilt*) 

T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, V= possible 
nucleotide insertion 


11326 


25227 


A 


11424 


267 


414 


VFQ I T FC I KNFWPG WAHACS PSTLGG * 
GGRIT*AQEFETSLGNMVKPC 


11327 


25228 


A 


11425 


189 


73 


HQDGLDLLTS * S TRLGLPKCWDYRHKP P 
RSARPNAWVAP 


11328 


25229 


A 


11426 


318 


33 


KKT I S S HARGRKLGS FKTAAPFFFFF* T 
GSCSLAQAGVKWCSHSSMQPRPPGLK* F 
SASGSAGI IGMSHHTRPIAGFLLAKLKK 
KSNSTGYGGAHV 


11329 


25230 


A 


11427 


291 


393 


DRVSLCLPGWSAVLPS*LTAAQISWAQV 
IFLPQP 


11330 


25231 


A 


11428 


51 


399 


KRVARQNPPNGQGFGWCSCSP ILYGQSR 
AQGENI YKLLGSGEWPGQLI IDVEREQL 
EDWRQGDLGRGMWMTKARVHKGPGLVAG 
TCNPSTLGGRGRWIT*GQEFENSLANMV 
KPCF 


11331 


25232 


A 


11429 


242 


420 


VKKKKRGGRFKGSKFTSACLQGISFFKG 
APKLNSRAGF * QR WEGKNLGVPQFNRFG 
AHP 


11332 


25233 


A 


11430 


77 


405 


TKRGDSS FCL YRI W 1 1 DSVMKWNQSKAA 
AAAAFFFFLRGTLLFLFRLKGKVGASVI 
*NFCLRGWGGSPASPSKEVGMEGACHNS 
GLFFVFLEKRRFPLVGQEFKTTRAN 


11333 


25234 


A 


11431 


121 


439 


LPLPTSGPEDPLALSFKPTPFPSTLSGM 
I YTGH PS PEHTSHLMP APP VNNNKACR F 
FKT * KKKKKKKKKKKKKKKKKKKKGGPIi 
KKKKI FAPPGGE I FFFFWGPKK 


11334 


25235 


A 


11432 


109 


283 


LTPWNLDSRCEI KGNGKPQENHLNLGGG 
GCSEPKSHHCTPAREKRQNSVS KK* INK 
IK 


11335 


25236 


A 


11433 


296 


] 


FUCKPSKGFSKLPHLPQNLNLFFPPK* K 
CPRAKKD I PPP S F* KI SSLFGEG * GFTG 
SRPPSLNLENPPKRIFFFFFFLDRVSHC 
RPGWSAMASSCPRV 


11336 


25237 


A 


11434 


385 


418 


QSETLSQKQKTK*E * SWLGAVAHACNPS 
TSGGRGGWIT *GWE FENSLTNMVKPCL 


11337 


25238 


A 


11435 


3 


405 


LLHQGRLMLGQWGNGRTKTLLMRENVGK 
FPAE PSCSALGVWLQALAC YPGLVRRQ * 
ANHWE KKKKKKKKKKKKKKKKKKKKSGG 
G 


11338 


25239 


A 


11436 


221 


3 


KHEFQTIQMESMK*NTSIKICIYAS*IQ 
DCLRYQWQRPAAVAQAITEH FGRPRQAD 
HLRS GVS DQPGQHGET 


11339 


25240 


A 


11437 


288 


366 


KMFFSWA*WLMPIIPELWEAKAGGSL 


11340 


25241 


A 


11438 


160 


370 


LW YH YFPVRDGLTI LVT YQLKAI RRLLT 
LKSRSGAVAHTCNLNTLGGRGGW I T*GR 
E FETS LANMEKPHL 


11341 


25242 


A 


11439 


230 


404 


MQHGQVGI Y SQGGGWRPVDKKNSQLGTV 
AHTCNLITLGGRGGWIT*GQEFKTSLTN 
MV 


11342 


25243 


A 


11440 


302 


54 


LSGVWFLTPPPKGGGFPTP*NGGAPGFF 
PPPPFKNPPPEGKLGGPKKKKNFPPPGG 
EKLVFLKGAPPFFFFFFFFFFFFFFL 


11343 


25244 


A 


11441 


271 


416 


KRTLSTELQYMCCLLI KSKS I FK* KKKK 
KKKKKKKKKKKKKKKKKNPPSLP 


11344 


25245 


A 


11442 


254 


392 


SVTGMTIMTSSCCYKIQKCTKIQKLARH 
GGTCL*SQLLRRLRQENS 


11345 


25246 


A 


11443 


390 


150 


LLKNQKTLGGPARNPPPLGGPNGGVPKA 
GNWGPPGPPGGTPFFFKNQKITPGGGGG 
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E=Glutamic Acid, F=PhenylaIanine, 


sequence 


sequence 




09/515,1 
26 


location 
correspond 
ing to first 
amino acid 
residue of 
peptide 
sequence 


otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


G=Glycine, H=Histidine, I=IsoIeucine, 
K=Lysine, l^=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GJutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 














P * FPPLRGGGPGNPLYPGGQGCH 


11346 


25247 


A 


11444 


122 




PAPLFFFFFFETQSCSVA*AGVQWRNLG 
SLQPPPDDGTRL 


11347 


25248 


A 


11445 


272 


403 


RPKVDKTTKMGRNQSRKAENSKNQSASS 
P PKDHNS S PAMB * S WM 


11348 


25249 


A 


11446 


70 


397 


LCHCAPAWATERDPVLGEKKRTMMSYCH 
PPMRMVEKKIVGDIKC**GYGBTGFLTH 
C*WECKIWQIL*NIV*QCLKNLNVHSPY 
ELAIVLIiGIYPREMKTYIFTKKTSM 


11349 


25250 


A 


11447 


3 


401 


HASAHASAHASAYYRLCAYPKS S FLYPS 
SFLLWSNFPLNILDTTIGPKATETLFI 
KHVMHDVISMQLKSKTFYNYFFLLSLNV 
FYFVLSCCKRRKRSQLGMVSHICNPSTL 
GGRGRWIT*GQEFETSLANMA 


11350 


25251 


A 


11448 


333 


1 


KSGCIPPKGQWLFFLQKTAKIGSPKPV 
SPPRGKGQFFPQGLRGFY*FLPPPPGVP 
E I FFF FF * DSSHS VTLS PRLEYSGMNT A 
HCSLHLPGSSNPASDSHIAGSTGTCHH 


11351 


25252 


A 


11449 


104 


402 


WCRPLLGGVSQLG YMGVMDP FKEAVCL 
FS ELKHCAGRTTALFRAVRQGCLSLKNF 
LLPFFQLCPAHRGGVL*RQ*ALLSCGGL 
HPVRASRPLCLPTQA 


11352 


25253 


A 


11450 


198 


2 


KYKFMAGLGGSGL * SHHFARPRQQDGLS 
PGV*NQPGQHGETPSLQKKVSWAWWHAS 
VEAEAGEQP 


11353 


25254 


A 


11451 


171 


3 


TPPPRATSLFFFF*TESHSVSQECNGTI 
S AHCNLCLPGSS DS PAS ASRVAG I TGA 


11354 


25255 


A 


11452 


177 


461 


PMTGGKYI SNSTCSKPNSLSPHLPLKTQ 
TKNMEHFGRLTRGDCLSPGV*DQPGQRG 
KTLLLQKI SQALWDVTAHAWWPGLPTR 
ERRRSLSFPTG 


11355 


25256 


A 


11453 


247 


3 


EGECCGIESR* ILHLKGRICWHINYLKI 
TKIKN* VI LKKMYKQKLRPGAVAHAYNP 
SALGG*GGRlARAQEIiETRLGNIVR 


11356 


25257 


A 


11454 


328 


57 


EVHSNICATLSTHQMKILTKGTRHQHIS 
*GFF*EGEVLTLSTKLECSGVIMAHCNL 
NLGGSSDPIiALVSQVAETRGASYLLLAI 
CPKVNK 


11357 


25258 


A 


11455 


70 


435 


ATRAKLHLKKKKEGRKKEFFPHWQGQPT 
PALVFPQPRTHFIFKVSIFSLKKNREQL 
P YTVQTQS LLPLNSHWGQVA* AGVQWRN 
LRFSCLSLPSSWDY 


11358 


25259 


A 


M56 


112 


401 


KCWDYRRERPCPAPNICIF*LTFFFLGK 
KLNQS FFFFFFLKGSLVLPPRLKGGGAF 
LFN*NLTLQGKNNS PAS PS *RAETLGLG 
PHAKLWFVFLKK 


11359 


25260 


A 


11457 


198 


3 


EKTSVKQSGDSNKKAWLEMKTNKKSLKR 
+GWAHACNPSTLGGRGGWIS*GREFET 
SWTNMEKP 


11360 


25261 


A 


11458 


179 


1 


FFHMYNHIVCEYTI7LSVNINSQGPGPR 
AHAYKPSTSEDRWI P * AQEFKTSLANMV 
KPH 


11361 


25262 


A 


11459 


210 


5 


RLIHYFFKNNLPSGGGSTA*GQEFEIRL 
DNTVRPHFLKK* KI SQVWWCMWVSATQ 
EVEVGNGLS PGV 


11362 


25263 


A 


11460 


139 


2 


RILiALSKLiFFFFFKQESHFFLPMLECSG 
V I SAHCNL* LLGS SDLS 


11363 


25264 


A 


1146! 


3S8 


272 


S * KKKKRGGRFKGSMFLS PAGQGwP FF'M 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=PhenylaIanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Lysine, D=Leucine, M=Methionine, 
N=Asparagine, P^Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, ^possible 
nucleotide insertion 














GPLKSISLAW*QRREGKNLGAPHITRL 
GAHPLFAGWHKTQGAEF 


11364 


25265 


A 


11462 


2 


387 


CWKSNNKKAPPCVAALRFRWKDDMDEM 
APGKHWGGLTSKVQSGGLMGRYTGPLTK 
TENS ILLKNPYFEAGHSGSRL*HQHLGR 
PRWEDYLRPGGQGCSEL+SSHCTFT*VT 
EGDPVSKKLKRTIFGG 


11365 


25266 


A 


11463 


325 


398 


SLPTWAVCAPRPSG*GSGIPELKTMLAG 
VILEDYLDIKNFGAKWGLSCTLATGST 
LFLGKVGPFVHLSVMIAAYLGRVRTTTI 
GEPENKSKQNEM 


11366 


25267 


A 


11464 


174 


2 


YLE I LFFFETGSRS ITQAG VQWSHPGSL 
QTPPPCPSNSPASA*VAAHASADAWADA 
W 


11367 


25268 


A 


11465 


105 


411 


TDVELLLMDE*RKWFLBLETTPGEDDVN 
IVKMTTKDFGYS INLVDKAVSEFKRIHS 
YFERGSAVGKVLSHSIVC*GEIFHERKS 
PLMQQISLLTFILFYFSE 


11368 


25269 


A 


11466 


291 


404 


SQEGVKIKRQGTVAHTCDPSTLGG*GGR 
IT*GQEFKTS 


11369 


25270 


A 


11467 


64 


236 


THASGLDLLTS*SSCLGLPKCWDYRHEP 
PRPARFSSFYSGSLLN YLAKI IKRDAFC 
I 


11370 


25271 


A 


11468 


278 


416 


NPQIS CLKQHVWGAVAH I CNPSTSGG * G 
GRI T * GQELKTI LVNMVK 


11371 


25272 


A 


11469 


178 


325 


RPEVKDQGASCRDQPHRVGGLLPVCSDE 
R VP QKKK KKKKKKKKKKKKKKS KKKKKK 
KKASSGGARF*KKKKKKKKKKKKKKKKK 


11372 


25273 


A 


1 1470 


264 


60 


DINHQSSWLSGRWTPLSWVSISQREDRR 
GFSASIYIYIYIYIYIYIYPPIHIYIHT 
YT* S YHFNQNT 


11373 


25274 


A 


11471 


95 


399 


DFPHLLLFLPLSQFCTACCPVPENRCFM 
YFFLIFSALCWEMNSRISHSNTARVEQC 
CKVFKILQMKSVLGQVQWLTPVIPALWE 
AEAS*LPEIRSRNQPGQ 


11374 


25275 


A 


11472 


139 


2 


NHVSTEKLSLHTMAHTCNPSTAGGQGGQ 
IT*GQEFKTSLATMAKP 


11375 


25276 


A 


11473 


179 


3 


RARWNPQVFC*GFLPVFPKILPIWPLRK 
GVPFPPNFFFFFLRVSLCHPHWS AVARS 
RL 


11376 


25277 


A 


11474 


282 


12 


KYIRNE* IVLFVIYFCRNRISWCCPGWS 
* TPELKRSSCLCLPSCWD YRHE PLCPAE 
MNKLDRLNKMDTTKWI QQKKKNHRTQRQ 
DNKII 


11377 


25278 


A 


11475 


145 


3 


VCMQ LRTD FQ PGAVAHACN P S TLRC * GG 
WIT* GREFKTGLNNMEQPC 


11378 


25279 


A 


11476 


61 


397 


PQTPRLKQSYHVSLPSSWDYS W I YHERL 
VNPGRETDWATCYSGGNIQLQENKLNTP 
TDSTLWVSLCCPRLECSSAIPADCSLNL 
PG* SDPPTSTSHVAETTGACHYAQLIFG 


11379 


25280 


A 


11477 


156 


3 


FIFIiKSSQNSYI * KCWPGAVAHACNLDT 
LGG*GGWIT*GHRFKPTRRPRV 


11380 


25281 


A 


11478 


224 


383 


CWVCLIHRAFLAPRPVLGCQ*QKKKKKK 
KNKKKKKKKKKKDSRGGG 


11381 


25282 


A 


11479 


339 


58 


NFKNKLCFCASGYLDSKDSQHYI IFFIR 
WSLALLPRLRWEDCLNPGGGGCSEPRSY 
HRTP AVJV IE* DPVS KK * I K* NKI K* KQQ 
DWLRLKDKIF 
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Predict- 
ed end 
nucle- 
otide 
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correspon 
ding to 
last amino 
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peptide 

sequence 


Amino acid sequence (A-Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Iso leu cine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P^Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 


11382 


25283 


A 


11480 


244 


407 


SFGKKIGFLFGPKKFNGGSYGNYYKINR 
RFFFPILGEGFSPHVYF*NLETRTTGGG 
LLETTKTTFNLLGRKLGS FLGPRSLTAA 
PTGTTI KLTGGFFFQFKGKVFPRMYI FW 
KDEWKKGF 


11383 


25284 


A 


11482 


308 


30 


SPQKNPPTYSGFKKNLSASPGQKAKPRF 
FLKNQFLPGIMGGGRQFPLLRRVRPPNC 
FNFGGKRCN* QKLGPCPPGGPKETVSKK 
KKKRERQQ 


11384 


25285 


A 


11483 


226 


407 


NHWNKK* FWPKVAHTYNLSALGGQGGRI 
T*GQEFETSPSNMRPHAS 


11385 . 


25286 


A 


11484 


10 


411 


QPP I YFLSLRI SLFWVFHI NG I IQYVAI 
CVWLVSCHNFSRFICVAACDQDFIVFLW 
I^IPLCGYNILCIHSICPNWRTTGLFP 
LLAI I IKL*TFVKKKKKKKKKKKKKKKK 
KR 


11386 


25287 


A 


11485 


184 


1 


TDEELLPMDEQRKWFLEKKPTSGEDAVN 
I FEMTRKDI ECYVH * VEKAAAMYER IDS 
NFLSN 


11387 


25288 


A 


11486 


435 


222 


AKNLNRHFSKKVI * MASRH I KRCS I SLT 
IRET*IKSIIR*HVIPGQMTFIQKTGDM 
FGSVSPLKSKHLKL 


11388 


25289 


A 


11487 


317 


408 


FRPGTVAHACNPSTLGGQGGWIT*GKEF 
KT 


11389 


25290 


A 


11488 


340 


469 


GKGDTKCHLWLGTWHACDPNTLGGQGR 
RI A* GQEFKTSIiSNM 


11390 


25291 


A 


11489 


38 


389 


KRPTSLKKSNFCSLQGFFFFGIiSQKPGK 
SNRS PGKKKNP FFSGGKAP * KGF* KI FL 
SPGLGGSPQKPHLLGGLGEKNYLTLEKG 
GCRDPK*CPCFPPWAGEGNPFSKKKKQK 
FPFFG 


11391 


25292 


A 


11490 


235 


2 


FVLMWLMPKLHI PVTEFRRLFAFPEFL* 
FLFCFVLSYFVLLETGSHYVTYAECSGA 
ISVHCSLNLPGSSDPPTSVLV 


11392 


25293 


A 


11491 


25 


417 


GTLCLRIGFINISHHCMILKENSLFFFF 
WETKFYFAPQADGRG PNLG * WNP PSPG * 
RGS PG P TS RKRGNGGGGP PGP V I FG FLR 
KNGVP PGGLKGPKS LAPG I GP PGPPKDR 
GLRGGPPAPGLLKKNPKW 


11393 


25294 


A 


11492 


234 


438 


MWI I AQQCDCTECHY*LKMAKMINFML 
FI F YHTHTHTKTHTHTHTHTRERGPS YI 
YGKVS FKLTHL 


11394 


25295 


A 


11493 


344 


477 


CFTMFIS* IKT*KFNISQLVAHTCNPSI 
LGGQGGRIT*GQEFKT 


11395 


25296 


A 


11494 


35 


482 


GIPGFHHVGQNGLDLLTS* S IHIiGLPKC 
WHYRPGPPRPAISITPST*AYLLGNQKI 
GVTKFLAILALFFSGTKAMISPKYVCI I 
FYLLVEEHLGCFYFLAINTATCVCVYKL 
FDTYMYQVESHATHEFISYYFFKILCSL 
SIRLYAVLL 


11396 


25297 


A 


11495 


307 


484 


LS PS ERHELKCVLW PG AVAKA YNQS TVG 
GRGRQ I T * GPE F 


11397 


25298 


A 


11496 


487 


349 


FLIKTEFHPIGQEGFDFLTL*FARLGLP 
KGWDYKGEPLRLAFWHPF 


11398 


25299 


A 


11497 


224 


417 


ASGASALKVELCSFSRYKIYPRHKRRYA 
RTDGKVF* FLSAKCEGITFFLSFFFFFL 
EVLFCSPG 


11399 


25300 


A 


11498 


17 


411 


KRLVFGGRGRTDRF I S QxV A AE YVS INI* 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, PHProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














LIFLTKKKKKKKKKKKKKKRGGALKKKP 
WGGQNKTGEKKKNFFFLRGGKKKPWGDF 
KKKNFFWGGENWGKPPPKKKTPWGKKKI 
FKGKGGKTPPYFCFLKKMG 


11400 


25301 


A 


11499 


117 


370 


VSESADLAWSLGICFFIEFPSDADVAGL 
ETAF FLGGWTGS CS WQPGV* WCNHS SL 
QPRPSGPRQSSCLSLPSSWDYKYMPPCL 


11401 


25302 


A 


11500 


244 


1 


EELFECYSCSPTFSSDPLTTPLLILTT* 
LLPLTIMASQRHLSSEPLSRKKLYLSIL 
ISLQISLIITFTATELIIFYIFFET 


11402 


25303 


A 


11501 


130 


55 


RFVCSTIKVLRDLSSDRSNPGRFLSTSN 
SSLKKKKKKGRPF*RDNSAIIiF 


11403 


25304 


A 


11502 


186 


421 


SFRLKI AGI IGVHHCTWLWLILNLVLL 
K*ITFALKIMMFSKYSDIGQAQGLMPVI 
PALWEADAGGSLQPRSLSLGNK 


11404 


25305 


A 


11503 


1 


213 


GELLFLYI FA* WLFLLPKLLKFVI VCLC 
ELQFNERFIMVLCI I I ILGVGGFFLFCF 
LL* FCFDFFFFFWRG 


11405 


25306 


A 


1 1504 


82 


6 


AGVQW PRS WLR L P * L FFTQVLM I F F P P F 
YHQNFFFP*KGFFFLGGLSHFFPPFNKG 
FFFNYPQGFFFSPSLKKKFFFFSSPIFL 
APPGIFL*GFPRFFFFFFFFFF*EGVSL 
CHLGWSAVAAILAPPAGFTPSSCLSLPS 
S 


11406 


25307 


A 


11505 


245 


440 


WAP CI G * QLLKDPQVLFAG YKVPHPLEH 
Kill RVQTTPD YS PQEAFTNAI TDLI S E 
LSLLEERFR 


11407 


25308 


A 


11506 


256 


370 


GKLM * GHHVGQAGLKLLTSSDPPALASR 
SAGITGVNRHAPPRLNVTLYTKGQTI IL 
YHNQYNQ 


11408 


25309 


A 


11507 


1 


149 


GOVIWPRTVLFAFQHKFVNFLKTLGDFGQ 
VQWLMRVI PALWEAEAGKLPEARRWRPP 
* PRTVL FAFQHKFVNFLKTLGDFGQVQW 
LMRVI PALWEAEAGKLPE 


11409 


25310 


A 


11508 


316 


420 


YT Y I YE PG * VQ WLT P VI PAL WEAKAGGS 
LEVRSLR 


11410 


25311 


A 


11509 


136 


2 


LDNRLWPGAVAHACNSRTLGDQSGWIT* 
GQEFETSLANMMKPHL 


11411 


25312 


A 


11510 


167 


420 


PTPRSGNTALQPPHHTHTHTHTHTHTHR 
HTHTHTHTHTRAC I * KVRVRALFS PKKK 
PLFCLS VSQI YVGRCLLKRLLFI FLYRH 


1 1412 ^ 


25313 


A 


11511 


124 


415 


IYISVANTIILIFIAIHVLAJSFFFFKK 
EVLFLPPRREGGGAF*VN*NLCFWG*GN 
FPA* PSLKKGITGALYTPGLFFVFLKKT 
GFRHVGQAWLDFL 


11413 


25314 


A 


11512 


121 


1 


PPVKFWAPFFFFF* IGSPSVAQAGVQWN 
NLSSLQSPPSCL 


11414 


25315 


A 


11513 


245 


13 


VLFILSGFFGPINYPTSPPPQLPFPASG 
NHPSTLY LHEFNCFYF YFLF IYFF* DRV 
SLCRPGWSAVARSQLLVPSDS 


11415 


25316 


A 


11514 


434 


2 


RPPFFPVWGSPPPFFFFFGEGIWGQEER 
KGASSPPWSRGILRGGGPFGGPGAPPT 
PLKKPPLFFFLAPPFLGVFPRFFS FPTV 
FFLPFFVCPKKREALLGVPKGPRPFFFR 
SFLFFFSGD*VSLCRPGWSAVARSQLTE 
SLAS 


11416 


25317 


A 


11515 


247 


390 


QGLRIYSQDFFFFLETK3RFVAQAGEQG 
GNLG ^LEPPPPS - SDFMAS P 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=Iso leucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unkno\vn, *=Stop codon, /=possible 
nucleotide deletion, ^possible 
nucleotide insertion 


11417 


25318 


A 


11516 


269 


368 


FFFBRESCSVTRAGVQWHRLSSL* PLPP 
AFRRF 


11418 


25319 


A 


11517 


256 


404 


NILKI FYLRI * KKKNQLGWAHACNLRT 
LGGRGGWI TRGQ* FKTSKEKMVK 


11419 


25320 


A 


11518 


211 


404 


YLCFTRCSVHSHLK*VFLHKCVI*MYRP 
HT*TGL* CMQVHTHTHTHTHAHTHTHTP 
LSQGGDCI 


11420 


25321 


A 


11519 


41 


232 


ESVEPRRRRVPRAQITPLHCSLGDRARP 
CL*SQLLKRLRHENLLNPGGAGCREPRS 
HHCTAAWATEQDPV* KKKKS LN FKTVRA 
INLKMLGALISFHLFLWGIWF 


11421 


25322 


A 


11520 


146 


2 


LIDQQGVNQLPRMECSGAITAHCCLELP 
GSSDPLTSAS*VAGRPPTRP 


11422 


25323 


A 


11521 


297 


437 


TDHFWNR I hSS CGDTFRBTWLG WPHAY 
NPSTLGGRARRIT*AQEFE 


11423 


25324 


A 


11522 


244 


460 


NCATI TTNSRI FLQLQKKHCTKWLSLS I 
SWQSLISF* * I CFVLFFETNLAVSLRLE 
CSGAILAHCNLCLLSS 


11424 


25325 


A 


11523 


222 


1 


ESKDLDIFRCQLAPRNSSCFVLFCFVFP 
LEIGSCSVAQAGAQWCNHGSLQPPPPIA 
S AS * VAGTTNVRHYVQLF 


11425 


25326 


A 


11524 


308 


424 


ITFLTMFKIYLQFDPAIPLLGIYPKDY* 
SCCYKDTCTRM 


11426 


25327 


A 


11525 


98 


2 


NIGGRPGAVAHAYNP S TLGGRGGQ IT * G 
QEFE 


11427 


25328 


A 


11526 


113 


399 


LDRFLTLLPRLRREDCLSPGGQGCSEP * 
SHRCVPAWATE * DLLS KKKKKGVLEKP F 
LPPPSVGKPPFSPPERLFFFI LRPPLGG 
VFPPPSKKIIS 


11428 


25329 


A 


11527 


280 


3 


PKYCQI*VLQCIAQLFKLSIYNTYTHTH 
THSLSLSLSLPPPKVNKWLMSSEFTLTF 
KM YTFGPGRVTHTCN PSTLGG * GGWIT * 
GQEFETNL 


11429 


25330 


A 


11528 


284 


2 


FWGPQKKNFSFPPPGLKIGSFKRAPPFF 
FFFFETKSCSVTQAGVRWCDLGSLQPLA 
WATERD* LNK* I KGMS KKLHT * VCMFRT 
RGRTRGRTRG 


11430 


25331 


A 


11529 


134 


2 


DTLLPRLECNGTITAHCSLKLPGSGDPP 
AS AS* IAGTTGMCSHA 


11431 


25332 


A 


11530 


104 


1 


GRGFFFFFFFFF* EVE S CS VAQ AGVQWL 
NLGSIQ 


11432 


25333 


A 


11531 


157 


373 


CVFYTYSTSQLRCYI FIGNTRSVFRCHE 
IYT* KNRFWASTVAHTCNFST LG S RGKW 
1 1 * G * EFMTS LANWR 


11433 


25334 


A 


11532 


237 


389 


ICKCSKTGFG*DFQEFETSLGNIVRPCF 
YRKIKASQV*WHAPIVLATWEAE 


11434 


25335 


A 


11533 


259 


122 


HISPS VQRHFFLF YF * TKPHL VAQAG VQ 
WCNLSSLQSPPPGFKRF 


11435 


25336 


A 


11534 


164 


1 


S S WSQALI LLFLKKTCFWPGAVAHACN 
PCTLGGRGERIT*GQKFSTSLASINL 


11436 


25337 


A 


11535 


106 


310 


GGDKGTVWDTLHFFDGQKRLDHNKDHWM 
TIHAELPHELSA*CYAFKKECISGIGSI 
* AEKE YKI QFDG 


11437 


25338 


A 


11536 


167 


378 


ASFFGTTITYKTILVYVYMFEKFIIYIH 
MYTF*RQGLVLSKLECSNVI IGHCNLKH 
LGSSNLPTSASRVA 


11438 


25339 


A 


11537 


101 


396 


VNNLKQRIGLSKRGEKKIiLVLISGFGGG 
FPGAGWI KKGRPPPGFPQGLGP * KRGPP | 
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Amino acid sequence (A=Alanine 
C-Cy ste'me, D=Aspartic Acid, 
B=Glutamic Acid, F=Phenylalanine, 
G^GIycine, H=Histidine, l=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine ) R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine t 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \~possible 
nucleotide insertion 














GKKGNFNKMGFNPKTLVFKNQNFGPGGA 
KPGT PGVRGP LGDL 


11439 


25340 


A 


11538 


18 


361 


QIGPRLY*DQNGETPSLLKIQKKKIPGF 
GGGPLKSQLLGGLGWENH*NPGGGGCRD 
PK*PQGLPPWGTNQNFVKKKEGGREKGK 
EGRKEGGGEKKKTLPKGIGALNLGGLLT 
SP 


11440 


25341 


A 


11539 


134 


365 


PGFQNKKTGGGGPFRPKTKTNFKVF * KY 
LKICFGFWPKRPPPPGFFILETRSLFFI 
FFLGFFF*ETKSRSVTRLECSGVILAHC 
NPRLVGSSHYP 


11441 


25342 


A 


11540 


2 


318 


TTERFIYRITFGPKETEVNDLPGAPSEP 
VAALRLEARSFILKL*SFSLVFDFFFLP 
F*KTGSCLSPRLECRGEIITVHCSRELP 
S*SHPPTSASQSSGSGGFHLG 


11442 


25343 


A 


11541 


294 


409 


GT I S F FRT CRLGMVAHTWN PS TLGGQGG 
*II*GQEFQT 


11443 


25344 


A 


11542 


1 


123 


GKQRQE PALS YDHATVLLTS * SAHLGLP 
KCWDYRSDPPCLA 


11444 


25345 


A 


11543 


69 


350 


VGAKHW IHMDTKI ETADPGNYYSGERGK 
R PRAE KL PVC Y YAL YLGNG I ICNPKPPC 
C WKGHD F I Y I YVCGH I Y IH I Y I CTH I Y I 
YFF*DGGLLC 


11445 


25346 


A 


11544 


103 


340 


LFCAFFLSPSKLDFYHLSFYV*TLFLKD 
LISLAFFFLFWEAGS CSVTQAGVQGPNS 
THCNLCLPGSNNPSTSASQAAGT 


11446 


25347 


A 


11545 


256 


354 


PYPFGKPKRGDHWGLGV*NQPGQQGETP 
SLLKI 


11447 


25348 


A 


11546 


196 


335 


EIVARLKRSPVGRAQWLMPVIPALWKAK 
ESGSPEVRSS*VKKITCRPGTVAHACNP 
STLEGQGKWIT*GQEFKTSLANMVK 


11448 


25349 


A 


11547 


202 


377 


TIHTKGVMVGNFILWFYQNFFKTFF*D 
KILGWARWPMPLI PAFWESEMGGLLEPK 
SL 


11449 


25350 


A 


11548 


396 


26 


PPRIKKEKRSPNGAFYPFPCWNF*KKKN 
LSLAKRFPRDSVFFFVSW*GSYLPTTFL 
QLLRW YVLTSH I AVS F KHLCE PHKNFTL 
QHS I P YLY I I E I E VYS YFINRNT I YKIN 
MDFS I VGFVFL 


11450 


25351 


A 


11549 


232 


371 


RE VP FEN I KIGQVQWLTLI I PALWEVKA 
GGSLEARSL*KYQNWPGAVAHAYNPSTL 
GGQGGR I A* GQKFE I S PANMAKP 


11451 


25352 


A 


11550 


174 


381 


NRDEGFRYADRAEVQRLLTGTILEHCGL 
QRPGL* RS SCFTLPCGWDYRH* SPSHVA 
GTTGISHHTRLIKKAFFSQTGVSSC 


11452 


25353 


A 


11551 


262 


359 


DYSNLGLVWWLTTVI PALW E AKAGRSS E 
VRSS*PALWEAKAGRSSEVRSS 


11453 


25354 


A 


11552 


70 


380 


LHKCS F I Y INHCNNP I YGL I SLYPKQLF 
SYENFKDFCNKI * P * I PNFELKTSTP* L 
EKL*NWAMARHGGSRL*SQHFGRPRRAD 
HMRSGVPGQPGQRGESPST 


11454 


25355 


A 


11553 


319 


56 


TFFLGPPGFF* KI FFS VGPPLI FFLPKK 
FIGEQKKNAPGKNFFFGFLFSLFFFFLL 
TLSSALECSGAMSAHDNLCLPGSSNSPA 
SAF 


11455 


25356 


A 


11554 


230 


399 


MMGE I INQAN 1 1 S VCQ 1 1 RRLRHE SHL I 
PGGRGYSELRS YHCTSTWVTE * DS CLKK 


11456 


25357 


A 


1 1555 


376 


396 


F * I FWRDGVS PCCPGWSQILRLQQS TCL 
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E=Glutamic Acid, F=Phenylalanine, 
G=Glycine } H=Histidine, I=Isoleucine, 
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nucleotide deletion, \=possible 
nucleotide insertion 














GLPKCWDYRPPHPATAASCPYFYHYYLL 
KIWRYSFDLSCLWNLVVWGLAFG 


11457 


25358 


A 


11556 


176 


384 


KAWPNVRNNYR * NFFFQ CCLNNFRKRLK 
CFRCGADKFGKTGFSCLQYALKPTNWGR 
EQWLTVNPTNFGK 


11458 


25359 


A 


11557 


295 


391 


PYKLLITHTHTHTHTHSRTHTI *HTTLS 
LYTF 


11459 


25360 


A 


11558 


1210 


1753 


NTWCMERLAVAFGSFNRNSPLRGWALV7L 
TPI ILALWEAEAGGPLEPQEFETCLGNM 
VKHCLYKNLKNVSGV*DLPGQHGETLSL 
QKFKKCVRRGGTCL*SQRLRRLSWEDCL 
SQGGQGCSEP*SHQCVPAWVTELDPDSK 
KTKTKKFFGLPSPFLFVCFLTHIYVNKE 
Y AF VLAEE AS GKTTS KLTM VTS RNGLGK 
TKKFFGL 


11460 


25361 


A 


11559 


172 


3 


KQKFFFPPPSHQIKIFGGGAPEMPFFFF 
F FE KE S R FVA * AG VQ WRNLS LLHPP PH P 


11461 


25362 


A 


11560 


143 


1 


GQAQWFMPVI PVLWEAKAG* SLEARSSR 
PVCATATPPQLIPK*YEGL 


11462 


25363 


A 


11561 


84 


2 


KPLDTLGGQGGCIT*GQEFETSLANMV 


11463 


25364 


A 


31562 


12 


387 


QGILLPCFSMSSEERGRIYSNTFSFFFF 
FGKGGQINPQGGGQATEPRLREPSPSGL 
TLQGPGNGGPPPPPGQNFFFKKKGGYPG 
GAGGVLNPGPKGNTPPGPPKARKKRGGP 
PGPGQKFL*NPKG 


11464 


25365 


A 


11563 


230 


375 


RAKG KLLSES LLS I QS F KKP S VHTCNP S 
TLGG * GRWI T * GLE FTTS LA 


11465 


25366 


A 


11564 


287 


55 


PPDGRVQWLTPWPGLWEAKEGGSLKSR 
SS* SQINPVGSVPYSLRVLR 


11466 


25367 


A 


11565 


115 


411 


FFFFFFFLGENSFFSFRGGPRGGGHIFW 
NFPPPG+GVFPPPPPPGGGNKGGAPPPG 
FFFFFP * KTGFPPLGPGGPPPPLFGPPP 
FPPPKGYNFRRDPPP 


11467 


25368 


A 


11566 


139 


356 


CLLIGWYNTGCITQGSTRKIGTTFFLRQ 
S SS VAQAGVKCSGM I LCYRNLRLPGS SN 
LPDSAS * VAGTTATYP 


11468 


25369 


A 


11567 


101 


410 


FLFFFFFFFLKKI FFFPPRGGGGGGN* K 
KQTPPLGGKKNFPPPPPKRGGGGTPPPP 
PLLFLVF *GEGGFPGGAGGGKKTPKGNP 
PPPPPKGGEMGRGPPPRGG 


11469 


25370 


A 


11568 


417 


56 


EDCLRTGVRDLLPNPHPNLQIAGRDGAR 
L*SQLQEHHLSPGFRGYSEL*SCHCSPA 
WATE * AYRKQNKTKQNYRSSALLHLYTP 
MATAWIKIAFLRAGDI FLSVLSFLPNS Y 
PLFSHPSQ 


11470 


25371 


A 


11569 


301 


443 


EDTT F VC FEVE S CS VAQATDS PAS AS * I 
AG I I GVCHHARL I FVFLVE 


11471 


25372 


A 


11570 


44 


408 


RGRGHG PHS PWRRRRRLRWEGRKATBS 
KRRAYKYPLPALLCSHQWKPNEWLDSQI 
KIPTGL**LK*PSFIFIFIYLFWKQGLS 
LAPMLE CRGAVLAH * HLRLPRS S LSHAS 
ASQVAVTTG 


11472 


25373 


A 


11571 


201 


1 


AERPVTGPPVYAEPGLQGRQG* KNMFKN 
MQHPRWADHLRSGVQDQPGQYGETPVST 
KNTKITQAQCI 


11473 


25374 


A 


11572 


343 


1 


FGLL FAFT LEY KLY ES I NHFAHCC VCRF 
SSGTLHWVCNRHPNKLDEWNSGGGGRQ 
DLTLT PRLGCSGY I MAKGHLVLLGLSN P 
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peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














VSAS * VAGTIGAHSHAWIDKLFFVKTGP 
LY 


11474 


25375 


A 


11573 


209 


3 


VGPPPKSLPGGFPFFFCPKCFPRLKKKK 
KKPGLVAYTCNLNTSGVRGKKI T* AQEF 
KKSPSNIVKPWY 


11475 


25376 


A 


11574 


312 


430 


QRYARPGPWHCYNSSTVGGRSRGIV*A 
QEFATSLCNMT 


11476 


25377 


A 


11575 


125 


3 


NSLGPVVHACNPRSLGG*GGRITSAQEF 
ETSLGKIRRPLY 


11477 


25378 


A 


11576 


190 


362 


CVNNY ID* EKKKKKKKRERE KKKKKKKK 
KKKKKK*GGGAPKKKKKFTPGGGKNFFF 
h 


11478 


25379 


A 


11577 


231 


340 


KR I TRVNQDGLDLLTS * SAPLGLP ECWD 
YTMKPCLY 


11479 


25380 


A 


11578 


60 


3 


RMQVSV* I FTPMS LQAYL IYSS SVAAGA 
QSGI EECKYQFAWDRWNCPERALQ 


11480 


25381 


A 


11579 


285 


406 


KKGNWPGAMAHTCNPSTLRGRGGWIA* G 
QEFETNVGDMAK 


11481 


25382 


A 


11580 


121 . 


1 


NLNCPGPVAHASNPNTIGG*CRPIV*GQ 
EFYTRIANTVKP 


11482 


25383 


A 


11581 


199 


384 


KKTPPGGIFFCPKEAGPAPPL*TPFC*T 
PLFFSPAPFKPKRALKFFLKPKRGFFQI 
PFFFF*DGVSLCHPGWSTVARSCPRV 


1 1483 


25384 


A 


11582 


677 


934 


YGSRHC I CFLQAI SE I LFLKNP ARHGGS 
QVQDQPGQQSKTLTLRKKSK*ARHNGSC 
LSSQNFGRLRQEGHLSPGG*GCSEL*PR 
HC 


11484 


25385 


A 


11583 


104 


2 


KKIGGGPPLFFFFETESCSVARLE*SGA 
ISAHCN 


11485 


25386 


A 


11584 


349 


9 


GGRSLGPRKTRVQWGIPEFPPLHLGGKA 
KPCFPKKKKKKVKGLNKHYSKEDIQMVN 
RYMKKCSTSLPRERKIKTIMRYHFPSIR 
MAIIRKIKDPKDQ*GCIEKGGSWRPMY 
L 


11486 


25387 


A 


11585 


163 


1 


TAVRIKHNSYTLTPCLRHSRYLINVTCW 
PGPVAHACNPSTLGGRGRWIT*GQEF 


11487 


25388 


A 


11586 


192 


41 


S KS KYHI YGDSALQKSTVYKW ITHFKKG 
*DDVKDKAHSTRLSMSICEEKE 


11488 


25389 


A 


11587 


30 


374 


GWSPEDLFPGSLPPALKGFSGEGG* PFL 
P FQKKRGGGGGSS PPFPVLKRVRPENRL 
YPGGGGFR* PKQGPCPSAWGAEPDSLSK 
KKKGFGQKKKKKKVLKEKT 


11489 


25390 


A 


11588 


66 


245 


SLCFFVFFVFF*TESCSVTQVGV*WYNL 
GSIjQPPPPKKKKNIKKQKKKINWGRGFK 
RRL 


11490 


25391 


A 


11589 


243 


402 


FLHLKKKKKKKKKKKKKKKKKN+ KDAG 


11491 


25392 


A 


11590 


205 


3 


I RIiLG 1 1 KNPTPAHCQ * K* KTGE PLW* V 
VW* FFRRLHTELS YDPAIPLLCINSTEV 
WLAPGIPPRFR 


11492 


25393 


A 


11591 


404 


152 


MG FHKVS QDGLDLLTS * STLLSLPKCWD 
* CWDYTREPLCPGHFAI SFNKWLNTTT 
WLYFYLI ILHCI YCPYKLLFLFPFYIY 


11493 


25394 


A 


11592 


311 


386 


TVYPPLAGNIAHAGA* AGI VGTALS ILI 
RAELGQPGALLGDDQIYNVIVTAHAFVI 
I FFIAI P I I IGGFGN*LVPLI IGAPDIA 
FPRINNISF*LLPPSFLLLLASSIVEAG 
AGTG * TVYP PLAGNLAHAGAS VDLT I FF 
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peptide 
sequence 


Predict- 
ed end 
nucle- 
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correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A^AJanine 
G=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glyeine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine,M=Methionine, 
N=Asparagine, P=ProIine, 
Q^GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, V=possible 
nucleotide insertion 


11494 


25395 


A 


11593 


39 


397 


EGCPLFFFVFFFFFFFGGKKI FFFPPRG 
GGGAPHKIPGPPPPGGNFFFPPPPPLRG 
GKRGPPPPPGFFFFF* KKGGVSLFPRGG 
VFFFSFWRGGRGGFPPPPPQKKIFLKGP 
PPFYKKK ' 


11495 


25396 


A 


11594 


243 


46 


GILSCCFHDSLSLNSL*CD*VWISLCLS 
YLVSI ELLGQCGLIFFIKRES FWLLLLK 
MYLQPLSFLS 


11496 


25397 


A 


11595 


327 


2 


PRAILRIQFLLWPMRPYVMWSQPCSLCF 
CHTAKT V PTSR PLLLG FFS L PG I AH I S M 
WLVLS YGLGFS S KVTLWPG VVAHVCNPS 
TLGG * GGW IT* G WE F KTSLANMVK 


11497 


25398 




11596 


292 


384 


LGTVAHTCNPSTLGGQDGW IT* IQEFET 
SLR 


11498 


25399 


A 


11598 


125 


2 


PFQGQRSGGTRGPGAVAHTYNPSTLEG* 
GGRIT*GQEFETS 


11499 


25400 


A 


11599 


302 


408 


TGGWLGWAHACNPSILGGRGRQIT*GQ 
ELETRLA 


11500 


25401 


A 


11600 


409 


131 


GRSGGVPKSGVLKPPGPPGGTPFFFKKP 
KITRGGGQPPLFPLLQRVRPENCLSLGS 
GGCH* TKLPPCPSTWGAQ*KLVSKKKKC 
EGGGRIYS 


11501 


25402 


A 


11601 


396 


330 


HHCMLAWVTGRPYLNIYICIYADIYIHI 
YICIYIYTHTRVYIYLLAI* IYIYVYMR 
IYIYIYIYVYIYTHTHGYIYIY 


11502 


25403 


A 


11602 


41 


252 


TPGRARGQRLGHSSTGLDLLTS * SACLS 
LPKCWDYRREFPRPAEDIKFMF*AILSE 
T I LVRGVRS RATP S 


11503 


25404 


A 


11603 


271 


440 


KKKKRGGRFKESKFTSAGLQGNI FFIGP 
PKLIPRAAV*QRGDWXNLGVTQLNRLCH 


11504 


25405 


A 


11604 


429 


2 


fssffaekgchcvsqagfkllssrdl.pt 
saphlgirllgrlrqedppspgv*gcse 
lqsrlctpawitkrdpvs*kfkkqktkt 
hth*rkptkgqdhqyhclppphvmplkv 
frgsnvhgavrtrgsrgsipelshavph 

AP 


11505 


25406 


A 


11605 


113 


2 


TEATHLVCTYKN* KLGWAWW IM P VI PAL 
WEAEVGRLL 


11506 


25407 


A 


11606 


301 


3 


WVMVC S VKN I ALGHACG KFH YEGRG KP A 
EHPSWVLLVLESLQKKLFLSYFLFTFLL 
QPPSPRLT* PVFRFFFFRDRIMLCCPGW 
SAWPERVGRAGNSE 


11507 


25408 


A 


11607 


117 


431 


SCDFCLSFSLSLFFFFFRKGI FFLGPRG 
ENKGGNKI KWTLGPGGKGNSRPSPPKGG 
GKRGGPPPGANLDF*KKTGFPQGGRGGE 
KPPPPGGRAGGAPQRGGKPAG 


11508 


25409 


A 


11608 


277 


88 


RRTTS S WLARNC I QGPAGYHTGS * TPDL 
KGSACLSFPKCWDYRWRLDL 


11509 


25410 


A 


11609 


200 


27 


VRLKPGDSRDPDMQWYYI*ENRPARCGG 
SLL*S *HFGRKRKADHLRLGVQDTRSSA 
R 


11510 


25411 


A 


11610 


17 


414 


TVWLVMQD PENE W I VSGWGGKEE PGR YF 
MCEARSSSMDRVFFFF*KKTLFFPPGGG 
GGAKIWVNG I PPPQGSPLS VAYPLGVWE 
KKGPPIHPHKKRVPPV* PGGVLFSEPGN 
PPPGPPKGVGKRSSPPSPVL 


11511 


25412 


A 


11611 


296 


428 


R I YENWRGTVAHACN PS TLGGQGGW IT* 
GQSLKTSTGTVAKP 
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residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=JLysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R^Arginine, S=Serine, 
T=Threonine, V=Valine, 
W^Tryptophan, Y=Tyrosine, 
X=Unkno\vn, *=Stop codon,/=possible 
nucleotide deletion, ^possible 
nucleotide insertion 


11512 


25413 


A 


11612 


20 


362 


NFVRGVARGSRGGATRCEWVGLRPVGMA 
YGRSPSWLL*VKPSAASHRPPLPRAADT 
PGTAPAPTPTPAPAAAPALTPSSGPGSA 
ALTLEEELQEAIRRAQRWGLTMLPRLAS 
NS 


11513 


25414 


A 


\\6\3 


158 


2 


LMLSRTPLQQL*YTLLELELPRLLAPDL 
LSNGSSLKDLKWTHSNYRASKETK 


11514 


25415 


A 


11614 


3 


410 


RSEVYGLVLTTWYQVNRVPWTHKAARFT 
RVSRWARPRSKISESCYWLPQAHRTKS* 
PLVRTLSGLCS LLG FHPLLFS LLATDLV 
PLPSFALLLLLELFFLPGMPKLAP*SPT 
KRSFSFYFFRDRVLLCCPGWSQTP 


11515 


25416 


A 


11615 


189 


397 


YKVLFLVEKNGFFFLFFETKPCCVAPTG 
VQGPNFG* LQAPPPGLFLFSALiAAPEVG 
ITNGTPPPPRFFF 


11516 


25417 


A 


11616 


158 


1 


KGKPTPKTFFCQKQI FFLVFFF* TESHS 
VARAGVQWRDLDSLQAP P PGFT PR 


11517 


25418 


A 


11617 


319 


425 


KWLYSQAV* KVWL YSQAVAHT YNPNTL 
GGQGWR I A* A YEFKTSL 


11518 


25419 


A 


11618 


94 


2 


CLT I S WLGTVAHVCNPSTLGG * GGH I TR 
SR 


11519 


25420 


A 


11619 


381 


9 


FFPLPLPLPTVSLFPRSPSDAEPKLDCT 
AAI SAHCNLPA* FSCLSLPSACNCRRAP 
PRLTAS AS RGAG I ADGVS FTQ CS MVPRL 
ECSGVISAHYNLHLPATSLGLPKCRDCS 
LCPDEIVDPRI 


1J520 


25421 


A 


11620 


144 


431 


IVNVIVGKGEKGRKIPE*LIRCNGWN*S 
WDRWVAEDHVLRE PS ENRRLARVRCLTP 
VSQH FGRRRRLDHE VRSLRPACATWRNS 
VSTYNTKSGWAC 


11521 


25422 


A 


11621 


174 


449 


VFSLPLLGGASQLGYLEVRDPLEEAVCP 
FSDLKPHAGRTTTLFKAIRQGHLSLQRF 
LLP FVWLCPAPRGGVYRGRQASLS * GGL 
HPVRASRP 


11522 


25423 


A 


11622 


316 


412 


GGPSAVAHACNPSALGGRGGRIS*GQEF 
KTSL 


11523 


25424 


A 


11623 


183 


2 


PKQVYLRNGLLSHSKCNKIHHINRLKRE 
KIQPGWAHAYNANTLGGQGRWIT*GQE 
FENE 


11524 


25425 


A 


11624 


355 


168 


DLL I VCYDPNAINL* SLLLRRLKWEDCL 
SPGG*GYSEPRSCHCSPAWAAEPDPVSK 
KDRNVEFIYCC 


11525 


25426 


A 


11625 


154 


3 


TADVLFLGQAGRGNFWMKKNCRPGTVA 
HACNPSTLGGRGG* ITRGLEFK 


11526 


25427 


A 


11626 


412 


3 


I FFPPPKKDKRTRGGQKKNI PPAPPRNF 
LGRGGNFPPPQKPFFPPLF* I FPRPP* C 
KGSIPPFLFPLCFQKKKTPSFFFLPNGS 
PPICEKI FLFPKKGFLKFKKPRFLKKKG 
PFFFFFGNAVLLCQPGWSAVAQSQ 


11527 


25428 


A 


11627 


290 


423 


VFEYFFKFFKKNNSWPGWAHTYNPSSL 
GGQGGWI T* SQVLRPP 


11528 


25429 


A 


11628 


213 


3 


LSGIYSGNRRLVRYSKSVSVIHHINRLK 
KKNHI I ISIDGEKPLEQIQNSFMT*KRN 
KLKWEDHLRPADA 


11529 


25430 


A 


11629 


152 


2 


FQKNRDLEPSKAGWPGMVAHACNPSTLG 
ARGGQIT*GQEFETSMVNMVKP 


11530 


25431 


A 


11630 


422 


189 


plsgfcpstwgakqralskkkkrseagh 
sgscl*sqylgrlkq:<dhpspggrgfse 
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nucleotide 

location 

correspond 
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peptide 
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Predict- 
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nucle- 
otide 
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peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=lso leu cine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, Y=possible 
nucleotide insertion 














LLNCT P AWVAKQD PVPKKRLK 


11531 


25432 


A 


11631 


239 


424 


KEDKNRVFVIGPLGGLKELAHDKAFKTQ 
NHHWLGVGAHACWPSTLGGQGRWTT*GQ 
EFTTSL 


11532 


25433 


A 


11632 


343 


401 


AREREHKSAACQ * GMERS V I KHCHVATE 
*EREQKRERQQERASEKERQQESERDSK 
RERARESERERESTRAQLVRKIVNQIS 


11533 


25434 


A 


11633 


125 


371 


FQGCAKICLTNTLLPYAHGYLIWRKDGK 
GCHEKMTAKI IHSCKLS PDAL* SQLGVP 
L*THTHTHTHTHTHTHTFSEKKGLVP 


11534 


25435 


A 


11634 


235 


433 


AALTS FSGP FS FCS YL INTLGKNFPNHE 
I STT* FTLQRL* KI IFWPGMVAHACNPS 
TFGGQGRWIT 


11535 


25436 


A 


11635 


93 


385 


FTLKSERRKHSVPQGKKQNKI KLLS PDC 
KLENKR FGWARWLTP V I PAL * EAKAGES 
PEGNGEREEATEPFGSIILGFFFCTQIA 
HLHRSDYEPNEDS 


11536 


25437 


A 


11636 


190 


398 


NTKKSQKAFFKTQGKPFPLDPKKKASKK 
TKFFLPKKISTRKFRTGPWRGTNFSLKK 
AKKNLGKTRKKIFP*RPPFPSISSLFWP 
FFFFLFF*KBKVPLCHRGWSAWQSQLT 
ATSISQMY 


11537 


25438 


A 


11637 


190 


380 


TPKKWPKAFFKTKGKPSPFHPRKKALRK 
TKFFLPQKFP*NPLSPLIPALFWPFFFF 
FFF* KDKVPLCHRGWSAWQSQLTATS I 
SQMY 


11538 


25439 


A 


11638 


261 


91 


EKTLWAGHGGTCL*SQLLGRLRWENHLN 
LGGRG*RSRHCTPAWVTERDSVSNK* IN 
K 


11539 


25440 


A 


11639 


1 


334 


FRANRTVKDAHSIHGTNFQYLVEKIIRT 
R I YES KY WKE E C FGLTAEL WD KAMELR 
F VGG VYGGN IKPTPFLCLTL KMLQ I Q P E 
KDIIVEFIKNEDFK*VQCSLANIRGMY 


11540 


25441 


A 


11640 


193 


457 


DGILLCGPGWSAVAPSQLCSGTISAYCN 
LCLPGSS ES PVSAP * YK3LQACAHAPLM 
FVFLVYTGTCMLPLSAHHSRAANSSHFS 
YPLL 


11541 


25442 


A 


11641 


257 


373 


NT I S CFRSLRPVAHTCNPSTLGG * DGQ I 
T*AQEFETCLG 


11542 


25443 


A 


11642 


211 


367 


DTTSHPFEWL * FKKSRK*RVLDR I WGN* 
MLVHCWWEHKMVQLL* KTVWWLFK 


13543 


25444 


A 


11643 


112 


367 


LQIKTIRYHHILTRMAKIQNTDTNKCYQ 
ECGATGTLIHCWWEFKI IPVI PALWEAE 
VGGLLEVRSSRPAWPTRENPI FTKI IKN 
Y*KYKNTPYIIR*LQIKTIRYHHILTRM 
AKI QNTDTNKCYQECGATGTLIHCWWE F 
KI IPVIPALWEAEVGGLLEVRSSRPAWP 
TRENP I FTKI I KNY 


11544 


25445 


A 


11644 


310 


335 


YHLIAGSLGDIERLSML* IGKAVCHNKY 
* *GCGEPGFLLHCWWEWKMVQSL* KIVC 
QILQT 


11545 


25446 


A 


11645 


143 


455 


KRKGPSLWLLGKSMGGIRFF* KKVFFSG 
FKSLWFFFWPLWKPIiERLN * IKISFYPP 
AKKKGGQRKAQFFFFFFPRSFSLAGCSG 
S I S AHCNLCLLGLS DS PTS 


11546 


25447 


A 


11646 


182 


458 


SRGGIRFF*KRVFFSVIKSIWFFFWPLL 
KPLGRVN * I KI S FNPPAKKRGWERKAQF 
F FF FLPRS FSLAG CSGS I SAKCNLCLiLG 
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Amino acid sequence (A-Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
OGIycine, H=Histidine, I=Iso!eucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V^Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, V=possible 
nucleotide insertion 














LSDSPTS 


11547 


25448 


A 


11647 


319 


3 < 


RSTKYSPRGWEPSMWFLFLKVFSLPFRS 
ILGPPLLGTPLRGSPPPPPPFEGKGFIY 
GLTFFFP*EPPCEIFFFFFLDRVLLYCP 
GWSAWQSAHCSLELKDKVCI 


11548 


25449 


A 


11648 


165 


2 


DQSGRPRKILCAHTHTHAHMHTSLGQT* 
QDPV*THTHTHTHTHTPLYAAAQRIN 


11549 


25450 


A 


11649 


290 


3 


SKQNTTHWATLKSY I YPSQFWRLGS PS S 
RWQLDGFLVRALFLVCRQPPS LR I IiTWW 
EKTEERKREREK* DRERERNE I ERNERE 
RERERETQRMY 


11550 


25451 


A 


11650 


220 


2 


TPTLPKGTGLTALDRI PGRDPGPRRAGR 
P F* AGVAAENCPDPAEGGSTRPLGAPGR 
GS AP P * RHPGPAGGRR 


11551 


25452 


A 


11651 


29 


387 


FLIFDVTIVIDLACHEPHSYKMAHLINV 
VCVLTAPPTSYFPISFLGPPYSLRHNDI 
EIRPINNPTRTSQCSSERKSHTSLTLNQ 
KLEMTTLSKDGMSKAKTG*EPGL*C*TV 
SQAVNAK 


11552 


25453 


A 


11652 


267 


2 


KFFFPVFSTSKLNFYPGTPFPLFLFSFF 
F FF F F FFMREG FAVT WV PGAVAPGAVLA 
HCSLELVGSGKPPISAS*GAGTTGMHYH 
TRMY 


11553 


25454 


A 


11653 


50 


411 


EAVAGGMEKSMMNLPKGPDTLCFDKDEF 
MNEDFDVDHFVSDCRNRGQLEELRDNPQ 
LYYKLLKTAMVELINKDYADFVNLSTDL 
VGMEQALNQLSVPL * QLREEVLS L * S FV 
IERISPSS 


11554 


25455 


A 


11654 


303 


403 


VRNMVE WLGVAAHACN P S T LGGQGEQ I T 
*GQEF 


11555 


25456 


A 


11655 


356 


653 


SVSPCPFPGPASTLSFLVADFRRRGVDV 
SQVAWQSKGDTPSSCCI INNSNGNRTIV 
LHDT*GPRASPCYNPPISWLNQFPHSPP 
WASQLPPSVASVYQK 


13556 


25457 


A 


11656 


442 


2 


GRPILMRPAVL*SEPRTPACVDIQQQIM 
TIIDEWKACAKVQTLSAPINSASRMQS 
I RHWYI LKDSS ARPAGKGAI IGF I KVG 
YKKLFVLDDREAHNEVEPLCILDFYIHE 
SVQRHGHGRELFQYMLQKERVEPHQLAI 
DRPSLV 


11557 


25458 


A 


11657 


109 


474 


DINMCGRWLVAERM* KRKRDKYNYEET 
E*VL*KKVRHSQLQKENVAQNAPWQNA 
AYIDQPSPAHVGQQGLSKLPSRPGDQGV 
E PQNLRTLQGRSV I RS ATNTTLPHMLMS 
QRVIAPNPAQLQGQRGPPKHGICRTTTP 
NMNPANV 


11558 


25459 


A 


11658 


342 


3 


ENQLKNGEHSKSQS ASS PPNDHNTSLAR 
A* KWAGAEAEIALLTEAGFGRWVIMNFA 
ELKEHVVTQCKEAKVHDKTTQELIAGFE 
RNITYLM* LKI TTRE PHI AITS INS RMA 


11559 


25460 


A 


11659 


204 


17 


STCTPLPTA*SQSRPHPPPTPTPPAPWR 
SSNGPLSRGSSSRNSTSSWSIALATGPW 
THRSEL 


11560 


25461 


A 


11660 


505 


514 


GS*GNHFNPDGASHGGPQDSDRTGPRGP 
CLPSPWGAQI LRDWYPQEEMGTLGKQ 


11561 


25462 


A 


11661 


80 


374 


FLFDI YNRCYLI FINI I * YLSSNFD* I F 
I I I YLVAI YRLLCLGLFRAPRQCKHPRP 
OFS FNK IGTKR *AWPGAMAHTCN PNTLG 
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SEQ ID 
NO: of 
nucleotide 
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SEQ ID 
NO: of 
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sequence 
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od 


SEQ ID 
NO: in 
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beginning 

nucleotide 
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correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 

T 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamtc Acid, ^Phenylalanine, 
G=Glycine, H-Histidine, l=Iso leu cine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q^GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














GLGAQIM*GQEFET 


11562 


25463 


A 


11662 


142 


437 


GQRVQI FFFFWKRFPFFAQGGTEGAQFR 
LIKPSPSGLKEIPPPNPPRKGE*RDGPP 
FRGDFVFLRKNGVPPCGQGGSKPPAPGE 
PFPFTPPRGENNRG 


11563 


25464 


A 


11663 


120 


353 


QALQVMLKHVNFFFFLFFLFWKQSFVPL 
PRLECNGATSVHCNLCLPG*RNFPASAQ 
EFRI AGS YDHAI CPPASGKGE 


11564 


25465 


A 


11664 


1 


404 


NTWSDL I IS YFHPRPSAFFLRQGLALS P 
RLEYRGTITGHYSLDLWGSSDSPASISR 
VAGTTYRCM* FQLLGRLRRENHLSPRGR 
GCS DP * LCPYTPAWVTERDPVS KKKLRG 
GGENSLLSSQTMY 


11565 


25466 


A 


11665 


493 


3 


NI WI PDVLPGHLARQ* LGS PGKLSAGF * 
PLQFPQTLSSARAPPFCLPRLQVRNTVG 
FLPSPHTSRPSHCLPACPLPSRTQPWVP 
VKPGPTACRGFLQHPPTPGSPSP*SFQG 
S S WWDTD LRALE CLLR I LRGRNCS VFVT 
TQGQLSHEGMNKEFLLTCAKKKK 


11566 


25467 


A 


11666 


3 


401 


CMERAVTVL.LPGSATQSPPVYAPRALAR 
LWLTAAMMISGFIADYEA*SSRCSSACP 
AGDSLSYYHSPADS FSIMGSPVGAQDFC 
ADLGVS RANF I PTDTDMSTS PDLQWLVQ 
PALVYSVGPSETIAPHPLGVP 


11567 


25468 


A 


11667 


1 


446 


LSCAKPPQRPLRHG 1 1 KIFNVFEETRAN 
KHLLGVS KLKPRGFPALFLS VS S FGQKK 
HVPQWLQHSHSHILVISLDFFFFFFFEK 
KFP FFPPGGRERDQTPFN* TPPRGG* RE 
SPPFPSRGGGTQGWAPHCPI IFRFFQKK 
GVFPLRAR 


IJ568 


25469 


A 


11668 


18 


403 


DPLQRPQYRGGLRSVGCDLQSPTPRGPS 
GRWPNPAQARGPGEAQAWAWPGGGPREA 
FSGQGRPPPLGLHPHRRKGAGPPGPMGP 
*HEGQGSC*GASCRGGLACTAFPAASSP 
P I FGGLTPPNLNFPEE 


11569 


25470 


A 


11669 


157 


1 


PRPQGFVFVFFFFEMQFGRLECSGAISA 
H * NLHFLGS SNS PAS ASHVYAPL Y 


11570 


25471 


A 


11670 


396 


7 


EKVGQARDKSACEGHSCADARSCLHLRV 
DQGGQHGSHGACPGRRLRGPDTHRQSCY 
QCRCNRSPDRLQPGRAGTP* CAEAAAEP 
DAAVHPGQPAPARLPPALPGRGLLPLPH 
PGGSQRVYCVRCIGSKLR 


11571 


25472 


A 


11671 


209 


.3 


DDLGTHRPGKSQVRVTPREAGENPSAVL 
PCYLHLVPFV*EDEVKPEDSIPDMPGNE 
YAREFLAHAPLY 


11572 


25473 


A 


11672 


270 


101 


LSLNRWILGAYI I FFETESHS VAQAG VR 
♦HNLGLLQSLPPGSRRPLAFKVAFLFIG 


11573 


25474 


A 


11673 


421 


592 


SSCTPGVI IKIPCVRSNHMKLPGPGMVA 
HTCNPSTLGGQGGRITWSE*FKTRLGNM 
A 


11574 


25475 


A 


11674 


176 


6 


LTFVYMPCFCALSSLIMTYGLAM* FHFH 
SMTLFI IGVLSVALAI YQ*WLDVSRVSV 


11575 


25476 


A 


11675 


1 


418 


NTVCVCVCVC VCVC VERKAGHRLTDS FP 
IQAGPGAQARTLCLRESTGGALSSGLPA 
SRYQDPGRGALHPHCVWTPRGWNWRLNY 
P PARS PSREAVC YFRQTLKKWRS PRGQG 
GR I LRVI RTASTFRDTS *' TESACLGARP 
CLDGK * VCES VPC FS LHTHTHTHTHTOT 
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NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 
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SEQ ID 
NO: in 
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26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyiaIanine, 
G=GIycine, H=Histidine, I=IsoIeucine, 
K=Lysine, LHLeucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R—Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 














HCI 


11576 


25477 


A 


11676 


305 


698 


S FMGRS PRK I DQFCNSRNMVHGS VTFRD 
VAIDFSQEEWECLQPDQRTLYRDVMLEN 
YSHL ISLGSSIS KPDVITLLEQEKEPWV 
WSKETSRWYPGK*E* SRQGEAI IVPDS 
PCAQRGVTPLRCGLENFLQ 


11577 


25478 


A 


11677 


107 


397 


GGDGRETLRAPADDGCI *QERAETPLAI 
SCPRSSSTCRGRARRYDGCRTRDGHQLI 
QRGQQQESSRRTAQPGSSKLCRGPGGHR 
PSLGTVGVHGCSL 


11578 


25479 


A 


11678 


514 


554 


RAKPTPP*GGGEGQNFGLLKPPVSGLPP 
LACPTPPKRWKPKPPPPAPPHFLFF* *K 
RGLI F* PGEVFNLRNWKLPPS PPQRGGN 
KGRNQPPPRGTFMF 


11579 


25480 


A 


11679 


266 


3 


GPPLFLGLKN * TFFS PTKV* I FFSQVFS 
SFPQGLKGGVRE I FFFFFF* DRLECNGT 
I SAHCNLRLPGS SDSQASAGTTGVCHYT 
RMY 


U580 


25481 


A 


11680 


1 


408 


NTCALRHRDHFSFWSDGKTKEGEERIVQ 
ENWLISWNNFSVPETLLTLSCPLGKEGM 
PGEDGTAGAGKYPGEDKI PG* DGTAGED 
GTE DENGTTGED ETAG VGKT PGAGGTTG 
EDDTESEDGTTGEDETAGKGGTAG 


11581 


25482 


A 


11681 


244 


420 


KDSESTKAYIRDNISSTKKKKSWPGAGA 
HPCNPSTLEGKGG* IT*SQEFETSIiANM 
VKL 


11582 


25483 


A 


11682 


294 


443 


QMNTCTH I Q P AYVH * E KY FR P S AMAHAL 
NPSTLGGRGGWIT* SQE FQTRP 


11583 


25484 


A 


11683 


346 


1 


YNTNQFTLRGTQASVYTCLTALIVLKLL 
NQPYTLSAVLLNAGNQPFHAFPSPPSLH 
PDAFHSKIMSHI I ILLLGFI PFLPASQS 
LTLS PSLERSGTI SAQCNLCLPGSSNS P 
ISA 


1 1584 


25485 


A 


11684 


140 


423 


SSHQASSPPTTASHSMKISVAAIPFFLL 
ITITLGTKTESSSRGPYKPSECCFTYTT 
YKIPAQRIMDYYETNSQCFKPGIVFITK 
RGHSRWTNPK 


11585 


25486 


A 


11685 


163 


3 


IKIPCNRLLKFGPPSCCGFFFPliLPPFF 
FFETLSRSVAQAGLQWRDLGSLLAL 


11586 


25487 


A 


11686 


179 


1 


PIVFLGMLFVAFAFRGPFESVTKPPNFF 
FFFEMESCSVAQAGVQWRDLGS LQARP P 
RST 


1 1587 


25488 


A 


11687 


192 


1 


EQGFDGHLASWISLKDTAKWKNGDELCS 
YYQSCCTDYTAECKPQVTRGDVFTMPED 
EYTVYDDG 


11588 


25489 


A 


11688 


89 


3 


SGGYCCCLCCRCSRYSCCCCRLREGPTK 


11589 


25490 


A 


11689 


368 


2 


QDRLQPHESKRKLGPAPQLRRSESDTPS 
WNFQSTESQLMSKGDEDTKDDSKETV 


11590 


25491 


A 


11690 


301 


397 


HFVFS LVTGMNPLS P YLNVDPRYLVQVR 
LRFY 


11591 


25492 


A 


11691 


18 


421 


TKALQ I TCYLHS TMS EESDGKL I EDSL I 
QLRCHFTWKLLIEAPEIPDLENRIWEEI 
Q FLDTKYNVGI HNLLAYVKHLKGQNEEA 
LVSLKKAEDLIQKEHANQADIRSLETWG 
NFAWVYYHMGRLAEAQTYLDKV 


11592 


25493 


A 


11692 


188 


3 


PLQG KG FG AKERGQKKGFLRGGVRASRP 
RPPFGPSLiFFFFFFETDSHSVAQDGVQW 
CDLGS 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,3 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenylaIanine, 
G=GJycine, H=Histidine, I-Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, V=possib)e 
nucleotide insertion 


11593 


25494 


A 


11693 


24 


391 


APRADAMGHFTEEDMATITSLWGQGNVE 
DAGGETLGRLLDVYPWTQR FFDS FGNLS 
SASAIMGNPKVKAHGKKGLTSLGDAIQH 
LDDLKGTFSQLSELHCDKLHVDPENFKP 
LGNGLQTALQ 


11594 


25495 


A 


11694 


138 


425 


NSGVKAAFQLNPGNPNLEGKPGPGFKTH 
PGQLGETHFLLKIHYLSGLVGKPWESQL 
LGNLNRENHWTPERGGPRDLRHCHGIQP 
WATNGNFVLKKK 


11595 


25496 


A 


11695 


158 


838 


CGTVHSCDAGQRQRATAP SHPCDHGNQQ 
PILYRVLCCQLFWILCFVFSHSIMSTKC 
PSVSPVSGEMKKRKAITLEMKLKI IAQH 
EGGKPVMAIARELxGLWQSTISTILRDKK 
QISDAAKSSASVKSTVITKKRAGPIDDM 
EJGjLVMWMEDQIQKRIPLSLLMIQAKAR 
SLFNMLKDRASDPTYTQMFKASKGWFQR 
FKRRHNFKNVKITGEAARAGNEGAIAFK 
EQL 


11596 


25497 


A 


11696 


834 


1431 


SSACQGGSQGWPPRQPWWMWGWSRRAPS 
VTHRTGERCSTSGSAPLPLLLSSLCSSL 
PRSWEHSTLRQPPHPPLLPPGCLPPGRE 
AQRIiS GhGHGPG PR PAAAS Y I PDTQADR 
KQQQHHPGPDQHVGHREGLALEDAATHL 
GLVAALSCWLLGRAEAGYQVPHGHDHPK 
DQHPQADGGQRIVRAIGLGLGHHVSGRR 
ARP 


11597 


25498 


A 


11697 


143 


1 


ISKERGASRFSGPWVFFFFFLESRSVAQ 
AGVQWCNLG S LQAPPPGFT 


11598 


25499 


A 


11698 


58 


459 


KGKEEKVKRKEAEQNFSPYAQDKQERI K 
GNENDEKTKQGKETI IDIELFKGLDETG 
ENMDS TLTRT P FE PLENNKQI LVLGLDG 
AGKTSVLHSLASNRVQHSVAPTLGFHAV 
C INTE YSHME FLE I GGS KPFRS 


11599 


25500 


A 


11699 


215 


1 


GTKKALWAGGGGFFPYLPPMGPIiGPICG 
GVGKGSPGLGCPIFFFFFFFETESCSVA 
RLDAQWPDLGSLQSP 


11600 


25501 


A 


11700 


300 


420 


KPKILFGNVFAAPHMEKLKCRGETVAKE 
ISEAMKVKAMC 


11601 


25502 


A 


11701 


351 


466 


R I KNADLSQAQW FTPVI PALWSAKVGRS 
LEVRSSRPVW 


11602 


25503 


A 


11702 


172 


400 


SNRLRNKIQG VFLNDSS I S PFILRKQS I 
GQAWWLTPVIPALWKAEASGSPEMDSTT 
LLPS SSQVPS LVKMEKLNYS 


11603 


25504 


A 


11703 


284 


408 


AS VFSS FFFVCLFVFEMESRS FAQAGVL 
WRDLGSLQAPPPG 


11604 


25505 


A 


11704 


20 


447 


LPGADYGGGKLSLRLFHLLLASAAWVPD 
ESQVTLNSAI CVLSTVLIMEFPDLGKHC 
SEKTCKQLDFLPVKCDACKQDFCKDHFP 
YAAHKC P FAFQKDVHVP VCPLCNTP I PV 
KKGQ I PDWVGDH I DRDCD SH PGKKKE K 
I FA 


11605 


25506 


A 


11705 


1 


455 


HSCSLQFTPTAWDCTGSVSSEQGERPAA 
AMKI CS LTLLS FLLLAAQVLLVEGKKKV 
KNGLHSKVRSEQKDTLGNTQIKQKSRPG 
NKGKFVTKDQANCRWAATEQEEG ISLKD 
BCTQLDHE FS CVFAGNPTS CPKAQDERV 
YWKQVARKLRS 


11606 


25507 


A 


11706 


1 


42$ 


D»lLADKF^ajL»JiVRGGRSGGSGG GSGAGG 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=Phenylalanine, 
G=Grycine, H=Histidine, l=lso!eucine, 
K=0Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, ^possible 
nucleotide insertion 














ASNCGTGSGRSGLLDKWKIDDKPVKIDK 
WDGSAVKN S LDDS AKKVLLE KYK YVEN F 
GLIEGRLTICTISCFFAIVALIWDYMHP 
FPESKPVLALCVI SYFVMMG ILTI YTS Y 
RE 


11607 


25508 


A 


11707 


1 


422 


RSQARSSAAAAARASVPLRGSPGP3AIM 
PMFIVNTNVPRASVPDGFLSELTQQLAQ 
ATGKPPQYIAEHVVPDQLMAFGGSSEPS 
ALCSLHS IGKIGGAQNR3 YSKLLSGLLA 
ERLRI SPDRVYINYYDMNAANVGWNNST 


11608 


25509 


A 


11709 


88 


423 


AADAMKQASEAMGAFMRLAYQELQIDRL 
KEDKMLLNLEGNKREHAERLGMGL.VSRS 
SVSHSVLSSMLVIEHETPVSAKSSRSQL 
DLFDDVGTFCSGPSKYTDNPFSLWESIG 


11609 


25510 


A 


11710 


393 


130 


NPSATAQS FHLAARKLSTLDRPGSQAPQ 
KPMPSPKPGVVLPASTNAVPTPLAEATP 
SKAHPAISLLSTEEGIFKAVPSPA3SCS 
FLHV 


11610 


25511 


A 


11711 


331 


440 


K I FFLN FL I S RAW W CAP L VLATWEAEVG 
ESLYPRRS 


11611 


25512 


A 


11712 


122 


478 


SGLCPQQPPRANSCPPSSMASCAEPSEP 
SAPLPAGVPPLEDFEVLDGVEDAQGEEE 
EDEEEEEEDDLSELPPLEDMGQPPAEEA 
EQPGALAREFLAAMEPEPGPSPVPKEWL 
DILGNGL 


11612 


25513 


A 


11713 


102 


2 


TNLGNPRRPPPFFFFETVSLLAQAGVQW 
CDLGS 


11613 


25514 


A 


11714 


126 


2 


FFAPFLKI FFFFFFSEMESCSVLQAGVQ 
WHDLGS LQNEGSQ 


11614 


25515 


A 


11715 


178 


449 


LSGGNLFG 1 1 ILLYGAI IGGKLLGLI KL 
PTLPPLPSLLGMLLAGILITNIPVINDN 
VQ IMKRWS YS LRS IALAI I IjVRAGLVLE 
SKALEK 


11615 


25516 


A 


11716 


48 


417 


GSGGNHSVCCDTMEGGGGSGDKTTGVLA 
GFFGAGEAGYSHADLAGVPLTGMNPLSP 
YLNVDPRYLVQDTDEFILPTGANKTWGR 
FELAFFTIGGCCMTGAAFGAMNGLRLGL 
KETQNMAWSKP 


11616 


25517 


A 


11717 


103 


2 


PXSPTQWLMAVIPALWEAKAGGSRESRS 
SRPAL 


11617 


25518 


A 


11718 


1 


413 


WPSGQVLVGCLSFSLYCWKRSLKRKNPG 
FEVLLKI FLKNHPRCHTNRiVlIQLTATPV 
SALADEPAHIRATGLIPFQMVSFQASLE 
DENGDMFYSQAHYRANEFAEADLNHAAS 
LGGDYLGDLALRLLCTL3PVFPQSY 


11618 


25519 


A 


11720 


116 


462 


AGMLPAVGSVDEEEDPA5EDCPELVPIE 
TTQSEEEEKSGLGAKI P VTI ITGYLGAG 
KTTLLN Y I LTEQHS KRVAVI LNES GEGS 
ALEKSLAVSQGGELYEEWLELRNGCLCC 
SVK 


11619 


25520 


A 


11721 


167 


407 


E I YSLTRFI EVKMSKKI SGGTWEMLGD 
EMTR I I RETLKEKLI FP YVESHLH S YDL 
GIENRDATNDQATKDALEAFNKPY 


11620 


25521 


A 


11722 


254 


423 


NQLSSIMAMFKKIKSFEWFNDPEKVYG 
SGE KVAG R V I VEVCE VTR VTCA VR I LACG 


11621 


25522 


A 


11723 


3 


424 


VSCDTMEGGGGSGDKTTGGLAGFFGAGG 
AGYSHADLAGVPLTGMNPLCPYLNVDPR 
YLEQDTDEFI LPTGAKKTRGKFELALirT 
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NO: of 
nucleotide 
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nucleotide 
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ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, 1=1 so leu cine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














IGRCCMTGAAFGAMNGLRVGLKETQNMA 
WS KPRNVP I LDrWTRQGAL W ANTLGALA 


11622 


25523 


A 


11724 


2 


343 


AFGTMKWVTLVS VLFLFS S AYSRGVFRR 
DAHKSEVAHRFKDLGEENFKALVMIAFA 
QDLQQCPFEDDAFSTSEVPEFATTCQDD 
DHFEDRCRRVHTLVWQLCTPATLLETD 
Y 


11623 


25524 


A 


11725 


1 * 


359 


HAFGTMKWVTVI SLVFLFNSAYSRGVFR 
RDAHKSEVAHRFKDLGEENFKALiVliIAF 
. AHYLHQCPFEBKVKLVNDVTEFAKTCDA 
DESAENCDKSLKTLFGDKLCTVATLQET 
YGEMADC 


11624 


25525 


A 


11726 


1 


349 


GAMSDRKFSAPRHGSLGFIiPRKRSIRHR 
GKAKSFPKDDPSKPGHLTGFLGYKAGMT 
HIVREVYRPGSKANTNEVAEAVTIVETP 
PMEVADIAGYMETPRGLRTFNTAFAEHM 
SDEC 


11625 


25526 


A 


11727 


81 


349 


TKGSWSCCVCLCVCLCTWSHLCRLVTW 
LPDMPDDVLWLQWVTSQVFTRVLMCLLP 
ASRSQMPVSSQQASPCTPEQDWPCWTPC 
SPEGC 


11626 


25527 


A 


11728 


264 


388 


QADPKDIMKFPGPLENQRLSFLLEKAIT 
REAQMWKVNVRKM 


11627 


25528 


A 


11729 


2 


471 


PGCSASWSKRGSGPDMLSSMAAAGSVKA 
ALQVAEVLEAIVSCCVGPEGRQVLCTKP 
TG EVLLSRNGGRLLEALHLEH P I ARK I V 
DCVSSHLKKTGDGAKTFI I FLCHLLRGL 
HAITDREKDPLtfCENIQTHGRHWKNCSR 
WKFI SQALLTFQTQ I L 


11628 


25529 


A 


11730 


160 


377 


LQG FGR P S VYHAA I V I FLE F FAWG LLTT 
PMLTVS IAELGLCFVRERDKFLGTYHC V 
CLDTCLGVALDSDLKQ 


11629 


25530 


A 


11731 


45 


438 


KLGQRKEATVTKES CKSCSRKVGS PDRF 
RSPQKRSKGRQDCFTLFCFWKVLDKNME 
LISPTVI 1 1 LGCLALFLLLQR KNLRR P P 
CIKGWIPWIGVGFEFGKAPLEFIEKARI 
KVCGRGRRGLQRRQCFLF 


11630 


25531 


A 


11732 


169 


292 


DS I LLVNLACS AV I S AHCNLRL PG S SDS 
PAS AS RAAGGAHL 


11631 


25532 


A 


11733 


473 


600 


KFWEKRTEGRGKSKSKNSETGEIVSISA 
LSTTEVAMHTSTSS 


11632 


25533 


A 


11734 


19 


349 


APSPDAMGHFTEEDKATITSLWGTVNVE 
DAGGETLGRLLGDYPWTQRFFDS FGNLT 
SASAI MGNPKVKAHGTKVLTSLGDAI KH 
LDDLKGTFAQLSELHCDKLLVDPENF 


11633 


25534 


A 


11735 


234 


359 


FYHLKSFTVSS VQSRWLTPVI PALWEAE 
AGGSPEVKSSRPAS 


. 11634 


25535 


A 


11736 


15 


372 


KLPLKALTGBEKTH INI DI IGHVHSVKS 
TTTGHLI YKSRGIDERTIEI FEKEAAEM 
GKGS FKYAC I LDKL KAER ERG I T I D I SM 
RKFETSKYYVTI IDAPGHRDFI KDMTTG 
TSHADCA 


11635 


25536 


A 


11737 


1 


357 


IWKAAMASPAGSWARPPRPMREPQTLAM 
PTNAAEDQKLKLERLMKS PDLAVT I PEK 
MSE WS PG P P PEFDRD VMG SRAGAASGEF 
HVYRLLRRRE YQRQDYMDAMAE KR I LDA 
EFQRRLE 


11636 


25537 


A 


11738 


24 


335 


APNANAMGHYTEEDKATI TS PWGKGNVE 
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C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Iso leu cine, 
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nucleotide deletion, ^possible 
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DAGGKTLGRLLDVYPWTHRFFDRFGNLS 
SDSAIMGNPTDKAHGQKVLTSLGDATKH 
LDDLKGTFAQLSELHCDKLH 


11637 


25538 


A 


11739 


141 


335 


MQFLPCIPILKSLEKSVASHSQTVHSDI 
ISTVEFNHTGELLSTGDKGGRWIFQRB 
QESKNQVHR 


11638 


25539 


A 


11740 


7 


337 


APS PDAMGH FAEEDKAT I TS LWCKVNVE 
DAGGETLGRLLAVYPWTQRFFDSFGNLT 
SDSAIMGNRKVNAHGTKVLTSLGDAIKH 
LDDLKGTFAQLTELHCDKLHVDPENF 


11639 


25540 


A 


11741 


182 


360 


SRHS I S PPVNQ I QLGASVTEELTWTKT 
ARVSRAQWLMPVI PALWEAEAGES PEVR 
NSR 


11640 


25541 


A 


11742 


174 


1 


HFSDHSLFVCVRQDLTLLPRLECSSLIT 
AHCSLHLLGSGDPPTAASV7VSGNTGVHY 
HA 


11641 


25542 


A 


11743 


75 


218 


KI I LGRAQWLMTV I PALWEAKVGRS P VS 
ASQSAWDYRREPPCPSCTI 


11642 


25543 


A 


11744 


168 


2 


LAPLWSLGPVLGGVGQGSPGPGFPFFFF 
FFFFETESCSVARLECSGPILAHCSLR 


11643 


25544 


A 


11745 


83 


2 


REQRFLLPFVQLCPAPRGGVYRGRQAS 


11644 


25545 


A 


11746 


156 


3 


FHSGSGRVENPSFFFSFFETGSCSVTQD 
GECTGATLAHCDLCFLGSSNSS 


11645 


25546 


A 


11747 


244 


330 


KDRAQWVTPVI PALWAAKAGRSLEVRSS 
R 


11646 


25547 


A 


11748 


201 


1 


TSQPLIRITLSTFFFFSRDGGLTMLPRL 
DPELPGSSNPPSCSASRVDGTTGMCHHT 
RLIFFSQTDKK 


11647 • 


25548 


A 


11749 


262 


3 


VYTSLTPPP ISVNLLTQVS PRSVS YEQN 
I LCLL F VTY FTCTF FVCL FVCL FVCF FK 
MES CS VAQAGVR WRDLGLLQAP PPGFT Y 
TT 


11648 


25549 


A 


11750 


225 


3 


PIINFSVPQFLHLYNGIIIESTPPSCCI 
LKCQSLGQAQWFTPVITTLWEAEAGRSL 
EAWTLKTILANMAKPHL 


11649 


25550 


A 


11751 


185 


2 


VSTPFNSFPSPPWDLVFLKGFFFFFFFF 
FFETEFCSSPRLERNGAPLAHCNLRLPR 
SSNSP 


11650 


25551 


A 


11752 


261 


336 


DRVSLWSPRLECNGAISSHCNLRLP . 


11651 


25552 


A 


11753 


282 


381 


TDLLYKKKCGLGAVAHAYNPSALGGQGG 
RITCS 


11652 


25553 


A 


11754 


202 


45 


NPRKVKLQWALILPLPFNLGGQLKSRFQ 
KKKKKKKKKKKEEKRKKKTIiWIC 


11653 


25554 


A 


11755 


293 


357 


LTPVI PALWEAEVGGSPEVRS 


11654 


25555 


A 


11756 


264 


1 


TLPINWCCDESTGPSLISIiPIiLSLPYS 
LRHNW I E I RL INWP TTACKYS CLKKSHK 
SLTLDQKLEM I KLGEEGLLKAQ IGQKIiG 
LLHQ 


11655 


25556 


A 


11757 


326 


1 


ILPGFLKELKIELPFNPAILPLGIYPKE 
KKLLYQKDTCTHMF 


11656 | 


25557 


A 


11758 


399 


226 


IEKGTGRVWWLTPMI PALS E AKAGGS PE 
VRSPRPACIGLLKFWYYRCEPQHPARLS 
EI 


11657 


25558 


A 


11759 


235 | 


380 


DTTVLKLGQLITLQWASKCSSERKSRTS 
LTLTQKLEMVKLSEEGMSKGD 


11658 


25559 


A 


11760 


249 


357 


NRASGQTWCPTPIMPALWEVEAGGSLEP 
RSLRLTWA 
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C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
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U659 


25560 


A 


11761 


168 


372 


KLHCLLCF YLS EFFVFCFLETRS LS VAQ 
SKCSGTVITHCSLKLLCSSDPATSDSRV 
AETTGTGYHAGL 


11660 


25561 


A 


11762 


303 


377 


QFAGCRWLMPVIPALWEAKADGSPE • 


11661 


25562 


A 


11763 


135 


1 


ATTPGLFFIFYFFETESHS I AQAGVQCM 
ISVHCNLCLPGSSDSPV 


11662 


25563 


A 


11764 


219 


2 


KliRSSQbSEPEVRPDSTLSMLMHARAHT 
HTHTHPHLFLPHPCSRPHTSTPTCMPIV 
THSPIYLKFHSHTYIL 


11663 


25564 


A 


11765 


102 


1 


NRPINNPTMASMCSSERKSHKSLTLNQK 
LEMIKL 


11664 


25565 


A 


11766 


245 


3 


PGEPEEDQLVKFRPSLLWVFMFLFFLRR 
SLALS PRLE CSGVECSGVI SAHCNLCLT 
GFKQFSCLSLLSGWDYRHVPPCPS 


11665 


25566 


A 


11767 


350 


97 


GSVPASPQVTRPTI^SIADKWFSLCLT 
SLRVNFGVALI LSGSHSVSQDTLDLLTL 
FSTHLGLPKCWDYKRVPPCSAQFLVTFL 


11666 


25567 


A 


11768 


188 


391 


LGSVAGDLLCFFGGVIFPCSFMFPMSLC 
YYLYIWYNSCVFLFFKTEPHFVTHSGVH 
WCDGLLQPPPPN 


11667 


25568 


A 


11769 


131 


2 


YTLYIPNKSPIENIQLSWAQWLVPVIPA 
LCGAETGELLEPRSS 


13668 


25569 


A 


11770 


165 


1 


VLIPLIHLCIYIYIYTYICVYIYTRIYT 
H I CI HTHVY IH I YVYIHTYI YTYMYT Y 


11669 


25570 


A 


11771 


72 


1 


ILLFFFFFEAESCSVAQAGMQWCN 


11670 


25571 


A 


11772 


121 


3 


TS F VLF C FETE FCS CCC PGLE CNGA I LA 
HCNLRLPGSSD 


11671 


25572 


A 


11773 


152 


3 


HVCLNLTLPFLFEKNIFSLCVCVCVCVC 
VCWLCKKKKCEKDFYFERTF 


11672 


25573 


A 


11774 


190 


2 


GFSPRQRGAPRVPPPLAGP FPP I FFFFF 
FFEMVJSHSVAPAGVQWCNHSSLAALTS P 
DSGDPP 


11673 


25574 


A 


11775 


287 


1 


GAHRRKLFITPGESLRDKQIAGFEHRGG 
EKKKTLFYKKKKKRKKKRKEKKGKEKKM 
LPKCNYTMKNHQVPYS IRPTRECADLRV 
FYFIKFQILKY 


11674 


25575 


A 


11776 


121 


3 


KCASRDLS KFFFFFFLETESRS VAQAGV 
QWCDLGSLQAL 


11675 


25576 


A 


11777 


142 


1 


EKTLHVRNTIHNSEGSLVIKIHGRLGW 
AHACNPSTLGGRGGQITRS 


11676 


25577 


A 


11778 


179 


3 


SHQPVPGTLDLPRGPQKLQSTSEAESEA 
SMSEASSEDLVPPLEAGAAPYREEEEAA 
KK 


11677 


25578 


A 


11779 


147 


6 


KTPGLKKNSNFFFFFFFFETRSHSIARA 
GVRGCDLSSLQPPPLGLK 


11678 


25579 


A 


11780 


184 . 


3 


GFGFYIIPNYRLFFFSLLIGPFFFLIFF 
ETEPCSVARIiECSGVISAHCNIRULGSC 
DSPA 


11679 


25580 


A 


11781 


120 


319 


VKSLAKFLVNYQLHRQTCSGQAGRGALR 
QRFGR PRQVDHLRPGVRDQPGQHGETP S 
LLEVRKLSSG 


11680 


25581 


A 


11782 


91 


3 


Y I CLS LI YTHTHTHTHTHTHTH I Y I Y VC 
V 


11681 


25582 


A 


11783 


221 


332 


SRLGMVDHPRSGVRDHPGQHGETPSLLK 
IQKLARRSG 


11682 


25583 


A 


11784 


243 


379 


LKYSVP I KNCKPWPGTWDHTRNPSTLGG 
QGGWITLDQKFETSLAN 
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nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
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E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, ]=lso!eucine, 
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11683 


25584 


A 


13785 


327 


5 


GRVDSQPIYPKNPPGSFPKIYPGSFKTL 
FPYLQSRHSFSIPRSKLARFHQAKPLFC 
SKPSQSLSPHSELKKKDLSQARWLTPVI 
RALWESKAGRSPEVRSSRPTWPT 


11684 


25585 


A 


11786 


245 


359 


FKGMDRLRSGVRDQPGQHSKTPSVLKIQ 
KLAGHGGMRJj 


11685 


25586" 


A 


11787 


356 


67 


RDI AS ITARLRYDRGVGITRLEVLSKSD 
EYTKGSVLDFSRETEPIGYIYIYIYIYI 
YIYIYLHSLPLNHMEISLGHPPLSNNKF 
VQNFDSPKLSYH 


11686 ' 


25587 


A 


11788 


103 


3 


KYI I P VC I FFFFETESCSVTQSGVQWCE 
LRRRG 


11687 


25588 


A 


11789 


136 


1 


VHQFTIRILNYYYYYFFETESYSVTQAG 
VRWCNLGSLQPLPPGFK 


11688 


25589 


A 


11790 


291 


163 


SLLLLPRLECNSTI SAHRNPRLLGSSDS 
PASGRSLEPRSSKLQ 


11689 


25590 


A 


11791 


107 


2 


I FMYVFFETESRSVAQAGVQWRYLGSVQ 
APPSGFS 


11690 


25591 


A 


11792 


329 


3 


KNPRNIIGKSRGPYITNFPGDKGPKNSS 
PRGNDLS INKNWAQAFQPRGQKKLFFPK 
KKKKKETQSRS I PRLECNGPT I AQCNLK 
LLASSNPPTSASHSAAIISMSHSG 


11691 


25592 


A 


11793 


213 


3 


I S PWLFLLLQATLFRS QYPGPFPPFSLS 
PYLFPFKQKLGSGFFFFFLFFFKTESRF 
LARLECSG I ITAQ 


11692 


25593 


A 


11794 


1 


400 


KRAAPQPAPEQRDLKKKKKKPPLAPSSL 
FRFWEKLLPSFQFRQPRGPFLGTRGVS F 
AFHRRRFWQYGNMGEKQTWGNPGSSSNP 
P PG TAG S LS GNRG PG WGGKI ADR FWLYA 
RDSFGHSRGLKGSCCQAHLGK 


11693 


25594 


A 


11795 


3 


769 


RKSQTRNARAEVLRQAKANFEKEERRKE 
LKRLRGEDTWMLPDVNERIEQFSQEHSV 
KKKKKKDKHSKKAKKEKKKKSKKQKYEK 
NNESSDSSSSSEDEWVEAVPSQTPDKEK 
AW KVKDE KS G KDDTQ 1 1 KRDB WMTVD FM 
SVKTVSSSSLKAEKETMRKIEQEKNQAL 
EQSME I FQS KLEDAEKA^STKEDYRRER 
WRKPTYSDKAQNCQESRESDLVKYGFCS 
RDRYATTDTAKNSNNEKFIGDEKDKRPG 
SLE 


11694 


25595 


A 


11796 


110 


13 


HTHTHTHTHKHTHTHTHTHKDRSGKI KC 
VPTV 


11695 


25596 


A 


11797 


184 


2 


SE I FGKFHVFYILTVHFMPPTQVYVERD 
VLGQEQLLTP VI PALWEAKAGRS PE VRS 
SRPA 


11696 


25597 


A 


11798 


83 


387 


GERRRRRRRRLWAPLQKKKKKKKKKKKK 
KKKKKKKKKKKGGAPLKKPPGGPHFSGG 
RQKNI P PLKGGELKRAPAGDFKNPGRGK 
IARGGFFEKNLSWGGEK 


11697 


25598 


A 


11799. 


153 


2 


I P WEDTTVT SGYPQFLFSHSFFYFYFFE 
TG S CSLTQTG VQW CNHGS LQPR 


11698 


25599 


A 


11800 


93 


3 


VWGRAWWLTSVISALWEAKVGESPEVRS 
S 


11699 


25600 


A 


11801 


115 


1 


NPLFFFFFFFEMESCSVAQAGVQWCNLG 
SLQPLPPRFK 


11700 


25601 


A 


11802 


295 


137 


RCFIFILFINKLYFVYLFIYFFEMESHT 
VAQAGVQWRDLiGSLQAP PPNKLYF 


11701 


25602 


A 


H803 


287 


i 


MGPKAFFKGTCGGFQDFRFKNLEI SAGL, 
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K=Lysine, L=Leucine, M=Methionine, 
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THSPLLAQNFRVGAVPSPRYSPRFGSKG 
GGGFFFFFFFFETESHSVAQQAGVQWCD 
LiGSLQALPPGF 


11702 


25603 


A 


11804 


202 


3 


WS VCCFKQGLGLS PRLECNGAPGFKQS F 
PLSLLSNWGYRGPPPRLNFFFFFFEMES 
CSLAQAGVQR 


11703 


25604 


A 


11805 


3 


315 


FMLLI LTLFLLRNDRLVQCDVRS SWCL 
GFLLGWSVILYPLMAAFMPTMWIFLFIL 
FSMLFF Y VFFLFLLFFLLFLYS FPLFCF 
FFCFYLFRFFFFFLI FFSSP 


11704 


25605 


A 


11806 


126 


3 


KRGFFFFFFFFFFETESRSFAQAGVQWC 
DLGSLQALPPGV 


11705 


25606 


A 


11807 


130 


2 


QS I KMTCS SLYFFI KKNVAWWLTP WPA 
LWEAGAGGSHEPKS 


11706 


25607 


A 


11808 


149 


1 


GGKKAILFFFFSLSPRLECSGAIPTHCK 
LCLPGSRHSPASAFRWGTAG 


11707 


25608 


A 


11809 


258 


3 


KYYYQGNRQIKFVNAKSNKNLPSYSQKA 
SWFSSRNIRMMGQHQPMHLDNPTSFIKK 
LGQAWWLRPVTPAIiWEAEAGGSLESRSL 


11708 


25609 


A 


11810 


153 


285 


CASP I RSHQKLNKNGWVQWLMPVI PVPW 
EAETGGSLQLRSSRIA 


11709 


25610 


A 


11811 


160 


3 


NHLFLLLNR YKS I F IGRVQWLMPVSSAF 
WEAKAGRS FEPRS SRPAGQHGKTL 


11710 


25611 


A 


11812 


89 


2 


AFFFFFETVSCSVAQGGVQLCHLHSLQL 
N 


11711 


25612 


A 


11813 


254 


336 


LLGQPQWLTPVI PALWEAEAGRPSEVR 


11712 


25613 


A 


11814 


108 


2 


CVGVLGFFVCLFFETESCSVAQAGMQWH 
DLLGSLH 


11713 


25614 


A 


11815 


166 


289 


SHSGWNAVWRDGSLQPLTPGFKRFPCLR 
VPSSWDHKCAPPC 


11714 


25615 


A 


11816 


192 


2 


D KDPG KVTR P FLLL» C I F YL F I CL FLRRT 
LAVSPRLEFSGRGCSEPRSHRCTPSWVT 
EQDSCRK 


11715 


25616 


A 


11817 


228 


343 


LLQYS IQNEDAG WAWWLTPVI PTLWEAK 
AGRSLEVRSL 


11716 


25617 


A 


11818 


322 


407 


VLiRMLliHCLRECKLVQPLiWKTVWQFLKD 


11717 


25618 


A 


11819 


119 


1 


WSKIVRGKKCQKENTHTHTHTHTHTHTH 
THTYRES KRERLV 


11718 


25619 


A 


11820 


269 


1 


FVQVFY I LWTQS FCLTCCQWFLLG YHLS 
SVSNRFMRSGKSTFFFMTSEFFFFEMES 
RCVTQARVLECSGS ISAHCKLHLPGPRH 
SPALV 


11719 


25620 


A 


11821 


217 ' 


389 


EH I ET VYGKLGI S EH WGKGGLFNKWLT I 
WKKDYSWQWWLMPVI PALWEAEVGGSRV 
H 


11720 


25621 


A 


11822 


188 


1 


G S HMP CRV I S S VKE SMNEFPAVPT S Y PA 
NPQPRERAWRNQREKEDKKERSQRSVGR 
VQAGLV 


11721 


25622 


A 


11824 


126 


3 


KLQGQVQWLTPVI PAPWEAEAGRS PE VR 
SSRPASTWRNLV 


11722 


25623 


A 


11825 


224 


3 


ALII KKEVS AMELEGSNRI QAYGVQS I T 
GARDYHAAASRVPAI KGTHHHARVI FVF 
LVETGFHHVGQAGLDSC 


11723 


25624 


A 


11826 


3 


364 


HELPEPLRVLWTAHRVAMAPGSRTSLLL 
AFALLCLPWLQEAGAVQTDPLSRLFDHA 
MLQAHRAHQIiAIDTYHEFEETYI PKDQK 
YS FLHDSQTYFCFSDS I PTPCNMEETHQ 
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KSNLELLR 


11724 


25625 


.A 


11827 


2 


376 


ARELPEPITVLRTAHLKAMAPGSRTYLL 
LAFALLCLPWLQEAGAAQTDTLYMLFDH 
AMLQAHRAHQLAIDTYHEYDETYI FKDH 
KYSILHDYQTSFCFSNSITTPYNKEETQ 
QKSNLELLRISLL 


11725 


25626 


A 


11828 


288 


3 


IHTKNPSVHHHHQRPKVDKTTKMGKKQS 
RKTGNSKKWSASPPPKEHSSS PATEQS W 
TEND FDE LR EEG FR RSNYS E LQE E I QT K 
GKEVENFEKN 


11726 


25627 


A 


11829 


107 


1 


KRSGVRLQRGLVRWLTPVI PTLWEAEVG 
GSFEVRC 


11727 


25628 


A 


11830 


220 


354 


QGNKS F KN 1 1 AP WLGTVAKACNP S TLGG 
QGGRITGHEIETILANM 


11728 


25629 


A 


11831 


273 


352 


ENLKTGQIQWLTPVI PALWEAKAGRS 


11729 


25630 


A 


11832 


239 


488 


SQTHCSGKTENSHGTNSDRNQVPTALHK 
RPS S P ARFLTEAVPVPPVDYLQS VAAFA 
VSAVASQWERTGKP FNPLLGET YEL I R 


11730 


25631 


A 


11833 


189 


294 


DQTRWLTPVIPALWGAKASGSPSVRSSL 
HFPLLF 


11731 


25632 


A 


11834 


1 


332 


GTS PE PLTVLWTGDLLAMAPG SRTS LLL 
AFALLCLPWLQEAGAVQTDPLSRLFDHA 
MLQDHRAHHLAIOTYHEFEETYI PKDQK 
YSFLHDSQTFFCFSDSIPTPSNMEET 


11732 


25633 


A 


11835 


330 


175 


PRELAQLGSPNVRVSRVKPRGQPQMFFF 
FF WRQES CS VAQAGVQWHE PEQQE 


11733 


25634 


A 


11836 


96 


1 


VFVETGSRSVAQDGGQWCDLSSLQPQPP 
RPRA 


11734 


25635 


A 


11837 


267 


334 


SWVQWLTPVIIiALWEAEAGGSP 


11735 


25636 


A 


11838 


180 


2 


SVLEKKEKNLYKNLFTYKLLKKVQGKTQ 
CEGRAQICSVCVCVCVSVCTCVHVCAYV 
SSC 


11736 


25637 


A 


11839 


183 


2 


AAPLTSSAPQAPGWKPHLAPNSPFKNFT 
GRGQAQWLMPVI PALWEAKVGRS PEVRS 
LSSC 


11737 


25638 


A 


11840 


54 


330 


DPNGQLPEPLKGLWTAHLVAMAPGSRTS 
LLLAFGUjCLPWLQEAGAVQTVPLSRLF 
DHAMLQAHRAHQLA I DTYHE FEEPYIPQ 
DQKNSFLD 


11738 


25639 


A 


11841 


117 


344 


IQYVKQINETEMRNNESYLNHTSLTITI 
HTLCLMGSYLEHFQNQCKGRARRLMPVI 
PALWEAKAGRS PDVRSSKPA 


11739 


25640 


A 


11842 


750 


968 


RAQGPRWILKIPFSSPFRLFPLIPLVFL 
YNSPPLFPPFPRTNATNNSPFPPPLPPP 
LPFSPPPPPSPSRPPPS 


11740 


25641 


A 


11843 


757 


1000 


WELW YCCLFS P FFPLPS P PLFP PPP P 
SPPHSFPPPSPFRPPLPLAPTRPPPPFS 
LFQAPPLLPSFCSPPPPPPPSLSFL 


11741 


25642 


A 


11844 


278 


361 


TLLFFFFFFEMESHFVARAEVQWRDLGS 


11742 


25643 


A 


11845 


260 


2 


GSCLLEGKLTNRKDIHTKNPSVHHHHQR 
PKVDKTTKMGKKQS RKTGNS KKQS TS P P 
PKERSSS PATEQS WMENBFDELREEGFT 
RA 


11743 


25644 


A 


11846 


194 


2 


TKFAKEPSPPPCWQEVFKTSPLGFFFCP 
RS FP VNG F PRL KNL F FFFETES C S VAQA 
GVQWCARA 


11744 


25645 


A 


11847 


219 


1 


SLGHTLKQSESSLSFSPGGTPLLBFKPS 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
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od 


SEQ ID 
NO: in 
USSN 
09/515,1 
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Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=IsoIeucine, 
K=Lysine, JU=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=G!utamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














VYDPSVPPFQVSKGGCVGFFFFFQTESC 
SVAQAGVQWHDLGSPRA 


11745 


25646 


A 


11848 


126 


3 


RFFFKGLFLHFFFFFFFFCFETESHSVA 
QAGVQWCDLGLV 


11746 


25647 


A 


11849 


216 


339 


KCFKVGKVYKLNRETFYLAQNFFDRNWA 
TQKMWKTLLELI 


1(747 


25648 


A 


11850 


190 


334 


VS YH I EMS FFEVYNE KIHDLLVCKDENG 
QRKQPVRLKQFI ICFEFLFL 


11748 


25649 


A 


11851 


198 


352 


SRQGTWGIHGLECFFVCLFVLFFETESH 
FVPRLECSGAI SAHCTLHLPGEL 


11749 


25650 


A 


11852 


371 


1 


PPKKLGIQVFTPFAPRAFWPVLiVFKKKG 
FPQGNLVLPGFFSKVSIQGVPPFCRKPF 
SGWGEAFQIGFFINRGKKFFFFPLGFFL 
KKVFLKFIiAKFFFFFFFFFETESKSVAQ 
AGVQWRSLGLV 


11750 


25651 


A 


11853 


176 


3 


KKGFPLDWNSMVKNI IVKFGEDLQISKT 
LHTFFFLTESRSVAQAGMQWCDLGSLHS 
C 


11751 


25652 


A 


11854 


281 


375 


QLTFKKYFLGWARWLT PVI PALWEAKAG 
RSP 


11752 


25653 


A 


11855 


146 


356 


KCGALI AE I E VPLFSELRDFLLAYSRMT 
ELCFKRCVPSLHHRALDAVDDDCLHSCS 
CTDYP INRSLMNAY 


11753 


25654 


A 


11856 


136 




LS I LCE FFNWLI EMEYRS VSQAGVQWCD 
LSSLKPPPPRLQQCHSC 


11754 


25655 


A 


11857 


277 


361 


I VLRGAMWLTP VI PTLWEAEEGGSPEVR 


11755 


25656 


A 


11858 


295 


152 


WFGFGFFETESHFVAQPGVQWCNLGSL 
CSLRLPGSSNSPASAFQVA 


11756 


25657 

• 


A 


11859 


1 


342 


GTRLPEPLTVLWTAHLGAMAPGSRTSLL 
LAFALLCLPWLQEAAAVQTAPVSRLFDH 
AMLQAHRAHQLAIDTYQESEETYIPKDH 
KYSLLHDSQTCFRFSDS I PTPYNMEDTQ 
HK 


11757 


25658 


A 


11860 


244 


332 


TIEMMLDIKQIQVIFLFEFKMGRKIAET 
T 


11758 


25659 


A 


11861 


1 


339 


GTRWT I CQ VQLHAYAHFLYFFEMEARS 
VAQAGVRWCDLGSLQPPPPGSSSSSSSS 
S 


11759 


25660 ' 


A 


11862 


311 


394 


GMLGAVAHACN PS TLGGLGGW I TCSQE F 


11760 


25661 


A 


11863 


130 


1 


VPPSVRTFFFFFFETESRSVSQAGVQWH 
YLGLLQAPPPGSTSC 


11761 


25662 


A 


11864 


3 


320 


IMMYALFLLSVGLVMGFVGFSSKPSPIY 
GGLVLI VSGWGCVI ILNFGGGYMGLI V 
FLIYLGGMMWFGYTTAMAIEEYPEAWG 
SGVEVLVSVLVGLAMEVGLVLW 


11762 


25663 


A 


11866 


165 


309 


GLILLPMLECSDRISAHCSLYLLGSYDP 
SDSGSHVAGTTGTCHYAWLR 


11763 


25664' 


A 


11867 


208 


314 


GSRNKLSGQAQWLTPVIPALWEAEVGGS 
PEIRSSR 


11764 


25665 


A 


11868 


102 


1 


KKFFFFFETESKSVAQARVQCNLGSLQA 
PPPRFT 


11765 


25666 


A 


11869 


126 


3 


NNALQLHSSYCKKPFFFFLKTESRSLTK 
AGVQWCDLGLLQ 


11766 


25667 


A 


11870 


1 


273 


KQLPVNFLNWVRLELTGLLCTLNLSKPC 
MIFI ILVI VKYWFLLFCNIFKNHI FSQA 
QWPTPVIPALWEADMGG3HGYWITTIVD 
FMCATLT 



1259 



WO 01/64835 



PCT/US01/04927 



SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 
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od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E-Glutamic Acid, F=PhenyIalanine, 
G=Glycine, H=Histidine, 1=1 so leucine, 
K-Lysine, L=Leucine, M=Methionine, 
N^Asparagine, P=Proline, 
Q=Giutamine, R^Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 


11767 


25668 


A 


11871 


126 


2 


FP KALL WFF F FF F F FETE S RS VAQARV 
QWHNPGSLQCPLR 


11768 


25669 


A 


11873 


269 


9 


QVSGKFWPDIFYKTGEKGLQCPNFPQGH 
? PRGLKKKS FS KKKKKKKKKS I AVGQAQ 
WLIPVI PALWEAKVGRSPE VKSSRPACP 
TW 


11769 


25670 


A 


11874 


114 


1 


LEAKPGVFQSGVLLVIFFFFETESHSVS 
QAGVRWCDLG 


11770 


25671 


A 


11875 


146 


1 


GNHLS SRVG VQDQPRQHGKTLSLQKI QK 
VAGPGGAHLQSQEVQVGGSL 


11771 


25672 


A 


11876 


2 


159 


S LQFFFFF FRWS FVL.VAQAGVQWHDLGS 
PQPPPPGFKHSPASASQVAGRGGR 


11772 


25673 


A 


11877 


124 


240 • 


FRAFSSGQAWWLTPVI PALWEAETGGSL 
EARSSRPDWRT 


11773 


25674 


A 


11878 


135 


5 


QVLFYFYFSDSFTLSPRLECSGVTLPHC 
NLCLPGSSDSCASAS 


11774 


25675 


A 


11879 


170 


2 


GFFFPEKWGYKNSFLCFFFFFFFEPEFC 
S VAQAGVQWLHLGS LQ PPP PVFKVDAAP 


11775 


25676 


A 


11880 


97 


267 


GHGHATLRGLCVLSLFFHIPAPSVSGTS 
DAEECCLCVTYRLI CGYISRNCLYFSDH 
R 


11776 


25677 


A 


11881 


1 


292 


LP EP LR VLWTAHLQAMAPRS RTTLLLAF 
ALLCLPWLQEAGADQTVPLSRLFDHAML 
HAHRAHQLAIDTYQE FEQTY I PEDQKYS 
FLHDSQT Y FCFS D 


11777 


25678 


A 


11882 


93 


2 


KYQMGWAWWLTPVI LALWEAEAGRSPET 
SC 


11778 


25679 


A 


11883 


145 


2 


PPLGLRLQVQAPTPGFFFFFFETESRSP 
RLKCKGAI LAHCNVCLLLV 


11779 


25680 • 


A 


II 884 


202 


I 


TWWRWGVTVLVRLVMNCRPCDRHKSASQ 
LIGRVRQENGLNARVGGCSEPRSRHCTP 
VWVTSENPSSC 


11780 


25681 


A 


11885 


42 


155 


GERSGLSPGVQDEPGQHSGTSSLQKILK 
LAGHGGTCP 


11781 


25682 


A 


11886 


215 


1 


STMARHCPLSPMLFNSAMEVLVRAISQE 
KEIQGIQIGKEEVQLSLFTDDMIFNLEK 
RKDCS KNLLQLMNLV 


11782 


25683 


A 


11887 


214 




GYFFGLNEVLGKLEKPSLKVPQNPSPKR 
PFFLGFFFFETESRI VARLECSGAI SAH 
FNLCLPGSSDSPVSC 


11783 


25684 


A 


11888 


215 


3 


WGPGFPFFSFFFFFLRQGLAVIRLKCSG 
T I TAHCS LN FLGS SD P P AS VS LVATTTG 
HEPSITQFHSHGSC 


11784 


25685 


A 


11889 


118 


1 


RFFIMGENPTKFFFFF3TESLLPRLECS 
GVISAHCNLSC 


11785 


25686 


A 


11890 


267 


83 


HCLRSGVQDQPGQHGKNPSIQKIQKLAT 
SFKKSLKIVIPPLPFKNDKTDSKSSCSS 
LSPHS 


11786 


25687 


A 


11891 


134 


2 


DRLAVLPRLECSGMIFLLPLPSRFKRFS 
CLSLPSSWDYRGAPRA 


11787 


25688 


A 


11892 


116 


1 


SKGVGKFLFFFFFFETESRSVAKTGVQW 
CDLGSLLCLV 


11788 


25689 


A 


11893 


397 


475 


RFVCSTIKVLRDLSSDRSNPGRFLST 


11789 


25690 


A 


11894 


115 


331 


KNVCLFVF VEMKNHLN PGDEGCS KPRS R 
HCTPSWAAQQDSISKYICIYRYLYLDIY 
LSKLWRLRINQLPNS 


11790 


25691 


A 


11895 ! 134 


251 


INPPVSRKKKKKKKKKKKKKKKKKKKKK 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=Iso leu cine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=G!utamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unkno\vn, *=Stop codon, /=possible 
nucleotide deletion, \-possible 
nucleotide insertion 














KKKKKKKGGGL 


1179! 


25692 


A 


11896 


186 


402 


PGKNLTLENVPRENKVGEKAPGQWEPPV 
LGGGEYQDPGGNVKGVGPPPAPGFGKDG 
PKRLADTFDLIEGDGD 


11792 


25693 


A 


11897 


100 


2 


KGPPFFFFFFFFFFFFFFFGQSGQVKLK 
SPKCK 


11793 


25694 


A 


11898 


92 


3 


SVARIMPYALFLLSVGLVMGFVGFSSKP 
S 


11794 


25695 


A 


11899 


283 


362 


MTYTMTIVHALWASVCLLLNHAHDPLD 


11795 


25696 


A 


11900 


248 


1 


VMSAQPGLS VI LRFMGKWSNWTGMQ YTE 
SEVER YRDRGRARETERQRDRERERENP 
RNRKLFFYGRFKHSPQDFFMPQKVHF 


. 11796 


25697 


A 


11901 


253 


2 


KLQENPFKLLNFIINLCSVSLNVPI ILN 
INIKPSFIAPKPRCPSKFQRNYIYIYPS 
I YLS I YLS J YLS IYLPITIS KVLKDVE 


11797 


25698 


A 


11902 


179 


3 


LAVTKNE IMGFSTTWMEMGDINFFFFFF 
ETKSHPVMRLECSGVISAHCNLRILVSS 
SC 


11798 


25699 


A 


11903 


1 


403 


GTSSQESFGGCCVSGLIAMGTKAQGERK 
LLCLFILAILLCSLALGSVTVHSSEPEV 
1 1 PENNPVKLICAFYDLSFFSSS FSYFF 
HLSFFIYYPYYLFVSCFTFYIVILLSFI 
YSHHLSSVS Y YS YVLLI YTFIL 


11799 


25700 


A 


11904 


37 


242 


KGPTRDQLQHPKARLPAPLRVLWTAHiiA 
AMAAGSRTSLLLAFALLCLPWLIYDGAS 
HIVSLFMLFI IF 


11800 


25701 


A 


11905 


1 


298 


GTSQHRGRKDSRTGSHSSSDHPGAKLLS 
TEEKQAAETMRPPSAPPRRGCIPWQGLL 
LSSS 


11801 


25702 


A 


11906 


225 


30 


ACYLQKDGAFAI LPDVLPDMFKGHS S LY 
PCQHAECDHI KNI YNCVCVCVCVCVCVC 
IVICKLNV 


11802 


25703 


A 


11907 


196 


320 


HSSSSTTPSQDTHTHTHTHTHTHTHTHT 
HKI PQRELLPS VPD 


11803 


25704 


A ! 


11908 


130 


3 


NGFVFSPFPFFFFFEMESHSVTQARVQW 
CDLGSLQSLPPGFK 


11804 


25705 


A 


11909 


381 


2 


ASI CGQKLIFFQHWKKI ILVQQVGLQKK 
NSTLKRAWEFLSFFQPAPPLGSPLKGIN 
ILNKGAKKLKGKTNFPGPIVFFFSVLPP 
FFGLKKFQKKVFHQGKSPFFFFFFFETE 
SRSVAQAGVQWHDL 


11805 


25706 


A 


11910 


138 


2 


SVPLKEFI ISQARWLTP 1 1 PALWEAEAG 
RSPEVRSSRPASTWRNP 


11806 


25707 


A 


11911 


231 


54 


IRASFGIQIRICLTIFFFFETEPHSVTQ 
AGVQWRDLGSLQPPPRRQEQNSVSKQTK 
TN 


11807 


25708 


A 


11912 


3 


461 


DAWGRVEGPPLRPPATSRRWAGPTLWRM 
EVTGDAGVPESGEIRTUCPCLLRRNYSR 
EQHGVAVSCLEDLRS KACDI LAI DKS LT 
PVTLVLAEDGTIVDDDDYFLCLPSNTKF 
VAMAINEKWAYNNSNGGTGWISQESFDV 
FEAYSGATLLFFF 


11808 


25709 


A 


11913 


318 


407 


LCGRLWWLMPVI PALWEAEAGGLLKLRS 
MR 


11809 


25710 


A 


11914 


340 


5 


DRVAKFSEARLFRLFFPFFPLKI FCFPR 
GFKIFRGVCPLLFPPEFWGLFQKGPGMR 
LFFPPLGGFFFFFFFFFFFFFFSFFFFF 
F F P FF FFF F F FFCKKN rCTLLKKVKNS I 



1261 



WO 01/64835 



PCT/US01/04927 



SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown 7 *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


11810 


25711 


A 


11915 


274 


385 


IKPQRWGETRAEKLKSLKIRVPLLLQRN 
AAPHQQWNK 


11811 


25712 


A 


11916 


259 


441 


DTKLPKVNI KLNFHALKKKKKKKKKKKK 
KKKKKKKKSSSLRG 


11812 


25713 


A 


11917 


254 


402 


LIVSVIDFLRWRLPLLPKLECSGMISAH 
CRLRliPGSYHSPASGCQLLGR 


11813 


25714 


A 


11918 


223 


1 


NTNSPRKKFFYEEVSSVPVFFGVFPPSP 
LKGSPRAFLKLAWRPPLPLFFFLEMESR 
SAAQPEVQWCDLGSLQPP 


11814 


25715 


A 


11919 


97 


3 


KPFFFPFFFFFFEIESRSVTQAGVQWRD 
LGS 


11815 


25716 


A 


11920 


154 


384 


KEFFLMLFFFLPPPPPFFFFFFLGKGLF 
FFPPGGGGGGQFFS I GPPPPRGKKDPPP 
SPPKMGGKRGTPPPPGYFFFF 


11816 


25717 


A 


11921 


225 


369 


GTLNLLTYKI KS WSGAVAHYNPSTLGGW 
GKWITSGQEFETSLANWKP 


11817 


25718 


A 


11922 


116 


372 


MEYTWNKMAMSYNLHIYQTYKNSVIEHP 
TGVLPHSRVIMVWNIVLCISKQLKELLR 
RLM WE EHLS PGGGG CSE PRSRHCTPAWA 
T 


11818 


25719 


A 


11923 


271 


409 


KKKKKKKKEKKKKKKKKKKKTGGGA 


11819 


25720 


A 


11924 


121 


228 


KKKKKKKKKKKKKKKKKKKKKKKKKKKR 
GGGFKKKPWGGQK 


11820 


25721 


A 


11925 


55 


423 


NKPKKKNFLKKKKKNFFFPFYPLKFFFF 
PKSLNFFRRVVPKISPPKKKFFFKNSPS 
VFFFPPLKKKNFFFLTPLKFGPPKNFFK 
RPPPLFFFFFFFFFFFLSGNCIEKL 


11821 


25722 


A 


11926 


253 


443 


YQHQRPKVDKSMKMGRNQRKKSENSKNQ 
NTSS PPKDHNS S PARQQNWMENE FDGLT 
EVGFRRW 


11822 


25723 


A 


11928 


175 


413 


KKKKKKKKKKKKKKKKKKKKKKKKAGGG 
A 


11823 


25724 


A 


11929 


490 


182 


RKQKI KGCKKPNPLAQGGVKKKGKKGPP 
FNFFLKQKGQDFLKNWNLGQKKKI P I P P 
QVFSLRQKGAFPGRI FFFFFFFETESCS 
VAQAGVQTbSYKNKNKIK 


11824 


25725 


A 


11930 


no 


2 


KKIPRKPPFFFFFFFFYKAGSHLVAQAG 
VQWHDLGS 


11825 


25726 


A 


11931 


169 


987 


YLE KI MS EHSRNS DQEELLDEE INEDE I 
LANLSAEELKELQSEMEVMAPDPSLPVG 
MIQKDQTDKPPTGNFNHKS LVDYMYWEK 
ASRRMLEEERVPVTFVKSEEKTQEEHEE 
I EKRNKNMAQ YL KE KLNNE I VANKRESK 
GSSNIQETDEEDEEEEDDDDDDEGEDDG 
EESEETNREEEGKAKEQIRNCENNCQQV 
TDKAFKEQRDRPEAQEQSEKKISKLDPK 
KLALDTSFLKVSTRPSGNQTDLDGSLRR 
VRKNDPDMKELNLNN I ENI RK 


11826 


25727 


A 


11932 


161 


389 


SVQTHPNLRSCSVLKNAMHFYLLGYVIS 
GCTEPAKAIKPIDRKSVHQICSGPWLS 
LSTAVKKIVGNSLDAGATNI 


11827 


25728 


A 


11933 


134 


289 


ASTQNMGQMLRLMS S PTLLSLSLSHTHI 
HKHTRMHTHTHTHTHTYTPTKKV 


11828 


25729 


A 


11934 


128 


1 


IKKGEFFFFFFFFETESHSPRLQCSGTI 
LAHCNLHLLGSSNS 


11829 


25730 


A 


11935 


131 


19 


M YIG WVQWLTPVI PALWEPKVGASLEAR 
SLRPARTSE 


11830 


25731 


A 


11936 


104 


3 


LKGvIj I RAQWLHPVI PS LWEAEMGRSPE 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
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sequence 


M 
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od 


SEQID 
NO: in 
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09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=lso leu cine, 
K=Lysine, L=Leucine,M=Methionine, 
N=Asparagine, P^Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unkno\vn, *=Stop codon, /-possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 














VGSSR 


11831 


25732 


A 


11937 


146 


2 


GG RLG 1 1 YHAWNPEG FFFF FE ME S CS VG 
QAGVQWCNLSS LQP PHPGFK 


11832 


25733 


A 


11938 


149 


1 


DKCFLCVCCLFIFIYLLIYFYFFETESC 
S IAQAGVQCCNLGSPQPLPLA 


11833 


25734 


A 


11940 


176 


380 


QSVAQLPRLECNGMILAHCSLHLLGFKR 
FSCXjSLPSSWDYSRLRNYQGGRRWSKTH 
QAWCQSLTILWN 


11834 


25735 


A 


11941 


133 


415 


KKKKKKKKKKKKKKKKKKKKKKKKKKKK 
KKKKKKSG 


11835 


25736 


A 


11942 


222 


420 


QGDKFLDNNNSTIbYMEKKKKKKKKKKK 
KKKKKKKKKKKKKKKKKKYSR 


11836 


25737 


A 


11943 


163 


425 


KNPPKNSQLSPSTLKKKKKKKKKKKKKK 
KKKKKKKKKKKKKKKTG 


11837 


25738 


A 


11944 


143 


3 


KMGKKQSRKTGNSKKQSASPPPKERSSS 
PAMEQSWTENDFDELREE 


11838 


25739 


A 


11945 


200 


3 


QNSQVGYKGAPHKEKTMALQARVNLGPP 
RGPLKRPALFFFFFETESPSVTQDGVQW 
HDLGSMQPP 


11839 


25740 


A 


11946 


221 


3 


RKIEGGPSTSRKSVRGVRCFQPLGLGFP 
GGLFFFFFFYETESCSVARLECSGWSA 
HC I LHLRSTNEGRRGR 


11840 


25743 


A 


11947 


68 


177 


ISSS FKNKCLCEKKKKKKKKKKKXKKKK 
KKKKKRKR 


11841 


25742 


A 


11948 


210 


329 


RHTHTHTHTHTH I LS LS LSHTHTHAHTL 
MYFLAFFDLRS 


11842 


25743 


A 


11949 


183 


20 


LG TVAHS CN FVF S LS RG FLHVG KNGLRL 
PTSDDPPTSASQSAGTTGMSPCTWPE 


11843 


25744 


A 


11950 


232 


329 


VGIKMSISSDEVNFLVYRYLQESGFSHS 
AFTF 


11844 


25745 


A 


11951 


412 


239 


LFYEKGSRFVSQAGLELLELKQSPCFGL 
PKCWD YRHE P PRQADLPFCVP SS AWASW 
L 


11845 


25746 


A 


11952 


310 


393 


PHTDISGTPEIMHYVHVHRVTTQPRNKP 


11846 


25747 


A 


11953 


193 


3 


LSSWPTLISGAFKAEENAVEGNDDSRIL 
RGFFVLFCFVLFFRQSLALSPRLECSGV 
ISAHCNL 


11847 


25748 


A 


11954 


185 


416 


S VQTHPNLRS CSVLKNAMHFYLLGTE PA 
KAIKPIDRKSVHQICSGPWLSLSTAVK 
KI VGNS LDAGATN I DLKLKDY 


11848 


25749 


A 


11955 


100 


1 


AHLSKVFP I FFFFFEMESHSVTRLECSG 
TISAH 


11849 


25750 


A 


11956 


35 


521 


KEKFFFHAGVYWGPPRNFLKRAPLFFFF 
FFFFFFFFFFSFFFS 


11850 


25751 


A 


11957 


214 


1 


KKKIFFFKEIFYPKLSRKKKNFFRGAGK 
I LNLS I FPEKKKI FS P FS FFFETES CTV 
AQAGVQWHVLGSLQP 


11851 


25752 


A 


11958 


114 


1 


R YFSGQAQWLTPVI PALWEAEAGRS PEV 
SSSRPAWLTW 


11852 


25753 


A 


11959 


188 


12 


SKCQCTFFSNFKKTEMESHYLAQAGLEL 
LGFS HLP TS VSQTVG I TGVSHCAQPNAH 
LY 


11853 


25754 


A 


11960 


241 


556 


S S I PFLPNKHLLLS FSTLSSLGREVS IQ 
DMCQGTYQKQPFPFPLPHVFHLS IRDGE 
LCVGRDVHLTCQTDLAQVFVCLFVLGVL 
LSLPRAGVQWCHLSSLQPPPP 


11854 


25755 


A 


11962 


362 


462 


KIGQALTPI IPLLWEAEAGASPEARSSR 



1263 



WO 01/64835 



PCT/US01/04927 



SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
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SAWPT 


11855 


25756 


A 


11963 


43 


2 


CCCYCCCCCCYCC 


11856 


25757 


A 


11964 


30 


415 


GLGFKQKI FFFFFLGQGPNFDLLRAKKR 
PFLPPPFFPKNPKNRGVFLLGPSGVPKP 
RPRQGDVPPLKPRSLGVFLTSKKTPLGP 
WANPPPPPGDGP 


11857 


25758 


A 


11965 


149 


2 


SKTWLKLAKYLKNTYPGWVRV7LIPIVPA 
LWQAGAGELHEPRNLRPAWAT 


11858 


25759 


A 


11966 


179 


3 


KKNIFFPPPVKFGPPQGFFKRPPPLFFF 
FFFFFFFETESRSVAQAGVQWLSIGSLQ 
AP 


11859 


25760 


A 


11967 


245 


382 


DVTDS KMHPDFRSSHVKMWLG WAHACN 
PSTLGGSRGQEFKTSLAN 


11860 


25761 


A 


11968 


264 


10 


LSTLECKGVSPGVFSPKGWRKGLGYIFS 
PFLEKPHIPPLGISMYFFFFFDGVSVAQ 
AG VQGRDLGSLKLP S PGSS YS CAS AS LS 
S 


11861 


25762 


A 


11969 


326 


406 


RLLKGWALWLTPVI PTLWEATVGRSPE 


11862 


25763 


A 


11970 


120 


1 


KGYGFPFPPKKPLGPPFFFFFATESCSV 
AQDGVQWRDLGS 


11863 


25764 


A 


11971 


84 


370 


RNGAE LKT PLAGQTQ I T I N CR YS W VTL V 
LGDLESILCWHI FKKTNLFI CGFFSSFL 
LGAGRMKSHCWRLECRGMI SAHRNVCL 
LGSNDSPCSAF 


11864 


25765 


A 


11972 


196 


3 


SRGKMGEYPFVLFKNPSWAKAGGNKGNP 
FFFFFEAES CS VAQAAVQWCNLGS LQAP 
PPRFTPRA 


11865 


25766 


A 


11973 


559 


644 


KIGCSAYGVAILLFLYFFNKLAFTLWKK 


11866 


25767 


A 


11974 


3 


391 


HEAQLPEPLMVLGTGHLAEIVIAGGSRTYIi 
LliAFGLLCLPWLQEAGAAQTVPLSRLFD 
HAMLQAHRAHQLAI DT YQELEETY I PKD 
QKHSFLHDSQTSFCLSDSIPTPSNMEET 
QQKSNLELLRI SLLL I E J 


11867 


25768 


A 


11975 


138 


365 


L KFECHS TL CANHG KQT FTF VF YNKFLL 
KYTFYFSVLTYAINWYI KNL INVFKKKK 
KKKKKKKKKKKKNF KKKKN 


11868 


25769 


A 


11976 


275 


433 


MS ILDLSKARNFFLSFLETGSCS I TQAG 
VQWLNKSSLQPQTPGLRDPPASASQ 


11869 


25770 


A 


11977 


108 


3 


RCGLFFFETRASPCRPGWHNLGSLQPLP 
PRFKRF 


11870 


25771 


A 


11978 


270 


408 


FENNVWLGRAQWLTPVI PALWEAQAGRS 
PGPENQNHPDYKGNPWPY 


11871 


25772 


A 


11979 


228 


441 


QALKFVIEMLLSLKKCLDVSVI FNRHKK 
IELLQKKKKKKKKKKKKKKKKPSQKKKD 
SSRGKDS 


11872 


25773 


A 


11980 


2 


447 


GALALNKTTADVWRLN FLVSGLH WKRWL 
QQTSLSKWVKIKECS I LKKKKKKKKKKK 
KKKKKKKKDKK 


11873 


25774 


A 


11981 


124 


445 


KYGEMSQNPARGGPKFSXNQKYSEHLRI 
HCCPPFTFLISKKE IGDRKYS I CKSGCF 
YQKKEEDWFCCPCQKTKTSRRAKSLXRP 
KQKPVAPPGGVKAPAKPRSLPRF 


11874 


25775 


A 


11982 


48 


429 


KESNGSQDRLLPK I HS PDVSGGKSGGMS 
QNPARGGPKLSKNQKSSEHLRIHCCPPL 
PFLISKKEIGDRKYSFGKSGCFSQKKKE 
DWICCPCQKTKLKGKIRPPPKKNGPGGS 
LNGRTTWVSGLFVHN 
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11875 


25776 


A 


11983 


419 


500 


S LLVKVE KQW PGAVAHACNPSTLGG PG j 


11876 


25777 


A 


11984 


362 


496 


LWS P I RI LLLRWAWWHVPTVPAI WEASV 
GRLLEPRSLCNMVRPIS 


11877 


25778 


A 


11985 


142 


433 


RFVCS Tl KVLRDLS SDRSN PGRFLSTSN 
SSLKKKKKKKKKKKKKKKKKKKKKKKKK 
KKKKKKKKKGGGGV I KRFLGGP I FGGGG 
KEKFFFFGGGFIN 


11878 


25779 


A 


11 986 


177 


2 


CQR R WVTI>S HHTHS I TR PhGQLKKG WS P 
GAVAHTYNPS FLGGQGGWVTYVHRTRGR 
TR 


11879 


25780 


A 


11987 


178 


443 


DKKKKKKKKKKKKKKKKKKKKKAGGGLF 
KKI 


11880 


25781 


A 


11988 


329 


410 


LIFFFIFFERESHSVAQAIVQWHDLGS 


11881 


25782 


A 


11989 


146 


1 


DRAS ALQPGRQGRNFV S KWKNNNVI YI I 
NTLTTTI FLGTTYDALS PEL 


11882 


25783 


A 


11990 


240 


420 


GVFAPLLGDVSQSGYTGFRDPLEEAVCL 
FLELSCHAERTTALFRAVRQGCLSLQNL 
SVAF 


11883 


25784 


A 


11991 


239 


410 


GTLFIYLLGDLFIYFRNKSLTLLPRLEC 
GG VI VAHCSLDLLGSG YPS ISALVSS WD 
C 


11884 


25785 


A 


11992 


126 


3 


YTGEYKSFCHKDTCTHMFTAALFTIAKT 
WNKPNGRVGGRV 


11885 


25786 


A 


11993 


300 


22 


SQLLGRLRQENHFNWGGRGCSEPRSCHC 
I PAWATRANS I FCGFQASS VEVRRSARK 
KL FS D I LKRHNT I S WR VS GLLLVDS Y FG 
RLATPVRAQ 


11886 


25787 


A 


11994 


303 


2 


EGEEIFLPSPPFPPGQENPGSNCSPPLF 
PFGTFLDIRVAGSHKVQAKGVFKKRP PS 
LHLFLIKKRFFFFFFETESRSVPQAGVQ 
WRDLGSLQTPPPGFKR 


J J 887 


25788 


A 


11995 


171 


457 


SFSDHLILGWIGFCNNLSKIALCSSPIK 
NKNDDLQKKKKKKKKKKKKKICKKAKASS 
SYQDSS 


11888 


25789 


A 


11996 


357 


204 


EKTGFPHVGQAGVDLLTSRSTRLSLPKC 
WDYRHE PMRLAGHLYYF YPQMK 


11889 


25790 


A 


11997 


219 




PRVFWAPPPRYPPGALFWAPRPVGVSLG 
ARAPTKAGPKQKGARGTFFFFFEMESHL 
SPRLECNGTIWAHCNLC 


11890 


25791 


A 


11998 


254 


1 


AHLRGNRQLPKHTFFQYMTTNLKCAFSV 
GRQSYS IPWRSFYTSLFFKYSCVFKTHT 
HTHTHTHTHTLYFQIMVLLLPSLRKKGS 


1J89I 


25792 


A 


11999 


167 


2 


NFKSFFQGLSRGVLNPKVNLFFFLETES 
CSVSQAGMQWGDLDSLQPPPTRPPTRP 


11892 


25793 


A 


12000 


104 


3 


RPWTFFFFSETKSRSVAQAGVQWRDLCS 
LQPPP 


11893 


25794 


A 


12001 


178 


2 


KIFQRGAKNSPVJGLPFRGGENKKGAPPC 
QKGFGRFFFFFETESRFITQTGVQWCDL 
GP 


11894 


25795 


A 


12002 


163 


282 


GIGGEWCLSKYVI KYVKKKKKKKKKKKK 
KKKKKKKKKKK KKKKKKKKK 


11895 


25796 


A 


12003 


195 


2 


KWGRIPLKKGGFCFTKKKSGQTLLIKSP 
PPAFFVFFIFFFLETESRSVAQATVQWC 
DLGSLQAP 


11896 


25797 


A 


12004 


152 


1 


CFVDVSAEKLRPKDFMTKTPKAMATKAK 
I DKWALT KLK S F CTAKET 1 1 RV 


11897 


2579S | A 


12005 


359 


1 


EWPIWPPPKRRVLSKVSKQFFISAPIR 
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nucleotide insertion 














KKPFPCQVGLILAPFRVLLIGRPFFFFF 
GVLWE I QRWS F I S LWVWRAR I MM Y A 
LFLLSVGLVMGFVEFSS KHCP VYGALVL 
VSIGGRV 


11898 


25799 


A 


12006 


189 


12 


DGRLPGS VCY YLI FLRQS FTLLAQAGAQ 
WCHLGSLQSPPPGFKRGRRVSRIRAVRG 
DQ 


11899 


25800 


A 


12007 


131 


3 


KI FFFFFFFEMESHSVAQAGVQWRDLGS 
MQDPPPGFMPLLFY 


11900 


25801 


A 


12008 


150 


2 


IILFLVPIEAVFFFFETVAEAGVQWHNL 
SSLQPPPPGFKQSPASASRVA 


11901 


25802 


A 


12009 


305 


3 


KFFFSLKASIRGRWFGLFFTPPKKGFLP 
KIPHQVFKGGPLWEKLQLGKAGLNLGPY 
KG FFKGAAR F FFFF FETES RSVAQAGVQ 
WRDLAHASVHASAHAS 


11902 


25803 


A 


12010 


263 


517 


DKGFQLPPRWCPGQDPLFFYGGIVFHGG 
YGPPFFFFFFKAESRLVAQAGVQWQDLG 
SLQPPPCRTRGSP 


11903 


25804 


A 


32011 


107 


374 


WMGTWGSLPVLLTTRSTCPPRSPRLHRR 
TGATIWIFFRLFETGTVLPMLECSTVIM 
AHCRLQLPGS S S S PTIAS QVAGTKAHLS 
DCFVY 


11904 


25805 


A 


12012 


144 


261 


FPKKINNFYFTFLQDPTGI FSLDKTI GL 
GTYGRIYLVS 


11905 


25806 


A 


12013 


371 


3 


I FHLRK I FTFLPG I F WP I RLLS EKKKPK 
RFWVKIKKFKNGYPLKNPIWPKYKGPRF 
FFKGEKNPNGKVPVLKKI KI P S FS KALG 
FLQKKASKPFFFFFFDTESCTVDQSGVQ 
WCDLGSLQAP 


11906 


25807 


A 


12014 


101 


3 


RGAFFFFFFETRSRSVIQAGVQWCDLGS 
LQAP 


11907 


25808 


A 


12015 


205 


2 


VFNPSGINVMYVDAIILNKRLATRIQQH 
I KQRI LHDQVGF IHGMQGW FS I RKS INV 
IQHINRPKDKN 


11908 


25809 


A 


12016 


168 


1 


GCVCQFQKKLGNGGLNGFFFPLFFFFFF 
EMKSRSWQAGVHWRYLSSLQTPPPEFK 


11909 


25810 


A 


12017 


311 


2 


RGLRFGQWKTQMENPNTPFSALRPGQLS 
SIRSGLLHTYPGLGDRSEPLSCSILSSS 
KYWWQAVTSALSSNKPGASQGHWKDDF 
FLFFETESCSVTQAGVQWH 


11910 


25811 


A 


12018 


383 


247 


LVEMGFRHVGQDGLDLLTSQSAHLGLPK 
WWDYRREPPRLATI EVL 


11911 


25812 


A 


12019 


2 


379 


RVLWTAHLAAMAAGSRTSLLLAFALLCL 
PWLQEAGAVQTVPLSRLFKEAMLQAHRA 
HQLAIDTYQEFISSWGMEAYITKEQKYS 
FLHDSQTSFCFSDSIPTSSNMEETQQKS 
NLELLHISLLLIES 


11912 


25813 


A 


12020 


3 


389 


PEPLRVLWTAHRGAKGAGS RTSLLLG FA 
LLCLPWL.QEAGAVQTVPLSKLFDHAMLQ 
AHRAHQLAI DT YQE FEETY I PEDQKYS F 
LHDSQTSFCFSDS I PTPSNMEETQQKTN 
LELLR I FLVLI ELWLDP 


11913 


25814 


A 


12021 


247 


386 


PMLGHVSQSGGNGVRDPLEEAVCPLAKL 
KHCSGRSTALFRAGRQKR 


11914 


25815 


A 


12022 


17 


371 


PLRDLWTAHLEAMAPSFRTDLLLAYALL 
CLPWLQEARAVQTVSLYRLSDHAMLQAH 
RAHQLAIDTYHEIEETYILKDQKYSFLH 
DSQTSFCFSDSI PTPSwKEETQQKFYLE 
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LLPISL 


11915 


25816 


A 


12023 


3 


359 


LRAIiWTADLGAKAPGSRTFLLLASALbC 
LPWLEEAGAGQTVPLSKLFDHAILQAHR 
AHQLAIDTYQELEETYIPKDQKHSFLLD 
SQTSFCFLDS I PTPSNMEETRQKSNLEL 
FRTSLLL 


11916 


25817 


A 


12024 


2 


363 


PEP LKGLWTDHLGALAP S S RTS LLLA YA 
LLCLPWLQEAGAGQTVPVYTLFDHAMLQ 
AHRAHQLAIDTYHELDETYMPEDHKYSF 
LHDSQTCFCFSDSITTPYNMEETQHTSN 
VELLRISL 


11917 


25818 


A 


12025 


314 


393 


GQAWWLTPVI PAFWEAKGGRSLEARS 


11918 


25819 


A 


12026 


80 


392 


PLIACSFFLFLFFFFGKKILFLPPRWKG 
GGGI WVNGNPGFRGQAFFLAS S S KKPGM 
GGPPLTPGKRFVFLKKKGFSPGGPGGSK 
FSAPGT PPTGPPKGWE I PAK 


11919 


25820 


A 


12027 


165 


2 


WQKLLFHFGTESCSVARVGVQWRHFSSP 
KP PPPE FKQLSAPAS RVADRRPPDAW 


11920 


25821 


A 


12028 


1 


338 


VFVALKFLMCLLSVCI FSLENI YSNSLP 
QKNCLPLI FFFLFFEMEACSVTQAGVQW 
GDLGSLQ P PVSHLNLGGGGCCE PRE CHC 
TP CS R PGDRDFVFNKN TM I QDNHLM ELT 


11921 


25822 


A 


12029 


2 


315 


HEERERERERERERERERERERVGRTVG 
GVGETYKAELPRVGGAAQKRAHFSARGR 
LFMEICGDMCGEKPGKPPHSLTEECLSR 
CGERFFDTSLAITRGCAQSV 


11922 

t 


25823 


A 


12030 


216 


3 


ERIIPRGVRQRNRLFSGRDTASPPLFYP 
P PRQ I H KRG V EDREH E RRGERERERERE 
RERERERERARAAR 


11923 


25824 


A 


12031 


114 


329 


QTERNS ININKKDIHTKTPSVGHQHQRP 
KVDKTTKMGRNQ SRKAENS KNW KAS S P P 
KEHNS S PAREQN WMES 


11924 


25825 


A 


12032 


177 


1 


IKKKTFFWQNIPLFSKKKTCRQKFFFFF 
FFETGSLLPRLVCSGAVLAHCSLCLPGS 
AFL 


11925 


25826 


A 


12033 


127 




PSFFFFFFFNETESRSVAQAGVQWRDLD 
SLQAPPPGFITPSC 


11926 


25827 


A 


12034 


47 


314 


GAPVASVSISCPSCSATDGWRWGKSTA 
GHQRYLCSHCRKTWQLQFTYTASQPGTH 
QKI IDMAMNGVGCRATAR IMGVGLNTI L 
RHLKN 


11927 


25828 


A 


12035 


15 


408 


GAIPGAMGHFTEEDKATITSLWGMVNAE 
DAGGETLERLLVDYPWTQRFFDSFGNLS 
SASAIMGNPKVMAHGKKVLTSLGDATKH 
LDDLKGTFAQLSELHCDKLHVDPENFKL 
LGNVIiVTGMAIHLGKEFTP 


11928 


25829 


A 


12036 


122 


2 


KLVDVLPPPGGARVFFFFFFPLETEHCS 
VAKAGVQWHDFG 


11929 


25830 


A 


12037 


154 


1 


GKPPPKLGFWGGPRQRVRGRGPPFFFFF 
ETES PS VAKAGVQWCDLGSLHPL 


11930 


25831 


A 


12038 


58 


254 


DPRVRQQRTTAHCSLNLPGLQRPPSLSL 
PSGWDYRHAPLCATQKI LAYRVAQRERS 
AHGSYYQAS 


11931 


25832 


A 


12039 


270 


124 


DGVLLLL PRLECNCAI SAHGNLRLPGS S 
NSPASNKTKQNNNKKTLSNKF 


11932 


25833 


A 


12040 


63 


355 


LGRGTAHRRLW RPLS RP ASRVS Y L S AAT 
NKRSFAPPSRAFPWDNKWERGGFYFFET 
GSHRRRPGRi-iECSGAITAKCSLDFPGSE 
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MGFTMFPSLVSNS 


11933 


25834 


A 


12041 


184 


3 


GVSRLLGGVSQLGYMGVRDPLEEAVCLF 
S KLKHH PGRTTAL FRAVRQG CL 3 LQ KFL 
LLFA 


11934 


25835 


A 


12042 


118 


3 


DRVLLCYIiGWECSGTISAYCNLHLPGSK 
RFSCLSLPSS 


11935 


25836 


A 


12043 


40 


436 


LSEGKLTNRKDNHTKTPSVHHHHQKPKV 
DKTTRMGRIQSTKAENSKNESTSSPPKE 
HSTSPATEQSRMENDFDELREDFRRSVI 
TNFSELKEDVRTLCKEAKNLEKRLDEWR 
TRINSMEKTINDLMELKTMA 


11936 


25837 


A 


'12044 


101 


2 


HLGQVRWLTPVI PILWEAEVGGSPEHRS 
SRPAW 


11937 


25838 


A 


12045 


123 


3 


LTVI KELPLGQARWLTPVI PALWEVKVG 
GSPEVRSSRPA 


11938 


25839 


A 


12046 


3 


432 


P I FGGGGER I FPF FFFFSQGGGVFFPKT 
KNKGFFPFLGFFKKIFLRNFFFLFFPFF 
FLNPFFYFGAPPFFFFFPGGGFFFPLFS 
RFFKFFFQNFPQGFFPGGGFFFFFLGFF 
FFFFFL 


11939 


25840 


A 


12047 


325 


164 


KNKRKNRGQARWLTPVIPALWEAEAGGS 
PEVRTTGVSHGTRPHLLLNTVSEYI 


11940 


25841 


A 


12048 


187 


381 


LMDKRVSLWGDENILKVESNSGCTTLYL 
LVCFEMESRSITQAGVQWRDLSSLQPPP 
SRFKRFSCL 


11941 


25842 


A 


12049 


84 


464 


SYRVPSHPDTLVLSRISAQEAGEKSPFC 
FPERVWPCPRPLSDLGRRLKLECGPDLL 
DSTFLSFFFGEGLKTGSHSGALECSGVI 
RDHCGLCLSGSGDPPISACLRVAGTTGV 
SHHCI FCRQDLPGS 


11942 


25843 


A 


12050 


232 


20 


LLQCSSRAKIHTSLTGNQKLEMIKLSEE 
GiMSKAEISQKLGLLHHLGLVANAKQRFIi 
KVIRSATPEFRHAE 


11943 


25844 


A 


12051 


401 


518 


FFETES C S VTQAG VQWRNLS P LRA 


11944 


25845 


A 


12052 


191 


1 


LFPLKKEKIFFLGPGTNGAPPMFFLKAP 
PLFFFFFFFFFFFFFFFFFFFFVNbFLL 
LSFLHILEP 


11945 


25846 


A 


12053 


12 


369 


PLPIYSACIEVGSNPQGPGIDAKSVSHN 
NCYLKEKKKKKSNGSQNNLLPKIHTPNG 
SGKKTGEKIQNPPRGGPKLSKNQKYSKN 
LKKHCCPRLTFL I S KREKGNRKYTFGKS 
GCFYQKK 


1 1946 


25847 


A 


12054 


89 


513 


NFTVRGS I PLLNWI PMANLLLLIVPILI 
AMAFLMLTERK I L»G Y I QLRKGPNWGP Y 
GLLQPFADAIPLFPKEPLTPAPSAFALC 
FAAPALALPFALFLCPPRPLGRP 


11947 


25848 


A 


12055 


158 


3 


SQLTWPIMAQTDSMCLWIVYFDDNIGWA 
WWLMS VI PTLWEAKVGGSLEHSN 


11948 


25849 


A 


12056 


113 


1 


KSHGGHGVEVLVSVIjVGLiAMEVGLVLWV 
KEYDGAALA 


11949 


25850 


A 


12057 


44 


263 


ARIARSAHEGKMPRKYTGARKNAENRPQ 
RELQLKASRSTIDLAEHACNALWYQLFL 
ISVGSWKNYILFYLTYT 


11950 


25851 


A 


12058 


142 


1 


KKIFFRYKKEGFSFPKFFRISLFFFFFF 
ETESRSVAQAGVQWRDLGS 


11951 


25852 


A 


12059 


12 


347 


QTERNSININKKDIHTETPSEGHQHQRP 
IVDKSTKMRKNQCKKAENSKNQSASSPA 
KDHNCLPAXEQNW TENE FHKLTfi VGFR I 
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ITNSVELKEHILTQCKEAKNLEQRLEEL 


11952 


25853 


A 


12060 


245 


3 


MPEGSF FLEIPEI TR VFVKKPGKFLGG V 
GPS LLFFFPFFFFFFMRQS LAVS PRLEC 
NSAISAHCNLCLLGSSDSRSSGSR 


11953 


25854 


A 


12061 


94 


3 


KGGI FFFFFFFFLRQSPSVAQAGVQWCD 
LG 


11954 


25855 


A 


12062 


108 


5 


NNLS SG VRDQPGQHGETTS LLK I QKLAR 
HGGRHL 


11955 


25856 


A 


12063 


298 


1 


KKKKINTHAQKRGPPFFFKPPPEKGKAP 
PPQRVTKKGGEKDSFIPKRREKKNPPPL 
SFFFFLVRVHVRERERARARERERERER 
ERERERERERERERAR 


11956 


25857 


A 


12064 


122 


1 


VCEGVCVGGWVRDRERERERERERERER 
ERERERERERAR 


11957 


25858 


A 


12065 


126 


1 


ATCTKFRNDNRALLRVRVFERERERERE 
RERERE RE RE R ERE S R 


11958 


25859 


A 


12066 


48 


351 


FFFFFFFFKKGPKIVPPGGGGGGDPIFL 
EPPPPPL»KKFWGPPLRGRGNKGGGPPGW 
VNFGI FXiKKKGLPWGPGGVKTPAFKGSP 
GPTPPKGGNNRKNPPPW 


11959 


25860 


A 


12067 


1 


365 


GTRLTVLWTAHLVAMAPGSRTSLLLDFA 
LLCLPWLQEAGAVQTVPLSRLFDHAMLQ 
AHRAHQLGIDTYQEDEETYIPKDHEDSF 
LHDCQTSFCFSDSIPTPSNMEETRQKYH 
LELLRISLLF 


11960 


25861 


A 


12068 


174 


1 


PAWVMQHNPVSLFFCFFETESRSLTRIiE 
CSGT I SAHCNVRLPGSSDS P VS PSRVAA 
RA 


11961 


25862 


A 


12069 


270 


375 


TRI KRCNGGRAQWLTPVI PALWEAKAGE 
SPEVRSS 


11962 


25863 


A 


12070 


197 


350 


KKIYIFLTINTIMDLIIPFLLDNILGFW 
LGAVAHACNPSTLGGRGGWITRS 


11963 


25864 


A 


12071 


3 


378 


HEGQLPEPLKGLWTAHLLGMAPGSRTSL 
LLAI DLLCLP WLKEAVAVQTVPLSRL YD 
HAMLQAHRAHQLAI DT YQE I EET YI P KD 
QKLS FLHEYQTSFCFSDSIPTPSNMEET 
LHKSHLELLR I YL 


11964 


25865 


A 


12072 


219 


1 


FSHPFPPGVFQTPLVFPRPWEVLGSPGG 
GGGPPKKWAGVQKWVFFFFFFFWEMESC 
S VAQ AGVQWYDLGS PRA 


11965 


25866 


A 


12073 


134 


3 


VFGPPFFFFFFCETESHSVAQAGVQWRD 
LCSLQAPPPGFMPSC 


11966 


25867 


A 


12074 


175 


357 


MPINQPVKKMCVCVCVCVYIYIHTPLYI 
RVYMYMCMYICVYIHVYIWMCIYIHTHV 
STYW 


11967 


25868 


A 


12075 


193 


350 


KIPKLTSLYHTQNYLKYC IGQAWWLTPV 
I PALWEAEAGGS PEVRSSRP AWPW 


11968 


25869 


A 


12076 


252 


1 


GVFGPFPKKGGFFWVKGFLGTPQIFWVG 
VFFKKKGGALWVKGLLGKPPPPPLFFFF 
FETES CSVARAGVQWRDSVKKKITIARA 


11969 


25870 


A 


12077 


273 


366 


IFIYLFIYLFETESCSVSQAGVQWPNLG 
RLR 


11970 


25871 


A 


12078 


127 


2 


KKKKTLFFFFFFFFEMESRSVTQAGVQW 
CDLSSLQPPPPLV 


11971 


25872 


A 


12079 


206 


334 


HISTRVTIVNNNLIVHFKITKRCWTQWLTP 
VIPALWEAKAGGSSE 


11972 


25873 


A 


12080 


135 


2 


KMKRKFGEDSQI PKTLQTFFFLTESRSV 
AQAGmQWCDLGSLHSC 
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11973 


25874 


A 


12081 


147 


3 


ATTPGLFFIFYFFETESHS IAQAGVQCM 
I SVHCNLCLPGS SDS PVLV 


11974 


25875 


A 


12082 


106 


2 


Y FI PINEGYCFI YLFFETESRS WQA6V 
QWCDLV 


11975 


25876 


A 


12083 


164 


346 


CP YKKRKKANQAKWLTP V I MLFGRPRRV"" 

DRPSSGVQDKPGQHGETPSLPKIQKKSQ 

AWWCA 


11976 


25877 


A 


12084 


186 


1 


KSLFSKGNPLFKGPPPPPFFFFFFFETR 
S R S VTQ AGWG AV I MAHC CLNLPG PS D P P 
TPAPRA 


11977 


25878 


A 


12085 


98 


2 


CPFFRVNFFFFEMESRSVAQAGVQWRDL 
GSRA 


11978 


25879 


A 


12086 


154 


2 


SPTPEKGVWELPSPFFSWGPKIWGCIFF 
FLYETESGSVAQAGVQWRDLGS 


11979 


25880 


A 


12087 


326 


3 


KTSVITCGLPMGWSWGPFPKGPWWCKKF 
CVCKFWANQTLKKNLGAGRFQKPFSAAA 
FLEMGGFPFKFFRFFQGSQFFFFFFFET 
BSCSVTQAGGQWCNIX3SLRPLPF 


11980 ' 


25881 


A 


12088 


154 


3 


KE FS FFAPGGKQRGE I RSLRGPP PQVKP 
LFFFFFFETESRSVAQAGVQWR 


11981 


25882 


A 


12089 


174 


2 


SWKAILQYSLEIILYLPSFQIFLCFSHT 
HTHTHTHTHTHTLYSQ I MVLLLPS LRKX 
GS 


11982 


25883 


A 


12090 


265 


30 


W F I VYS E I S QL WGLPS FNTLG TWQ S LS F 
I FVAEMRSCCVTQAGLELliASSDP PVSA 
SQSARI RAMSPS VAWVI PGRSR 


11983 


25884 


A 


1 209 1 


257 


1 


GSVMRMHTEEQYPENKVEQSSSGFIRPH 
LVYRVCFYLSVCLS VCLS I YLS I YLS I Y 
LSFFLSFFLLRKSLYLSIFLSFFLLSPR 
V 


11984 


25885 


A 


12092 


311 


404 


LYHQNPWLYSVFFFLETESCSVAQAGVQ 
■ WCD 


11985 


25886 


A 


12093 


331 


83 


GDDYKGARENSRDDKLFYVLIMMWIQL 
HVFVKIHKTLQLNGYILLCKLYLINLTK 
NNFQKPKTNSQKTLHIFHPDSTAANI 


11986 


25887 


A 


12094 


82 


2 


LREPNLLNPGGEGCNEARWCHCTPAi. 


11987 


25888 


A 


12095 


255 


393 


HGI LFSS FVCNLNQFLSQH WLMPVI PAL 
WEAEVGGSLEPRSSTLAW 


11988 


25889 


A 


12096 


80 


3 


PFFFFFFFFETESRSVARLECSGMI 


11989 


25890 


A 


12097 


367 


1 


PRVDKPTKMGKKQGRKTGNSKKQSPSAP 
PKERSSSPATEQSWMENDFAKLREEGFR 
RSNYSELiQEE IQTKGKEVETFEKNLDEC 
ITRIPNTEKCLKELMELKPKAQELREEC 
RSLRSRCDQL 


11990 


25891 


A 


12098 


94 


236 


MWADYFCMSPS IDEGLKKKKKKKKKKKK 
KKKKKKKKKKKKKKKGGAP 


11991 


25892 


A 


12099 


410 


212 


NSLHPSTSLFRFSSHFHIKLI SGQAQWL 
TPI I PVPWEAKAGGLLETRSSRLAWAAQ 
EDPISMPVCK 


11992 


25893 


A 


12100 


310 


324 


NLGQAQWLMPVI PARPWEAEVGRSLAVR 
SSSNSPASASQSAGPTGPSKHNWLMFLQ 
LLTRL 


13993 


25894 


A 


12101 


247 


357 


KQIQGRAWWLTPVFS PLWEAKAGES PE V 
RSLRPAWPT 


11994 


25895 


A 


12102 


245 


2 


D VLVGGL CL PAGGTLHLTS I FLLSGCWC 
GSKDEEAPCKQRISVQRESQSRTLRAGV 
S PKKAHPCEMCGLI LEDVLHFPDLQ 
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11995 


25896 


A 


12103 


230 


379 


KEASFFETESHSVSQAGVQWRSLNPLGS 
GNPPTSTSRVAGTTGMSHHVWL 


11996 


25897 


A 


12104 


86 


2 


DG VS VLS PRLE CNGL I LAHCNLRL PG S S 


11997 


25898 


A 


12105 


116 


2 


SQKTPFFFFETVSLLLPKLECNGAISAH 
CNLCLPGSSD 


11998 


25899 


A 


12106 


271 


416 


QFTFS KAFNLVF IKPSFHIS FYRTHYS Q 
AWWLTPVI PEL WE AE AGGS P 


11999 


25900 


A 


12107 


194 


3 


HVLGTVLVAEDTASNKI RSLQGCLLLPL 
LFSWLEVLARAIRQEKEIKVIQIGREE 
VNRRRGR 


12000 


25901 


A 


12108 


142 


3 


LCLL YWDCKRHAWWRGQVRWLTP VI PAL 
WEAEAGGS PE VRS LRPAW 


12001 


25902 


A 


12109 


233 


346 


HQRSLIGWAQWLMPVI PALWEAEAGVSP 
EVRSLKPAWP 


12002 


25903 


A 


12110 


234 


349 


GFS P F YDFCLFVFETES CCW QAGVR WH 
DLGSLQPPPP 


12003 


25904 


A 


12111 


136 


333 


RLNF F Y F Y FFETAS C S VAQAGVQWHDLA 
HCI PAWETEQDS I SKNTVQKRKKKVTRA 
GISKTKNENF 


12004 


25905 


A 


12112 


.345 


110 


QDGLCLVMTLQETQP I LAYS LWFPIVFP 
TTKACNVQGDSKFLLKKKYLGQVQWLMP 
VIPTLWGADVEGSPELRSLEPA 


12005 


25906 


A 


12113 


108 


285 


YNAMKNR FLKT I LWNKNS I G WAR W LMAV 
I PALWEAKAGRS PE VKS SRSRPGAVAQV 
DAA 


12006 


25907 


A 


12114 


1 


364 


RWAAEMGKFMKPGKAALDLAGRYSGRK 
AVI VKNIDDGTSDR P YSHAL VAG I DRYP 
R KETAAMG KKK I AKR S KI KS FVKVHNYK 
QLMPTRYSVDIPLDKTWNKDVFRDPAL 
KRKARREAK 


12007 


25908 


A 


12115 


120 


3 


TQ I WGAFLTLFFFF F FEAKSHS VAQAGV 
QWCSLGSLQA 


12008 


25909 


A 


12116 


120 


3 


TQIWGPFLTFFFFFFFEAKSHSVAQAGV 
QWCSLGSLQA 


12009 


25910 


A 


12117 


231 


1 


FLSPFSCYSAITKLLSL3YFLLGHILLL 
LGSTPEAAAQWQWVSFADSD I VPPAST 
WVFPTLGI MHHNKQATENAKE 


12010 


25911 


A 


12118 


216 


1 


LI FPPLLNFCYGEVLGFQGFLALFFACM 
FFWPRFSPKTFFFFFYEKEFHSAAQAGV 
QWCDLGSLQPLPRGFK 


12011 


25912 


A 


12120 


267 


1 


KKKKKAFP S P PLLG CF I RLQKHLLG I LV 
YRSHLISSLLCLEGI ILSLFIIATLITL 
NTHSLLANIVPIAILVFAACEAAVGLAL 
LVSIS 


12012 


25913 


A 


12121 


167 


372 


ISGQDLLKETKRVKRPFQQDDVPFINIF 
VPTPGAPRSLRQILELRGKTDPSPI IVG 
DFNTLFSALDRS 


12013 


25914 


A 


12122 


124 


2 


GLLKFNI FAPFFFFFSETESHSVTQAGV 
QWCDLGSLQPPP 


12014 


25915 


A 


12123 


121 


I 


R FTAAS ANS GAVS AAVG YMGD PG S E 1 1 E 
SVPPAGPVSSW 


12015 


25916 


A 


12124 


176 


2 


RDLRGFSRFSNYKFWNFGRVGGI FKFVF 
GKGVKRFFFFFFETESCSVAQAGGQWRD 
LG 


12016 


25917 


A 


12125 


137 


2 


KLGFFGGGGGKIFAPQKNFFFFFFFFFE 
ME S CSVAQAG VQWRD I G 


12017 


25918 


A 


12126 


103 


3 


LEENLGNTIQDIGRGKDFTSKTPKAMAT 
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K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop cod on, /^possible 
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KAKID 


12018 


25919 


A 


12127 


212 


351 


L I TFE I LN I CLGDNI LDLGLARWLT P VI 
PALWEAKAGGS PEVRSLR 


12019 


25920 


A 


12129 


187 


3 


RGSTMQQTNTRTPGVRVLYYILMIATMA 
IPYIPMANLLLLIVPILIAMAFIMLTER 
KILGY 


12020 


25921 


A 


12130 


13 


329 


ASRVTIRVMASNSTKSFLADAGYGEQEL 
DANSALMELDKGLRSGKLGEQCEAWRF 
PRLFQKYPFPILINSAFLKLADVFRVGN 
NFLRLCVLKVTQQSEKHLEKI 


12021 


25922 


A 


12131 


231 


330 


MDTDEI YLGRAPWLTPVI PALWEAKASG 
SPEVR 


12022 


25923 


A 


12132 


95 


2 


KGKCFPPFFFFETESRSVAQAGVRWRDL 
SSL 


12023 


25924 


A 


12133 


129 


1 


DLALLPRLWCSAMIMAHCNSELMSSSDP 
PISASQIPGTIGMCH 


12024 


25925 


A 


12134 


196 


361 


TIMEYYAAMKKKSESPLLHFQGKKLNQD 
GGWAWWPTP VI PALWKAKTGGSLEPRS 


12025 


25926 


A 


12135 


120 


1 


PPPPGLFFFFFFETESYSVAQAGVQWCN 
LGSLHPPPPEFK 


12026 


25927 


A 


12136 


235 


3 


KKGIRIERDRLKQLSVCQRFQLAKFPPF 
PSGVGEKQNFFFKKKKKKKGRAQWLTPV 
I PALWEAEAGGS FEVRS SRPA 


12027 


25928 


A 


12137 


185 


1 


IWCPFFCPGPQICSFRCKFFFPFFFFFF 
LEAETRS FAQAGVQWCDLGSLQS SWVTE 
LAAAS 


12028 


25929 


A 


12138 


158 


1 


LRRGGVFSIFFCGGTMVLSPADKTNVKA 
AWGKVGAHAGEYGAEALERMFLSF 


12029 


25930 


A 


12139 


76 


3 


KVLARAIRQEKEIKDIQIGKEEVK 


12030 


25931 


A 


12140 


157 


2 


FFSSPRLLKRRPGNFLGARENFFFFFFS 
ETESPS I AQAGEQWRDLGSLQAP 


12031 


25932 


A 


12141 


234 


388 


LSQLMSANLFFVLLFEAEFHSVNRLECS 
GMIWAHCNLHLPGSSESPASTSQ 


12032 


25933 


A 


12142 


112 


453 


LGRRQAASMREG I S I HVGQAGVH IGNAC 
WELYCLEHGIQPDGHMPSDKTIGGGDDS 
FNTFFNETGAGKHVPRAAFVDLiEPTAID 
EVCTGTYRQLFHPEQLITGKEDAANNYA 
RG 


12033 


25934 


A 


12143 


44 


443 


AKLGTRKLPLKAKMGKEITLINIWIGH 
VDWGKTTTTGHLIYKOGGIDKRTIEKLE 
KEAAEMG KGS FKYAWVLDKLKADRERG I 
TIDISLWKLDTSKNYVTI IDAPGHRDFI 
KNMITGTSHADCAVLIDAAGV 


12034 


25935 


A 


12144 


3 


386 


REAATMRECI CIHVGHAGVQNGNACWEL 
YCLEHG I QPDGHMPSDKT IGGGDDS FN? 
FFSETGAGKHVPRAVFVDLEPTG I DEVR 
TGTYRQLLHPVQL I TGKEDAANNYARGH 
YTIGKEIIDLVLDRIR 


12035 


25936 


A 


12145 


3 


386 


AGAT Y I DRLRVALFEATTFTLWVRPELS 
SGEATTMRECMS IHVGQAGVQNGNACWE 
LYCLEHGVQPDGHMPSDKTIGGGDDALN 
TFFSETGAGKHVPRTEEVDLEPTEIGEE 
GTGTYRQLFHPEQHMM 


12036 


25937 


A 


12146 


3 


385 


GRATYIDRLRVALLEATTSTLRLRRELG 
S REATTMREG I C I HVGQAG VQNGNAC WE 
LYCLEHGIQPDGQMPRDKTIGGGDDFFN 
TFFIETGAGKHVPRAVFGDLEPTDIGEI 
RTGTYRQLLHPEQIiI 
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nucleotide deletion, \=possible 
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12037 


25938 


A 


12147 


11 


381 


HTPEGRTRVPVLIAGWYCQEALRDWGR 
VTASSTGAMAFLRSMWGVLTALGRSGAK 
LCTGCGSRLRSPFSFVYLPRWVSSVLAR 
CPKKPVSSYLRFSKEQLPIFKAQNPDAK 
TTELIRRIAQR 


12038 


25939 


A 


12148 


321 


509 


YSMDQYFIPFYCQIMHLCFFLKTRVSLL 
FPWLECSGATSAHCNFCLLSSSSFSPASA 
SRVTGIT 


12039 


25940 


A 


12149 


51 


400 


AATMRECI S I HVGQAGVH I GNACWEL Y C 
LEHG I QPDGQMP S DKTIGGGDDS FNTF F 
NETGAGKHVPRAVFVDLEPTVIDEVRTG 
TYRQLFHPDQL I TGKEDAANNYARGHYT 
IGKE 


12040 


25941 


A 


12150 


25 


399 


EATTSTLGLRHQLGSRESTAMRECISIH 
VGQAGVHIGNACWELYWLEHGLHPDGQM 
PGDQTIGGGDDS FDTFFSETGAGKHVPR 
AVLVDLEPTVIDEVRTGTYRRLFHPEQL 
ITGPEDAANNYAR 


12041 


25942 


A 


12151 


1 


402 


TKEEELCLACRQRRADGGWRSQMNAGSD 
P VG I VSAARTI IGS FNGALTAVPVQDLG 
STVI KKVLKRATVAPEDVS EVI FGHVLA 
AGCGQNPVRQASGGAGI P YS VPAWS CHM 
ICGSGLKAVCLAVQSIGIGDST 


12042 


25943 


A 


12152 


3 


400 


TDRLRVALKKTTTS S FCFPHHLGSRETA 
TMRECI S IHVGQAGDHIGYACWELYCLE 
HSLQPEGQMPGDKTIGGGDDYFNTFFSE 
TGAVKHYPRAVFVDLEPTVIDEVSTGTY 
RQLFRPERLITSKEDAVNIY 


12043 


25944 


A 


12153 


2 


398 


GRCALDRNTFI FARFGCYLIASGHPGEK 
LMDMDMSPLRPQNYLFGCKLKASNDYHF 
I VANDENEHQ I SLTTASLRAGANDDMN I 
VEAEAMNYEGTPIKETLATLKT>ISVQATD 
SLGGSEITPPSVLRLKCGSR 


12044 


25945 


A 


12154 


203 


405 


FTCPSRI ICVISRR I SPCCAPDLNPMLS 
ANAMLDFVFTVEDPGGWDSKNLQKKWSH 
YSFLTGIRPKI 


12045 


25946 


A 


12155 


229 


397 


TKIAHHKAGFALISKKITKNLKMFLSKF 
LPIHALWVTGSSGMQPYPLVWGHYDLGK 


12046 


25947 


A 


12156 


2 


399 


GGVPHCVWATAWGMRPGLPGPTGLCAQT 
S SRGQKS VLKQKES CGIWQLYHFLSRKQ 
EPRWEPCVSGSSSGDGAVADLADELRGY 
PALCCTLPVHSYRS WAGIRPQ IMNGPLH 
PRPLVALLDGRDCTVEMPIL 


12047 


25948 


A 


12157 


85 


414 


ALLPQSEALQGAVTMPHSYPALSAEQKK 
EVSDIALRIEAPGKSILAADESVGSMAK 
RLSQIGVENTEENRLLYRQVLFSADDRE 
KKCIGGVIFFHETLYQKDDHGVPFVR 


12048 


25949 


A 


12158 


85 


407 


GLLPHSEPLQRAVTMPHSDPALYAEHKK 
DVSDIALRIGSPGKGILAAYESMGSMAN 
PLNQMGVENTKQNRRIjYRQDL FRAEDRE 
KKSLAGVI FFHDTLYHKDDNGVP 


12049 


25950 


A 


12159 


85 


406 


VLLPHSEALEGAVTMPHSYPALSGEHNK 
ELSDIALRIVAPGKGILAADESVGSMAK 
RLSQIGVENTEENRPLYRQDLFSADDRG 
KKSIGGDLFFHDTLYQKDDNGGP 


12050 


25951 


A 


12160 


275 


146 


EKTFI ISFFFFFETESHSVAQTGVQWHD 
LGSLQAPPPWFRRIA 


12051 


25952 


A 


12161 


212 


2 


KRS R P FF F FFKE MG S C YI AQVGMQ WLFV 



1273 



WO 01/64835 



PCT/US01/04927 



SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
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nucleotide deletion, \=possible 
nucleotide insertion 














GTITACYDPELLGSKDPPTSASQVTGNT 
GTHTHTQLI FVFLV 


12052 


25953 


A 


12162 


253 


2 


GRVDS ECCIANGNAEKS I FCCLLVYLF Y 
LS S RQSHS VAQAGVQWHDLGSLQPP PPG 
FSLLSRVAGTTGARHHDWLI FVFLGET 


12053 


25954 


A 


12163 


228 


1 


LLGLWFEQLGRFTKIAKTKGERSQTQSG 
HSMDMRVPAQLLGLLLLWFPGSRCDIQM 
TQS PPAVSASVGDRVI ITCR 


12054 


25955 


A 


12164 


122 


1 


RLLFFETESRS VARLECSGAI SAHCSLR 
LPGSGSSPASAS 


12055 


25956 


A 


12165 


239 


350 


GQARWLTPVIPALWEAKASKSL.EVRSLR 
LAWAGHGGS 


12056 


25957 


A 


12166 


235 


359 


TDKAI S KRDLS VLRE I IFFEMESRSVAR 
MECSGVI SARCNL 


12057 


25958 


A 


12167 


209 


396 


QFSGIPFDQYKCGFKNYIYIYIYIYIYI 
YIYIYRKYTRAYTKRYMCVFSRATHNIV 
YLCAYI 


12058 


25959 


A 


12168 


327 


130 


GGVGFLGGPGEKPPRPSPFFFFFFEKES 
RSVPQARVQWPDFGSLPAPPFGFTPFLR 
ESFLFSSLMI 


12059 


25960 


A 


12169 


317 


422 


RGEKPLFFFLQKKKKKKKKKKKKNIKKG 
GRRYKPS 


12060 


25961 


A 


12170 


260 


375 


EKVTCSWTQWLTPVI PALWEAKAGRSLE 
FRSLRSAWAI 


12061 


25962 


A 


12171 


304 


3 


KFFFFFKGFFFLGGVGPIFPPPKKRFFS 
KIPPGVFFFPPLKKKIFFFFPPVILGPP 
RVFFKGAPLFFFFFFFFFFFSETEFRSC 
CPGRLECSVAVSAHCK 


12062 


25963 


A 


12172 


105 


3 


LI FLRQS S AbLPRLE CNGAI SAHCNLCL 
TGSSD 


12063 


25964 


A 


12173 


146 


17 


RIFFFFFLGQSIiAVAQAGVQQRDLGSLK 
APPPGVHAILLPQEY 


12064 


25965 


A 


12174 


234 


2 


LFPRKPKPSVSLSLSLSLSLFRLRKLSS 
FMRHNNIEVRPINNPAMTSKCSSERKSL 
VS FTLNLKPGM IRLSE EGMLK 


12065 


25966 


A 


12175 


292 


380 


VRGLGRVAHTCNPSTLGGQSGW I TGGQE 
F 


12066 


25967 


A 


12176 


109 


1 


GRIKKVI KGQAQWLMPVI PALWEAKVGG 
SPKVRSSR 


12067 


25968 


A 


12177 


170 


291 


LINFYFYFLRQSHSVTQTGVQWCNLGSL 
QPPPAARRRRRG 


12068 


25969 


A 


12178 


260 


380 


LTYIALFI FLRRSLTLLPRLECSGMISP 
HCNLCLLGSSDS 


12069 


25970 


A 


12179 


54 


166 


PKRGFHRVTQEGLNLLTLGSPRLGLPKG 
WDHRQDPLP 


12070 


25971 


A 


12180 


273 


359 


APAGHGGSCLQSQHFGRLRQADHLGSGV 
R 


12071 


25972 


A 


12181 


27 


225 


IGQAQWLTPVILAFWEAKAGGSLEVRSS 
RPAWP I YFLVLLAS YLRRFCL I QGHSDL 
LYTSSTRGS 


12072 


25973 


A 


12182 


1 


155 


VHKIFIAYKYSGFQTVFRGRAWWLTPVI 
PALWEAKANRSPEVRSSRSGRRG 


12073 


25974 


A 


12183 


91 


2 


FNLFFFFFFETGSRS VARLECNGAI SAC 
C 


12074 


25975 


A 


12184 


122 


2 


GQAQWLTPVIPPPWEAKVGKSPEVRSSR 
RTRGRTRGRTRG 


12075 


25976 


A 


12185 


389 


20 


TDRGRRARRACFCGKVFDGELS FALKLA 
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REMGRPDWRAMLAGMSSTEYADWHRFYS 
TH Y FHDVLLDMH FS GLT YTVLS LF FS DP 
DMHPLDFSLLNRREADEEPEDDVLMQKA 
AGLAGGVLFLI 


12076 


25977 


A 


12186 


180 


393 


LNPGSGGCGEPRSRHCTLAWVTERESVS 
KNKI KIKINKQIEEGFLFQPFGQGFI WC 
YFPAKEFIIALSKT 


12077 


25978 


A 


12187 


37 


443 


PDF P I P FPPR KVQHLARPS PKTARPAWT 
IRRILPPPPKKRHPPGLARGKFGPRELA 
KAGLAKE CPAGG P T PAVR FATPTRGGGG 
RVAGFQTPMGI PMGKARLGLLTFLGFAS 
GWIGAYRPRETLCGGELVDTLQF 


12078 


25979 


A 


12188 


3 


411 


AFP ENAATG S T FQD P VPAS KGRAEVGNM 
RLSVAAPISHGRVFRRMGLGPESRIHLL 
RNLLTGLVRHER I EAPWARVDEMRGYAE 
KLI Y YGKLGDTNQRAMRMADFWLTEKDL 
IPKLVQVLAPRYKDQTGGFTRMLQ 


12079 


25980 


A 


12189 


2 


414 


QEFGTRKRATFI SLLFFFSSVYSRGVFR 
RDAHKSEVAHRFKDLAEETLRAL.TLIAF 
AH YXjHQC P FADHVKLGNEATE FAKT CVA 
DESAENCDKSLHTLFGDKLCTDAPLRET 
YGEMADCCAKQEPERNECLLQHKDD 


12080 


25981 


A 


12191 


3 


514 


PRLLMEAGPHPRPGHCCKPGGRLDMNHG 
FVHH I RRNQI ARDDYDKKVKQAAKEKVR 
RRHTPAPTRPRKPDLQVYLPRHRDVSAH 
PRNPDYEESGESSSSGGSELEPSGHQLF 
CLE YE AD SGE VTS V I VYQGDDP GKVS E K 
VSAHTPLDPPMREALKLRIQEE I AKRQS 
QH 


12081 


25982 


A 


12192 


182 


3 


RGLFFIPPPPIKTKIWGPFWKWFMGFG 
PPLFFFFEMESRSVAQAGVQWHDLGSLQ 
ALP 


12082 


25983 


A 


12193 


53 


400 


PETPSWLGPVRRFYFIIIKSVWKWKKKK 
KKKKKKKKKKKKGGGPFKKTLGGDKFYW 
GVKKKI FFFLGGS KKHPWGF FE KKLFFG 
GGKCGAPPPKDI SCLWGKKNFLGAIGEK 
TCCC 


12083 


25984 


A 


12194 


182 


3 


KGIFIQIAPPKKKKNWGPLWKWFKGFG 
HPI FFFFEMESRSVAQAGVQWHDLGSLQ 
ALP 


12084 


25985 


A 


12195 


200 


378 


KSSKSQWLGYLFIGLLIYLFIKFFLETR 
S CFVTQAG ILAHCNLKLLGS S DP PTSAS 
QAA 


12085 


25986 


A 


12196 


341 


54 


SLSFHGGLSVLCIFSTYKKLSQEQWLTP 
VISVLWEVKVSRSPQVRSLRPGDPPASA 
SQSAGITGVSHHARPHLSLNPRLEMIKL 
SEEGMLKRMTG 


12086 


25987 


A 


12197 


1 


462 


GGPPRPFRMKAAVLTLAVLFLTGSQARH 
FWHQDEPPQS PWDRVMDLATAYVDGLKD 
SGRDYASQFEGYALGKQLNLMLLDNWDS 
VTYTFSKLREQLGPVTQEFWDNLDKETE 
GLTHEMSKDLEDVNAKVQTYLYDFQKTW 
QKELTFTAESEPVP 


12087 


25988 


A 


12198 


80 


1 


SNKVFFLETESCSVAQAGVQWCDLGS 


12088 


25989 


A 


12199 


223 


3 


TMG I M LD KKS I QA I FS PKFKMGHAVAE T 
TCNIKNTSGPGTANKVTVQWWFKKFCKG 
DESLEDEEHHGRRLEV 


12089 


25990 


A 


12200 


239 336 


FGAL P I TTCKFGKKKKKKKKKKKKKKKK 
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KKKKKKKKKKK 


12090 


25991 


A 


12201 


279 


366 


KKKKKKKKKKKKKKKNKKKKKKKKKGFK 


12091 


25992 


A 


12202 


153 


40 


GFLFLI FFFFETKSRSIIQAGVQWCHLG 
CTLAWATETD 


12092 


25993 


A 


12203 


317 


406 


LGKLKCNGAVSGHCNLRLPGSSDFSGSA 
S 


12093 


25994 


A 


12204 


2 


925 


RSG PDHELTLDSGKGCTRS MV PASAAS E 
DRRKLPI IVEDEGGPTRSRACSSPARGS 
RPP PS A I GCS PVAQASDSAAGPARRTAL 
QSLSSWLGYQIDRHSVPVYVFKSPLFSV 
IMAPKHKSSDAGNLDRPKRSRKVLPLSE 
KVKVLDL I RKD KKS Y AE VAK I YGKNE S S 
IRE I VKKEKE I RAS FAVS P PTAKVTATV 
RDKCLVKMEQALHLWVEEMNRKRVP IDS 
NMLRQKALSLYQDFSKGCSETDTKPFTA 
S KG WLHR FRKRFS HHY KKKKKG IMAQVA 
VSTLPVEEESSSETRMWTFLVSALESM 


12094 


25995 


A 


12205 


272 


1 


KMARAG LLV I EGKVWRTV Y YRFATREER 
EGKM S TNLMN KLDT IG FDN KKDLL I SVG 
DLVDRGAENVECLELITFPWFRAVRGNH 
ERQPGO 


12095 


25996 


A 


12206 


166 


478 


ISASGLLPTSPLTGTSKLQDPNEHLNLL 
MLNRVSLLLPRLECNDTIIiTUTOILRLPG 
SSNS PDS AFQVAGI TES FTCE INALKDS 
SQVALWS FRCEMDHSS ICNR 


12096 


25997 


A 


12207 


102 


497 


PLLAIiL ?GS Q KT FLCLQLSL KP TRS S LL 
SPGSAGNPENEAPCPQLNPEATSLKKKK 
KKKKKKKKKKKKKKKKKKKKKARG 


32097 


25998 


A 


12208 


131 


2 


FGFLGGDKAKGKGPGPPFFFFFETESPS 
VAKAG VQWCDLG S LH 


12098 


25999 


A 


12209 


221 


499 


QTRHKLTNSRSGRTGLQILIKEQHSSGR 
WCGTGDILWQSLEDTICYSVPKSLLDFL 
LFIYFETESHFVTRLECSGAISAHCNLR 
LLGSSDSPA 


12099 


26000 


A 


12210 


297 


398 


HNFFF FLXTES CS I SQAGGH W YDLS S PQ 
PPPPG 


12100 


26001 . 


A 


12211 


341 


452 


NWSPGLKQSSRLSLPKCWDYRREPPCLA 
GLLAYLFRR 


12101 


26002 


A 


12213 


214 


468 


FFFFFFWGKGVFFFFPPLEGRGGNSVYW 
TPPPRGKXNFPPLGLGKMGNNPPPPPPR 
FFFFFFKKKGVFFLGPGGVKTPSLRETP 
P 


12102 


26003 


A 


12214 


223 


477 


CPTETQLQLIFAREKHKPSPVLLDLDIiY 
GFFFEMESHSVS PLECSGPI SAQSNLYL 
LGSHNSPA 


12103 


26004 


A 


12215 


263 


55 


T P KRYG KMQTAT YL P KSK I Y 1 1 ISDLRK 
QDYI VYKQSKKPEGWAQWLTPVI PALWE 
AKAGRS P KVRS S R 


12104 


26005 


A 


12216 


328 


471 


GVIHAQVLHAHTHTHTHTCTHTQVHAHS 


12105 


26006 


A 


12217 


434 


1 


WDHSASPPTSVGTPLRADGPHPRLTAPG 
HWGS S AMLS APAAGRRGLS LGR WCDRD 
KLSQRGPQPKWILWWGNLFGGGFLHRHI 
HKPLPTI PIPSTVRGEEVLQQLLHSASF 
I LFIFI FLDRVS LCHPGWS AVARS RFTT 
TSTS 


12106 


26007 


A 


12218 


304 


43 


EKPIGVAPSRVKKRAGG3EKTECCLLKG 
PLQGFQGVLKGPLKLKTGPLFFFFFLRRS 
LALFRLE CNGT I LAY CNLCLLG SSDSPA 
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C=Cysteine, D=Aspartic Acid, 
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SBI 


12107 


26008 


A 


12219 


171 


365 


ALVQPRAGREAVTVPACCSCSDVFLYET 
NKVARI QSIN YGTI KFPHVI VFSYVSKE 
HLKRKLEK 


12108 


26009 


A 


12220 


171 


452 


ARYKFLHPKFAFGLYPSSSQIQGARHQL 
WILI KCSFYLLFLRQCLTLLPQLECSNM 
ITAPLQPQLPGLKQSSKLSLPSSWDYRH 
EFP 


12109 


26010 


A 


12221 


323 


458 


FRVQRSLFLFIYFFETESHPWQAGVQW 
HDFGSLHRISP 


12110 


26011 


A 


12222 


189 


56 


DRVSLVQPTLECSGTISVYCSLNLLGSS 
DPPTSASPVASTTGGI 


12111 


26012 


A 


12223 


246 


494 


FLCSSYAYDDHFLRPFFPIHLILDITAF 
SQDT 1 I SNLLFQDACSVP KKAANLGQAQ 
WLAPVI PVLWEAEVGRLLEVRSSRPAW 


12112 


26013 


A 


12224 


55 


353 


RIQGCSCWVKLCPKVEKWVQRTDAEQES 
QTKAE I QDMKQELSAVNMMDKFARSARL 
ERKINKMMNKLKTHVKVQTAQSGMLKWV 
ISVAFYKLPGTVIRL 


12113 


26014 


A 


12225 


199 


23 


LKLS S I Y YVPGT I LKYFTGW AR WLLP V I 
PALWEAEVGGSPEVRSSRPANSTSWIER 
HQ 


12114 


26015 


A 


12226 


369 


14 


DCVRVGLSYPQTVCPSCESVRKAPFSSR 
PHRGVLSSGQPRCKSRRVDI FFSMERER 
DRERERERERERERGQKRTRRVWGGERE 
KLGYSHAWGWRGGRGLGWGGMCGI PPPC 
TSDVHQ 


12115 


26016 


A 


12227 


206 


2 


QNGHMNGGPCRVSLSLFLETRSFLFFLA 
SLSMLECSGAILAHCNLRLPGSSNSPAS 
ASQVAG I TGACH 


12116 


26017 


A 


12228 


231 


501 


RSWEGQALHGSDPLASCTRIQSNYMALQ 
RINQELEDKLYRMGQHYEEEKRALSHEI 
VALN S HLLEAKVT I DKLS E DNV S AAHT W 
PMEDCR 


12117 


26018 


A 


12229 


86 


455 


DLPTFPVPPHPGFSGPLLGQSEIiPSNCQ 
TGGRSHLSVSVLCPTLATTPLGLDFPTC 
GTSQLPLGPLQLAHRSELAGLSLPGLST 
NCTSQRLPDRPHRHSDAPHTHTHTQTHT 
HTHTHTPATVT 


12118 


26019 


A 


12231 


211 


96 


KSHFKAGGGGPPRYS PPFGGGGGGVFRV 
GGFPPPGLTQ 


12119 


26020 


A 


12232 


228 


3 


KRWFIKGFLSKGLGPPPPKLKFLGFFFF 
FFFEQSFVLVAQAGVQLECSGTI LVHFN 
LYLPGSSNYPASASHPRV 


12120 


26021 


A 


12234 


137 


3 


VPLPGTliKNHSMVPCFLFVCLFVCYETE 
SHS VAQAECSGAI SAH 


12121 


26022 


A 


12235 


139 


1 


FKLCFGQAWWFKPI I PALWETDASRSLE 
VRSSRPADAWADAWADAW 


12122 


26023 


A 


12236 


80 


1 


VLLI FIFLEEMGFHHVGQDGLDLLTL 


12123 


26024 

j 


A 


12237 


187 


494 


TERNS FNINKKDIHTKTPFVGYQHQRPK 
VDKTTKMGRNQSRKASNSKNQSTSSPPK 
DRSSLPATEQSWTENDFDELTEVGFRRS 
AI TNFSE LKEHVLTRRTE 


12124 


26025 


A 


12238 


252 


503 


CQGVGADDLQSQAEIQCELCAES IDRFD 
ILFFFFLGLGPPHKNPPPPTPQKWKPGK 
APPLALWD I PRG AKG PGRQ PGR P AI HTC 


12125 


26026 


A 


12239 


2 


471 


RIALCPAVRIRHEERERERERERERETP 
GKTQLS PGARRTP PLERERAFNDCFSliN 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=A)anine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Iso leucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














YPGNLCPGDLIEVFRSVYQHCALYLGDG 
YFINITPVYGIPAFFTSAKSVFFSRALE 
KTHLLKDWGSDTYRINYICYETLPSLS 
VEEI ITRSEFVIGQEA 


12126 


26027 


A 


12240 


14 


149 


FMTACRIRHEMGRGLLRGLWPLHIVLWT 
RIASTIPPHVQKSALPH 


12127 


26028 


A 


12241 


588 


671 


RSRPSFQEQAFESSQKYKEGKYI IELNH 


12128 


26029 


A 


12242 


384 


464 


RGGQVWWLKPWPALWEAKAGRSLEAG 


12129 


26030 


A 


12243 


325 


459 


TYLFIYLFIYFETESCSVAQAGVQWRDL 
VSLQPLPNSPALPTN 


12130 


26031 


A 


12244 


323 


457 


SLQRECGPADTFFEMDSCSVAQAGVRWC 
DLGSLQNSP 


12131 


26032 


A 


12245 


241 


466 


GALFFFFFFFFFFFFFFFF FFLSNKAV Y 
F 


12132 


26033 


A 


12246 


385 


37 


QARRRCLQTSRMEALGRWS Y KVT I EF P L 
QI LFFPP ILKYKVI FG I KITPNFHFLEE 
YEEGQRLGKSTVSWVFNEDTRLINIWNR 
HITVNPRTNSQNRMLKMAHESKYSQVPF 
PLQA 


12133 


26034 


A 


12247 


318 


456 


L I LK Y FMYVCM YVHM YVOVJ Y I KS CSVAQ 
DG VQW YDLGSLQ P P P PGL 


12134 


26035 


A 


12248 


362 


473 


S PCQNNMN S AKTE ART N I KFMAKLMWKN 
GAIIDALQK 


12135 


26036 


A 


12249 


310 


463 


FSIKTRKNALGKLQSLLRCHQFYLYVCV 
CVCVCVCVCVCTRTYAILTCIAV 


12136 


26037 


A . 


12250 


169 


54 


EAKSGQARWLTLVI PALWKAKAGG CPE V 
RSSRLAWPIW 


12137 


26038 


A 


12251 


49 


449 


GIPATSTSCVQVILLPQPPLLIiGLQDSF 
LPQEIIIKVEGEDTGSLTIPSQEGVNFK 
IVTVDFTREEQGTCNPAQRTLDRDVILE 
NHRDL VS WD LATAVG KKDST S KQR I FDE 
EPANGVKIERFTRDDPWLSSC 


12138 


26039 


A 


12252 


130 


428 


RWGLAMLPRLVLNSWSQVILPPQPQVGA 
TFGG F PNP PGHWKLLGG I SSGEEAG VEE 
AEEGQALGFLGQFPPSSPQLLLQTSWCIi 
RTTRTCRWSLLWKAH 


12139 


26040 


A 


12253 


204 


54 


PGMHKTQKSDLGQVQWLTPVILGLWETE 
AGDS PE S RS S RPAWAKHKI VR 


12140 


26041 


A 


12254 


375 


470 


SLCLFFSFFFQTEPRCVAQAGVQWRHLS 
SLQP 


12141 


26042 


A 


12255 


432 


512 


GQVRWLTPVIPALWEAEGGRS PEARS S 


12142 


26043 


A 


12256 


232 


1 


KGALSKKTQSFWGGQKFFPRPGVFKGVF 
FWGGGFCFFVFGGFFFFFFFFETESCSV 
AQAGVQWCSLGSLQPLPPRFK 


12143 


26044 


A 


12257 


22 


123 


RFVCSTI KVLRDLSSDRSNPGRFLSTSN 
SSSTRP 


12144 


26045 


A 


12258 


71 


362 


QTERNS I S I NKKDVHTETPSKGHQHQRP 
KVDKSTKMRKKQRKKAENSKNQNASSPP 
KDHNSLPAREQNWTENEFDELTEIDFRR 
WITNSSELKEHV 


12145 


26046 


A 


12259 


375 


3 


SPNIEAPPKAFFFFNQRHFVFCFFSCRA 
ENTHEIVWVKVYPFSNQSKAKLFFSFFV 
QRDRSSPDSYLMPLQLQMWDTAGQERFR 
SIAQSYYRSANALIIiAYGITCEESFRCL 
PEWLRQIEQYA 


12146 


26047 


A 


12260 


98 


1 


KYQIDLGGRGSSNS PAS ASRVAG I TGTH 
HHTQ 
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SEQ n> 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=IsoIeucine, 
K=Lysine, L=Leucine ? M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


12147 


26048 


A 


12261 


125 


3 


LFLIQKYITGQERWLTPVIPAPWEAEAG 
RSPKVRSSSPAW 


12148 


26049 


A 


12262 


228 


385 


S CL YLQM I PS S I E KS QGM Y KKATRGRAR 
WLTPVI P ALWEAGAGRSPEVRS SR 


12149 


26050 


A 


12263 


227 


369 


MLKH KSNKNLQDLGQMQGLTP VNP VLWE 
AKVGGS PEVKNSR PAW PTW 


12150 


26051 


A 


12264 


102 


487 


QQNRFFGTERGRGTGRGVSSFPQLSPGS 
LELASSPSPSVGRLGSAASQRSRGGQER 
PKAQLRRPPRPSSSPRLRGRENCAHHAR 
TPGGILDRPPPTLGTNLLKGGRPAWMAG 
NPPEAHKTANGPGEQA 


12151 


26052 


A 


12265 


391 


476 


PRS YRVECSGS I SAHCNLGLLGSSNPSS 


12152 


26053 


A 


12266 


378 


476 


MSSRLGSLEVGILGRAWWLTPVIPALWE 
AKEGG 


12153 


26054 


A 


12267 


166 


58 


NFFFFFFETDSCSVAQAGVQWHDLSHCN 
LCLPASSN 


12154 


26055 


A 


12268 


209 


49 


TPPNFYIKISRSGQAQWLTPVIPALWEA 
EAGRS PEVRSSRPAS PTRRNS I SNS 


12155 


26056 


A 


12269 


127 


1 


SSSFFPLKTFGFLFFFLCFFESESRSVA 
QAGVQWCHLGSLQA 


12156 


26057 


A 


12270 


154 


27 


IQTPGKKEFFFFFFETKSCLVAQAGARL 
CPKKKEKENSNFVL 


12157 


26058 


A 


12271 


180 


22 


VLKIHNIGRAQWVTSVT PELWEVKAGGS 
QALRSSRAARTRWDACNFVEKTQVY 


12158 


26059 


A 


12272 


207 


422 


GDSSTRGADEKPKEGVKTDNNNLINLKV 
MGQDCSMEQFKIKSHI PAGYGGSHRESQ 
HFVMPRWADHLRSGV 


12159 


26060 


A 


12273 


59 


479 


NSLGGGGVYGSRFRFTFPGCRALSPWRV 
RVQRRRCEMSTMFADTLLIVFISVCTAL 
LAEGITWVLVYRTDKYKRLKAEVEKQSK 
KLEKKKETITESAGRQQKKKIERQEEKL 
KNNNRDLSMVRMKSMFAIGFCFTALMGM 


12160 


26061 


A 


12274 


354 


439 


VKTGE CS I AKTWNQ P KCPTT I DW I KKMW 


12161 


26062 


A 


12275 


366 


499 


TRAQYFWWGIGSQCVTLECSGMIMAH 
CSLDLLGSSDPPTSAS 


12162 


26063 


A 


12276 


136 


37 


KFPGQVHWLKPVI PALWE VKAGGS PEVR 
SSRPA 


12163 


26064 


A 


12277 


419 


48 


DGKLNVS FLTVSS SGCFLI FLLIVRPPY 
FLRYNNSEIRPINNPTMTCKCSSKRKSR 
ICLTFNQKLEIIKLSEEGMSKIKTGQNL 
GILCHVSQWDEQHKFVKATKSATAKIN 
LSISKYIMSGA 


12164 


26065 


A 


12278 


190 


1 


RKGVFSPQMGPGLGKGKSPFLAKLNFPF 
KKTFFFFFETGSCFVAQAGMQWLDPSSP 
DAWADAW 


12165 


26066 


A 


12279 


276 


389 


WFNLSFFCFCFLFFETESCSVARLECSG 
AISAHCNLT 


12166 


26067 


A 


12280 


425 


1 


PPHPPPGKPPPPPRPELFFPKTKNKKER 
VLFFS P P P KXNFFSLTT PRFFFS PPQKK 
KKKLNPPPQKGPPPPVPIKPPPPLFFFF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFPFFFFSFGLTVRVIMTCWLIVDIGLL 
I 


12167 


2606S 


A 


12281 


253 


90 


IWPPQGSFKTAAPFFFFFVFLVEMGFCH 
WQAGVKLLTS S SP PAS ASQTAE KLF 


12168 


26069 


A 


12282 


61 


387 


PKSSSSAPFQISSWFSLPASTSASTMSI 
RVTQQPYKVFNSGPWAFSSSSYMSRPSA 
KISSLPvSRVGSSSTSFQGGLGAGMGIjP 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

<JIIIIIIU «It.IU 

peptide 

OCIJ UCI ILC 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 

/linn if\ 
11 III g iU 

lief aniinA 

«fIIIIIJU 

acid 

rfQiflup nf 

1 Lj lUUv VI 

peptide 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=Pbenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M==Methionine, 
N=Asparagine, P^Proline, 

V£ V»1U 1*1 IIIIIIC, -TV — /-VI gllllllC, O — CJCIHIc, 

T^ThrpAninp V=Vnlinp 
x j it 1 cv 11 1 lie j ▼ t an iii;, 

W=Tryptophan, Y=Tyrosine, 

^» 1/III\1IUYYII, OlUjJ LUUUMj / JJLIiilLIie 

nucleotide deletion, \=possible 
nucleotide insertion 














liHSSLANRARLSQKKKKRGGRS KEP 


12169 


26070 


A 


12283 


186 


305 


LHKSTVQVREEEHKSLHTQVHTHTHTHT 
HPHLVIMSSHS 


12170 


ZUU / 1 


A. 


17704. 

J zzot 


7 
z 


ton 

iy\j 


PRVRFDRVNNRS LNPL I SRTMRI E P I PE 

X XV v AVI l-'lv V XV w m t V XJ JL X I Jiv, JL. X-J X JL X Hi 

NPKFSVKKKKXKKKKKKKKKKKKKKKKK 
KKKKKKKKKGGGA 


19171 


zou /z 




J ZZo J 


1 7 




SLGVGGGOGGGGVHFHPCKPFIVL^PV^ 
LSKHCSYLPKKKKKKKKKKKKKKKKKKK 
KKKKKKKKKKK 


1917? 

JZ J /Z 


7A07^ 

ZOU 1 0 




177R6 
J zzou 


I JO 




GVA*? YAEORKYFTiEMRYFPRnFAT ^ TVF 

vJ V iiiJ X iXLiy/i\I\i J. 1. ,1 J 1. j I'll j X .17 X VJ VJJ J_jX* JL iJv JX V £0 

MTTEDLHYS INLDDKAPTGFERVYSHFE 
RTCTDDKMLS YC I TCYRE I FCERNKL 


12173 


26074 


A 


12287 


364 


142 


GCAFFLGGPLKERRKRRAVPPLFFFNFF 
FETE S CS VTQ AG VQWCNLGS LQPLL I DW 
WGOPGTVAHACHPGTLGG 


12174 


26075 


A 


12288 


171 


380 


VFEAGHAWLQENLTSPDFWGFS FFPTDT 
GMLEKKKKKKKKKKKRKKKKKKDR 


17 ns 


ZOU/O 




177CQ 
IZZO7 


104 


7 

z 


GWFFFFPKRGFPFPSFLFFFDRVWLCHP 
GWISVARS 


12176 


26077 • 


A 


12290 


179 


416 


LIFLFTFACAVCFQLARFHFKIHIETAL 
RCLI I LSCPFTHPLYKKYREGQARWLTP 


12177 


26078 


A 


12291 


3 


408 


LAYG FHDRTT YDT P S ALMMVLLVDEL I T 
GNKNGSGEAPEYLPEDFRDGEDDAAVTL 

iZ> iw^JlZi LJXj l l Jj V J. Ltir v J. JJoiinyKyKCj J. 1 » r- 

AKLLKKRLELGSLLDLVBDLELI IQLKK 
KKKKKKS PLiWGAP FKGPT.GAP T 


12178 


26079 


A 


12292 


119 


278 


PKSIEAGVKSVLDHPLPNKAKTHLSCKK 
LKKKKKKKKKKKKKKKKKKK KKKK KKKK 
KKKK 


12179 


26080 


A 


12293 


119 


3 


MLFTGFHTKVMSLNSITGWAWWFTPVIP 
ALWEAEVGKS 


171 Rrt 


zOUOl 


A 


lZZy4 


JAjj 


1 


TT,W7?Y c i r FFTKOPl?'f?T .PT,LT.l?V<?nrFnPlf 

X LiTLC luiljl l\ylr £\.V7J-IC 1 iAA ) 1 r 2 C5 ±J V^Hj iy{ Hi I\ 

RKGGKLKKIFKKAQGQARWLTPVIPAFW 
EAKVGRSPEVGS 


1 Z 1 O J 


ZOl/OZ. 




1 £.£7 J 


71S 

Z J J 


1 
1 


AFKI KT.TMHT FSSPLCT.PnDFFFrPFTFA 
LLPHVRAIAYTWFNLQARKRKYFKKHEK 
RMSKEFHGARLEGNQ 


12182 


26083 


A 


12296 


185 


411 


D VP LM FHW LHLHS D FRI I S TE I G VET VG 
KRVLHIRILLLLLLLFFEMESCSVTHAG 
VQWCDLGS LQAP P PGFTP F 


12183 


26084 


A 


12297 


281 


388 


RILFFFFFETKFCSVPQAGVQGRDLNSL 
QPPPPGFM 


171R4 
JZ tot 


zDUOJ 


j± 


177Q8 


1 7R 
I /O 


-1 
3 


TT ,T ,9 VTOV P T T . PR ^ A ciPT.T ,V^Tl KTl T V T 

PTSVFFSFQTEPRSVTQAGVSWCDLGSL 
QP 


12185 


26086 


A 


12299 


199 


450 


TLIRHMICNYFLLSHRSPFSSVDYFLCY 
REIYSLIKTLNKLGIGGFFLNI IKVMYD 

V PAA Q T T P Q QPT^riTf a PDT ,P Q<TTP T P P PT 


12186 


26087 


A 


12300 


3 


407 


EFKDHSTAMDTEPNPGTSSVSTTTSSTT 
TTTI TTS SSRMQQ PQ I S VYSGSDRHAVQ 
V IQQAIiHRPPSSAAQYLQQM YAAQQQHL 
MLHTAALQQQHLSSSQLQSLAAVQASLS 
SGRPSTSPTGSVTQQSSMVQTSI 


12187 


26088 


A 


12301 


227 


3 


KRGTEGQTLFFPAPRARKSFPPPKMPKK 
VEKEFPKGFFFFFFFETESAPSPRLECS 
GAISAHCSlaNPHASAHAS 



1280 



WO 01/64835 



PCT/US01/04927 



SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Iso!eucine, 
K=Lysine, LHLeucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 


12188 


26089 


A 


12302 


292 


380 


KCYRWARWLMPVI PALWEAKAGGSLEPR 
N 


12189 


26090 


A 


12303 


13 


186 


VDERH IRLSEMAWVALYLLSLLWATAGT 
STQTQSSCCEYDVFPRRSGKQLPWGRGC 
DL 


12190 


26091 


A 


12304 


253 


409 


ITAVIAAAAAWGVTIIMTNFRSLPRIiE 
CNGAI SAHRNLLLLGSSDSPASAS 


12191 


26092 


A 


12305 


153 


64 


GRAQWLMPII PALWEAKAGGPLDVRSSR 
PA 


12192 


26093 


A 


12306 


265 


420 


GTLMSRNTKLQLCRMNKSRTLGQPWWLT 
PVI PTLWEAKAGGS LEARCSRPAW 


12193 


26094 


A 


12307 


295 


1 


RPCLKNKKTSPHWKKKFANPLFGKAPVF 
PLDKKPFKPHYRESPRPLKKKNGPKGFF 
FFFFETE SCS VTQAE VQWRDLGS LQAP P 
PGSYHSPSKLIIKN 


12194* 


26095 


A 


12308 


247 


421 


SWCCLKKKKKKKKKKKKKKKKKKKKKKK 
PGGGALKKKKI FS 


12195 


26096 


A 


12309 


168 


3 


GQ ALLLLCLTVAFS KTTWCHAP I LGWA 
WWLTPVI PALWEAE VGRS PE VRRSRP 


12196 


26097 


A 


12310 


117 


3 


VAYNIAVFRGGVSLLLFRLECSGVISAH 
CNLRLPCSS 


12197 


26098 


A 


12311 


256 


101 


EP PTP FFFFETGS CS I TQAGAQRRDHS P 
PQSPTPGLSNTPAPASRACESGR 


12198 


26099 


A 


12312 


146 


3 


FVALCIQLFPLVNVI CFHKKHKTGRARW 
LMPVIPALWEAKAGGSPEV 


12199 


26100 


A 


12313 


207 


3 


VRVRTTGVIMALRGLCFSVEESMTKDEL 
IARLRSLGEQLNRDVSLTGTKEELALRV 
AELKQELDDSR 


12200 


26101 


A 


12314 


282 


2 


NCLAE KVKFIWS AI S LQPQG I S I LFLS F 
FFEKESCSVPQARVQWPNFSSLQSPPPG 
FKLFFCLSPGGGGCSEPKSCRCIPGWQQ 
NETPSQKKK 


12201 


26102 


A 


12315 


298 


84 


KTPQPPVFALSKNFGPPKTAPIFFFFFF 
FETKS YSVAQAGVQWRDLGSLHS SLGHR 
ARLHLKKRKRKS INL 


12202 


26103 


A 


12316 


124 


1 


SKGHFFFFFFSETES CSVAQAGVQWHNL 
GSLQVPPPRFTPF 


12203 


26104 


A 


12317 


132 


398 


WTNAVFS I VTFS PS VCHS E VALAAYKW 
LVCYLLRETYQKLNQEIKPPTLSVPKKK 
KKKKKKKKKKKKKKKDKKK 


12204 


26105 


A 


12318 


262 


392 


PH YCQNQ PQARR S GS HQH FGKQRREDHL 
SLGVRDQPGQYSETP 


12205 


26106 


A 


12319 


168 


3 


EAPKAISNCTTALQPRDQGKNLSQNKEG 
QVQCLTPVIPALWEAKVGGSPEVRSS 


12206 


26107 


A 


12320 


48 


385 


DIX3LSGHSTLAS S F I S LLLLS YKKCLS P 
STKIMRLPLPCGSPIKPPFLPKKKKKKK 
KKKKKKKKKKKKKKTGG 


12207 


26108 


A 


12321 


218 


83 


NLS P I FPEGNFFFFFFEMESRSVAQATV 
QWRDLGSLQAPPPRRKF 


12208 


26109 


A 


12322 


197 


1 


KETWPFPPKKEQGLFSFFKKRIFFFFSK 
KIPSPPFFFFFFFETESCSVAQAGVQWH 
DLGSLQAPP 


12209 


26110 


A 


12323 


128 


36 


KGLFFFFEMGVSLLPRLECSGTTSAHCN 
LP 


12210 


26111 


A 


12324 


140 


3 


ILNSNLKPIKKNTFFFETESRSIAQAGV 
QWHDPSSKQPPPLGFKR 


12211 


26112 


A 


12325 


273 


369 


RKEWFLKKKKKKKKKKKKKKKKKKTKKK 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=€ysteine, D=Aspartic Acid, 
E-Glutamic Acid, F=Phenylalanine, 
OGlycine, H-Histidine, I^Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 














KKK 


12212 


26113 


A 


12326 


140 


415 


RTGLLGPGRLQGPPYGSREYKGSGLLDL 
NASLLDPEVWAPSLSGPETKP PAHS PSD 
PGVHSSCSASQGSQGPGPLTPSSRIHEP 
KPPGPFFK 


12213 


26114 


A 


12327 


2 


506 


FVPLSETEAPGCSGSRPPELEPERSLGR 
FRGRFEDKDEQLEEESELEEEEEEEEED 
MSHFSLRLEGGRQDSEDEEERLINLSEL 
TPYIIiCSICKGYLIDATTITECLHTFCK 
S CI VRHF YYSNRCPKCNI WHQTQPLYN 
IRLDRQLQDIVYKLVINLEEREKKQMKD 


12214 


26115 


A 


12328 


1 


419 


FRVPSCARCQYLPLLRGASQLGYSGVRD 
P VEEAVCP FSDLKLCAGRTTTLFKAVRQ 
GHLSLQRFLLPFVWLCPAPRSGVYRGRQ 
AS LS CGGLH PVGAS WPRCLPTQASAMAG 
TPPPASLPPCSLISDCCASNERGSMGV 


12215 


26116 


A 


12329 


259 


414 


NMENSLRCVWVPKLAFVLFGASLLSAHL 
QVTGFQI KAFTALRFLSEPS0AVT 


12216 


26117 


A 


12330 


302 


404 


SCYRVSVISQARWLTPVIPALWEAEVGR 
IPEVGS 


12217 


26118 


A 


12331 


94 


375 


S RDMS PG LLTTR KEALMAFRD VAVAFTQ 
KEWKLLSSAQRTLYREVMLENYSHLVSL 
G I AFS KPKL I EQLEQGDEP WREENEHLL 
DLCPGWSAMA 


12218 


26119 


A 


12332 


7 


246 


N P I VD P S PCGG I R VRTP AGRGG PALRVR 
PETWEEAGEKMPSESLCLAAQARLDSKW 
LKTDIQNCFITRKISLLPLFCHHL 


12219 


26120 


A 


12333 


150 




YIYFFHLVRGLPGSGYDEYFVWWNTE 
SPSVTQAGVQWCHWSVQHKPP 


12220 


26121 


A 


12334 


34 


426 


E PG F L FVF FFL I RG KKTPL F FWG TRKNP 
KSGKPLFTPPGKSPSEPPKSRGGPLFFQ 
TRSRLTPQSWFGGAPKPPLGGEPPKFWE 
AGLGGDP PT PRAPHRADQTLEQP I QAPT 
RIIQNEQWAIKPFQPMPGG 


12221 


26122 


A 


12335 


185 


3 


TVITPLHFSLGKRARLWSRVPAPPAENR 
SMEGGLGRAVCLLTGASRGFGRTLAPLL 
ASHE 


12222 


26123 


A 


12336 


60 


316 


GWGP I PSTDVTVLS PQKP I LLQGHERAI 
TQIKYNREGDLLFTVAKDPVSVGWRGSG 
RGGILLPGRWVDTPVLPLAGNQRASHCE 
K 


12223 


26124 


A 


12337 


3 


167 


SFRIQVQGNHTSKHYPKIQYSDQAQWLT 
SVIPALWEAKTGRSLEVRSLRAFRAP 


12224 


26125 


A 


12338 


238 


401 


RPLS VMCGRLTS I RVEDSGARCW FYLF I 
YETEFRSVAQAGVRRRDLGSLQTPPSG 


12225 


26126 


A 


12339 


354 


433 


GWAQWFMP VI PALWEAKAGVS PE VRS 


12226 


26127 


A 


12340 


238 


382 


LLIVYNMMLLPDAVAFTCNSSTLGGQGR 
RITRGQEF3TSLANMVKPHL 


12227 


26128 


A 


12341 


101 


1 


KTKQTNKKTRSVGQT WWFTP 1 1 PAYWEA 
QAGEL 


12228 


26129 


A 


12342 


276 


23 


GSCLLEGKLTNRKDIHTKTPSVRHHCQR 
PKIDKTTKMGRKQSRKAENSKNQSSSSP 
SKECS SLAATEQS WMPHDFDELREEGFR 


12229 


26130 


A 


12343 


122 


1 


YMGVNERGCGQS I FKSSLSSQLWWWAPI 
IPATQEAEAGEW 


12230 


26131 


A 


12344 


180 


360 


LHWYFFSVGDFLPPPPPPLDDSSALPS 
ISGNFPPPPPLDEEAFKVQVRAEVKVML 
GKS 
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C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Gtycine, H=Histidine, I=lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
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W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


12231 


26132 


A 


12345 


35 


411 


TMTIVDKASESSDPSAYQNQPGSSEAVS 
PGDMDAGSASWGAVSSLNDVSNHTLSLG 
PVPGAWYSSSSVPDKSKPSPQKDQALG 
DGI APPQKVLFPSEKI CLKWQQTHRVG A 
GLQNLGNTCFANA 


12232 


26133 


A 


12346 


278 


386 


IIYCYKQTIIGQARWLTTLIPSLWEAKV 
GGSPEVRS 


12233 


26134 


A 


12347 


111 


2 


KSQTGQVQWIiTPVI PALWEAEVGGLLEL 
RSSRLAWA 


12234 


26135 


A 


12348 


215 


316 


LNXRLMI SFlKKKKKiCKKKKKKKKKKKK 
KKKKKKKGGPL 


12235 


26136 


A 


12349 


265 


404 


SNISYFLNNNFSKALSLNIKYMISQARW 
LTP V I PA1»WEAKVGGLPE 


12236 


26137 


A 


12350 


372 


1 


PVFPLPPKKLGEQLPPPALRFXiAVSPLP 
KAAHEQEIKEKVLAVHKNPIDPVYGFPK 
KGPTNFFLFPLFQRVIFLGAQKGVSWFG 
FYVKGLSLAPKLGGPPFFFFFETESRSV 
AQDGVQWCDLGS 


12237 


26138 


A 


12351 


318 


416 


QGRAQWLTPVILTIiWEAKAGGSSSVRSS 
RPAPP | 


12238 


26139 


A 


12352 


144 


1 


NGYAIVECWLFCFVLFLRQSLSVAQAG 
VQWCNLGSMQPPSTSLVQA1 


12239 


26140 


A 


12353 


116 


1 


SGRYSFKKMKCGQERWLASVI PALWEAE 
VGRSPEVRSL 


12240 


26141 


A 


12354 


102 


2 


ETRYKKQPGGWERWLTPI IPVLWEAEVG 
GSPEV 


12241 


26142 


A 


12355 


112 


294 


LISPSCPMWPSPHGPTWLSRLISQHSPS 
NLKKKKKKKKKKKKKKKKKKKKKKKKKK 
KKKKKKKRGGGAF 


12242 


26143 


A 


12356 


138 


2 


SFFLKVFFFFFFEMESRSVAQAGVQWCN 
DGSPQAPPPGSRHCPVD 


12243 


26144 


A 


12357 


148 


394 


PGEP PGE KGEGGKGKKPGGRG WGEP PCR 
GGFKEKFPRGKEKAPPLKENSRENLGWG 
TKI PPLGKKKPPPPQEQEM I LGPPQF 


12244 


26145 


A 


12358 


97 


3 


EKFSPCFVRARTHTHTHTHTHTLSLSLF 
GFH 


12245 


26146 


A 


12359 


204 


2 


KKYTPAACFAPCLPDEAPI I AAAKPATT 
TSEQKMAVPPKYANLGKSARNVFNKGYG 
YGLINLLKTKS 


12246 


26147 


A 


12360 


156 


391 


NRGLFXVEESFS I PCS VRCS I HPSALVS 
PTPPTTDTTNWALFFETESCSIHQTGMR 
WRDLNLLQPLPPGFKRFSCLNL 


12247 


26148 


A 


12361 


179 


1 


ALKKLLI I DDNSHKTN1VKHFSFRNFTF 
LFLLETESRS IARLECSDANPTHCNLRL 
SGP 


12248 


26149 


A 


12362 


218 


391 


T I LKDCTFKM I CLSQAQWH I P WPATWE 
AESRGLLEPTS 


12249 


26150 


A 


12363 


2 


356 


TNSHVDNSITQKPEFCIPRCCGYIVCQL 
QLVERAFIFNFFFFFLKRSFVLLPRLEG 
RGAIFRVTQEGSNLLTLGPAPLGLPKGW 
NYRGDHPGPAYF YFLMGKRLLQ I QGGRG 
LKRNLL 


12250 


26151 


A 


12364 


94 


1 


KNPPLrFFFFFFFESESCYVAQTAVQWHD 
LGL 


12251 


26152 


A 


12365 


237 


2 


YSSLLFKPAESAKGILRHHVTPKSTSAI 
SKILFCFLFFVETEFLSPGLKCSGAIKV 
HCSLNLSGSSNSPTSASQVPGS 


12252 . 


26153 


A 


12366 


340 


442 


GL»FVCFFVWFLETESCSVTQAGVHWCDL 
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SSLQPS 


12253 


26154 


A 


12367 


208 


45 


EGGEFFFFFFSVFFFVRAGVLPCCPGWS 
ETPGS SDPPALASQSAG I TGATVNST 


12254 


26155 


A 


12368 


60 


443 


CS S CVS SKPQS PHLKKQGDRMGAHLI S G 
GCTEATAGKGMRKCLVAWSTCAGARPPS 
RNRGSQGHSARFQGPRLPTWRLPPPPP 
QAKPFCKKWMLGGP IS I EPKPQHP PGPG 
KKKWAGKAAPRLVPT 


12255 


26156 


A 


12370 


341 


421 


SGWVRWLTPVTPALWEAEAGGSPEVGR 


12256 


26157 


A 


12371 


280 


393 


NT I DH YVTQWLTPVTRTLWEAEAGRS P E 
VRSSRSAWPT 


12257 


26158 


A 


12372 


268 


399 


TLCSLSSAESI IHSYPBFSCKDNYLMTH 
NEAY KACMRRAFH I PL 


12258 


26159 


A 


12373 


209 


468 


EAVPDETSHPSPADSFSTSSNLACGTHH 
LRRLLKLYVEEELEYYPRKIQCFHFSFF 
FFESRSLLLLPRLECSGSISAHCNLCLL 
GS 


12259 


26160 


A 


12374 


381 


488 


TITCLFQKCQRGRARWLTPVI PALWEAE 
AGGSQGQE 


12260 


26161 


A 


12375 


276 


395 


GHWKTKKLKLI FVFFEMES CS VAQAGV 
QWHDLGSLQPPP 


12261 


26162 


A 


12376 


295 


472 


VYSSVALNTFTLLCNHKHQPSPEVLTFP 
NKKCRGAILAHCNLHLLGS SDS PTSASR 
VAG 


12262 


26163 


A 


12377 


390 


511 


PGAVAHACN P STLGGRGGQ ITRIPSL 


12263 


26164 


A 


12378 


417 


511 


AHRS FFEMESCS VTQAGVQWRRLGS LQP 
SPP 


12264 


26165 


A 


12379 


133 


406 


KLSLNKRDYKKQVCKETKKKKKKKKKKK 
KKKKKKPGAQKKKSGGS PR 


12265 


26166 


A 


12380 


247 


345 


HMWSSQLNKGLINXKKKKKKKKKKKKKK 
KKKKKKKKG 


12266 


26167 


A 


12381 


156 


15 


NFFFFETESCSSVTQTRVQWFDLGSLQP 
GKQEQNYVS KNKQKEL I H 


12267 


26168 


A 


12382 


275 


1 


KIVFLKGPPLFFFFKKKFVFFLKVGFKG 
GLNF I KTFP PGGKP I SKKRQI FFFFFFF 
ETEPHSVARLECSGTISPHCNLHLPSSN 
DSLASTS 


12268 


26169 


A 


12383 


43 


379 


LLGYDESRSLSWICLCLSCLGQLLSFLN 
LLVYVFCQTCEVCFHYFFQYFSTLFLLT 
FWDSNVLNIRHFG I VPQVLDDLVFFS PN 
KEIFLGQLQWLMPVIPAFWEPEAGRSPE 


12269 


26170 


A 


12384 


303 


J 


KGPFFFFGLSPFFFLKKVFFFPPFFLRT 
PPFFFFPPFFKNFFPPPPKKKNPFFFFP 
PPPFFFFFFFFFETRSRSVAQARMQWGD 
HGSPQPQPPGSSDPTRP 


12270 


26171 


A 


12385 


141 


1 


KKEPLRAPPPFQLFCFFYFETTPCSVTQ 
ARVQWCNLSSLQPPPLRYK 


12271 


26172 


A 


12386 


31 


417 


CRLADSPSPNDTGQDSRGRAGIKHIPPL 
KKKKKKKKKKKK KKKKKKKKKKKKKKKK 
KKKKKSGGGA 


12272 


26173 


A 


12387 


291 


412 


LSS EMLLFCFVYFLRWSFTFVAQAGVQW 
CDLGLLQPPLPG 


12273 


26174 


A 


12388 


310 


1 


MTPCPFLFFCPKKRKAGGGFIKKALFFN 
PQKEVFLGPPPKGPFFLGFQPTPPWGKK 
I PRLN PL FWRS S I F F FFLLDG VS RLLPR 
LECNGVISAHCNLRLLGSS 


12274 


26175 


A 


12389 


117 


2 


KRNPQRGG PLFFFF F FFETE S RS WQAG 
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AQWHDLGSLQ 


12275 


26176 


A 


12390 


117 


319 


SFLKKPKDSTKNLLKLINEFSKGSGPKI 
TIQKSVAFIYPNGEHSEKKIREVITFTI 
AAKT I KCPPQA 


12276 


26177 


A 


12391 


95 


2 


SKSWTSLKLRTLGWARWLTPVI PALWEA 
KAG 


12277 


26178 


A 


12393 


127 


395 


IPGS QG FCL I KQSALHLLNKS S FS FLS F 
IETQSLSVTQAGVQWCYLSSLQPPPPRF 
KLRQKNRLNSGGRGCNKLRAHHYTPGWV 
TERDS 


12278 


26179 


A 


12394 


252 


391 


FFCFCFVLLCFFEMESCSVAQAGLQWPG 
LGSLRQGFAMLARLVSSS 


12279 


26180 


A 


12395 


275 


3 


TFRLVILKKSKETVEIKCLETKSHSVAR 
AGVQWCSQDWWEEKAGFNWRPSKARRCP 
TS VLKCHLGWAQ WLMPV I PAL WE AKAGG 
SPEVRS 


12280 


26181 


A 


12396 


111 


3 


GNNVKIRGLRIRPLKGWACWFTPVIPAF 
WEAEAGG 


12281 


26182 


A 


12397 


124 


1 


VSRGRTVHQE KPTERAAS PRLECNGT I S 
AHHNLCLPGSSNS 


12282 


26183 • 


A 


12398 


259 


387 


YQQESFSCLFIFIFLFFIFYDTESRSVA 
QAGMQWHDLGSLKPP 


12283 


26184 


A 


12399 


103 


1 


KGRFFFFFFYETESRSVAQAGVQWRDLD 
SLQAPP 


12284 


26185 


A 


12400 


308 


416 


CGNNFEKAGRVRWLTPVI PALWEAKAGG 
SLBVRSLR 


12285 


26186 


A 


12401 


92 


1 


HTWGLTQWLMPVIPALQEAKVGKLFEPR 
SS 


12286 


26187 


A 


12402 


265 


388 


GLSWLFRRPGHLFYFIFFETMSVSITQA 
GVGCCDLGSLQPP 


12287 


26188 


A 


12403 


279 


1 


SLSSKMESGSLTFSTIDIWGWIITLSCC 
PGHCRMFSSIRGLYPLDASSTTSSSCNN 
QNS PDMGTCSLVJSQLLRRLRWEDCLNTG 
GRGCSEPGS 


12288 


26189 


A 


12404 


387 


152 


NPPPGFNFGGPLKKNFFSPPRGEKFVFL 
KRPPPFFFFFFFFFETESHSVAQAGVHW 
RDLGSLQALKS FCDLVSAFKGF 


12289 


26190 


A 


12405 


262 


464 


IIFHWLACILVIHSSFDGHGLFLPLAVA 
DSAAVKHSCTSICliNIEYIPGNGIAGSC 
GISVLNFLRNC 


12290 


26I9I 


A 


12406 


204 


67 


KCFGQAWWLMPVIPALWEAEAGRSQELR 
SSRPAWAKELLNNRTRAS 


12291 


26192 


A 


12407 


134 


1 


IQRPCIQLFSTFFLILFYFIFTEIDSCS 
VAQAEVQWHDFGSLQP 


12292 


26193 


A 


12408 


145 


403 


TYMYSLINYYKANSHVTKVNKHNTARFL 
ESLLCAHPNPP IKLII SLLLRE KKKKKK 
KKKKKKKKKKKKKQK 


12293 


26194 


A 


12409 


65 


415 


RNKRIISQSWNGPSRKPQSSAIaLCGLGH 
LTSGVRARVSPVQGCLVRIKGGSGPSKP 
KKKKKKKKKKKKKKKKKSSKKKKAQKGG 
ALKKK 


12294 


26195 


A 


12410 


124 


404 


PWV3EFLHHRNCQMWPGTVAHTCNPGTLG 
KKKKKKKKKKKKKKKKKKKS S S S S S 


12295 


26196 


A 


12411 


164 


2 


TGAGLTIMDMVGVGPDLLQLRFCFVLFC 
FLVFFFETESDSVAQAGVQWCNPSSL 


12296 


26197 


A 


12412 


114 


3 


PGMVAHACNPS TLGGQGGW I TRGQ VYKT 
GLAKTVKP 
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12297 


26198 


A 


12413 


583 


391 


LADTGFHHVGQADLKPLTSCSTRLSLPK 
CWDYRCE ? PHLAVPDTVLSTRGLI SEKS 
FLPLCTFR 


12298 


26199 


A 


12414 


280 


2 


KVSSMNAQTSVLFSTMLGISETLKYRGR 
KKRRERGREGGREGRERKDECRKVRSER 
IEGGKKRGRKEYREEGRRRIEVSANEET 
PRDHPANK 


12299 


26200 


A 


12415 


264 


352 


R FL VHGCD I Y I C I N I YTHTHTHTHSHTH 
T 


12300 


26201 


A 


12416 


359 


485 


TDARMDWCYQEGTGSQAWWLTPVI PAFW 
EAEAGGS FEVRSLR 


12301 


26202 


A 


12417 


171 


1 


PLFRWWGFELFPPFGYQKKKGYPPFFFF 
ETEPCCVTQAGMQWYDLRSLQPPPPEFK 
R 


12302 


26203 


A 


12418 


265 


1 


WHKSNEELAAEASAPVKARASNTILGPH 
THGKER C P VS LRNVLDEGVKS TNFFS F F 
ETDSHSVTQAECSGAISAHCNLHLMGSR 
DSGA 


12303 


26204 


A 


12419 


278 


388 


S SLTG FF I RRRNE E PGQHGET P CLLK I Q 
KLAGCGGVL 


12304 


26205 


A 


12420 


205 


413 


WNLFVYLYQTHKYARVYSYNI ILFTNKK 
RAIGQAEWFTP 1 1 PAVWEAKVSRLPEVR 
SVRLSLPKWRITS 


12305 


26206 


A 


12421 


103 


2 


RSLSKHRTETILGLPAAVLIILFPPLLI 
PTSKY 


12306 


26207 


A 


12422 


372 


478 


NIFVFLKETPCQSQWLMSVIPTLWEAEA 
GGSLEPR 


12307 


26208 


A 


12423 


174 


416 


KMHY CVLS AFL I LHLDTVAL I LS TCSTL 
DMDQFMRKRIEAIRGQILSKLKLTSPPK 
NYSEPEEVPPEVI S I YNSTRDLLQE 


12308 


26209 


A 


12424 


572 


733 


RFSCLSLPGSWDYKNCLNSGSRGYSEPR 
SHHCI PAWVTEQDSVSKQNKTKQSA 


12309 


26210 


A 


12425 


123 


2 


GYIFI FI YFFETGSCSVAQAGVQWRNLG 
SLQLPPPSSSDS 


12310 


26211 


A 


12426 


224 


417 


TADF YG VNY I S I KLL P KKKKKRGG P FKE 
S KFTAAGLQKNI FFLS APNFIALAWLK 
RRDWETPG 


12311 


26212 


A 


12427 


115 


3 


FFETGSHPVTQTGMQWCDCNSLQPLTNR 
LKQSSHLSL 


12312 


26213 


A 


12428 


250 


398 


PWAGHLTSLGLSLPI FNIGIAGQACWLM 
PVIPALWEAEAGRSLEVRSSK 


12313 


26214 


A 


12429 


215 


3 


KKGLVW7GANKLWVFTPQKFFLNQFSGW 
ALTPRGKSWFSKRRAPAPFFFFFFWETE 
SHSVAEAGMQWHDL 


12314 


26215 


A 


12430 


345 


2 


FSHHPAKPQGGLFYRQPLGLEGKETRPR 
LNDRFMAGPSFPAQQGLRMPFLGKEWER 
EGCSFVQCPVEGFAMWPYPLGSPPGPRL 
FFFFFFEAGS PSVTQAECSGS I TANCSL 
EL 


12315 


26216 


A 


12431 


348 


3 


LYSSLGDRVKLRLEKTKTNKWCLHSYIS 
TCKI INLDHYLIPCTKTNSEW I KYLNVR 
TNSIKLLEEDISVNLHELGSDNSFLAMT 
PKAEIIKEKIYKLHFIKIKIFHVSEDII 
EK 


12316 


26217 


A 


12432 


160 


52 


NTAIGHPHQAI YIjSYLSI I YLS I YLS I Y 
LSIYLSIYL 


12317 


26218 


A 


12433 


180 


2 


GRSKLHICREHSICAIEHVCGCDTENNT 
W LCQ V QWLM PVI LA I WEAEAWRS P EVRS 
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LRA 


12318 


26219 


A 


12434 


193 


3 


PGAENPPPKGGPPFFFPFPRGFPPGVFF 
LKKKGFPFFFFFFFDTESRSVAQAGVQW 
HDLGSLQ 


12319 


26220 


A 


12435 


321 


422 


NVEGDFLQR I KNI RYWPGAVAHTCGPS T 
LGGRGG 


12320 


26221 


A 


12436 


116 


2 


HDRFCLFVYDRVSTLS PRLE CS SAI SAH 
CNLRLPGSSN 


12321 


26222 


A 


12437 


302 


400 


THTHTHTHTHTHTH SGLLYYLP KG KQS H 
EFCW 


12322 


26223 


A 


12438 


329 


399 


LALQARLECNGI ISAHCSLCLPY 


12323 


26224 


A 


12439 


310 


2 


KP I LSRGLFCARPCFRRSNMRS FNPSQS 
YQVGKDTFGFTVKVTKLECVLGPSRGPKG 
ARQGVHKGKCLLTLPKMLRFLFFFETES 
CSVTRLECSGAI1AHCNL 


12324 


26225 


A 


12440 


151 


401 


EGMPHLPGSHSSSNIluKRGRESGREAGR 
KGRKERRKEGRRERRKEEEKRGRGKIRR 
RKDRGENEEEENNRLSSKVSLEISTCP 


12325 


26226 


A 


12441 


276 


1 


FHPFGRPSVSPRVQPKIGI I IHTCVARI 
EKGGSCKALSLCPQYVGSNFFFFFETRS 
FSVAQAGVQRCDLGSMQPPHFPGSSDPP 
APASRVAG 


12326 


26227 


A 


12442 


140 




NSHFPKNLGSPSSFFFFFYLETESRSVA 
QAGVQWLDLRSLQPPPPG 


12327 


26228 


A 


12443 


310 


3 


LPPCSWGLYGAFARCPLMDDKQLSQVPL 
WACPLSTAGRTRLCVACIKAAGKAQGFF 
F FFETESHS VTQAGVQWHDLGS LQPPTH 
AKPKHTHTHTQTHT I PDT 


12328 


26229 


A 


12444 


337 


2 


GFFAIVPKKGQKKMFSRFPKEGIFSGPC 
FISSTPTLLGVKKILPPNTSEIRPVLSS 
IINTSVHIREQKKTTYVGIFTGKSRFFF 
FETDSRS I AQVGVRWRHLGS LQAPPPG 


12329 


26230 


A 


12445 


112 


9 


GRVRWLTPAIPALWEAKAGRSPEVRNSR 
PAWPTR 


12330 


26231 


A 


12446 


172 


3 


I PGELLIGRAKFKAKNWPWAPRGI PLGR 
QKFS FFFFFETESRSVAQAGMQWLNLGS 


12331 


26232 


A 


12447 


221 


41S 


RKAERLEVFFRRLCQRALRPFPHCLAAA 
PMPLIVLKKPILGRAWWLTPVI PALWEA 
EAGGWLESRS 


12332 


26233 


A 


12448 


280 


392 


PLGRVRWLTTVIPAHWEALAGGSPEFRN 
LRPAWPDMA 


12333 


26234 


A 


12449 


410 


129 


PSQEI FPYPGPPGKTPFPLKKKKKNRGG 
GGGLPFPPPQKVKKKKFFYPGRGRFKQP 
KFNPLPPPRGEKKNPKPPPPKKKKKKEK 
KRKEKKLVT 


12334 


26235 


A 


12450 


159 


1 


KKKRGGFLGLTFIKKTGPDPFFFFFETE 
SRS VAQAGVQWYDLG SLQAP P PG LM 


12335 


26236 


A 


12451 


270 


1 


NPLKFFFFSLFQNPLKKDPAIFNPFFFF 
FPRIFWPFFGGVSKAGIWFFFFPFFFFF 
FFFFFFETESRSVAQAGVQWRDLGSLRT 
RGRTRG 


12336 


26237 


A 


12452 


249 


33 


DKLVKLPLILKKLNSFFFFFETESRSVA 
QTGV QV7RNLGSLHPPP PGFKPNAWAYAW 
AHAWWPGLPAQNLR 


12337 


26238 


A 


12453 


190 


86 


TNDLGWTQWLTPVI PALWDAEAGRSLEL 
RSTRKA 


12338 


26239 


A 


12454 


318 


2 


PPTFPLKTLFFPVFLPXLFSTLFSPKKK 
FSN FliVS AP PFF FLTLGNSQG VLLKGGP 
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FFFFFFFFETESRCVSQVGVQWRDLEKK 
EKKKCNSNGGRHYISTHLKYK 


12339 


26240 


A 


12455 


204 


288 


IQLNTHTHTHTHTHTHTHTHTiSrETGKMW 
I 


12340 


26241 


A 


12456 


409 


13 


QGGAP P I P I MLS FLQEAGLLHVAQPGVK 
LLTSGDPPSPPSKSAWRQRQETRLNLGG 
RGRSEQRPPPRPPAWATQQDSVSKTNKQ 
TKNNKKIRKSLCKKTLEKEMLAVMI ITK 
YLLCTDVPSTVLCTHLYYLT 


12341 


26242 


A 


12457 


204 


3 


KKKS PTPQSAKFPLGGGVFFFFLVGMRS 
PFVAQAGFKPLGSSDPPILASQSTAITG 
MNHHTSSAFH 


12342 


26243 


A 


12458 


289 


3 


KAREKKKVGATKNIRIVDLSKGRTSLCK 
KTEMPPVGNRNKQKPQNPSNLRISTRPE 
TESRSVTRLECSGAISAQVILPGSSDSP 
ASNWRQSETLS 


12343 


26244 


A 


12459 


261 


388 


SWHFGRSKQVYCLSLGVRNQPGQNPVST 
KTTKISQAWWHAPW 


12344 


26245 


A 


12460 


313 


391 


RVGRVHWLKPVI PALWEAVAGGSPEV 


12345 


26246 


A 


12461 


112 


7 


HICIYFFETESRSVAQAGVQWHDLSSLQ 
PPPAGFK 


12346 


26247 


A 


12462 


389 


3 


ALHEGAARGSPPPGGKKKKRGSPFPMGN 
QAPQVPGKLGGGFGFWFFKKSLPLQPGG 
KPGVFYKPKKVPTPKDPPPPTLWGGWGT 
TGGPPGPLFFFFFFFETESRSVTQAGVQ 
WRHLGPLQPQPPAHAS 


12347 


26248 


A 


12463 


26 


349 


CIM IDSTTGI FIHCWWECEMAHPFWKAV 
WQFLFFFFKRGVYFFFPGGIKGPGFGLR 
EPLPPGIKGVPRPNPLGGGDLRPPPPPP 
INLGFFKGKGVFMVWFGVGNFLI 


12348 


26249 


A 


12465 


187 


54 


GYKNKPGQAQWLTPVIPALWEAKAGGSP 
EVRSSRPAWPIWRTRG 


12349 


26250 


A 


12466 


199 


55 


I IQLRQQNS FLFICFD1 BSCS VAQAGVQ 
WHNLS S LQLE VPGFKHAPSH 


12350 


26251 


A 


12467 


107 


3 


FAHCLIGNSFFSFFFFFFEVGSCSVAQA 
GVQWSD 


12351 


26252 


A 


12468 


120 


1 


GNVJCRAQWLTLVI PALWEAE VGRSPE VG 
SSTRLGLPTHRP 


12352 


26253 


A 


12469 


105 


2 


YYLYF I FFLFFLETGSCSVTQAGVQWKD 
LRSVTS 


12353 


26254 


A 


12470 


238 


405 


FHI CLFILKNQVSKMYTHTVSCSLLE I I 
YCCLLQWLTPVIPALWETKVGGSLEVRS 


12354 


26255 


A 


12471 


425 


1 


LNPAPAIFFGGPKKKNFFSPPRGDKFFF 
FKRAPPFFFFFFFFFFFFQKJCKKPFFPA 
QKKGRGFPGRALSQWPLAGFLKITQFPP 
WSRVPFSPLLFFFFDMESPSVAQAGVLW 
HDLLHLPGSSDPPTPASRVAGITGVCHH 
A 


12355 


26256 


A 


12472 


122 


2 


QNKYSELRWNTFGRPQWFTPVI PALWEA 
EAGGSPEVRSSR 


12356 


26257 


A 


12473 


144 


420 


GAPRP I SRPRGKTPGGGGKRGKPNKNGL 
GENPFFPKGVKGKNPPI FFPFLGQNGPT 
PLVLGSANMGFYLYLKRKGPGKGREALR 
AEPGQPFY 


12357 


26258 


A 


12474 


80 


1 


KGGQVQWLTP V I PAFWEAE AGG S PEV 


12358 


26259 


A 


12475 


54 


294 


LIKSWQHRQEYVSLMSFIFFPCHIWYQS 
QRS PGTTQI I LEI SDTGKNKYFNCTGSH 
RKKTKTHTOTHTKTHTHAKTH'L'L, 
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12359 


26260 


A 


12476 


364 


234 


EEMGFHHVSQDDLNLLNSWSACLGLPKC 
WDYRSETSRLASLPF 


12360 


26261 


A 


12477 


288 


450 


PGTVAHACNPSTLGGPGWSRTPGLKRI I 
CLGLPKCWDFRRELL.PSLPCLLI FL 


12361 


26262 


A 


12478 


271 


396 


TISFLSFFFLFLFFFFFFWRKSGRESRF 
SQKKKKYKKKNIYY 


12362 


26263 


A 


12479 


289 


2 


GSCLLERKLTYRKD IHTKKFPSVPHHHQR 
PKVDKTPKTGKKQGRKTGNSKNQSAS PP 
PKECS S S PATEQKWMENDFA KL REKG FR 
RSNYSELQEEI 


12363 


26264 


A 


12480 


323 


1 


YSMVYPYVFLTAKKDQLQVNNTQLTCKS 
YQLYHCXNHSTLQTHNISTLMI LGHI PG 
LCIPVNLSEAWAATPALHFVKLLLTELT 
HHVCRALDI I ILAI VSLVALITS 


12364 


26265 


A 


12481 


279 


429 


MNGQRKCVYVYTLYTYTTEYYSTLKRRK 
WPGTMAHACKPNTLGGHGGWTA 


12365 


26266 


A ' 


12482 


308 


423 


LTFFKNITGFFFFFFETEFCFVPQAGAQ 
GGDLSSLKAP 


12366 


26267 


A 


12483 


272 


474 


MVSWGLSRAKIPWVLSFFFFFEKKSHFF 
FQGGGRGGDLSWEQPLPPRSKGFSCFTL 
PSTRDYRPAGP 


12367 


26268 


A 


12484 


84 


229 


LFKSVCHLLPLSSCSSHVRQACLPFTFC 
HDCKCSEASPAMLPVQPAEL 


12368 


26269 


. A 


12485 


392 


3 


TGKKKRFPFFPKKK I FNSKPQFS WRKKK 
LFK I FS PPGGAPF P PGVQKKKT FFFFYP 
FSPRTPPFSIFFPGKGPPKGGGLGPPFF 
PQKRGGGFPNSKWETWEKFPPLFFFFFF 
ETESHSVAQAGVQWHDL 


12369 


26270 


A 


12486 


76 


I 


KL I S WT W WHM? WP ATWRAEAGDFS 


12370 


26271 


A 


12487 


204 


8 


FFI FFFFFFFFFFFCFKTGSHSVAQAEQ 
EWHDHGSLQPQPPQSFHLSLPCATMPGL 
FFLFLFLFT 


12371 


26272 


A 


12488 


96 


3 


KKPLGQAMFFFFFETESCSVAQAGVQWC 
DL 


12372 


26273 


A 


12489 


126 


1 

i 


KPPHPFLPFFFFFETESHSVAQAGVQWC 
YLGSLQAPPPGFTP 


12373 


26274 


A 


12490 


107 


2 


I AGRAWWLMLA I PTLWEAEAGGSLEPRS 
SQPTCAT 


12374 


26275 


A 


12491 


302 


2 


GGFGFFPPREKGGFFQTVLFGVPPGFFS 
PPVFKTGPGVFFLGAQKKKIFFPPPGGK 
IWFFLRGAPLFFFFFFFFFFFEMTSCSV 
AQAGLQWCDLGSLQHP 


12375 


26276 


A 


12492 


248 


398 


PTEQVTLGITAQSYSRVHINNRVYDLAV 
GSGHPDGAAAIKGSFVQRLKSY 


12376 


26277 


A 


12493 


341 


1 


KTSHFREIYPLEHASSLSKKISTGALPC 
SQELLNS QENS VMNKFPCLNQIiPFQDVN 
SCI IRFQTKDTLKHTAIQPEAKTLSLPC 
YTHTHTHTHTHTNNHS I FELLCMQCDS Y 
N 


12377 


26278 


A 


12494 


95 


3 


PLFFFETKSCSVAQAGVQWQNLGSLQPP 
PP 


12378 | 


26279 


A 


12495 


62 


420 


CAAKLCTEAHTAAS ADTHT S PHGSVSGL 
FCFHFPPHRREQRSRPGRKPGAHRLAGR 
ALSQKPVGSGATPHNLHHQIRTQTNGLI 
QLLGDRQAP WVT P VL P VLW EAE AGG S LE 
ARSLRPA 


12379 


26280 


A 


12496 


58 


492 


NSPPPPAPSQRTSPPAAAASPTTGSSSA 
PCPASS SWFRSS I All ITFYPDQP CGLS F 
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NN LAV7 KQNVL CFKERLLEKMFRRI Q VLG 
LLVWTL I AGTE YFRVPAFGWVMFVAVFY 
WVLTVFFLIIYITMTYTRIPQVPWTTVC 
LCFKR 


12380 


26281 


A 


12497 


2 


198 


RLQGI LS AGFLVPGDTSVS S PRPRAAGS 
QSSTSRVLWKLSGAMLRLYVLVMGVSAF 
TLQPAAHTG 


12381 


26282 


A 


12498 


71 


346 


GM FMT S KE T F THY Q PQGN S D PAHTATAP 
GGLSAKAPAMTPLMLDTSSRKLVAWDGT 
TDGAAVG I LAVAADQTSTTLTFYKSGT F 
RYEDVLWP 


12382 


26283 


A 


12499 


122 


59 


HQIYIYIHTHTHTHIYIYIYI 


12383 


26284 


A 


12500 


220 


3 


PKKESVRCWAMKWILLILCQTSLHWSL 
PQSSLAATPCCLNLGTVDVLQGLGAVAH 
AWN P S T LGGRDG W ITS 


12384 


26285 


A 


12501 


125 


1 


HVLVLSHFLFVLFLFFETESRSVTRPKR 
SSAVSAHRNIRLS 


12385 


26286 


A 


12502 


373 


476 


ILTRDGVSLLPRLECSGTISAHHNLCLL 
GSGDYP 


12386 


26287 


A 


12503 


440 


573 


STWEDHLSLGNRGCSEPRLHLCTPTWVT 

E 


12387 


26288 


A 


12504 


146 


5 


HPHLQTMNGAP I PQELTPLLEKBRDGLR 
CRGNRSPVPGIPPRWSRDL 


12388 


26289 


A 


12505 


140 


44 


AQGLTPVI PALWEDEVGRSPEVRSSRSD 
WPTR 


12389 


26290 


A 


12506 


247 


420 


LEISVTFSCSFGCLPKKHIHTHTHTHTH 
THTKTHTHTHTYIRTIYTTLARPLSALH 
NF 


12390 


26291 


A 


12507 


346 


473 


KAY I IGLNCFYETKSHSVAQAGVQWRDL 
NSLYS 


12391 


26292 


A 


12508 


87 


414 


RTGVYRVGKDGRSRSPDLLICPPLGLPK 
CWDYRREPPRPACliPLLSHPSPPPHFSF 
LSPFLSFSLPCSPFLYIFFPPPLLPISL 
RWRLI PEGRFMAPLCLQMPLGHDT 


12392 


26293 


A 


12509 


386 


467 


LLLFFEMESHS I TRLECSGAI LAHCNL 


12393 


26294 


A 


12510 


344 


472 


KFLPFDPAVSLLGIYPKESKSFYQKDTC 
TRMFI AI LFTIANTW 


12394 


26295 


A 


12511 


424 


132 


RGEGPPRSGILSPVGPPGETPPFLKKQK 
I TRGGGGG P LF P P L KRVRGENS FP PGG K 
SFHGAKFPFCPPPV7ATKRNSVSKKKKKQ 
TKKGSR 1 1 L YNNE 


12395 


26296 


A 


12512 


351 


510 


GGTDFCKQS KLQSLQPAVEI F I PGRVQW 
LTPV I S ALWEAEAGGS PEVRRSRPA 


12396 


26297 


A 


12513 


362 


463 


NTITWLGAVAHACNPHTLGGPGGRITWG 
QEFETS 


12397 


26298 


A 


12514 


137 


1 


EE I SLLLPRLECNGAI TAHKNLHLPCS R 
NRRPGKAVHVRQAYS FQ 


12398 


26299 


A 


12515 


251 


592 


GAFTGLAFTMAGGRPHLKRSFSI I PCFV 
FVESVLLGIVILLAYRLEFTDTFPVHTQ : 
GFFCYDSTYAKPYPGPEAASRVPPALVY 
ALVTAGPTLTILLGELARAFFPAPPLAG 
PV 


12399 


26300 


A 


12516 


3 


413 


SWGGRKFLCPPRSLSGSRELHPAQGDRP 
GPLSSSGKRETGTHRETLGKKKVSSAPE 
AQGAGLRLSQALGGLCYDLSPLTEPRLP 
LAATAFPRPCPALPHPQPRVTMGSVSSL 
ISGHSFHSKHCRASQNKLRKSSHLK 
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12400 


26301 


A 


12517 


115 


2 


KSKGGHGVEVLVSVLVGLAMEVGLVLWV 
KEYDGAALA 


12401 


26302 


A 


12519 


444 


34 


YCLRQKHTEYRRRSNLEMASSCLSGRKT 
CTFFTLYQNLKMIKLSEEGTWQAEWLKS 
RPLAPNSKWEAKEKFLKKI KSVTSIoNT 
YILRKPNSLIADMENILVIHIEYQTSYN 
IPSSQRLIQSKALCLFNYLEMECRI 


12402 


26303 


A 


12520 


1088 


935 


ILIIFINYRCEFLRSKKSSEEITQYIQS 
YKGFVDRTVMYNSNFFLKFLDYL 


12403 


26304 


A 


12521 


2 


347 


AMAAGSRTSLLLAFALLCLPWLQEAGAV 
QTVPLSRLFDHAMLQAHRAHQLAIDTYQ 
EFEET Y I PKDQKYS FLHDSQTSFCFSDS 
IPTPSNMEETQQKSNLELLRISLLLIES 
WLE 


12404 


26305 


A 


12522 


50 


205 


WGESDRQSSGFWASSAHACNPTWEVET 
GGSLEARSWRLQCTMFASVKSTRP 


12405 


26306 


A 


12523 


105 


2 


RFVCS TI KVLRDLSSDRSNpGRFLSTSN 
SSLYER 


32406 


26307 


A 


12524 


279 


455 


KTKNWLALGG P P CFPGQ YG E T P SLLKNE 
KLAGHGGAPIiYSRLLRRLRRGNSLTPGD 
RGC 


12407 


26308 


A 


12526 


305 


454 


LYFLLFPFQI INRMVLFFVFCFFFETES 
HSVPQAGVQWRDLGSLQAPP ?G 


12408 


26309 


A 


12527 


363 


459 


CT I LGQAWWLTPVI LALWEAEAGRS PEV 
RGSR 


12409 


26310 


A 


12528 


182 


36 


QHI WFHKEHGQVRWLTPVI SALWKAEA 
GRSPEVRSSNAAWHACRNSA 


12410 : 


26311 


A 


12529 


388 


530 


KSFLNFFPRDGILNCVPQAGLELGSSDP 
PASAFRVAGTTSVWHHSQI 


12411 


26312 


A 


12530 


134 


1 


GH KKG VL F F F F KT ESRS VAQAGVQW CTL 
GSLQPPLPAHATRPRV 


12412 


26313 


A 


12531 


230 


1 


KKFGFFPQAGPQGGLHLFTTLiAPWGQAI 
SQKREKPFFFFFFETEPHSVARLECSGT 
ISPHCNLHLPSSNDSQASTS 


12413 


26314 


A 


12532 


145 


3 


KKLE ILSAI I PPKFRKEI FLFFF YETES 
CSVAQAGVQWRDLGSLQAP 


12414 


26335 


A 


12533 


365 


2 


LFFFTSLIWQITLGFLILNCPCIPGVCI 
YIQFANTFRKFAFLFTNEIAYNILILPL 
TKQDYTNLI KL I KKHS FFSNFLKQLLFV 
FCFVFVSLFAMESCSVAQARVQWRNLGT 
LQPPPPRFK 


12415 


26316 


A 


12534 


313 


475 


FFWSCSSFQQGFGGGITI III ILRWILA 
LSPRLERKWRDLGSLQPSPSGFKRFF 


12416 


26317 


A 


12535 


124 


3 


MGSLIMGAELSVYRQVCDCKPQGVCVCL 
CVCVCVCASTCM 


12417 


26318 


A 


12536 


333 


58 


ACNPSKVRSCRPAWSNMVKSCLSKNAKI 
TKEWWWANFFFVFVFLVEMMFHHVGQLS 
LKLLTSSDLAASASQSYGITGVNHYAQP 
ARTRGSFR 


12418 


26319 


A 


12537 


214 


3 


SDPRRAGRCPKEAPEQQRRLPGGSVPAL 
SIFFFFKTESCSVPLSPPXECSGVISAH 
CKLRLPGSCVPPAS 


12419 


26320 


A 


12538 


146 


1 


MNFIiAFTNPPRPPQQILKPLbFILRRES 
HSVAQAGVQWCDLSSLLPPA 


12420 


26321 


A 


12539 


195 


1 


IFIPPQAQKRGDPFLSFFFFETRSPSPR 
LECSDAITAHCSLHLPGPGEPPTFPIPK 
SWDHSHVPP 


1242! 


26322 


A 


12540 


151 


334 


LLGRLRHKNHIiWPGGGGCSGTIMALCNI 
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LLPRS S S S PTSASKVAG ITGACHHACCL 
RYFLY 


12422 


26323 


A 


12541 


314 


402 


NLNSGPAWWLMPVIPALWEAEAGGSPEV 
G 


12423 


26324 


A 


12542 


339 


3 


G S CLLEGKL PNRKD IHTENP S VHHHHQR 
PKVDKPTKMGKKQNRKTGNSKTQSVSPP 
PKERSSSPATEQSWMENDFDELREGFRR 
' SNYCELREDIQTKGKEVEIFEKNFERV 


12424 


26325 


A 


12543 


31 


365 


RSAVMPTEDRATWfCFNYFLKI IQLLDDY 
PKCFIVGANDVGSKQMQQIRMSLRGKAV 
ALKGNNTMMRKAIRGRLENNPALEKLLP 
HI RGNEGS VFTNEDLTE I RDMLLGNKV 


12425 


26326 


A 


12544 


278 


410 


FS FKNVTAGRWWLMPVI PALWEAEVGG 
S PEVKRSRTTLANMVK 


12426 


26327 


A 


12545 


88 


276 


EVPQAHRKLPS APQLRPLQAI STKGPCP 
PQSPQPSPRPEGSSAHSRGEKRTRERGK 
EEKRKKK 


12427 


26328 


A 


12546 


251 


3 


PVEDNSKRRFQTGEEKVTQEEQEALTPW 
RTNFFCFSDRVSVAQARVQCCDSGSLQP 
QLPGS SDLPTSASQVAGTTGMRKHAC 


12428 


26329 


A 


12547 


106 


2 


DEHLPFLTSFETESCSVAQAGVQWRDLG 
SLQAPP 


12429 


26330 


A 


12548 


225 


1 


GFFKKEPHPRPCKKKTPPKLKPFWGGLN 
KKIPLSFVFSKKGPLPKKFFFPFFFFET 
GSCSCHPAGVQWCDHGSLQ 


12430 


26331 


A 


12549 


218 


3 


VPPPGLVPGLPQIIPKGFPKTKGPPKGP 
PPGDKQKSGPPNKKNFFFFFLRRESHSV 
AQAGVPWCDLGSLQP 


12431 


26332 


A 


12550 


276 


24 


GPLKMVEAPPQNPLSPKNKIFFFFFFET 
ESHSVAQAGVQWRNLRLPGSSDFPASDS 
RGAGITGARPHVQLI FVFLVKTGRKRR 


12432 


26333 


A 


12551 


331 


440 


KSASSWS FNSKGWGWWLTPVI PALWEAE 
VGGSLEVR 


12433 


26334 


A 


12552 


136 


1 


RHTGSKTTATALVDVFLLLFFETESHSL 
SRLECSGTTPGHCNLYE 


12434 


26335 


A 


12553 


84 


1 


I F FCLFS FEMESHS VAQTGVQWCDHGS L 


12435 


26336 


A 


12554 


3 


349 


HASGPEELSKDPSLVSQGQPHRKPGLKR 
CS S RPLGP PDKLGGEGKQGLLG FTLWLS 
GP I KPCDDEEKNKKKKKKKKKKKKKLGG 
GPFKKKLFFPPGGGRNFFFLGAPKFFWA 
GRF 


12436 


26337 


A 


12555 


257 


417 


KQLHLLQGRLFSPS FLPS I SKLFFFEME 
SRSVTQAGVQGCDLASLQAPPPGFT 


12437 


26338 


A 


12556 


268 


403 


YMRLSFQDLVMFKDVAVDFSQEEWECLN 
SYQRNLYRDVILENYSN 


12438 


26339 


A 


12557 


286 


426 


DMLIKTCCHLVCEKKKKKKKKKKKKKKK 
KKKKKKRGGALKKKPLPPP 


12439 


26340 


A 


12558 


114 


1 


PLKEIANRHMKKCS SSbAIRQMQIKTTM 
RYYFTPVTMA 


12440 


26341 


A 


12559 


106 


1 


FFFFFFFFFFFYFFFFFFEMESRSVAQA 
GVQWCDL 


12441 


26342 


A 


12560 


118 

i 


2 


KNYSNNLKTNAGRAQWLTPAI PALWEAE 
AGKSPEVRNW 


12442 


26343 


A 


12561 


; 196 


2 


MCCFSKNWQEGAQMTPGFEPPQDNSEKT 
KLSDTRLLFFFFERESRSVAQAGVQWRG 
LGSLQAPP 


12443 


26344 


A 


12562 


128 


3 


RAPPFFFFETEFRSLPRLECNGAISAHR 
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NLRIPGSSDSPAS 


12444 


26345 


A 


12563 


277 


370 


WSFTLVPQAGAQWCDLGSLQPALSRFKR 
FSR 


12445 


26346 


A 


12564 


55 


251 


AMVIPTVPFNITINSKPLGHISFQLFAD 
KFP KTG ENFHTLNN KDKG FG S C FHR IIP 
EFICQGDDF 


12446 


26347 


A 


12565 


129 


3 


KYSVLRPGTVAHTCNPNTLGGRGGQITW 
GWEFETSLANVAK 


12447 


26348 


A 


12566 


172 


1 


PPPKKKKPRPPKKKNFFFFFGAISAHCN 
LCVPGSSDSPASPSRVAEITGSRCHAEP 
Y 


12448 


26349 


A 


12567 


257 


380 


GICRPLLGGVLQSGATGVRDPLEEAVCP 
LAELKHCAGRSTA 


12449 


26350 


A 


12568 


120 


r 


GVFLFCFVLFETESCSVARLECSGMISA 
HCNLRLLGSSNS 


12450 


26351 


A 


12569 


250 


2 


I GKP KT PQFWFLKTKRGGPNAAFS FKKF 
FFQLKPPPRVFLKI FSCQKKI FFFFFFE 
TESRSVPQARVQWHDLGSLQAPPPGF 


12451 


26352 


A 


12570 


338 


3 


VSTPEKNFVFHTRGFSKKSSKYKVCNFF 
FQKKLGS P PFS LKKVFPGKP P YS CCSGR 
VSTAPYFLNPGPSTKILPPFFGPLGPEE 
KFLSFFFFFEMECRSVIQAGVQWCDLG 


12452 


26353 


A 


12571 


5 


556 


I CCCLCFKIN I FQLHFNKINF FCVTRSG 
PVTQAGVQWCNLGSQQPQPLSSKQS 


12453 


26354 


A 


12572 


115 


2 


DRVS LL VLAHCNLCLRGSSDS PAS ASRV 
AGITGARHD 


12454 


26355 


A 


12573 


279 


377 


GQVQWLMPWPAFWEDEAEGLLQPSSSR 
PAWAT 


12455 


26356 


A 


12574 


117 


1 


KTPLFFFFFFFEMESRSYAQAGVQWYDL 
GSLQAPPNGFT 


12456 


26357 


A 


12575 


253 


374. 


S LPGMWHACNPS PLGGRGRR I TWGRKF 
GTSLANMAKPCL 


12457 


26358 


A 


12576 


130 


2 


KFPGP FLG FFF FFFEMESRS FAQAE VQW 
RDLGPPQAPPPRFT 


12458 


26359 


A 


12577 


132 


3 


RVPGP PL LFFF FFFEMESRS FAQAE VQW 
RDLGPPQAPPPRFT 


12459 


26360 


A 


12578 


3 


370 


LRKN C LNLGGRE CS ELRSHHCTPAW VTE 
RDSVS KKKTLS VLMEKWPNFPLRPLALS 
RKKFPRGPFEQVTHLVKEWSLTKPCCA 
EGADLDCYDPRTS PLFAKSWBRNFPFPV 
HPGPVEGCPK 


12460 


26361 


A 


12579 


131 


2 


F I YFL FY FLFLE TE SLS VAQAG VQ WHD V 
GSLQTPPPRFTPTRA 


12461 


26362 


A 


12580 


202 


2 


RMRKNQHKRAENS KNQNTSFPPKDHNSS 
QTREQNWMENEFDELTEVGFRRWVITNS 
SNLKEHVFSC 


12462 


26363 


A 


12581 


109 


2 


KI FFFFFFETESRSVPQARVQWHDLGSL 
QAPPPGF 


12463 


26364 


A 


12582 


105 


375 


L I M I LLPLQ I LVHSGTFCFLFFG FETEF 
HSTPRLECNGVMS AHCNLCLPVKTS PAS 
TSQIS 


12464 


26365 


A 


12583 


106 


3 


RPIFFFFFFQTESRTVARAGEQWCDLGS 
IQPPPP 


12465 


26366 


A 


12584 


484 


700 


SS S WNRAFSRKKDKTWMHTPEALSKHFI 
P YNAKFLG STEVEQPKGTEWRDAVRKL 
KFARP I KKSEGQKKKK 


12466 


26367 


A 


12585 


301 


1 


TQPKVRTQMKNNIFEHAFLSSTFPWPLP 
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NLFADFYSATSQTKPDILQSMQKAFKNS 
GIGQAWWLIPAI PAALWKAEVGGLLEPR 
NLRPAWVYGKTLSLQK 


12467 


26368 


A 


12586 


76 


379 


PLHPPAS CPAPSLS CLIPGS PSDPLFSD 
SARPGWGRGWGRAPQLPPS PPFLYNL I K 
KWSRFC1IQKKKKKKXKKKKKKKKKKKKK 
KGFFFWRAPPFFFFGGGFFS 


12468 


26369 


A 


12587 


109 


376 


LEFFKMSGWPTAPEQPAGEMENQTKPP 
DPRPDAPPEYNSHFLPGPPGTAVPPPTG 
YPGGLPMGYYS PQQPSTFPLYQPVGGIH 
PVRYQ 


12469 


26370 


A 


12588 


323 


1 


KKFCPPRFFPPLFGFQKKGTNPISPFSG 
GFKKNLGGFNPPPRPPWSPQIGFFLGRS 
KELILGIiFPTRIGIGGFFSFFFFCETKS 
HP VMRLE CSGV I S AHCNLRLL VS 


12470 


26371 


A 


12589 


262 


I 


GDPPTTSGPQTNQPKEHLMNFKSDSQLY 
EDTLAGRSVLIKNLTPQTLQPRWTGPYL 
VIYSTPTAVRLQDPPHWVHRSRIKLCPS 
EKK 


12471 


26372 


A 


12590 


284 


374 


T ESRNWG WAQWLT P V I LPALWEAEAGGS 
PE 


12472 


26373 


A 


12591 


297 


387 


GQAWWLTS VI PAFWE AKEGGS PDVRSSG 
LA 


12473 


26374 


A 


12592 


206 


407 


VKPQNPRKAQKLKGLVYIH I YV Y I YVYV 
YI YI Y I YM YI CRY I SLT WYTVNS KEKG 
LDTAAHTCNSR 


12474 


26375 


A 


12593 


222 


379 


LTTGSIMGNFSFLILVYTRKVMGSVQWL 
TAVI PLLWEAEAGGSLGSRSSRPA 


12475 


26376 


A 


12595 


301 


189 


KKIQSQAQWLPPI IPALLEAEAGRSLEL 
KNSRSAWAS 


12476 . 


26377 


A 


12596 


209 


3 


SVKPLFFINYQSQWLFLFFRGRLKRSG 
T I S AKOTLHLPGS RFS CLGL P KCWD YRC 
APPTLKIDLTKD 


12477 


26378 


A 


12597 


306 


3 


NGKKGFYMAPEPPFFFHFFFFS PLFPTL 
FFFF I FP L P FCFALS FLVGS S S LS PRLL 
CPMQHYFCDAWNTFDALI WGS IVDIAI 
TEVNVSTWRLSLTVRV 


12478 


26379 


A 


12598 


1 


249 


NTGAQCPLEFSIQELQLLFLGGGEDGVS 
LL.LLRLECNGAI SAHCNLRFPGSGDPPA 
SASRVAGITGACRDHTCEPPRPVPPSS 


12479 


26380 


A 


12599 


228 


404 


RLSSLSPVTEFGCLSPPDLILKCDPWPG 
TVTHTCNPSTLGGRGGWITRGQEFKTSP 
VNiM 


12480 


26381 


A 


12600 


101 


580 


LSLTKNCAIiLGEETMMEQEMTRLHRRVS 
E VE AVLS QKEVELKAS ETQRS PLEQDLA 
TY I TECS S LKRSLEQARME VS QEDDKAL 
QLLHDIREQSRKLQE I KSQEYQAQVEEM 
RLMMNQLEEDLVSARRRSDLYESELRES 
RLAAEEFKRKATECQHKLLK 


12481 


26382 


A 


12601 


464 


3 


CAIQQTLYEHPMKSSRLGPTQLKIFTCE 
YCNKVFKFKHSLQAHLRIHTOEKPYKCP 
QCSYASAIKANLNVHLRKHTGEKFACDY 
CS FTCLSKGHLKVH IERVHKKI KQHCRF 
CKKKYSDVKNLIKHIRDAHDPQDMYCGR 
SRGSARMSRALPS 


12482 


26383 


A 


12602 


140 


1 


WIGSFLSSESDGTWQKISGVQRSLSLY 
VYIHTHTHTHTHTHTHTV 


12483 


26384 


A 


12603 


132 


2 


E S L RPG AVAHTHN P S TLGGRBG W IRKDK 
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EFKMCERV 


12484 


26385 


A 


12604 


139 


1 


KKRVKRGPPGGFKGLGGKFFFSPKTFFF 
FFFETESCSVAQAGVQWR 


12485 


26386 


A 


12605 


204 


460 


SGQGNGEICHPRGFCTLPKAISSLLIAR 
VCLKHFNKGVASLSQTANGCFFEGFSET 
ESRSVIQAGGQGGDPGSLQPPPPGSCLS 
L 


12486 


26387 


A 


12606 


317 


1 


KRPLFKNIFQKPFFKKGAPRKKGPRCIF 
GPNQKNPLFFPAPLSPPFWETKKGAGFF 
FWGFPFPPTKIIGGQKVPFPPTFPPFFF 
FFETESLSVTSLECSGAJSAH 


12487 


26388 


A 


12607 


248 


367 


TQILSSGSVQSSLHHPPICFIHTHTHTH 
THKHKHPHS 


12488 


26389 


A 


12608 


2 


387 


QKQQQRAGRETSTCSLR1 ISAPTMATFV 
ELSTKAKMP I VGLGTWKSPLGKVKEAVK 
VAI DAGYRK I DCAYVYQNEHE VGEAIQE 
KIQEKAVKREDLFIVSKVQWCIWLAFNT 
IVPKSQWQSLQTPQRC 


12489 


26390 


A 


12609 


258 


23 


MWERGI IRPQTQTLTGEPEGLDHGRS IS 
PNSVMNGIVPHISILTLNINGLNAPLKR 
PG I AEW IRI YQTSMCCLQETGR 


12490 


26391 


A 


12610 


176 


433 


DALS SAWGTVLGTQI P S H FTE I LMLS P P 
AWGSSSLTQTLFYVPSGAKRTGSYVLAR 
VGQ K YKTL WLGAVAHACN P S S LGGRGRW 
IT 


12491 


26392 


A 


1261 1 


312 


410 


AHMYRTCSRGWVS WLTPGI PALWEAEAG 
GSLEP 


12492 


26393 


A 


12612 


3 


289 


VFEFLSRKLSYILRMFWTFKEWFWLERF 
WLPPTI KWSDLEDHDGLVFVKPSHLYVT 
IPYAFLLLI I RR VFEK F VAS ? LAKS FG I 
KETVRKVTPNT 


12493 


26394 


A 


12613 


294 


432 


FMKLFFFFFFFERRSCFVAQSGMQGGYN 
ASLQPLPFGSSNFSSLPL 


12494 


26395 


A 


12614 


138 


375 


S PNATCGHADPLPLCCLHI CQAIiKRFTR 
QTFFFEGKP YLG I LLYAVFFFLRLES CS 
VAQAGVQQPNLS S LETP PPS A 


12495 


26396 


A 


12615 


232 


1 


GVPQRAKLRAGL C P PQAMDT FS TKS LAL 
QAQKKLLS KMAS KAWAVLVDDTSS EVL 
DELYRATREFTRSRKEAQKML 


12496 


26397 


A 


12616 


236 


419 


LEMRNGWVWFFPPVNPTLWETEGVRDQP 
GQHEETLSLKI I FKISRHGGMRLGSQLL 
RRLRQ 


12497 


26398 


A 


12617 


197 


1 


LCHESEGFLSRVSALLFTLLLLVXiMES P 
PSVSQPGVQWPHFSSLQPLLPRFKQFCN 
TLPSGWDYK 


12498 


26399 


A 


12618 


132 


3 


NS FMFHLFTYFEMESCS VAQAG VQWRDP 
GPLQAPPERVGGRV 


12499 


26400 


A 


12619 


255 


2 


MNSLNSFTDIAVIPLHYKRILPHFKI IL 
SYFLKSKFQVKGIGRVKWLTPVIPTLVJE 
AEVGRSLEPRNCPAHQPGQYGK I LLYKK 


12500 


26401 


A 


12620 


141 


2 


FHFTYI IISCVCVCVTESRFVARLECSG 
AISAHCKNHRPGSSNSPA . 


12501 


26402 


A 


12621 


274 


421 


TYVMEVCCTYYFLAQVLSIAPFNYFI IR 
PDPRTDPRPDPRPDPRPEPRP 


12502 


26403 


A 


12622 


132 


2 


NQSI KKKIELQGFQRSGGRGKGQVQWLT 
HI I PARWEAKVGRS P 


12503 


26404 


A 


12623 


390 


86 


IFPFFYQNTNGMAPGNQIPQGFSSRFFF 
r LRQ VLALW PS LE Y TGE I TDQCRLKCS S 



1295 



WO 01/64835 



PCT/US01/04927 



SEQID 
NO: of 
nucleotide 
sequence 

• 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, l=lsoleucine, 
K=Lysine, L=Leucine, M=Metbionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
\V=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
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SLSLCSSWDNRCAHPPLVNILFYVETGS 
HYWHIGLKLLHSRYPG 


12504 


26405 


A 


12624 


297 


396 


I IRVRGRAQWLTPVI PALWEAKAGGS PG 
VRKSR 


12505 


26406 


A 


12625 


303 


381 


KFHLVPS INTMSGSQELQWMVQPHFL 


12506 


26407 


A 


12626 


318 


152 


GDKRRLVSKKKKNLWNFGR IQW VP P VI P 
ILLEGEAGG SPEVRS FRPTGQQSMTP F 


12507 


26408 


A 


12627 


48 


388 


QLGNGKVRXjL YQRHS S S FFFFWGEG F PP 
GPQGGGGGPNLGPRESPPPGVKGTPPPS 
LLEGGEPPGPPPPPGNFWGFWKKGGLPL 
YPGGPFTSGPKGEAPPTPPKGGGGIKGL 
P 


12508 


26409 


A 


12628 


116 


3 


KVATNQAQWLMPV I PALWEAKAGGS SEV 
RSSRTAOAW 


12509 


26410 


A 


12629 


227 


400 


VLF VLFFCKEWHVF I FL I LPVYS YKKKK 
KKKKKKKKKKKKKKNKKDSQKKD 


12510 


26411 


A 


12630 


182 


3 


TNWLFPPPKFGPVPLFFLPPFFFFFFET 
GSHT VAQAG VCSG AI LAHCNLRLPGS SN 
TPA 


12511 


26412 


A 


12631 


134 


473 


ASTIMDLLFGRRKTPKELLRQNQRALNR 
AMRELDR^RLKLETQEKKI IAD I KKMAK 
QGQMDAVRI MAKDLVRTRRYVRKF VLMR 
ANIQAVSLKIQTLKSNNSMAQAMKGVTK 
A 


12512 


26413 


A 


12632 


100 


462 


QLLLCCCCCRQWTAFNVPASPAPPPAWA 
HMAPS LLEP PSLL VTQI CKLS AFSG PS I 
NAFLLSKKKKKKKKKKKKKDRGGAPLKK 
KK 


12513 


26414 


A 


12633 


313 


441 


CWELLKWLVCFLCH 1 1 KTNNCWAQWLTP 
VI PTLWEAEVGGSLE 


12514 


26415 


A 


12634 


129 


400 


VSQQCWPTP PAL YS I SRQALAAS PQGR P 
WDLQPTMPESPLLPPRAPAWPKPLRRAL 
PPAPRCLGPSTAQGLRSAGAWCGTGRHL 
HLRPQC 


12515 


26416 


A 


12635 


308 


399 


E IRGWTQWHVPVI PALWEAEAGGLPEVG 
SS 


12516 


26417 


A 


12636 


285 


390 


IFGVLINSFIYFETESRSWRAGVQWRD 
LGSLQPP 


12517 


26418 


A 


12637 


260 


2 


GQGPFFFGGRGSPKKKPPLKGFFLVGGS 
KTQFFFPRPKKGPFGVFFFFFSQTASRS 
VTRLECSGMILAHCNLRLPGSSDSPASA 
SR 


12518 


26419 


A 


12638 


76 


3 


ALQVQWLMPVI PALWEAKAGGSLE 


12519 


26420 


A 


12639 


219 


484 


LGLQE PDLDPKP I LELPLAELAQQLQTE 
ELSLESILCSYLKQALKVHQEVNCLMIF 
LGECEEELLALKKLKKSERGLLYGVPMS 
LKDT 


12520 


26421 


A 


12640 


33 


462 


EGLSWGYREHNGPIHWKEFFPIADSDQQ 
S P I E I KTKE VKYDS S LRPLS I KYDPSS A 
KIISNSGHSFNVDFDDTENKSVLRGGPL 
TGSYRLRQVHLHWGSADDHGSQHIVNGV 
SYAAELHWHWNSDKYPSFVEAAHEPDG 
LAG 


12521 


26422 


A 


12641 

i 


2 


414 


SGPAAPATPMSIFPELYFNVDNGYLEGL 
VRGLKAGVLIQADYLNLEQGETLEDLKL 
HLQ STD YGN FLAN EAS PLT VS V I DDRL K 
EKMAVEFRHMRNHAYQPLASFLDFITYS 
YNI D15VI LLITGTLHQRS I ABLVPK 
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SEQ ID 
NO: of 
nucleotide 
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SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
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Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 


12522 


26423 


A 


12642 


161 


463 


KEKARGRPKPLLLLPITSATTAMGLTIS 
SLFSRLFGKKQMR I LMAGLDAAGKTT I L 
YKLKLGEIVTTIPTIVFNVETVEYNNIC 
FTVWDDGGQERIRPLWN 


12523 


26424 


A 


12643 


124 


2 


TVFGGQ AQWLMP I I PALWEAEAGRS PEV 
TSLRPAWPRTRG 


12524 


26425 


A 


12644 


258 


454 


NRPELCDYQGLREDSKRKGEINQTACQF 
LWPRQKMSSQTKFKXDKEI IAE YEAQI K 
EIRTQLVEQ 


12525 


26426 


A 


12645 


636 


182 


APLERPS MI P PRCSGGQPSDLRS S PAQG 
TPRPALGDRKGTPRIAEPGGAGRAEAPP 
PRGSRQRFGPQRRQTNAHTKRVRTRLSA 
APARHS PAFI ACRQVGRPRIiSARRSGKK 
PHLTVWCPFSFQDPIQDTTLQI ILLSLL 
GGDTFSDFPCS 


12526 


26427 


A 


12646 


199 


414 


IYHTSRSLLDRKLRAQRYDELPHYGGMD 
GVGVPASMYGDPHAPGP I P PVHHLNKGP 
PFHAKQNYGAHAPHPN 


12527 


26428 


A 


12647 


130 


3 


DRGFLCVHTPLKKKKNTKTHTHTHTKTH 
TNTHTHS HTHTHTK 


12528 


26429 


A 


12648 


192 


443 


LLLCWAQESLGTLGENTASSHTAGLEWE 
SPLFACWFFWGFCLFVWLFVWFEPKSGS 
VLTLECSSVITAHCSLDLPGSRDPPASA 


12529 


26430 


A 


12649 


393 


179 


LHSRVQRSGCFCSCVSNPVTGGMHFTRS 
S PQSNQEADGQARWLTPVI LALWEAKAG 
GLPELKSSRPAWATP | 


12530 


26431 


A 


12650 


263 


2 


KAPLLVYKWEPNPGGGNFS PGMGTPNTF 
GVRFGGETKPLFFFFFETGSHSVIQAGV 
QLCNLDS LQP PPPS VKPSQ YS WDHRKQS 
NTK 


12531 


26432 


A 


12651 


355 


111 


QDPCFIRLHLPLSFLSVFLECTKISLLD 
WRMIiFPLLGDPPHLAIiSSLVSPINTWNY 
SFQQLMMFRDVAVDFSQEEWECLDL 


12532 


26433 


A 


12652 


271 


3 


RQLKNILIFFSHKNMLSIKHIIYNYTLF 
LKSAF I FNLETDHLNCFPGS I SAKSYF I 
LFYFFETEPHSVAQAGVQWRDLSTLQPP 
PPRFT 


12533 


26434 


A 


12653 


333 


2 


PPSPGVFGNPPSPGDPLGKTFFFPRGPP 
PRFWGGGFFFILAPFPFGGLFPPFPLPP 
SQGPPFFPFFFFFFFFEMESRSVSQAGV 
QWRDLSSLQPPRARFSDSPLYFLEGR 


12534 


26435 


A 


12654 


310 


444 • 


TYLSN I FY FI LFLVETES P S VAQDGVR W 
CGLGSLGPPPPGFGRFS 


12535 


26436 


A 


12655 


336 


411 


DQPGQHGETPSLLKIQKLVECGGAH 


12536 


26437 


A 


12656 


114 


2 


GIFFFFFGLESRSVAQAGVQLCDLNSLQ 
SPPPGFKQF 


12537 


26438 


A 


12657 


191 


1 


KNF I LTFS FRGAKNGDVFF P P PG YIQTG 
ENFFFFFFLKWSFTLVAQAGVQWCNLSS 
LQPPPPR 


12538 


26439 


A 


12658 


1 


445 


LRTGSEFSGRDSKGLAAAEPTANFGLLL 
AS I EDQGAGGGGYCGSRDQ VRRCLRANL 
LVLLTEAAG VAGVALGIXJVS EAGGALAL 
GPERLSAFFFPGELLLRLLRMI ILPLVG 
CSLIGGANSLDPGALGRLGAWALLFFLG 
TTLLASAL 


12539 


26440 


A 


12659 


2 


224 


ESTASRIAFEAWQPETLPKGLNYSGASP 
WLNAVLPLKKKKKKKKKKKKKKKKKKK 
KKKGGGVFKKKKIKGGGG 



1297 



WO 01/64835 



PCT/US01/04927 



SEQ ID 


SEQ ID 
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SEQ ID 


Predicted 


Predict- 


Amino acid sequence (A=Alanine 


NO: of 


NO: of 


etb 


NO: in 


beginning 


ed end 


C=Cysteine, D=Aspartic Acid, 


nucleotide 


peptide 


od 


USSN 


nucleotide 


nucle- 


E=Glutamic Acid, F=Phenylalanine, 


sequence 


sequence 




09/515,1 
26 


location 
correspond 
ing to first 
amino acid 
residue of 
peptide 
sequence 


otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q^GIutamine, R = Arginine, S = Serine, 
T=Threonine, V^Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, V=possible 
nucleotide insertion 


12540 


26441 


A ■ 


12660 


96 


1 


GLKDGRGQVQWLTPVISAPWEAKVGGSP 
EVMS 


12541 


26442 


A 


12661 


1 


457 


CWVHLQKGCHFTLGRIiHSGQAEALLTSQ 
TGRPGRGAPHIPDGAARQRRSSHPRRGG 
QAEALLTSQTGQPGRGAPHLPDGAAGQR 
RSSPPRRRAAGQRGSSPPRRWAAGQRLS 
SLPRQGGRAEALLTFQTGRPGRGAPHI P 
DEAAGQRGGAG 


12542 


26443 


A 


12662 


175 


351 


KGIFFFPPSWKAGGAIWFKGTFAPRGKG 
NPPPNPSGEGGPKKKKPSPQIPRGPPLG 
PPV 


12543 


26444 


A 


12663 


247 


401 


S SNQE PGCKGPCVLTNFYF I LGRAWWLM 
PVI PAI WEAGTGGS PEVRSWRPT 


12544 


26445 


A 


12664 


152 


251 


RENSCCF I KKKKKKKKKKKKKKKKKKKK 
KRGAP 


12545 


26446 


A 


12665 


2 


365 


APTRSRCLRHASCTACRTVSTDTS SLRR 
ADPKGRSALLADIQQGTRLRKVTQINDR 
SAPQI ESSKGTNKEGGGSANTRGASTPP 
TLGDLFAGGFPVLRPASQRDVAGKEEFI 
LAPLWTVSP 


12546 


26447 


A 


12666 


179 


1 


LELSKSGSHWMRHFLSRSFYSLI S PQLN 
TTVWPTIITPILLTLFLITQLKILNTNY 
HLP 


12547 


26448 


A 


12667 


279 


35 


NQ AP F KARF FTT F KNV F LKE I KKKRC W S 
GAPPIIPPPWGGGGGGPPWGGNLSPPGP 
PVSTP FFNGAQKKKKKKDRYGGACL 


12548 


26449 


A 


12668 


299 


399 


I S I KKQRKG P VQ WLMPVI PALWEAKAVG 
SPEVR 


12549 


26450 


A 


12669 


1 


385 


KMQNRGFFFLTFLALLGLTSGAAKKKNK 
GKKAGPGSKSPNWPWGPCPPSSKDCGGV 
FRKGTWGAQTHGIRGRGPCNWKKEFEAN 
CKSKFKNWGAGDGGPGTKVRQGTLKKAP 
S I AKGQET I RVTKP CT 


12550 


26451 


A 


12670 


357 


1 


AGKIAKI CPVS SMQAPTCGFPVGGNDNQ 
GQAPDGQFQPPLQQNQTSSPDFSNENSP 
ATPPNEQGQGDAPPQLEDEEPAFPHTDL 
AKLDDM I NRPRWWP VL P KGELEVLLKK 
GIDFSKK 


12551 


26452 


A 


12671 


107 


373 


TGNYTPLEDCAQEQMRLIAQVHLLETRV 
KQQQVKI KQLLQENEVQFLDKGDENTW 
DLGSKRQYADCSETFNDGYKLRGFYKIK 
PLLNP 


12552 


26453 


A 


12672 


196 


45 


YLYFHGSREKRLAKKYYDKLFKECCIAD 
LSKYKENKVCFPKIYGKRFLFK 


12553 


26454 


A 


12673 


33 


448 


KEGAERAGAAAPVWS FLLDRRDSARTRS 
GTSLGSADMGDMKTPDFDDLLAAFDIPD 
IDANEAIHSGPEENDVPGGPGKPQPCVG 
SESEHTASASAGDGPGVPAHASDHGLPP 
PDI SVLSVI VKNTGLSRAGWRPWVDV 


12554 


26455 


A 


12674 


250 


29 


KKQTNMWQSPYEDYRIFYTISSSLYHLS 
I YPPIYLSI YLS I YLSLYLPTHLS I ITD 
YVS ILEHELRPLWR I QP 


12555 


26456 


A 


12675 


377 


442 


SDRQWWCMPIVPATWEAEAGES 


12556 


26457 


A 


12676 


1 


553 


RI FPGRRFRVKLPS C PDPAMGTRLLFWV 
AFCLLGADHTGAGVSQS PSNKVTE KGKD 
VELRCDPI SGHTALYW YRQSLGQGLEFL 
I YFQGNS APDKSGLPSDRFSAERTGGSV 
STLTIQRTQQEOSAVYLCASSLATAWKS 
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Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F-Phenylalanine, 
G=Glycine, H=Histidine, I=Iso leu cine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, PHProline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, ^possible 
nucleotide insertion 














RLLPAHKPHPSLSLQLLDTLNRGFSLLL 
PPHGKQVDLDLSLSFG 


12557 


26458 


A 


12677 


232 


3 


G WS SDCQQRARVI EEKHSRQKGH I AENP 
SVHRHHQRPKVDKTTKMGKKQNRKTGNS 
KTQSASPPPKERSSSPATCI 


12558 


26459 


A 


12678 


134 


3 


IFIVEFGGKPFSCTSISLSQWLWCLFIG 
IGELLWGQVSTGTLY 


12559 


26460 


A 


12679 


212 


47 


NGTH PHG YTLHTHKHTHS CAHTHTHTHS 
LLLVRKSFLPQKNTTISFRSCCKLGKP 


12560 


26461 


A 


12680 


224 


2 


MKMAS FLAFLLLNFRVCLLLLQ LLM PH S 
AQFSVLGPSGPILAMVGEDADLPCHLFP 
PMSAETMELKWVSSSQCI 


12561 


26462 


A 


12681 


35 


397 


KPLISQNKIHSVSSSQTGFFFFFFFFGE 
KKPPFAPRGGGKGPPPPLPPGGGNRAPP 
RGEKKGKGKPPPKNPGAFPPQKEKKTPG 
GGGGKKKTPAPPTPKEVDQPLPRREQRG 
PGRLPPRGA 


12562 


26463 


A 


12682 


188 


514 


HTTHLVYVLSMAAFFFFFFEKNFLFAPR 
VEKRGKDLGSLKLPPPGFRHFSGLTLQG 
SGNNGAPPPSPVI FLVFFEKRGFPLVGR 
EGLILPPLQPAPFCISFRGAINGPS 


12563 


26464 


A 


12683 


366 


3 


ISQTQTQWTDWVSLELQGCELAHRDTVA 
I PQTRS FPFPLM I S CMTLFS LLQMDRAF 
PPFPPQTPITIQGQISTSPATWPLTHLH 
STPGLSVEYSSNTHKSSLSPTSQWIRVD 
PMLASPTV 


12564 


26465 


A 


12684 


99 


1 


SSCIRFFLEQAYGQQSYGTYGQPTDVSY 
TQACI 


12565 


26466 


A 


12685 


247 


31 


FLKEKRNQFYLFI FQRWLDPNKP IRKQL 
KRGS PYSLNFRVKFFVSDPTKLQEEYTR 
WVYGYIFLENIVKTLI 


12566 


26467 


A 


12686 


416 


253 


PSPMTIPVTGAPRDADLWSSHAKMLAQP 
LKDSDVEVRFLGSSQIFLCWRPSFNL 


12567 


26468 


A 


12687 


116 


3 


CLLGRMWWLMPVI PALWEAKAGRSLEVR 
SLRPARPMY 


12568 


26469 


A 


12688 


380 


1 


SPESSKLGLWSKHVSPLLCIGVCPPPVS 
MAELRQVPGGRETPQGELRPEWEDEVP 
RSPVAEEPGGGGSSSSEAKLSPREEEDL 
DPRIQEEIiEHLNQASEEINQVELQLDEA 
RTT YRR I LQES AS V 


12569 


26470 


A 


12689 


157 


488 


REFVSQGGSHLKAQVRLEALLLTIGTPP 
WAHLS ILHMTAMGQRENKREAASSRIQV 
LLNHSQAI KQES 1 1 LFFEMESGVWPRLQ 
YSRMI S ASCTLFLSGSNNS PVSAPR I 


12570 


26471 


A 


12690 


514 


3 


PS I RAGLLCGSAENATP FLCG I TMAAGP 
LYTYPENWRAFKALIAAQYSGAQVRVLS 
APPHFHFGQTNRTPEFLRKFPAGKVPAF 
EGDDGFCVFES I AIAYYVSNEGLRGSTP 
EAAAQWQWVSYADSDIVPPASTWVFST 
LGIMRHNKQATENAKEECMRPLRVSSLV 
GP 


12571 


26472 


A 


12691 


304 


4 


AESLRVAHERLDTRSTSSD I FWFPQTQS 
NLEMNSEILESWANYQSSTSYSINTELS 
LFSKVNGKFSTEFQRMKTLQVKDQAITT 
RVQVRNLVYTVKINPL 


12572 


26473 


A 


12692 


221 


2 


VGPATRDLCFADE P WGRQKQKQ W WWVL 
CLSPNRGIiAGRPDTIiHITCASAHMRTHT 
CMHTHRH AHAK VHTH C I 
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Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophnn, Y=Tyrosine, 
X=Unknown, *=Stop codon, /-possible 
nucleotide deletion, V=possible 
nucleotide insertion 


12573 


26474 


A 


12693 


3 


205 


YMLCSILSTQEAFSICLGEKFIYLFTYL 
FRDGVSLLLPRLSCNGVILAHCNLRLGD 
RARLHLSKKQK 


12574 


26475 


A 


12694 


233 


3 


I MG PAHLF KGNH WNGGWGLM YKAF F FL F 
FFKTGSHSVAQAGTIPVHCNLRLPGSSD 
S PAPASRVAGNTNVPHHTLY 


12575 


26476 


A 


12695 


408 


3 


PAREMEKFRVCVRKRPLGMREVRRGE IN 
I ITVEDKETLLVHEKJCEAVDIjTQYILQH 
VFYFDEVFGEACTNQDVYMKTTHPLIQH 
I FNGGNATCFAYGQTGAGKT YTMIGTHE 
NPGLYALAAKDI FRQLEVSQPV 


12576 


26477 


A 


12696 


123 


631 


REAVQGGKGSGGGFGSRDSRS SGAVSAA 
VGDMGDPGSEIIESVPPAGPEASESTTD 
ENEDDIQFVSEGPSRPVLEYIDLVCGDD 
ENPSAYYSDILFPKMPKRQGDFLHFLNV 
KKVKTDTENNEVSKNHCRLSKAKEPHFE 
YIEQPIIEEKPSLSSKKEIDNLGASDCW 
D 


12577 


26478 


A 


12697 


429 


629 


LALL YPLKWVGKP VNFMEED I LGPLPP P 
LNEEEEEAEEEEEEEEEEENPVHKI PDS 
HEITLKHGTKT 


12578 


26479 


A 


12698 


158 


404 


LPLLSVERLVEIYAYTFIQDLFSASNIP 
YSSSVFRIEMDQPRTHSGPTTASNPAPS 
STNSSSAPSATNSKQERSSSSLSKPS 


12579 


26480 


A 


12699 


185 


3 


HRRPISIPSSVIHPAMVRLGLQYSQGLV 
SGSNARCIALLRALQQVCPILLSLMIQP 
HLVY 


12580 


26481 


A 


12701 


181 


3 


TOAS M FLSAALRARAAGLAAHWGTHVR 
HLHKPDMQNGAGGALFVHRDPRENNRDT 
PCI 


12581 


26482 


A 


12702 


373 


1 


ILVQERDSQRVIRMIDKLVSSRDRVGRG 
VEKQMTDHRVIYSVRKLQKRITPEPFIP 
SRTTPKLLFLLVLPSGKTPGHPVSSRTI 
PEPPLPTEPLERIPEHPVPSGTIPKPPE 
PPLPIEPHETMY 


12582 


26483 


A 


12703 


272 


508 


TKLGKKQNRKTGNSKQQSASPPPKERSS 
S PATEQS WMENDFDELREEG FRRSNYS E 
LREDIQTKGKEVENCEDNLEE 


12583 


26484 


A 


12704 


341 


1 


VFQHPHRPLAPPSLGPQSWPLMSGSRPR 
SSLSLASSASTISSLSSLSPKKPTRAVW 
KIRAFGKRGNALRRDPNLPVHI RGWLH K 
QDS SGLRLWKRRWFVLSGHCLFYYKDSH 
V 


12584 


26485 


A 


12705 


187 


365 


WEPSCRGPSAFKPTRCQSYDWACMCGAE 
GRSAMEQPQEESPEVREEEEIEEMAHAE 
GTP 


12585 


26486 


A 


12707 


238 


596 


LiWLVRVKYSIMSGAALGLEIVFVFFLAL 
FLLHRYGDFKKQHRLVIIGTLLAWYLCF 
L I V F I LPLDVS TT I YNRCKHAAANS S P P 
ENSNITGLYATANPVPSQHPCFKPWSYI 
PDGIMPI 


12586 


26487 


A 


12709 


131 


2 


DRVSLSPRLECSGTILAQPSRLKQSSHL 
SLTGSWDYTHAAPCI 


12587 


26488 


A 


12710 


151 


2 


YRQGLILLPRLECRVMIMALCNFESPGL 
SDPLTSASRVASTTATCHHTC 


12588 


26489 


A 


12711 


151 


1 


NFPEFDHFTVGCLRAGSSLIHYWWECKL 
VQSLWKAVWRFI KDLKIDLPLY 


12589 


26490 


A 


12712 


28 


411 


RWPARPAGEPREPHVSWVMKUJPQQAP 
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ding to 
last amino 
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sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, I>=Aspartic Acid, 
E=G)utamic Acid, F=PhenyIalanine, 
G=Glycine, H=Histidine, I=Iso leucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S^Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, V= possible 
nucleotide insertion 














L YGD C WT VLLAE ED KAE DD VWYLVFL 
GSTLtRHCrSTRKVSSEXTLETIAPGHDCC 
ETMKVQLCAFKEGLPVFDVTEKYFLLPR 
MYRYYRCLYTYPSAAV 


12590 


26491 


A 


12713 


351 


675 


ENPRHTFIYSLPLGLQKHQVLTVDIGFG 
GTAI MTVGKS S KMLQHIDYRKRWI LQDG 
RIFIGTFKAFDKHMNLILCDCDEFRKIK 
PKNAKQ PERE E KRVLGLVLLRGEN 


12591 


26492 


A 


12714 


326 


457 


ACSHHQRKSQAPPPAHLWPDCPHPGSLP 
LLNPSVCNPGREVSPT 


12592 


26493 


A 


12715 


200 


1 


TITKVCVFTRTCGLKGSLMAKALFDQRK 
DRESETECQVWWLTPVIPALSEAEVGGL 
LEPRSSRPGV 


12593 


26494 


A 


12716 


208 


405 


KKEREKKKHTKKKGRKKMVH I CQWHS S L 
HRKFQGIYRKTTGT6WAQWLTPVIPALW 
EAEVGGPPEV 


12594 


26495 


A 


12717 


157 


3 


AKETPTHKGSCKKKFFFFFFETESRSVA 
QAGVQWRGLGSLQAPPPGFTPCI 


12595 


26496 


A 


12718 


1 


472 


SPA I LPRLAILPYLLFDWSGTGRADAHS 
LWYNFTI IHLPRHGQQWCEVQSQVDQKN 
FLSYDCGSDKVLSMGHLEEQLYATDAWG 
KQLEMLREVGQRLRLELADTELEDFTPS 
GPLTLQVRMSCECEADGYI RGSWQFS FD 
GRKFLLFDSNNRKWTW 


12596 


26497 


A 


12719 


537 


1 


LHTMNGGNESSGADRAGGPVATSVPIGW 
QRCVREGAVLYI S PSGTELS SLEQTRS Y 
LLSDGTCXCGLECPLNVPKVFNFDPLAP 
VTP G GAGVGPAS E EDMTKL CNHRRKAVA 
MATLYRSMETTCSHS 3 PGEGAS PQMFHT 
VSPGPPSARPPCRVPPTTPLNGGPGSLP 
PEPPSVSQACI 


12597 


26498 


A 


12720 


369 


3 


AAK 1 1 PATRKKASLELELPS S PDSTGGT 
PKATI SDTNDALQKNSNP YI TPNNRYGH 
QNGASYAWHFEARKSQILKCMECGSSHD 
TLQELTAHMMVTGHFI KVTNSAMKKGKP 
IVETPVTPV 


12598 


26499 


A 


12721 


230 


3 


KFFWVLAGLTGKNSDASASLNQVHISPF 
LFSRHHPFSLGPLSS PVLLQGSKRRQLL 
ATLRALESASLSQHPHVPV 


32599 


26500 


A 


12722 


26 


110 


REQYAEGNMRGPAPGKKTSGLQQKNVEV 


12600 


26501 


A 


12723 


224 


1 


WQNQDQASDP PKYS FTSQCCLS FARLAR 
RYGDVFQIRLGSCPIWLNGERAILQAM 
VQQGSAFADRPAFAS FRV 


12601 


26502 


A 


12724 


120 


530 


KKVARGRSRSRERSRRRRSRPKAIT\NR 
T * GTRCTPRRWR S TV LGMRS CTRAR * QR 
RSGLSRGHTRSAGLCDHGSVRAGSGDGA 
DGTGGGDRRliGLGRDSLLLSSSQSAAFS 
SSASGSSSFFSATQPLRMLLEYFWL 


12602 


26503 


A 


12725 


434 


222 


KEEKS RGEVKEQD* EKDREEEVEKSRE / 
RRRS RRRRRRSE V * YRKGKRRREREDI L 
VAERSHRS FRNS I PSTL 


12603 


26504 


A 


12726 


268 


370 


SNILIHMESMFFFFGTDIPSV*KNLPA* 
KTPGPDGFI DKLYITFRGELTTFPHILL 
H*FTEGAVLFNSFSK\AASITLTPKPNY 
DI MRKENYS PISSYI WNQCS FFLEQIFP 
LSFKLNCPGLI I PH 


12604 


26505 


A 


12727 


103 


636 


VCFISMKQPHGSRHP?CS/PPRPGS*VC 
Q0SRPLGPQPSAFGQLA*HRRGLLGLOG 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
etb 
od 


SEQID 
NO: in 
USSN ' 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
0=Glycine, H=Histidine, I=Isoleucine 7 
K=Lysine, L=Leucine, M-Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, ^possible 
nucleotide insertion 














PRSPMQPVPHGGRVSGQTLPKGPTPKRP 
PGPAPRHGGHSWLLK*LVCAQPPPRP*S 
*ASQAVFTLQVPGKPQVWTPCPVPVRAP 
TP\ PLSNGGLG VS ERH* DGH * QAPTSP * 
PRCQGGAGEGPQ 


12605 


26506 


A 


12728 


1193 


1638 


GLGFAMLPRLALNSWPQGDPSVPASRAA 
GTRGVHRHTQLQVS FNYYKVLAMHSGSQ 
L*SQHFRKPRRSDHLRSGVRDQSGQHGE 
TPSL/LKNTRISWAWWHTPLVPATREAE 
ARELPE \ PGRQKI ASEPRVKP I CTAS LG 
NTCETPPQKK 


12606 


26507 


A 


12729 


304 


101 


RHLHWPCPSPLAPTPDISHCPEPPKSQT 
SP/CPGT*CLGFLKCPSSCHFA*K/PLH 
PSEPARKSLAPDTC? 


12607 


26508 


A 


12730 


91 


264 


SQRSISGLRVKENLIFVMIMFPPIYSSQ 
TFWSQTFLMLKIVFMGIS IS ICY IIi/NT 
EKNLGQGWWLAPI I PALWEAEAGGLL* L 
RVKENLI FVMIMFPPI YSSQTFWSQTFL 
MLKIVFMGISIS ICYILKLRKI 


12608 


26509 


A 


12731 


1026 


406. 


LAHFRSQ I FS FSH I LVHFERMVLNYR YY 
LACVRRWFRFLHVLAYFGR 1 VLKYRYF 
LVHFRREVFRFRHMLACFRRMVLYCYFL 
VLR K I LL I Y VT FLT Y FRKV FLRHN FLRG 
GY*RKYFGYAFRWLSFFLYRCQCFLHYF 
LFYLWLRLHLQF/ CCFWSFCLFDFLFL 
FSAACVFSVLQIAIVMFPFHALQLFFF* 
WLCFSNSRFQ 


12609 


26510 


A 


12732 


1508 


157 


QDVGGSGFKVDTHPRGRMAS I FSKLLTG 
RNASLLFATNK3TSVLTTGYLLNRQKVCA 
EVREQPRLFPPSADYPDLRKHNNCMAEC 
LTPAI YAKLRNKVTPNGYTLDQC I QTGV 
DNPGHPFIKWGMVAGDEESYEVFADLF 
DPVIKLRHNGY* PQG* LKHTTDLDASKI 
T\QGQFDEHYVLSSRVRTGRSIRGLSLP 
PACTRAERREVENVAI TALEGFKGDL\A 
GRY\YK\LSEMTEQDQQRLIDDHFLF\D 
KPVSPLLTCAGMARDWP\DARG I \WHNY 
DKT\FLIWINEED\HTRVISMEKGG\NM 
KRVI LSR\ FCRG \ LKEVERL I \QERGWE 
F\MWK*AP*EYILT\CPSNLGT\GLRA\ 
GVHVRDPQSFSQDPTAFLKILEKPRTPR 
KRGHKVWDIAA\VADVYD\ISNIDRIG 
RSEGELWQIVIDGVNYLVDCEKKLERG 
QDIKWPPPIiPQFGKK 


12610 


26511 


A 


12733 


261 


487 


TGSETDCAKPQASFLPREVPTVAEMKME 
LLKNKQFW\RGWAHACNPSTLGGQGGW 
IT*GQEFETSLANMAKPHLY 


12611 


26512 


A 


12734 


382 


668 


YKRITDIFVDSETVHILINKRQSCRIPG 
FIQLVQLI SHQLAAPRDYTVSHSVAQAG 
VQW/RNLGSLEPLPPGFKRFLCLS*HAL 
KNLSSCDTPPQY 


12612 


26513 


A 


12735 


401 


27 


GDRAEESAEPRAWSHSDNSHRYTTLFIC 

lththvhnpvhsXhththththththth 
tvshrhtetpplllkqtglkf y * ns rdd 
tprsrpgssglqrlsssppvpfqpgtve 

ASADFCGHDLLTT 


12613 


26514 


A 


12736 


202 


182 


KYLPIFINLITMNILFFFETHSCSVGQA 
EVH*S*LKPMPPG\SSDSPASAFQVSGI 
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SEQ1D 


SEQ ID 


M 


SEQ ID 


Predicted 


Predict- 


Amino acid sequence (A= Alanine 


NO: of 


NO: of 


eth 


NO: in 


beginning 


ed end 


C=Cysteine, D=Aspartic Acid, 


nucleotide 


peptide 


od 


USSN 


nucleotide 


nucle- 


E=Glutamic Acid, F=Phenylalanine, 


sequence 


sequence 




09/515,1 


location 


otide 


G=Glycine, H=Histidine, I=Iso leu cine, 








26 


correspond 
ing to first 
amino acid 
residue of 
peptide 
sequence 


location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Yaline, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














TGSPSSSWLIL*FFLKIGFTKKG 


12614 


26515 


A 


12738 


319 


2 


QNTPPHI YEYI PRVDAFLLFLEMS YSVY 
WAVIjS PLY FVNKLALTLHCRLALNS fl 
HKIQEPS\LGSGSG\PLSCNTII*RLCL 
RSLHLVFRSLWILPICDSSSVF 


12615 


26516 


A 


12739 


1363 


2000 


DRVFV I PG WKCSGT I KVHCS LDLPGS S N 
PSTSAP*VAC\ATGTYPPCPGLNFIiYF* 
MEERGFTQVA\QAGSQNSWGSKPNGTSE 
AS PKVLGMNKP * NLPNPGRS I LN*DGS I 
RYKDNYPA/WEPPKRS YKCLRQKKSI *L 
SAGPRDYEASQPRKNYPISSLLTFCTSS 
LHFNPTELTCS SQLQKS IRRKLKSEESC 
PPLPACSLKHTQAIIKITF*RLHRTA/L 
FYLLK* NCFRW * S * * KYFLLS VSNLKI / 
QWVRS SLHPGAPGYTPTV* RP KHKSNVL 
I* IQS /ITRCLFYILVYMRTIFPLQLFL 
SKHPSTYKEDFA*VLPLSFFFFEMESRS 
AARRSLSSLQPLPPRLKRFSRLSLLSSW 
DYRCA?PGLASPSILSS*GSCCSIQFIL 
RMS T I SHA I NVLVL KNT YL VLVSAS EHS 
LIKKPC 


12616 


26517 


A 


12740 


1696 


743 


GGQIMRSGVHDQPDQHGETLSLiLKIQKS 
AGPGGMHL*SQLLRRLRQENRIANRGCS 

epr/srhctpawateqdsvskkikk* kk 
*nhles kqqgqpaleppeqaggqlrkte 
qlqdgrrelaadmtl* pghs krlgal pr 
pllsayyfnkvhaarrqayletpgtfts 
yqreegeqalilgveaqassptvfhrer 
rqsqalcshrksqeapvrpahpprrvpg 
lgkpssqglsahlgqdrapaprraswdq 

RSQAPISVTFPSVLDKEESVPCGPGFPG 
HAPA PRG I HGATW EGAH S RG Y PGHFLAL 
PQHNSDEQRPN 


12617 


26518 


A 


12742 


2473 


445 


RGARRSRRRSRHRRRRHQSRPVRAAPRQ 
PEQGRRRGAPTHGPQLIMMDLELPPPGL 
PSQQDMDLIDILWRQDIDLGVSREVFDF 
SQRRKEYELEKQKKLEKERQEQLQKEQE 
KAFFAQIiQLDEETGEFLPIQPAQHIQSE 
TSGSANYSQVAHI PKSDALYFDDCMQLL 
AQTF P F \ VDDNE VS S ATFQS LVP WY SPG 
HIESPVFIA\TNQA\QSPETSVAQVAPV 
DLDGMQQDIEQVWEELLS I PELQCLNIE 
\NDKLVETTMVPS PEAKLTEVDN YHF \ Y 
SSI PSMEKEVGNCS \ PHFLNAFEDSFS K 
HPLHKND PNQLTVN S LNS RM / P T VNTD F 
G*WNFILVF\MAEPSIRQQAWPSPATLS 
HSLS\ELLNGAHGCFLDLFTL/CKAFNQ 
NHPEGTA\EFHGF\DSGISLNTS P\SVA 
SP\EHS\VESSSYGDTLLGLSDSEVEEL 
DSAPG\SVKQNGS*NTMYSSSGDM\VQP 
LSPSQGAEHFTCMDAQCEEHTRGKDLPV 
\SPG\HRK3STPI SQKDKHSS PLGGLISQR 
DEL\RAKAL\HIPIPCRKKSFNLPVG\D 
FNEMMS KEQFNEAQL \ ALI R\ D I RREG V 
RNKS /AAASGI CRKKENWENI VELEQDL 
DHLKDEK\EKLLK\EKGEKDKSLH\LLK 
KQLSTLYL\EVFQHAYRDEDGKPYSPSE 
YSLQQTRDGNVFLVPKSKKPDVKKN 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylnlanine, 
G=Glycine, H=Histidine, l=Iso!euctne, 
K=Lysine, L=Leucine, MNMethionine, 
N=Asparagine, P=Proline, 
Q^Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 


12618 


26519 


A 


12743 


6 


457 


RPRNR PG I RVD P R VRGRVGHAPHEGL V P 
ERMYRGSPTACETQAAA* ERAFGPSSPT 
C\RLPIPRMSTSVPQGfiTWTQRVKKDDE 
EEDPLDQLISRSGCAASHFAVQECMAQH 
QDWRQCQPQVQAFKDCMSEQQARRQEEL 
QRRQEQAGAHK 


12619 


26520 


A 


12745 


297 


12 


QERPVRMLVLHFGRLRLADHLKLGVRDQ 
PGQYGETPSLLKI PKLSGM\MWAP* IC 


12620 


26521 


A 


12746 


98 


428 


YNTSFNETVLLLTTPITIDCLYTRKDAI 
APESACGVCPCSLLGVGA* IPKWIRLD 
LS KKHVTTAYGG FM CAKC VLDR INRAFL 
IDEH\KIVPKV*KALSQSSKAFCEHET 


12621 


26522 


A 


12747 


3 


638 


LWLWSLCWVWAVSLPLQFILGSLHPCQG 
QASWREVDLLHEGSQEASSKPSSGSVPV 
. GLLLDQEMVTPLLVCICGILLHQEMVSP 
LL VYTRG I LLKQ E MVTPH PGL Y PWDS P P 
LDGHSPPGLF\RGAAPPS*GG/PLPVLV 
YTRGAAP PSGDS \ PPPPGLYPWDT\ PPS 
GDRHSPPGLYPWDTPPSGDGHSPPALYP 
WDSPPSGDSHSPPGLFPRG 


12622 


26523 


A 


12748 


2 


333 


DMVLLCHPGWSAGS I *KKTKERERMMNL 
HR* * IDRSRERERERQIQ*KE * KRINEF 
EGSKERFTQSVKQRNK/ELEKIRTLDTS 
ETIYLS I YRN * SLRRGQERRTER2RKRH 


12623 


26524 


A 


12749 


30 


333 


KTSYLLPVQWWAQNDNERYSSSKNTIMA 
LPLPLPVFPRS PS DAERKLDCSAAI SAH 
CNLPA\DS PAS ACRVPAIAGARRHA* LV 
FGFFWWRRRFAVMAGLVS 


12624 


26525 


A 


12750 


233 


39 


INDLL / CLF* FKKL I WGR VW * LMP 1 1 PA 
RWEAKAGGSFEPRSLKLQ*AMITPMYSS 
MGGREQDPVS 


12625 


26526 


A 


12751 


49 


273 


HLQVTEVFWFWCVFFRRWGGSHCV/AQ 
AGV*WLFTGTWPCCSPELLGSRDPPAS 
AS * VAGTTGACLAANCRGF 


12626 


26527 


A 


12752 


505 


897 


SYLRVQFYSAFSLFFRHEVLPCHTGWNA 
W* S QLTTASN / CLG PSNLLKAPHLAN* 
KKI F * RQGLSMLPRLVLNS WPHM I LLP * 
LPE*LGLQARATAPGSGFTFCLTQDSIL 
MCSPTVHKLSDS IALETKQ 


12627 


26528 


A 


12753 


335 


542 


CCNEFFLSQVWWLMPWPATR/SAEAGG 
LPGPGSQRLR*ARSKPVNSKCSAGGRCG 
IDPISIKQQKNNNR 


12628 


26529 


A 


12754 


356 


72 


WHEY YLLMEHTKKCHLS *GL YNGLNN* Q 
/WCTHPWPATWEAKAGESLEPRSSRL* 
CTMI TPANSHCP PAWATARS CLNQS INQ 
SSNNWQGMITGGK 


12629 


26530 


A 


12755 


340 


127 


NYLFIYFRDSLTMLPRLECSN*FTGSII 
VH ITLKLLGSSLP / AS ASQVAGTTGTRK 
HVQLSNYFKIKI KPKP 


12630 


26531 


A 


12756 


290 


511 


KKNQPGT/CGSRL* SQRFGRLRQADHL\ 
DQLGQHGKAPSL / LKNTKIS WALWRTPV 
FPSSQEAEMEELIEPTSSRLQ 


12631 


26532 


A 


12757 


254 


549 


YPGAKKQRPGGDSVRGTHLQSRCWCVLL 
HNCQ* PKSNTQTFP * PKNLRE*NAT*KS 
TDL/WPGAAAHLCNPSTLEG*GEWIT*G 
QEFETSLGNIPRPQIY 


12632 


26533 


A 


12758 


617 


451 


NKKRRGLTLLSRLKCNGVI IAHCSLKYP 
GSSEPPTSAF*VAGITY\GT*HHAWAY 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=Iso)eucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=G lu tarn ine, R=Arginine, S^Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 


12633 


26534 


A 


12759 


91 


393 


KWYTKECItYLLVS I FSRYLLS I YYREVL 
AQTLGKLS KTVMCPALEETVAVNGLQI K 
SKK* /WLGMVTHACDPSTLGG*GGWIAW 
GREFGTSLASMAKPCLC 


12634 


26535 


A 


12760 


10 


221 


QSHNTNGCLDLHFQKGPWQSVGETLNl** 
TIFLCTCVCVCVCVRHVYHM/CYIYSDF 
I CPS I ECEYFS FAHR 


12635 


26536 


A 


12761 


309 


55 


IIRYKNYISIKLEGWKRLYHATTHEKKV 
GVAVLISEKVDFRAKN/MYQDKEGRFLL 
*R 


12636 


26537 


A 


12762 


640 


896 


TIVLKCWFVLFCFRDEVPLCCPGWSAVA 
/QSQTQLW*TTQPGLKQSSCLSLLSSWN 
*RHVPPHLAMFLKTTFTLLYTCGHLETN 
IC 


12637 


26538 


A 


12763 


1 


357 


IVPLHSSLGDRVKPHLLKKKKTCSTSLV 
IRETQIKITRYTYSNG*S*RKNKAGNNK 
CWHGYTATGTLIAD/CKSKQMLWKTF* Q 
FHIKLNIHY*YNSEIQFLGLYPRDKNIA 
RCGGLRL 


12638 


26539 


A 


12764 


129 


320 


RWKCVKLKNTGQAWWLIPVI PALWKNQ \ 
AEGSLEARSLRIiQ*TLIAPLGYSLPG*W 
SGTLSLNK 


12639 


26540 


A 


12765 


617 


435 


GVKGLILPKLQKLLKSIE/I/E*KLPKS 
FYDASLTMIPKLHKD\RPISHVKIEARI 
LHKISINII 


12640 


26541 


A 


12766 


243 


5 


QKIKKRIKKNKQNli*EIWDYVK*PNL*I 
IGIPERGEKVYNTENIFEGI IQ/ESVPN 
L/ AYLVREVDNQI QE I QRTP VRYYTK 


12641 


26542 


A 


12767 


101 


16 


RTQL F VS VFW FC FLFC FETG SCSI TQVG 
VQWCHLRSLQLPPPP\GSRDSPPPSSQN 
RG*TNI 


12642 


26543 


A 


12768 


362 


92 


RVPHMDLGEECTS P YMYI * PMTCVFKHG 
♦DGNFCVTYFATI 1 1 YTYI / YLSVCVS I 
*I*CYLCVCVYIYIYIYIYIYIYFIYKY 
MKERRL 


12643 


26544 


A 


12769 


1161 


824 


QAKDLSTHFTKENM*MG/NKQMKRLSES 
LVI S EMQ I KTTMRSHLTHTRI T I IKMTS 
DKrCP*RCG*MRA/LLQC*WKYKIVSPL 
WKIWPFLKT\INMNISCNLMILFLGFY 
PIY 


12644 


26545 


A 


12770 


241 


237 


K*RIWA\GVMAHACKPSTLAGCDGWIA* 
SQEFETNLSNTGRPCLYER 


12645 


26546 


A 


12771 


326 


6 


DTVSRKNKSGKIFQLSSRV* I YERSQSG 
VKVYKCKTFGKAFTQ/HF*AHMRMYTGE 
KPYKY*ECGKFFILVLLLLLMIQKYFHL 
I KIVRLYL I RKKVS CKQPSNK I LQS 


12646 


26547 


A 


12772 


413 


1 


KKTFYFFFFETTQAGV* *CDHNSPWP*P 
P\GSDNPLTSAS * MA\GLQS CMHHHAQL 
S * FVFLEAGSHYVAQTDGDS PTSASQSA 
GITAVSHHTRPILYILIVLSYAISFFSI 
LYSAYFLQPHLPIHNNFLFSYISSSC 


12647 


26548 


A 


12773 


2 


1055 


FFFFLRLNLTLLPRLE/CVILAHCNLHF 
PG * SNS PAS ASRVAE I TGSRHQARI IFV 
FLVEMGFHHVSQAGLE / S PDLK* SAHLG 
LPKC*DYRRDITPGHH*SF**RYFLKTY 
ISRVL/WCSLLGTFMFMIAFDHM*TW+H 
*QNHFHFLLQIWKLISERLRSAQGQQAL 
KTVTSTFS S KYVCRAFIAEPG W I AI LYV 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

etb 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A-AIanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=Phenyialanine, 
G=Glycine, H=Histidine, I=lsoleucine, 
K=Lysine, L=Leucine, M=Methionine r 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, V=possib!e 
nucleotide insertion 














TF* SLDSRRTCLVVAVSNFFYSTLFLKL 
RQQVLLTG/PYHLFSARPSSCGSDKKAR 
ELNLSLSL\CFFFFGKRVSGTQAGVQ*H 
DHSSLQLWPSRLKQCSHLSLLSSWDYRS 
ITPHLANF/CNFLSGWSRTPGLK 


12648 


26549 


A 


12774 


315 


98 


FRCFLNS INPLE S PHVRKTCAGRAQWLA 
PVIPPFWEAEA\SQMPQVIHSR*LEARS 
SRPAWPTWQKPCLKYKN 


12649 


26550 


A 


12775 


275 


59 


NPSPYKIQKISWAWWYMPWLATR\RLQ 
*AEIAPLHSSVGDRARPCLPKKQKTNKQ 
TKTSGE PVLVQGPATVC 


12650 


26551 


A 


12776 


73 


381 


SLLNSELSSIPRPTWESPSTSDSAWPMQ 
NLASEPLPLP*TTPPPDFPALSA/PE/P 
LPFTPSPRDTLPPPLSPSYKQAPVLSVH 
PPVMPLKYVPSSPLIiLSFLYL 


12651 


26552 


A 


12777 


274 


342 


LNCVIA*WLMPIISALWEAEVGVSFA/P 
/RSLRPAWATW 


12652 


26553 


A 


12778 


2 


478 


TIYYTKYTTFRVPNLGLQLLTAFFCVCV 
CVSIiNGFKNAKDYGGSHLKYSVYITGFL 
LQFSLKFDCFLCVCVCVFP*MA*KMPKI 
M/RGSHLKYSVYMT*FLF* FSLKLKYFC 
DPILRKRI/WPGAMAYTCNPSTLGG*GG 
QIT*AQEFETSIiAKWKPCLY j 


12653 


26554 


A 


12779 


680 


453 


SETDWRKNKFFPSNFPSNLRTNSWTNSC 
SKKTFKKH/RVGLGVLAHTCNPSTLGGR 
GGWS P * GQEFENS LTOMVNHFS 


12654 


26555 


A 


12780 


59 


487 


SLHKHPERGLQS FQGCWTQEMLGGSHVQ 
QRAWELCAPPNLTCPGYLSFFLRQDQAL 
LSKLECSGTIPAHRNPSLPSPQLKVSSH 
LS F P S S WNSRCTT PHQLLLLFF * DR I RL 
FCPSWSAVAQSQPTVT/SSLPSPQLKVS 
SHLS FPS S WNSRCTTPHQLLLL FFNFL * 
RQVFTMLPRLVSNS * VQAILPPWPPKVL 
RLQA 


12655 


26556 


A 


12781 


109 


361 


LSPWHHSRNYTIilYPYTLNFWMSAFPTP 
KEEFFFSFPETRPHSVAQAGAHW\AI IA 
HCGLDLLGS\SDPPTSAS*AAGTTAI IP 
G 


12656 


26557 


A 


12782 


3 


204 


LIDGSLALSC*LFCGGAIIAHCSLELLG 
LGQ \ S P VS ATR VAETTGVCHHAQLAFVN 
LHLRTRS KHCGH 


12657 


26558 


A 


12783 


2357 


6366 


LTGS\NSHITILTLNI/NMGLNAPI*RH 
RL\AKWI KSQDPSVCCIQETHLT\CRDT 
HRLKIKGWRKI YPSPMGKQKKKAGVAI \ 
LVSDKTDFNPTKI KRDKEGH Y I MVKGS I 
QQEELTI LNI YAP\NTGAPRFI KQVLSD 
LQRDLDSHTLI \MGD\FNTPLSTLDRST 
RQKVNKDTQELNS ALHQADlA I D I YRTL 
HP\KSTEYTFF\SAPHHTYSQNWTTIVG 
SKALLSKCKRTEI IT\NYLSD\HSA\IK 
LKLRIKNLTQNHSTT\WKLNNLLLNDY\ 
WVHNEMKAEIKMFFETNE/NKKTPTYQN 
FWDAFKAVCRGKFI \ALNAHKRKQE\RS 
KIDTLTSQLKKTREAKSKQHSKASRRQE 
ITKIRA\ELKEIETQKKTLQP\LKKISE 
SRSWFF\ERINKIS\RPLARLIKKKREK 
NQIDTIKNDKGDI\TTDLTEIQTTIREY 
YKHLYA\NKLENLEGMDKFLDTYVSLPR 
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SEQ IB 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenylaIanine, 
G=Glycine, H=Histidine, l=Isoieucine, 
K=Lysine, L^Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=GIutamine, R^Arginine, S=Serine, 
T=Threonine. V=Valine, 
VV=Tryptophan. Y=Tyrosine, j 
X=Unknown, *=Stop codon,/=possib!e 
nucleotide deletions-possible 
nucleotide insertion 














LNQEEVESLNRP\ITG\SAIVAIINS\L 
PTKKSP\GPDGF\TAEF\YQRYKEELVP 
\FLLKLFQLIEKEGILPNSFYEASIILI 
PKPGRDTI KKENFRPI SLMNIDAKI LNK 
ILAKRIQQHIKKIiIHHDQVGFIPG\MQG 
WFNIRKS INVIQHINRAKDKNHMI IS ID 
AEKGFDKIQQPFMLKTINKIGIDGT/YY 
FKIIRRAIYDKPTANIILNGQKLEAFPL 
KTGTRTGMPSLTSPLLFNIVFCPIVFWA 
R\AI RQEKEI KG I QLGKEEVKLSLFADD 
MIVDLENPIVSAQNLLKLISNFSKVSGY 
KINAQKSQAFLYTNNRQT/EEAKS *LMS 
ELMSELPFTIASKRIKYLGIQLTRDVKD 
LFKENYKL\PLIKEIKED\TNKWKNIFI 
PCLWVGRIS\ IMKMAIL\PKVI YRFNAI 
PIKLPMTFFTELEK\TTLKFI WNQKRS \ 
RIAKSILSQKNRAGGITLSDFKLYYKAT 
VTKTAWHW YQNSMVLVPKQRY I DQWNRT 
EPSEIIPHIYTILI\F\DKPLLEKNKQW 
GK\DSL\ FI KW\CW\ ENWLA\ I CRKLNL 
\DPF\LTPYTKINSRWIKKKDLNVRPKT 
I KTL EENLG I T I QD I G VGKD FMS KTP KA 
XMATKAKIDKWDblKLKSFC/TICTAKE 
TTIRVNRQPTKWEKIFATYSSDKGLISR 
IYNELKQIYKKKKKTTPSKKWTK\DMNR 
HFSKE \DI YAA\ KKHMKK\CSS \ SLAI R 
EMQ\ I KTT\MRYHLTPVRMAI I KK\SGN 
NRCWRGCGEIGTLLHCWWDCKLVQPLWK 
SLWRFLRDLELEIPVDPVIPLLGIYPED 
YESCCYKDTCTRMFIAALFTIAKTWNQP 
KCPTM I DW I KKMWHI YTME YYAA I KNDE 
FMS FVGT WMKLET I ILSKLSQEQKTKPR 
IFSLIGGN 


12658 


26559 


A 


12784 


787 


926 


PQAI RRPRPPKALG* HNSVDLGWAWWFT 
PA I P TLWD YTKE P L YLAKI S * C FKKVYK 
FVLNCIQNCPGPHAARHS/SGKTS*V*S 
NYLFFSFCFETKFCSVTQARLQWHDLNS 
LQPPPPGFKRFSCLSLPGGWDYRRLHTR 
SANFCIFSRNRVSPSWPGWSPTPDLRRS 
AVLGLPKR WDYRRE PP CPAKI YT I MAPQ 
KVNSHSSQPVSFLS 


12659 


26560 


A 


12785 


248 


285 


C*VIF*CVMILYVLIYVEKITSIRLAVY 
W/CI ITRLFVK* I CSCLLGC* FLFFNV 
C* LY I FI I FVFFFFFFFFFFFFFFFELN 
ILY 


12660 


26561 


A 


12786 


243 


37 


RRSAGHGGSCL* SQYFGRPTRADHKVRS . 
N/ RGETLSLVKIQKVSQAWW * EPWPAT 
RQAEAGADAWVDR 


12661 


26562 


A 


12787 


75 


289 


DYRHEPRFFFLYTFFEMDSHSVARAGEH 
WCDLGSL*PST\*GSSDSSASTSR\QRH 
HAWLIFGFL*RRGFAC 


12662 


26563 


A 


12788 


83 


256 


RKMYWGLFVTAQN* K* PKYPSTEE/YI 
K / RL W YLHTMEY YS AI KI EKLGTRAQNR 
LILY 


12663 


26564 


A 


12789 


197 


499 


QSLKSVEETVFRNWKQ/SIPTFQVILMP 
WWLIPII*ATWEG*GRRSPLNPGV*DQP 
GQHSKTPSLQKKTKFS * AWWRVPWPA\ 
IGEAEVSG I PLSPERSRL 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted' 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I~Isoleucine, 
K=Lysine, L=Leucine, M-Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \— possible 
nucleotide insertion 


12664 


26565 


A 


12790 


394 


91 


FYNSTKFKWKNIQFLITRCRGDQIQVIR 
AHAVQLGFHVL Y F / CL.SQNLS I YL»Y I S I 
YVYTYRYIHTHMCIHI I* YLYI YTYIVI 
KH*KBHWWIKHCFPTERK 


12665 


26566 


A 


12791 


735 


75 


FFFRQEGFLSPPKRGGVPPPHTKKYFSP 
RGSFLWGWGTSRPPPRKCFSPPPPPVFL 
PPPKKKKIISFSPTKLAPPPEYFKSPPP 
PS\PPPPSSPSPSTFFFFKNFPFSPSSF 
SSPPFFFPSPAWEDPPVQSRRI Y* FLPP 
PF*IPEIPR\RVLKKKPGEGRSWGLFPP 
PKGPKRRVPLGPEFKTRLGHKTKPRFFK 
KKKKKKRFLISCLWWHMPAI PSTLGG 


12666 


26567 


A 


12792 


482 


342 


MGVFVFFFETGSHSSVTWAGVQ* CNLSL 
LQP/PASSDPPTSDSRVAGST 


12667 


26568 


A 


12793 


405 


169 


FYKKKKTGSGLALS PRLECSGTI IAHCN 
LELLGSRSLRL\SNTPTLAS*AARTVSV 
CHQTHLI F * FFVETGS CYVASA 


12668 


26569 


A 


12794 


270 


94 


YVCYSYKQIKICSHVKRRE*NTFKIYLL 
KYIALGLTGHFFPL* TFSKTVTLWPGAV 
AHACNLS PLEGQGRWI I /R /GQE I KTI L 
ANM* IPVFLHVNI FLSVCSYS IHTTLDY 
LFYYSCYILISL 


12669 


26570 


A 


12795 


284 


415 


NFIKI IKKKVMGLFYGKTFLLNLKKKKR 
KKK*WPGTVPHA*NPST\LGGLGGWITR 
GQEFETSTLATN*NPTKSISRGF 


12670 


26571 


A 


12796 


1387 


32 


AP SS FAIRS FFSGPMNAFFSSMVWKRPW 
PNLE WSMNLRS I FS RARR FVCTS KDLR 
RVSTRFLVHLTAFLGYKAGMTHIVREVD 
RPGSKVNKKSWEAVTIVETPPMWVGI 
VGYVETPRGLRTFKTVFAEKISDECKRR 
F YKNWHKS KKKAFTK YCKKWQDEDGKKQ 
LEKDFSSMKKYCQVIRVIAHTQMRIiLPL 
RQKKAHLMGDQVERGAPVPEKAD\WAPR 
E ALSS KVLVTQ VFWAG I K* SNF I GGDPR 
AKGYKGGQPVCWAHPRKLPPQRPHPRAL 
RKGGPVLGAW\HP\ARVA\FSVARRWGR 
KGLPFKRTEINKKIYKIGQGYLIKDGKL 
IKNNASTDYDLSDKS INPLGGFVHYGEV 
TNDFVMLKGCWGTKKRVXiTLRKSLLVQ 
TKRRALEKIDLKFIDTTSKFGHGRFQTM 
EEKKAFMGPIiKKDRIAKEEGA 


12671 


26572 


A 


12797 


86 


364 


EQDNRIFFSFLSLFFFFGETEFGPFAQG 
GGQGAILGPLKPPPRGL\SSFPT*GSQE 
VGTTGAPHDI FCFFNKKGETPRLYKXNK 
NNRGGGATP 


12672 


26573 


A 


12798 


101 


440 


HCSRYATGIFEWYSGILVLILLRFGIYE 
PMNLNIGDPYDSPPSRLYKMRLWERMAK 
S L I EVS LKNSHFWLGMLAHACNPRTLRG 
IGGRIA*TWGF\KTSLGNIARPHLYLKK 
I 


12673 


26574 


A 


12799 


364 


195 


NPPGDLGEETLLWGGE I MGTTPP * FPPP 
KEGFLPQDPPGGFNSPPI / QGRSFSFPP 
PGKFGPPQGFFKRPPPFFFFLNNNQ 


12674 - 


26575 


A 


12800 


86 


489 


PTAMAEEG IAAGGVMD I NAALQELLKTA 
LIHDGLAHGIRKATKALDKRQAHLCVIA 
SNGDETVYVKLVEALCAEHQINLIKVDD 
/NKKLGEWVGLCQIDREGKPRKWGCSC 
IWKDYGKESQAKHVIRENF*CKK 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=A!anine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L^Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q^GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptopban, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, ^possible 
nucleotide insertion 


12675 


26576 


A 


12801 


64 


211 


GNHKKSNYSLAPWW YTTWPAT* EAEAG 
GSLE\L*CAMITPVNSHSSLA 


12676 


26577 


A 


12802 


300 


3 


NNVTTGSN I QFHS F PLIRKD I CVS Y I KN 
DAWNYIKNSNPNGETQVYQ*YSFSN*NL 
KIWLGKVA\NPSTLGGHGGRIA*A*KFK 
TSMGNIVTPCLINKQK 


12677 


26578 


A 


12803 


208 


394 


SQHFERPRREDCLS \QEGQYSE I PSVQK 
NVK I S WAW * CM PA I PATRE AE AGGPLE P 
RRSRLE 


12678 


26579 


A 


12804 


183 


348 


AF I LG KGK YE P YN P CLGWA * W CM PWLA 
SWEAEAGG\SLEPRSSEL*CAMLIGCLH 


12679 


26580 


A 


12805 


397 


238 


ETEFPFCCPGWKWGDLGSQQPLPPGFKR 
FSCFSLPSK\KGQ*SLAPVGKGCK 


12680 


26581 


A 


12806 


1 


293 


PTRPRTS YEKQGYLLLPPVFS IVLEVLA 
RAISQETEIKSVQMGKEEVKLSL\FVCV 
YIYA*ENPVESTKTVDANLFDTRDWFHA 
IFPQTVGEVGMVLG 


12681 


26582 


A 


12807 


297 


281 


AKNPRRQPRE I PGGVFPTPG FPPYFKAK 
MPEGPFPGGFFFKGGVLWGTPPPFFFFF 
FFSETESRFVTR\LQCSGAI SAHCKLRL 
PGSCHSPETC**R*G 


12682 


26583 


A 


12808 


255 


59 


AGCSGSQLFGRPRGVDHLSPGV*DQSVQ 
HGEILSTKNTKI S WVWWSVP WG\EVGR 
SLEPGR*RLE 


32683 


26584 


A 


12809 


247 


310 


PMS ILGTPTTPAPTPGKPLLIRTSILY 
* KPILSQAWWCAPLVPATQ/ eaeargsl 
EPRDSRRS*AMIKPVNSHCTPSWALPRP 
LPQPLESHC 


12684 


26585 


A 


12810 


101 


477 


I GKE E I KLLLF PDNLM E YT VN F * M YKP L 
ELI S * FS KVTE YQVNTEK/ SNC I YT*QL 
QIENEIAKTI * FMIAS KS I KYLKISLTK 
CN\KWRDILCICIGRLSIIKVLVLPKLM 
CGGEKIFNPNPTGFW 


12685 


26586 


A 


12811 


1022 


696 


CF FVLFF KEMG SH Y VAQAGVRWCDHG S L 
YPPTP\GSSDPPTSAS*VAGTIGAHHHT 
WLIFKFFVETRSCFLIQAGLKLLASSYP 
SQPRTSPKCLGFTDVKSLHLAWLCP 


12686 


26587 


A 


12812 


166 


143 


RKEERFQINNLGFHVEKLEKEEQI \NPK 
TNRKKKIIKTRV* INEIEYRKTIGK\IN 
KNKSWFF*KINKIDKPIiAMLDSGRRDSN 
Y*NITKS IEEGRRRKEEAGKRKEEGRKR 
RRRWRQRWR 


12687 


26588 


A 


12813 


163 


423 


KAFWE F INHKKTGP F * RGP I GKVS P PGG 
NIoASSI*KG/DPFPSPLKGPQGSSGGPP 
PKKPPTGFNLAFGGLRGRETKRGPFFLA 
GLLP 


12688 


26589 


A- 


12814 


2521 


2842 


INTMYFPP * EMLVGWAWWF / RASNPQHF 
GRLRQADRLRSGVRDQSGQHGETPRLLK 
NTKI S WAW WRAP VI PAT WEAE AGES LE P 
GREGCSEPRSCHRTPVWVTRMRLY 


12689 


26590 


A 


12815 


173 


395 


LDQPGQQRETLSLIiKKKQTNKQTKKKNG 
FKKFL* LDTWVHTY\NPTPLGGRGGGFF 
* GQDFETTLANMGKRGFY 


12690 


26591 


A 


12816 


43 


346 


CVYVS / GVCVCVCVC VCLAS VCMYVCLH 
L*VLFGIFYV*DHNICKLRSFCFLFKNT 
FSHVYF*KIIWPGVEL*VTVFLFLSSLL 
HTSCVYEKSYAILI FSAL 


12691 


26592 


A 


12817 


647 


931 


SQHFGRPRRADHLRSGVQDOPGHHGETP 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
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od 
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NO: in 
USSN 
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26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of . 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


•Amino acid sequence (A= Alanine 
C=Cysteine, D=Aspartic Acid, 
E?=Glutaniic Acid, F=PhenyIaianine, 
G=Glycine, H=Histidine, l^lso leucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P-Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *— Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














PLLKI \QKLAGRGGA* L* SQLLGRLKQE 
NRLNPGIRSCSEPGSHHCTPAWATE *DS 
VLKKKIGADKY 


12692 


26593 


A 


12819 


392 


2 


PLFFFFFFFLETESHSVTQAGVHNPPGP 
SKPPTSAP*AAGTTGMHYSAHAQ/LIFF 
L*TELPSCRPGCSAMAVHRHDPFTDQHG 
S AS FL I WASS PLLRQTGDS SLIiGGHHVD 
AKFSVDTQLALCTKAQKS 


12693 


26594 


A 


12820 


1183 


870 


DRVS LLT P RLG VA VARNLAQ W K P L P PGF 
KRFSCLSLPSS WDYKHTPPTPG * FH /RG 
* FFVFLVETGFRHASQ\AGLNSLPQSDP 
PTLA/FPKCW\DYR\HDHLAWPRKM 


12694 


26595 


A 


12821 


172 


298 


IYLSIYLPI/YLCIHLSIHPSIYPYLYL 
SINPSYWVSFSREL*LT*L*YLSIYLPI 
IYVSIYLSIHPSIHISIYPLIHPIGFLF 
LENFD 


12695 


26596 


A 


12822 


51 


254 


YDFLKKHLLSSYRAKDLESLFLLHLKI F 
SGN* SVS YICVCVCVCVCI CVCVYI CI L 
I KSWCS * S / CGVL * AICAGVGWC* L I FG 
VL 


12696 


26597 


A 


12823 


199 


668 


QPLPAPSPSAQKGRAHPTPSPARPQAE/ 
SLPSQSLGTRTRPPLGRPP * SP PWGQVD 
LWGWLGPS PTLPAPESLS PASLSTEWL 
CGTRYLVWGRSGAGQA 


12697 


26598 


A 


12824 


33 


442 


GRGKLLKKSQKQRKNALLKYDS KTRVGT 
LFHWTSVPVTRPLLL* INIRLI KLDVI K 
CS IHKI FHSE FL FFKQHSGL / C I * LS K* 
LCS * KIHQSGLGTVAHVCNPSALiGGQDR 
RIT* GQEFETN * VNMVGPCLYPPLPP 


12698 


26599 


A 


12826 


363 


75 


HWEPRPAGPTLPSAPGSPSGRPCPPHTT 
SG/PPPPNTHTPMSRSPWSWEGSDQRPQ 
PPHTDTGSPPN*GSRLRHERPPWGRERQ 
RPPPTTLVRPLHSH 


12699 


26600 


A 


12827 


76 


486 


CTVNLFI Y FEMES CS WQTGVQ WCDLS S 
LQPLIWFSCVPTQISS*IPMCCGRDLVE 
GP*MMGAINSFLCCSYDSK*VS/ARSDL 
FFFFFFFF 


12700 


26601 


A 


12828 


345 


672 


KMKCTADLS LLE KDT * I KLE ENR * QER I 
MLRAEVNE I ENKI ENINKTGGS S SEKYY- 
KIDNPLTRSIRRKVN\KITSIRNEKGSV 
TTNPTE I KRTVKE Y I MKNFM P I NLTT 


12701 


26602 


A 


12829 


208 


1 


PSNSTPI FKRNENI CPHKDPCTLMLMAA 
LFVMSKK*KQLKCSSNDE\KLWYIHMME 
*YIAIKRNKLLI 


12702 


26603 


A 


12830 


649 


172 


FPLKYSLSTTSVMAPSFTSCGGSQQVLL 
APLSRLLAWPNV/HPPPQPRHPAPHSFL 
SPPRSPPNYSP*SSQGPLLQPQSKHVRP 
LPQANARLPSAKNPARGPPGAAQAPRDR 
PPA/VCPPLPPHS PLRHERAPRLFLPLP 
ATEPAALHAVHPGRRQERVT 


12703 


26604 


A 


12831 


1068 


667 


KLSWFFCSRLRLHFGSIFKVPSL/PVPK 
LLLPIPSLCPSLPLQIiQPAQPVPFNTAT 
WLCFC* / P * R V* PAPGYQRVGHFNFFPL 
QUSTFIPIILQ*HLGF*KSASLSSSSSPK 
KVS LCHPGCS AWRS AHCNLKLLI S * SA 
CLGLPKC * D YRLQS PRP P/ PT I FKVTKM 
CLLK^LTGSVSF 


12704 


26605 


A 


12832 


268 


1 


KCFFLLALNWPEFILDYFILNW*NSLRT 
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SEQ ID 
NO: of 
nucleotide 
sequence 
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beginning 

nucleotide 
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correspond 

ing to first 

Amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
£=G)utamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, l=Iso!eucine, 
K=Lysine, L=Leucine, IVNMethionine, 
N=Asparagine, P=Proline, 
Q = Glutamine > R=Arginine, S^Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /—possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 














VTEKFLFMTLFLTY*NMNRPGMVAHACN 
PS/TLGGSGGRIT* AQES * TSLGNI VI P 
CLYQKL 


12705 


26606 


A 


12833 


654 


321 


LVGIRFLKVKYFNIKFPHLLAIISPSLE 
VFNLQHFYIYFLLFIYF*ETRSCSVA\R 
LECSGVIIVHCSLKLiLGSSNPPALASQV 
AGT ITAHHCAQQHFCILEKSNLiCTYFPL 


12706 


26607 


A 


12834 


867 


201 


RQMSPTLRTKSRDVASRSSFSAWAWGKL 
FTFWDFSMSLVSLQGPGGPFHS PQP * TQ 
GR*VENNQEPLATPFPGPERSLPGQPRL 
EPAHSGKRRGLPLEIRPPVNVSDIHGDP 
SGAQWLVRG VEAG P WAVGGQAP * SEG P S 
PSPPLDVSPDPGSSLPRSSSP*MDLRA* 
LSIiTF*TFQKGSGN/DPASSPA\GKPG* 
EWRKIKGPPQGLLLGPEMWGSQARVLS 


12707 


26608 


A 


12835 


328 


1365 


YAFCLMMLDKKQI \*AVFLFEFKMVHKA 
AKTTQNISNTFGPGTANKHTVQWWLKKF 
CKEESLEDEERDMGHGKVDNDPI*EP\Y 
*TT*KITEELSVDYSTVIRWQHSKQIG 
KVKKIiDKWVPHELSGNQNYRFEV*SSFM 
LRNNNPFLNRI VTCNEK* ILYN/RPAQW 
LGPRGQLQSQFPKPNIiAPKKKRSLVHW/ 
^PVGT.LAPTLDPTY^FLNP /GGRTTT^F 
KLCSGKLGKMHG KLQYLLPALVNRKGP I 
\LLHDNT/RDCVFAQPVLQKLKELG\YK 
VLPHPPYSPDLSPSDYHFC*HLDNFLQG 
KH/SQHGAENAFQEFVKS * STDFYATGI 
NK / L FS HWQKC VDCNGS CFD 


12708 


26609 


A 


12836 


7 


328 


RRERE RERERERERE RE RERERE RERER 
ERERERERERERERERERERERERERER 
ERERERERERERERERERERARV*EKST 
RYKSPPPRIPPLRR/RRV*RKHPHARAL 
SFFLYKKCTGGERAPSPVCVGGKFTW 


i i. i\jy 


76610 




1?5tt7 

1 /.OJ / 


?1Q 


JJU 


PFNHTYATWS III SNVOVCFMRAOD I YLi 
LIYLFEAGSCSVA\RLECSDMIMLHCSL 
DLPGSNNPPISAT* I 


12710 


26611 


A 


12838 


189 


359 


LGLDGVMRVGPCRALGPS * EEKS SRVQW 
LTPVI PA/LLEAEAGGSPEVRSLRPAWP 
TW 


12711 


26612 


A 


12839 


368 


77 


KNPNFIiKFGSKPMGPPIYSPP\LEG*AG 
GFINPGF*TPPGYMGKPPFFLKYSIMLPG 
IAAPGGCSPFPGGLGRKISFTPEMEVSI 
NPGS PLS LPPGEQN 


12712 


26613 


A 


12840 


16 


173 


KTDVHSKTCTKLFTAALFLIVKKWKHLK 
PP* VD/E INKMWYNHI VE YYLAI KS 


12713 


26614 


A 


12841 


425 


45 


NS FVF FF FLFENR VXjTPVAPRLE \ CTG V 
IL/APHCTLPALPEFKRFLVPSALLSSW 
DYRPVPVMPWLNFCI FVETGFH\HVAQS 
VLKLVSLKL/PSPTWDVPKC*DFRYSVR 
CCGLFF 


17714 


9661 S 
ZOO I J 




IZ.O'tZ 


7^7 


375 


LIiLWVY I LKKLW \ LG WAHACNPSTLG 
GQGGWIL*GREFETSQVNM 


12715 


26616 


A 


12843 


21 


325 


TSFFFLESYSVAQAG\QWCDPGSLQPQP 
PG / S Q / DN PPTS G * VAVTTG / MHHHARL 
I FVFFCRDGILLYCPGRSKI I FSSCIRE 
LFKKISDFFLLTMLIFCNSNKG 


12716 


26617 


A 


12844 


369 


58 


PISPLQFMLPFLKNQSPYGPFFWKKKNF 
PPPFLGGAVLKTPGF*NFLFKKIORGFP 
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Predict- 
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otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Giutamic Acid, F=PhenylaIanine, 
G=Giycine, H=Histidine, l=lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














PFPKKKKPRERPKPCPPKFG/SPPFFPY 
PGFPPPKKGPPPPFFFFFFF 


12717 


26618 


A 


12845 


211 


409 


NALKPKIHFFFSGGGLKGIWGNTLLDI 
GLHKTFFF\*GDFYYAL*I*AENALFGG 
GGDCAPHLQVK 


12718 


26619 


A 


12846 


200 


33 


QKT3SINTKCWQGWGATGTLIHF* GEFTRV 
QSHWKIVWKFL*/SLNILP/PSSSSSSS 
S SSSSSS VKNLCLHKACTWMFIAALLLL 
AKTWKQP*CE 


12719 


26620 


A 


12847 


416 


125 


K I PTR PGKGGP PL Y PR PFRGLNKQ I GLT 
PEFGTPLGNKGKSFPFFQKNKN\NWPRG 
GGPIGPPSPGG*VGGLTLKGEVSTKLDS 
PPAPPPGGQKWPLP 


12720 


26621 


A 


12848 


85 


397 


KIHI FFI FLVSLLKGLSFLLTFSKN* VF 
/ SFIDFLILCVCVCIEFYCFILILFHYF 
CLFGFCLFLMFVS * CQSLDFYFNS F/YI 
F*YKIQCYKF/CILIF 


12721 


26622 


A 


12849 


1 


246 


RPRRHLVASLVLTQIiNGSSPLPI KI ISW 
AWWLMPVIPTFWEAEVGGLLEPRSSRQ* 
*AMI\SPCPPAWAT 


12722 


26623 


A 


12850 


161 


1 


NKTTSSQ I W* PMP I VSTT*EAE VGGSLE 
PRS/L/KLKCAMISPVNSHCTPTWTTK 


12723 


26624 


A 


12851 


65 


244 


STYYLHSLPFLDISYKWNHTTC\SRYFM 
GALFLIAIK+KQPKRPSADEQI*T/IMW 
Y I YTLEYDLAI KRNE I LI HAAT WMNLKN 
IMLSKRSQTQKT\HW* FHL 


12724 


26625 


A 


12852 


235 


34 


TTIAVSVLILKELMEP*TLQPDFLG/WS 
LITESVDHVQWHAPVI PALWEGEVEGLL 
EPRSLRPAWATW 


12725 


26626 


A 


12853 


168 


453 


CQLRGASGTQGPGLSESQCQQPCP/ PEC 
RPSKPRPCGCRI*SPARTSPQPSPAASM 
ALPTNERTPPPALKW*PK/CPKQSPQSA 
KSKSPVKSTERTAK 


12726 


26627 


A 


12854 


244 


3 


ELQVYMHI * I YI/ C* S I YTKANI YIKYM 
HMCVY I YTHTHTKVC I HLH I YLLRHGFG 
CWLMLG KVRGCEGMAGLMKAQQGWGW 


12727 


26628 


A 


12855 


376 


428 


KNEPGGHTSFFFFKGLVLNFGRGVLQKT 
RP*GGGQGGKFNPNFSGPREPPPP/PPP 
GGGKKGPPPPPGVFVFFLEKGGSPILPR 
GVLNPGAPKGVYTQRGG I KSGNQGATPL 
SFFLKVWF 


12728 


26629 


A 


12856 


129 


395 


APPNTHPFLSEAPQSLSLEARPPSAPSN 
SSSR/PPVWRASQTHPPQDWSSHCSPPW 
GAPPQI*PFSDSLAHPPQDWSSHCSPPW 
GAPPQI*PFSDSLAHPPQDWSSHCSPPW 
GAPPQI 


12729 


26630 


A 


12857 


363 


97 


GLATLRRLVSNSQAQVIRPPQPPEVLSL 
PKC*DYRPGRPPASLIHVNSRQT* KVAP 
/CSKPLAAAAHQGS PGATETTRRCPPSR 
CGPIF 


12730 


26631 


A 


12858 


322 


361 


KS E KQS * VMLAVC TLDMM KMT \ F I SWF 
LPKTHN/ LMSN YKKNTRQ I PMEGHST I Y 
LTRI PQNCQGHQKQ * KSEKQSQQESKKK 


12731 


26632 


A 


12859 


88 


417 


HFTFFFFFFFFKRGGGFFSPRLKRLGKN 
FFFLDPPPPG*RDFSPSPFKEGGF*KP/ 
S PPPLVFFFFF* KKKGFPLGAGGF* TSG 
PGETPPLYS PEVWFFYGGTPGPTPFFF 


12732 


26633 


A 


12860 


217 


2 


L I I.il NK UVNMH CFL F V FET S LT * AGVQ W 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIalanine, 
G=G!ycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M= Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S^Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














HDL/GLLLRLAPPRFKQFFCLIIiPSSWD 
YRCALLHPATREDEA 


12733 


26634 


A 


12861 


375 


145 


LSFFFFF*KPGSCSVAQAGVQWHAHSSL 
QLQTP\GFSDSLNSAFSVARPVRVHCHT 
RPLS P FN I CWRARPKQKVFL 


12734 


26635 


A 


12862 


525 


852 


NLLS I YFFETESRS VAQAGVQWCHLGS L 
QPLPPRF*IKKPSNTALFM*NIjLSIYFF 
ETESRSVA\RLECSGAILAHCNI>CLPGS 
SDSPASTSRVAGVHHHSHLSSWDYSCAP 
PCLANFFVFF*VETGFHYVSQDGL/DTS 
* PQVI CPSRPPKVLGLQA 


12735 


26636 


A 


12863 


3093 


204 


EPDKTGPVLWKVGGGARVPGMAETLSGk 
GDSGAAGAAALS S ASS ETGTRRLSDLRV 
I DLRAELRKRNVDS SGNKS VLMERLKKA 
IEDEGGNPDEIEITSEGNKKTSKRSSKG 
R KPEE EGVEDNGLEENSGDGQEDVETS L 
ENLQD I D I MDI S VLDEAEI D \NGS VADC 
VE DD \ DADNLQE S L S DSRE LV\ EG EMKE 
\ LPEQLQ \ EHAI EDKETINNLDTS S SDF 
TILQEIEEPSLEP\ENEKILDIL\GET* 
RSEPVN*ESSELEQPFAQDTSSVGPDRK 
LAEEEDLFDSAHP\EEGDL\DLASEST\ 
AHAQS S KADSLLAWKREPAEQ PGDGER 
TDCEPVGLEPAVEQS/ SVAASELAEASS 
EELAE APTEAPS PEARDSKEDGRKFDFD 
A\CNEVPPAPKESSTSEGAD\QKMSSPE 
DDSDTKRLSKEBKGRSSCG\RKFLGLVG 
L S S TTRATDLK\ NL FRQI WGRVGGAPRL 
WTNARS PGS FAVYG FVTMS TAEEATKC I 
NHLHKTELHGKMI SVEKAKNEPWKKTS 
\PKRGSDGKKEKSSNSDRSTNLKR\DDK 
CDRTDDAKRCDDES VEKSK\DKNDQK\ P 
GPSERSRATKSRKSRGPKRTW\MDKSK 
\GVPVISVKTSGSKERA\SKSQ\DR\KS 
ASREKRSWSFDKVKEPRKSRDS\ESHS 
RVSERSEREQR\MQA\QWEREERERLE I 
ARERLAF\QRQRLERE \ RMERERLERER 
MHVEEERRREQER I \HRE\REELRRQQE 
LiRYEQERRPAVRRPYDLDRRDDAYWPEA 
\ KRAALD ER YHS D FNRQERFHD FDHRDR 
GRYPDHSV\DRREGSRSMMG/SIREGQH 
YLERH\SDPEPH\GQDSLR*LGWGYEL* 
Q * RLS \EGRGLPS FP PGAGRDWGDTWLE 
DEDDPVMGKGTAERGPMMDR \ DHK\ RWQ 
\GGERSMSGHSGPGHMMNRGGMSGRGSF 
APGGASRGHPI PKGGMQGGFGGQSRGSR 
PSDARFTRRY 


12736 


26637 


A 


12864 


2 


456 


IHLGSGSEGDGSSGGLGRGNSNTSRFSS 
SSWARGDNVPRPPPPAVCSWISEGDVQK 
PGLGEAGAGSSTPGDGGELRY/WPGLLG 
A*GRGIGGDDDDLRTLGLAGVQQCGKG/ 
RRGPRGPGRGQEPRRPRDLGLRGPRRAQ 
GAA/S ATAAP PP 


12737 


26638 


A 


12865 


388 


I 


LIKMTKVKKTAITKC*RGWGRTGAFIHC 
SWECKMIQPLWQTVWQLLKKLNIYL\WK 
QPKCPLM/VKWIKKLYISMDSYSAVKKK 
ETLMYTTA* MTLEVALLSERSQTRKGER 
GHI LWDS I YINFQKMQTNL 
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26 


Predicted 

beginning 
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ing to first 
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peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E~GIutamic Acid, F=Pheny!alanine, 
G=Glycine, H=Histidine, l=Iso leucine, 
K=Lysine, L=Leucine, M=Methionine, 
N^Asparagine. P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknovvn, *=Stop codon, /=possible 
nucleotide deletion, Y=possibIe 
nucleotide insertion 


12738 


26639 


A 


12866 


198 


3 


KKQKQTKNP PKI S * ALWC VP WPATWGA 
KVEGSPE\PRRSRLQ*AMITPLHSSLCN 
RVRTRGRTRG 


12739 


26640 


A 


12867 


415 


973 


NPVNCAQFSLLPTTEYMGHRVEGATGHG 
TCP S VP PNTHLHTGWWCLQHHRSCRAWG 
RGGSHTHRCPQRVPDGEEP* IYIAVHGE 
PP/ESPPQPSPLSCCPFQGNYIALREPP 
QGLPLPGTLPSHPPPFWHICKTHSSESR 
HPPFPGFCGLELEKGVGDFIIHPPLTLP 
KPLPNPLKPEPTPTPQTHI 


12740 


26641 


A 


12869 


715 


893 


PCVAPHS IREERFRLTSESTNQKVLWWG 
ID* T/ VILILTGNWQKRKLKSFFLGQKP 
GG MAP 


12741 


26642 


A 


12870 


342 


58 


KKKKPKENEQSIWDMWDNFRTSDI*TTG 
VLEEQFRE*GRKIIFKEIVRENSSNLTK 
/QINLLTQEV*TTT/HKNMNKSTPIHII • 
IKLMKTTTKEKI 


12742 


26643 


A 


12871 


110 


401 


SFYSVMILKASRIVICX>FT*LYAYIHTL 
YI YTLCX YYAYYVNI YICTHI \NIHI I I 
HVYTYYIVYASIYYSMYSAWYNIKVSKD 
RQTYENIALITYY 


12743 


26644 


A 


12872 


439 


256 


S Y FMR FGGDK I S KLYQF I LS RRWKQS TC 
PSMDEWINKMWH/ITIMEYYSAK*RNEY 
SYMRHE 


12744 


26645 


A 


12873 


356 


43 


GT PERHSHSHS QGS P I IG AFMG I RAS FV 
YSLLS CVLLMLS PQRVLFYFS AFKQQQS 
\WSGKVAQAYNSNTYGKAGS IT*GQEFK 
TILGNIARPCFYKKKFLANS 


12745 


26646 


A 


12874 


341 


205 


RE I KKMVLNKKFFS LQFLGKKKNFWEFF 
LKKKKKVFKAPPVFQF*VPIF/CSF*KK 
KYFGFFFSPKI FFFFFFFFFFETESTSV 
TQE FNGT I LAHCNLGLQGS S S S PASAF * 
VAVS QDCAI ELLGDRGRLCLEKKKKKKK 
KKNFGGKXKSKNLFFLKRKKMGT 


12746 


26647 


A 


12875 


474 - 


206 


PIRANGPVFSPPPPPPAPGTCPHPLVIM 
LSRKLRQSHPHTTVQPWTSGLLPSPPA* 
LSLETQHFKTAHFIFP*H*SSGLGPGWG 
SPVGEGPG/HPKLCLQPAAAAEAWPPRE 
GTKARRHEGLPPAPCAWSPAPRDAGSRL 
EFPTLPTI PVSQSPGR* HGAGGRPSCLR 
AFVPSRGGQASAAAAGCRQSFGDLALLP 
LGSPI LGQGQNFSARGR 


12747 


26648 


A 


12876 


257 


467 


YR VFTLL PRLE Y I MAH CNLDLG S S DP / P 
QVAGTTLK* LSRASRVAGTTGVHHHSWL 
IFK/YFL+RRWRRFSL 


■ 12748 


26649 


A 


12877 


867 


1409 


GPTVLVGGQDWNSVLQQPRAHSFTPRGG 
AGGLGGGYVLHKP I PFSQELLGAFPCQK 
QYRVPPPLEAQLLRQARSPAFIY*AAVL 
CQACGPSPSPCSPMVSPRPSGRAPHLGA 
APQSGQKALASTSSPDAEPV/SVG*WCP 
SNPALQGGQNGAEPRAV3ITSGALSLYD 
CFLDGWWEGRSLGP 


12749 


26650 


A 


12878 


1416 


740 


IRNPRWVQLDVFAAAPMGRSSPLFLLLF 
SFLKTVLRGSMIPSTVPS/FHKGPQACL 
YS LSLRS * SQAAPSPAPS SLPTPS PHPG 
LTQPGRGRRKADIS/CSHSWAFYVP/SL 
PHLPWACPLSLPQPVLF3II:FCFVLF*D 
GVLLCHPG\WSAMARSWAHCNLHLPGFK 
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Amino acid sequence (A=Alanine 
C~Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=Osoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T— Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X— Unknown, *=Stop codon,/=possibfe 
nucleotide deletion, V=possible 
nucleotide insertion 














QFSCLSLPSSW\DYRCMPP\GLG*FLEF 
*VEDGGFTILVRLIWNS*PQVIHLPQPP 
KVLGLQA 


12750 


26651 


A 


12879 


961 


31 


PS P VRDSCVAWAGERRGRLGGSGVWRSH 
ARGPGLMRGLPASEPAPCVLGSRPCFRV 
AFETPWTSPASCMPERCPLWRPPALVAA 
GLDPFLLSSTLTLAVCPRDLIRSWKKGC 
DVCDSRNCWS THSRPACRDAI LAP ILAG 
RGE C I GL WG / AAAACTGRV / TP VSGLTF 
STVFDFS S ARG /V*GGLPRSTGGPRVRG 
PGQPVPLPGLPAGVALAPLEGRPQEPAP 
ALSHSLTGSLFLGAWLCGCTRHGRAPVL 
LEVS I VAGGQLCEAAPQI FIQPGLQQLP 
TPSATS PAYLWVDVLLTL PS VI KQKRVI 
SFLTKE 


12751 


26652 


A 


12880 


232 


28 


P ALW VP WAR FDGME ETGQNH W S KE KAW 
PGAVAHTCNPNTLGG* GGH/MT* GQE FE 
TSLANMAKPRIPP 


12752 


26653 


A 


12881 


331 


1 


KNGPFFXNFPEFFPPPKWGVPPNPTPYF 
PWPEPIF*LTPGPLKKQKIPFQ\KGEIL 
GWGGKMGNPLPFWKWGGPPRESKSSPRF 
YKTPPP I FKKKKKGGSSRS RTS PRV 


12753 


26654 


A 


12882 


370 


287 


VS I IiAGCCG VAM I DNT VLVH KE VK I L R K 
*IIKK*CI** FRGHSK/HNFSG * AWWLM 
PVI QHFGRPR* GSLEARSLRPAWATWQD 
PHLYEK 


12754 


26655 


A 


12883 


656 


1168 


EPS RGVW PHEDAR I NGS KKKKS KDKKR K 
REDEETQLD/IVGIW*TVTNFGEISGTI 
AIEVDEGTYIHALNNGLFTLGAPHK/ER 
I ALKPGYGKYLS INSDELW/GRSDAIG 
PREQWEPVFQNQEVRNGGPAEMGEEKRN 
GTKWREDTDHTSFPLFPSTGGQPKAKSN 
WRKVCH 


12755 


26656 


A 


12884 


205 


452 


ASGS AHR FRTF S FE TGRVLD KP S QTDHT 
NRKRGGPNVK* SFL/ WLGAAAHACNPNT 
LGGRGRRTA*AQEFRTSLGNTVRPCLY 


12756 


26657 


A 


12885 


112 


290 


FCYNI I Q YN I VFS CHDPLVS FWL FFFF FF 
GKGGFPLSPRLKARGRVWGYLNPPLPGK 
RNFWAPPPQGFEIKGLPPP*SPPFGGVG 
QAKNLG PGFQP P PAPMGKTP FF * KNQNN 
PNWGGQ PLNFKTLGGGGP K I S F/ PPGGG 
GLNNPKPSPGPSTWGKGGNPLSQKKKKK 
KD 


12757 


26658 


A 


12886 


406 


190 


TRSLARGGFGRPLQKPYWNEQSFIMAKK 
\YEQPKCPWTDKWVNKMWS ILTVG* YAA 
MKRRDVLTCAT I TCR 


12758 


26659 


A 


12887 


440 


160 


RVCVCVCVCVCVGVFVCFALWLSSLIYN 
* VYLFLFVHNLGRVS FS YFLNL I * FFEH 
VNHV/HNYVQMIQNTKLYLKSYLETFH/ 
C pc*FPPILLH 


12759 


26660 


A 


12888 


201 


411 


HIWDVILIRHALGNFliTSISCYIVKIWL 
DAVAHTCNPNTLGGQGG/W I T * SQEFKI 
ILGNIGDPLSTKKNK 


12760 


26661 


A 


12889 


138 


464 


SCTMNPSEMQRKAPPWRWRHRS *APSTH 
KMNRMVMSEQMKLPSTKKAEPPTWAQLK 
KLTELA/ TKKKSLENTRGTQTS KNMLFA 
ALMIVSTGCAGVPSSSKETATIEDKP 


12761 


26662 


A 


12890 


146 


366 


FS VSMKYFV* IH* FYHL/KLLLMVYL* Y 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l-lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=G lutamine, R—Arginine, S^Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X—Unknown, *=Stop codon, /—possible 
nucleotide deletion, ^possible 
nucleotide insertion 














GFLAINTSQNISCSQKIMKLHL*GRGW 
AHACNPGTLGG* GEWMA* AQEFNTGLSN 
TAKPYLDF 


12762 


26663 


A 


12891 


368 


50 


KGRQSGVSSFPVGTVTNYTNLVAHNNRY 
LFPHSPGCQTSEVKVLA\GCSLLKASRG 
EGFFPLPGSGGSKCPLAYGCITPISASS 
CDLLLWVCVFSSYLAY*DTCHWI 


12763 


26664 


A 


12892 


227 


456 


KLTTLLKLFEK\ iedrtlsksfyevsit 

*IPSSSSSSSSP/lSLINTDAKKINKIP 
RNR I QQC IKKII FHKFSTG I YS 


12764 


26665 


A 


12893 


461 


89 


I YQNLSRXPDNI KCW\ *RCTE I QTLIHC 
W*QCKIMQLLWKI I WQFL* /ELNTEVPY 
DSI PLLDMY /PKKTEC/YTSTQTLI YCM 
NPIEYYSAIKRDKVLIHATTWMNLENI/ 
ISKRRQSQKSTYCMI PLI 


12765 


26666 


A 


12894 


222 


185 


RIVMQYLHLNLGACVCVCVFVCD/CVCL . 
CV*MCVHIHIYISTHTYIHIYVCM*ICR 
LS**CEFYS 


12766 


26667 


A 


12895 


186 


166 


I FFFFFEMGSKS VTQARVQWHNHSSKQH 
PPP\GASDPPTSAY* VAKINIFFFF 




ZOOOo 


A 




j j 




TCWDCRDEPPRPALSSVFIiTSFLjEDEFS 
IEIADLVPHPTPCSHPLQPP**VIIYFC 
S\KPLVTGSSSPPRKPFPSLHEPKSASL 
*PFSSTIQPVRSSPXG*PSPPLA*VQFP 
KGMTIPTLCLSPVLPVPSTIPLPNVSKP 
LPFY 


12768 


26669 


A 


12897 


923 


312 


VDVIRKVQIEITLRCHLIKVRIAIIKKT 
RANKCW * GCGEEGALVHC * WE C/NY ILV 
QPLWKTVWRFLS KLKI ELPYDPAI PLLG 
ISPKGNEI I/CCTPKFIAAQFTIVKIQT 
QPRCSSMGKWIKKL/W/HIERSIDR*LE 
Y YLAFKKKAVLS FAKTW IDLED I MIjNE I 
SQTQKEKYCV I S L I CGI KKKKVKYIE I E 
NKTVITKVRIRWRK 


12769 


26670 


A 


12898 


320 


3 


ESWRWGRIVRPHSVPGETWLCPVPPGPP 
NAPDGSFPLAICPVPPSCRSL/CPP/DD 
CLASAGV*GGFPLPRTEAS PHHSWACPQ 
EQPWPPQCKLKVGAPNSTALSEGS 


12770 


26671 


A 


12899 


138 


447 


I FVFHYKTQNI YNEDTLHVI I INLWCYP 
GS YI KSTGRLRPEVERGLGPTTMCC* FS 
S I KN I F Y F * KLKS YRKL / W LGTVAHA YN 
PSTLGG*GRWIT*GQKFNRD 


12771 


26672 


A 


12900 


279 


1 


INQSIHNNIKSTYISQCKLIQP+WKIIW 
RFI R /NLK I ELL YVPATPP VGV * LKELK 
SAC*RDNCTT/M/FTAALFCIGYGSNLG 
NSTALIGSRVLFP 


12772 


26673 


A 


12901 


235 


183 


I INYGGGPPEKKKKGGKKIPLI FYKKIK 
KFLWPKKKKLFGGAKNPI V* AHHI FTV* 
IDVDTRADFTSATI \ 1 1 AI PTGVKVFS * 
LATLHGSNMK* S AAVL * ALGF I 


12773 


26674 


A 


12902 


180 


19 


LFIYLFIYLFICEIRCHFVTQAGLQW\L 
1 1 ASCSFALLGSSDPPTSAS * VKGI 


12774 


26675 


A 


12903 


406 


949 


APVPGHGFCVSF/CGENGVSLYCPGRS* 
TPGLKLCSCL\CLPESWDYR/R*APVPG 
\LGFVFHLTL*KPPFFKDYMKSLFQFLK 
YL I QG * CTL V* G VR YR S S F I F FF F FLRR 
SLA/SVTQAGVQWRDLGSLQSPPPGFTP 
FSRLSLLSSWDYRRPPPCPANFAFVFLV 
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G=Glycine, H=Histidine, I=Isoleucine, 
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EMGF \ TVLTRMVW IS* P CD P PAS AS QS A 
G I TG VS H RAR P KFVL Y FKDNGE PLGGVL 
HRSHIAVHWM 


12775 


26676 


A 


12904 


265 


385 


GNGS P SLRW IGVLRGRS PTLELRH * P I F 
LTAASRRSLDRVSVPMWGTFLSEPLSIE 
GLVSRYLTNNLMERIPILYRN/PFNNET 
M/PESLCYRVLI FLSKGYPRVKGRLDTC 
YSPVRRSPAKKASFLPCAPRLACVKPVA 
SVHPEPGSNRTRG 


12776 


26677 


A 


12905 


363 


137 


APGMQA* WCM/HCI PGAQEAEAGEWTEP 
RSSRLWCTVIMPMSSHCTPAWATW 


12777 


26678 


A 


12906 


315 


565 


TPVDHGWLECSGAIMAHCSLKFLGSRDP 
PASASRVAGTTGVCHHSQLI FLFFVGTR 
SNFVAQAGIELLG\ *VILLPQSPKVLGL 


12778 


26679 


A 


12907 


376 


412 


DAWPTWRNL* FGVYVMTDR* EPRQMLTY 
LTA\ PLKYFKSGVMAHSCN \ PSTLGGQG 
RWIT*GQEFET\LANMEKPCLY 


12779 


26680 


A 


12908 

< 


215 


2 


NI H INGQKLFL/ WLGWAHTYNI STLGG 
RGGKI S * TQEFETSLANTVRPHLHKKQK 
IALPLPLPLPLPTVR 


12780 


26681 


A 


12909 


307 


379 


kr*kqpqyplteewinkiv;rhv\lqyysi 

LKKKEIL 


12781 


26682 


A 


12910 


279 


123 


KMILSWARWLMS I IPAAWEAEAGGSVEL 
RS*\LRLE*AMTVPLRHFTPAWARE 


12782 


26683 


A 


1291 1 


\ 


194 


AKQHIPLLARLECI /GLFTNTVTAHCSIi 
ELLAS SHP PAS AS *VARMTECVHRAWLE 
WIFYLCVIL 


12783 


26684 


A 


12912 


206 


388 


IELSKQGPVYYKWYVRITPPYFPFFFFF 
ETGSCSVT\RLECS/GTITAHCSLNLPD 
LSYSPTAAS*VAW 


12784 


26685 


A 


12913 


238 


426 


TPLLPSQQWLKKES LWLGRWHACNPST 
LKG*GGRDS\RGQ* FETSLGNIARPCLI 
REKFLKI 




zuuou 


A 


19014 


IIS 
i j j 




AKDRHFTEV* TANKOMKRCS^SLiGMREM 
QS KI VRYHLSE * /R * LFFFNGGNCWQG/ 
CKK/IQDLIHCWECKMVQPLWNIIW\RF 
L*NHTCICHASGIALLSIYPREMKTCVH 
TKTCI RMFI AALFVMVETWRQP I YPS LG 
G 


12786 


26687 


A 


12915 


282 


21 


IKFPPCPSTWGTKKKLFPKKKKEVMKML 
CFHFNLLVQN IF* YRLGNE I SDKAG \ P V 
AHAWNPSNLGS *GELIT*GQE FDTSLAT 
IVKL 


12787 


26688 


A 


12916 


254 


150 


TLPGRSSDFSLPEV/ YLKKNNH / WLGW 
AHSCNPSTLGGKGGWIT*GPPHMLKVSF 
LPTDPTSRVKQLYNWVPHSTVLIALFPT 
KNPG 


12788 


26689 


A 


12917 


181 


24 


AQWCDG I P VSKGNL / WLGMVAHACNLST 
LGGPGRVJI T* GQEFETSLANE FHRTD 


12789 


26690 


A 


12918 


75 


298 


LFILFTS*VD*MRPTHIKEKNL\LAQST 
NPNVHLI Q / KHPHRHSQNRWPNVWAPH 
G\QS£CHIKMTIMARPSGSQL 


12790 


26691 


A 


12919 


251 


9 


KFMFLARCGGSRL* SQLSGRPRRADHLR 
PGVQNQPGQHSESQYLPKSTNFCRVWWC 
TPWPATREAEA\GESTEPGRQRLQ 


12791 


26692 


A 


12920 


198 


211 


RFCFSVGDEREGKRA/ RERNRETERKKQ 
TEDTETQREERDRDRKR\ERE?DTERER 
PREERIITER5KX* KAAFP FQTLG CRAQD 
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FPKQLFPSRGWGVGPMTVLL 


12792 


26693 


A 


12921 


438 


127 


RWMER I FVRPH I KDSM W WVRSTLLGRG 
DALNDLFLYCWEKEDPRYKERP\RERER 
ERERERERDRCWE K* APGWRERP VQS LR 
ACHPPWHRFALAGPVPSAQ 


12793 


26694 


A 


12922 


158 


460 


EARVELLLSSRCKPRAVITAETQPAVYR 
LQLEIENFPGLSPRLECSGAITAHCSLD 
LLG*SDPPNSASR\AGTTGMCHAQIVNS 
CKADWLVLLSSRGLTHR 


12794 


26695 


A 


12923 


33 


484 


GIQAACIPGLRLSPPAEAHCHRHIPSPP 
PSSTAPRDDVGPMQITPDNFPPQDPPL\ 
TPPSQSLLPHRAP*PQGSSVDLPGLLFK 
LPQMPQTPGPEEVDTCVQIHKTSKARED 
VQMR KTPMAREGRHRAAHR W IGALGKAP 
QSQRRLRHTVG 


12795 


26696 


A 


12924 


335 


332 


RIKVYVRGHAKE * LRPGP / VAHTCNCHT 
LGGQGMRITQAQEFETSLGDIGRPHLYK 
KKKKKGGPF 


12796 


26697 


A 


12925 


302 


407 


AQWLMAI I PAPWEAEAGGLLEPRS* \ AW 
VTW 


12797 


26698 


A 


12926 


766 


244 


RVPRLPPPAL* LEFS PSGLRIARSHS FQ 
LRSSPPPVSPPST /PFSPHRSPPLSLLS 
LSVPS/PPLHHSFFSQRAPHILPPPPLI 
FPSSPPPAPPPPPPSSSSPSSSSSPPPP 
/PPINWGTPRFFPPPPFKNPPPEINFGG 
PKKKKIFS PPRA* KFVFLKG PPPFFFFF 
FFFFLGINLL 


12798 


26699 


A 


12927 


127 


720 


WCGLLESTELQLQSIiPGLYVSVI FGKI F 
YISGLGICCTKIFPSALPSAQPAASLPL 
PARSALGIVFFLHFC*IE/CNYKKLPIF 
HHI*LIKTFSYGLT*LFFFFFKTKSLIF 
S PRLN CRG P FLVN * NLCLRG * GNS PA* P 
FGGAG I EGAPHKAGL I FGGGGGAFLKKT 
G\LHHVAPGGASNSGTGNISHPPLNPPK 
GAGE 


12799 


26700 


A 


12928 


308 


49 


KEHW * AVGHFNI CVTGGP *GERKKGTER 
VFKEIITPNFPNLIQTIKPKTQETQHTP 
SRRNTKI / TPRYI 1 1 KFLKTSNKEKILT 
TAR 


12800 


26701 


A 


12929 


375 


2 


GCMLSLKTINILEVLANVLAQKKIKRRK 
LND\ EQADTTFTI * FCVRQKTKKSTDKL 
VKLI /ERFYKVATYMINFLKKSNSFHTP 
ATINC+KQIPFIMATKTIKYLRINPRKK 
CVRFVGKKLTFPER 


12801 


26702 


A 


12930 


202 


379 


SSENFSGKTVNLCCLGRLHRMISWLHTV 
AHACNPIT\GWIT*GQEFETSLADMVKP 
CLY 


12802 


26703 


A 


12931 


209 


376 


TVGEKMYLKVKRI \ WLG WAHASNPNTL 
GGQGGWI T * PQE FETS LGNMAKPHLYKK 


12803 


26704 


A 


12932 


280 


31 


I S CKVTKKVTLFELKYI WNKFY I YERVS 
IVGTTEDAACEVTSNVITGPGAVAHACN 
PST/RGRWIT*GRELKTSPANIAKPRLY 


12804 


26705 


A 


12933 


1 


189 


QMYTESEP * CKLWTLGD YNV* M * FI S * / 
IQCTTQAGNVNIRE/RLCMCSGGEYMRN 
LCAFLSILL 


12805 


26706 


A 


12934 


108 


409 


HHSHLLKLPNVWFSFCHFIVTVFLSL*R 
VFFVFY* I FLSSSHFLFSFCFCP IMFIM 
FPFKFILSCNIVFI*NIIFFLFA/H^/*G 
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* STLVSLFVFFFFLLQF 


12806 


26707 


A 


12935 


225 


11 


ILLCMEPSFLFWFGFICFVKTGSHYIAQ 
VEVLWLFSGMI KVHCSL/ KLLHS SNPPT 
S AS * VAGNTGS S YCTW 


12807 


26708 


A 


12936 


74 


12 


GTTHASAGVINI LVF I LS VFFLTHSLC I 
HISKYMFVYICVCICTRTDFYTEFKSFC 
TFCFFGSSFFFLEKKFPLVAGLEEQGKD 
LG*LKPPPPRLKKFSCLTLPG /AWE*RT 
NG 


12808 


26709 


A 


12937 


166 


311 


PHS * WECQMVRAPALENWQFLTKF IRGL 

piAdaalplhm/ chrksctqlfi IALL I 

VAKKCNHHKNL * ANEWINKM * Y I HTTDY 
PLAI KQNE I L I HATT WMNWTL 


12809 


26710 


A 


12938 


51 


296 


LCMYTLHTLYMHTCIYAYICYTCKYIHR 
Y/LKT* I PEDANSVKHKYTHTHI YVHTL 
VYPHTTYLMCS Y I YS WVCVCVCVY I YT 


12810 


26711 


A 


12939 


426 


28 


KGFNLVNPRVFPSPRF*TPAPVFYFCGP 
IKKIFFFKAGGVKFDSLKRAPLFFFFLK 
TGSCFVP\ RVECNGI ITAHCNLCLPRSN 
DPPPSDLQVTGVKPPP\AKLFFFFLW*M 
GFHHILDYQGETCLSQLLGWR 


12811 


26712 


A 


12940 


11 


213 


ATAPGLF S FFE TGFCS VDQAGVQ * HDHG 
SLQP*PP\GSSDPPALASQSARITGVSH 
CKGPEHTSLLT 


12812 


26713 


A 


12941 


416 


3S 


WLNNHSRLGFPNCWDYICKLPRLGPDVA 
SLL I HNS S \ GTW YGATKLE SNWYFS I P I 
YKDHQK* FAFTWQGQQYTFTVLPQDYVS 
SAL*HNTVHRDLDHLDNPQNITLVHYSD 
DTMLLDLMSRKYQAL 


12813 


26714 


A 


12942 


361 


74 


1 1 HPS FS VS SRN I HKNAYPAALGG YS VY 
GVAI LLFHYFINKLAFTLHCGLA/ LNS F 
LR* IQEPPLG\SGSGPFSCNVFLAPTEG 
TIVQKPDPMATFG 


12814 


26715 


A 


12943 


125 


401 


SSFEPRGIPLNPEVAMQKESVNILCSPR 
SQEFLESRIKIKVLTDLTQDEL*GQAQ* 
LT PI I PS \ I WEAKAGGFLEPRSLRLRSC 
HCTPAWVTE 


12815 


26716 


A 


12944 


92 


306 


KRTTNSPWGCSAY/GVAILLFL* IL/KL 
LAFTLLHGLPSNS FLRKIQE PSIiG \ SGS 
G P I FGNKLVAFGGLM I WS 


12816 


26717 


A 


12945 


322 


24 


NFKKNINLNPFLRGYFIiFLKGGFGLTFK 
KVGEPLKKKKKKLNQQFSYFLSTYTVK* 
NN* /WLGAVARTYNPSTLGGRGGW I T* G 
QEFGTSLANMVKKILAK 


17X17 


a\j / 1 o 


A 


12946 






GGAPM KRHG S LDRN YR VDTTRP S Q L FLF 
QLYHNYLKFAYEATKEYMHRKETLCLPC 
P AI QFS-FPKSNLC YRGYLCMRTCAHTQM 
* M YRY I YTYVYMHI Y I CM/ YRAI CTYYT 
FE * KY I LQKCILLC I FLFFLQYLKYVS Y 
LNRHHFLIAA 


12818 


26719 


A 


12947 


155 


415 


LLETGGNAGCSSWMHVLLGKIHRYGMGS 
RPISLMAG* \GPP WLRRVAGLRGR PATL 
GIiRYGPDSYGRQQW\EYCTMGASLMQRR 
RVRD 


12819 


26720 


A 


12948 


247 


250 


KYVYKY I H I VYKC VY I FTHVYGYT YLYT 
DVYKYTYLYTDV* I CI CI YKCI YLYWYI 
YIHLCINMYIYIHLCINMYIHIHV* IYI 
HIYI\m/YIFIYIFLY1^/YK*IHIYRvc 
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EYIYTLWKL 


12820 


26721 


A 


12949 


25 


407 


EALiS LLLLLVL WGLLLG I AR KM YQQIVDNJL 
VQYCLLVL* P*NL*EQKRFEREVGLDHR 
EP*FHVLGCIHSEKSFRFWSW*EIi*IRK 
LTLVA\GG VAHT YNPNTLGHRGRR I T * G 
QEFETSLGNWRPLFL 


12821 


26722 


A 


12950 


370 


106 


I LYLETLLKSF I S IRS FSAVDSHFEKSS 
TVGKMLPNSITCTDPI \FNKKKSQLMWQ 
SSLLPFSQKLPQSPQPVSQQPSPSRQGS 
I PATR * QLA* GS VQVMLFGS ILPTVELF 
SKWESTAEKLLILINDFSKVSRYKIYV 


32822 


26723 


A 


l L~ J J I 


980 


J17 


DQPGQHGKTLF*VSH*KKKKGWM/PGVY 
A YNPRFLGGRGG W IT* GQGFETS LANMG 
KPCFY 


12823 


26724 


A 


12952 


1^7 


113 


I QLAI SLLGICPRDLNTVTCS \SDTCTQ 
MFLSVLLMVAKR* KQPKYLQIDEWINKM 
QCIHMESYLVSKGRKYLCKL 


12824 


26725 


A 


12953 


9 


186 


DPTVCCLQDTHFTYKDTNKLKVMGWTKY 
/CHANGNQKRAGWIQIEYP*SKSLKSE 
IVQN 


12825 


26726 


A 


12954 


46 


381 


NENTYSYKKNTQMFIAALFWTPNWKQP 
TCPS/SGEWINKLL\YHLMVYYSAVKTN 
/ILVYVATSVNLRI 1 1 LiGKO PDKKR VI/F 
V* FHLYRKYKLISSDRKQIHGRLEVENK 
GK 


12826 


26727 


A 


j 2955 


19 

1 y 


250 


CS DMVLKAC I KKLMS YS KMGH I QAMDY Y 
TAL/NKEL/ LMYATI *ANLEDIMLiSKIR 
QTQKDNCLMVDC I Y I R YAEQ SSL 


12827 


26728 


A 


12956 


173 


163 


G AKH S AS GTG WI RKD FTR PGAVAHACNP 
STLGG*GGWLSRSEFETSLTNMVKL/CL 
YXS I CCTRCLLGCFPQALSKAID 


12X28 


26779 


A 


179S7 

1 LJ J f 


j\j i 


79 


EKDNOPLLTPRNCNVLHAI KESENHRIT 

X-* l\l/Ll V: X XJXJ X X ▼ XJ± inx IVU^UIY llX\X X 

ELKVTLRPDAVAHTCNPSTLGGRSG/WI 
A * GQE F E I S LAKKVKP CL Y 


12829 


26730 


A 


12958 


314 


126 


KHHHFKKHNFRVLCVCVCVCVCVCYKHF 
YLFSLSFSQ*P/CICC*CVCCSFLLCCF 
* SATYLKI I 


12830 


26731 


A 


12959 


529 


1697 . 


VPFMGHISNFFFSPFLRQKYLALLPRLG 
VQWQ*SRAHCNLRFPG\SSNSKASASRV 
AG I TGACHHGWLLFVFLVETWFHHVG\Q 
AGLKTP* PQV\SACFSL?KCCDRRCEPL 
CPATSS ILNL*TMDRL/PGEPKNQ*NLL 
VSFMCAVFYREGQQHVYFQRSLWCKREF 
I VRSPWAINLEKGGLRMGAV* IRIMLNL 
TLT*GFCTEDEKTVNSRVNQKLTPT*LG 
DE * DVKRE KN I NDE I KLLTCNSE YTKS Y 
RLGR* I KLGAFCFFM\ VPMGQTGRNGQL 
KVQV*TGTVAHVCNPS TLGG* DRR ITR V 
QE FETNLGDHS ES S SLQKI KLARHGAAR 
L * SLSL* VS WDYRCSPPWP \SNFC I FK\ 
RDGVSPYWPKA/ WS * TPDLQRSAHFSLP 
KVLG 


12831 


26732 


A 


12960 


1 


395 


GTREFRILSEKFKKEIDFFFFFKKQSSP 
PPGPKKQTL/C* KQTTPTKPQKTNLGPK 
KFGGQPKKPFKISHPQIGPPKKKNYNP* 
GQPVWNPPGEKKEGPTPPKFKDPGANLK 
VMGLKGGVKRKRGGKTFPQRA 


, 12832 


26733 


A 


12961 


90 


436 


LWRI.TS PKS AGW AGNTjETQEESMLQFKS 
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nucleotide deletion, V=possibIe 
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EGHR/PAESPI*A*SSVLFSLSTDWMRPI 
L I / MEGD LL YS KP I HFNVK I S S KNT YRN 
IWNWVlSrPHV*TL*PSQSD/CLK*TITPI 
VLLTCTH 


12833 


26734 


A 


12962 


415 


463 


LRAKDQVLPKI L VC * TLLFFCAQ I KKKK 
K/WPGAVAHAYNPSTLEGQGGWIT*GQE 
FKTSLGNMVKPSGIPPHWI 


12834 


26735 


A 


12963 


774 


316 


SIS PTCSGKIGGKLNRHFS KEDKTG \R Y 
MNKCFISLVISECYLKP*EDTH*TP\LR 
MAKIKRADH/DKC*QRYKMTGTLIQC*R 
ERKMVQPLWKTV*QFLKRLNIHLPDDS I 
PLLG\ I Y* RKMKACVHTNTI IWMPIASL 
LVKAK\AKKQPKCPSTR 


12835 


26736 


A 


12964 


44 


395 


MYFTliVWGSLGPKFLGGHQPLQGQGPFL 
HFLKAL* * FPFPQTPGNGGGTPPAKTFP 
PFPTPLSSSS PCPSPAEGGSREPSLSTP 
/SCLHLYIGGAASNPSPPSPLPPPQCSG 
LGYPVC 


12836 


26737 


A 


12965 


424 


50 


MAFP FLS CAP DA I NNNNKKCCRE INNE E 
PPNE PLKWRH I M I FFFEAESHS FAR \ LE 
CSDAISAHCKLCLPGSRHSPASAYRCPP 
RHIiAN FS AFL VQTG P PR VS QDG KDLQTS 
*SALLMPPLDPAIP 


12837 


26738 


A 


12966 


245 


44 


QSKELGNYPLPSPSPSSPSPAFHGLPPP 
SSPSPAFHGLPLLP\SWTVQPQ*RLTAT 
SLPDPPASPRPG 


12838 


26739 


A 


12967 


205 


30 


Q EGAS LS R I KRG PG VGAWLGMVAHACNP 
STLGGQG/ SWIT*AQEFETSLGNPHLYQ 
KYKN 


12839 


26740 


A 


12968 


114 


465 


DPVSNPETTKPE PPKKTTES * P/ EPKSL 
LWLPS\P*NSS*CANREQQPPSPQPHLP 
HPLPTSLQVHRLP* * SLPPPNLFTPLYD 
/SPEPSLLPVIQFPA*TSPP*PPVYSSP 
PSPTCPLH 


12840 


26741 


A 


12969 


1618 


1038 


VHMVQDKDINLNNQFLSGTMLLFFKETS 
HR*DRGPGDF*SCLYSAPIPTQPLLF*Y 
LPQNFIEG*PS*NSRIKGTSLKTYISRV 
PSFFFSFL*DGVS\TVPQAGVQWHDLGS 
LQPPPPGFKRFSCLSLLSSWDYRHPPLC 
PANFLDF* * RRGFTMLARLVSNS * PCDL 
PASASQIAGITRMSHHTLPHVFPLNV 


12841 


26742 


A 


12970 


197 


411 


EQALRSTVCYCLNPLPPAPLRPRQDC\G 
PCRFRWMPGGRTRWLAPVIPTL* EAEAQ 
VWLEARSLRQAWATW 


12842 


26743 


A 


12971 


616 


777 


MGIHGVRHNAQLNFVFFVEDGGEWQLTC 
TGAGDSS WLGEP * VFAC/ PKNVLFYLH 
P * LFG* 1*1 YQLKITF*NYEGMVP/ FVF 
RFLFETRSLLSPQLECSGTITTHCSLEL 
VGS INSS ISAS *LDGNT\GVRHNAQLNF 
VFFVEDGGEWQLTCTGAGDSSWVLGEPL 
RYYKERLIKWYL 


12843 


26744 


A 


12972 


222 


389 


VITEALGSDELEGRQLWW* IKIHSQVQW 
LIPVIPAL/LEAKAGGLLEAKTSRPAWE 
T 


12844 


26745 


A 


12973 


425 


145 


Q I I FFLFQK I KNQ FNKAS FFKKMTLEQM 
TSRQKNSAQISVFKKKNSKWITDLNVTH 
*TIQLTGKN\NIGKNLQDQGLGKEIIjNL 
TLKAOSTKG 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=G)utamic Acid, F=Pheny la Ian ine, 
G=Glycine, H=Histidine, ]=IsoIeucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, Y=possible 
nucleotide insertion 


12845 


26746 


A 


12974 


322 


411 


NNKKIFMSTS*KFFYCFIVQLIFETILS 
VRVLESIILINYL*YCIICIFPFLFL*T 
*EIRKVTKFTSFPQIEEK* KCRR/WLGT 
VAHT YNP S TLGG * GGR I A* AHEFETS LG 
NTGRLPSVQKIL ! 


12846 


26747 


A 


12975 


428 


1 


SRKSSSQFRLSFFPSHFLDLGAKGAVSQ 
DRAIALQPPGPRSR*S*RPPSRPRCFRR 
PRPSRSRSNKERAGEKRRPRRSLQPRPP 
HIAGP / VPRRQGS PSAGPEAPLATDEPP 
PQRRS G IRGGSANI RSKQTS SLRPRVRG 
RVG 


12847 


26748 


A 


12976 


60 


294 


NHCTETVLFQ*LTCVCVCVCVCVCVCVT 
ERI /CFFKFLIGKGTYGCIYLWPKTFIL 
IFIYILYSPRGRSILKKKFFFLT 


12848 


26749 


A 


12977 


6 


341 


DSLTLLPGLECSGAISAHCSLCLSG/ FK 
QFSCFSLPSSWGFTG/VPPRMPQLIFPY 
FS VKTGFH\HVG * AGLKLL I PPASAS PK 
CWDYRREPPRPGDLWNF* PTGFKLRFPQ 
FPL 


12849 


26750 


A 


12978 


296 


42 


SQPASLGFKSVDLPLLAVCPLGKITLTL 
WDFSFI IYIKQE*WPRAGIFMPY\NPST 
LGGRGGRIS*AQEFKTSLGNIVKPHYIiV 
K 


12850 


26751 


A 


12979 


20 


373 


KLYGG I WGF FFFRRGLNS ATQAEMQGHN 
LG*LKSPPPG\*RNFPASASGEPGITGS 
HHKAGIVFYIKKKPGLKTFTLGKPTPPP 
S FRHWHQPAPMGVYS EILGEKSLTNGWD 
PQTKKG 


12851 


26752 


A 


12980 


380 


96 


PVFPPPWRAQKGGSIGREIRPPRGNQGK 
PGPP* KKKKNYPGGGGGPPSPPFPEG\ * 
PKKWHNPGGGTPP*TEFPPWPSTRGGEK 
KPPSKKXKKEK 


12852 


26753 


A 


12981 


331 


71 


ELLIYSWVWWCTPWPATW+GRELVGRI 
T\KSRRLR3jQ*AKIVPLHSSLGDRVRPC 
PPPPTKKKIANLCPWTEGSLSFSYLLEQ 
FHSC 


12853 


26754 


A 


12982 


383 


189 


RRALKSCAHSTPAGPKNAALSQAQWLTP 
VIPTL*EAEAGGSLELRSPRLL*AMIEP 
\CTPAWVTE 


12854 


26755 


A 


12983 


144 


6 


ILAQ* KTWLSVVAHTL*S / TLGGQDGRI 
TGAQEFKTSLGNTRRPCLYK 


12855 


26756 


A 


12984 


341 


509 


ETLYTME * YTAI EKNEIMS FAGTWLELE 
A\VIL \SNLV* EQKTKMLPCS PLMGWEL 
K 


12856 


26757 


A 


12985 


387 


125 


DLPPTPPKLEPRTCSVAQVEVWCL/GSL 
QPQPPGP\SDPPA* AS * IAGTTGTRHHA 
*EMFPF*VYFVQTSEHILGINPPGRHTK 
VAWEV 


12857 


26758 


A 


12986 


151 


350 


GRGGF*PLP* I FPPGQERPGFTPPKAPP 
PLGFFFPG/RFFFFLRRSLALSPRKWRD 
LGPLQPPPPPPTPGY 


12858 


26759 


A 


12987 


251 


1 , 


RMS LLE KRRSNN * EAR V / CGR V I YVATE 
VIKIMIGISMVAHAYNPSTLGGQGRRIA 
SAQEFETSLGNMTNPCLYKKYTHKKSRA 


12859 


26760 


A 


12988 


320 


3 


TPWGVKDFSPPPPPGEGVFPGRAPPPR* 
KFFGNFWKKRGFPNWVRGVLKFWG\KNF 
FPPNLPKSLGLRFKPPGPP*KAPPFFPL 
FFFFFEMEFRLCCPGWSAVAOS 
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Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
KHLysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
r^=GIutamine. R=Arpinine S=Serine 
T=Threonine, V=Valine, 
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Y=f In If nown *=Stftn rnrlrvn /=nf>s<;lblf* 

nucleotide deletion, \=possible 
nucleotide insertion 


12860 


26761 


A 


12989 


353 


183 


E YGTHI LTAAI FS I AKRWAHR * GSLTDE 
OTRKNSSLQSMEYY\SAIKKNEVLLHAA 
P 


12861 


26762 


A 


12990 


377 


2 


LSIILIMFPRFSLQRGKYPGPPPPKKNP 
PQKTPPRAPKKKKPFFFFFFFSFFKKNF 
PFFFKGQSKGGFFGPP*PLIFRFK\NPP 
PPPPQKIGFKAWVFSTPPPPIWGFFFFF 
FLRWSLALLPRLEC 


12862 


26763 


A 


12991 


134 


298 


PPNSYCDGSITLIPKSDKNITRKEKYL/ 
P/IYLMTIDAKVFNKVLPNRIQQ* IKRI 
I 


12863 




A 


1799? 




171 
i / 1 


QT* SH / SLSRLEYSDTI IAHCSLKLMGS 
SDPPASASQVAKPTGHCTTMPQPHTEGF 


12864 


26765 


A 


12993 


269 


2 


GLWCFVKQLKLTETCPHFAVPQFPLGIS 
KIPISILFFKKLLL/WPGTVAHTCNPST 
LGGRGGWTT*GOEFGLMFHHWPSKFHHG 
QHGET 


12865 


26766 


A 


12994 


324 


4 


SLYPLLFQAEIYFLVTIHRSSMESCIYF 
FSVWPLDSLVQIHVKI /HIKGIGML* V 
N*QNCFKINISE*L*YIWPSTVADTLGC 
QGEITRAQGFKTSLSNMTKPHLY 


IZoOO 


ZO/O/ 


A 




1UO 


j j i 


WLLNLFAFLNDKGFI.VPMMRSRI FSOOK 
VKQRSFEGCSSFNDVGSCYVTQAGEQWL 
FTGVIGTL*PGTPG\SSDPPASASQAAG 
TTGS*HHIQIiNLILYSSG*QTI *LL*TT 
DSTHLI FASLYAKEQLITFYFQATSLQP 
CT 


12867 


26768 


A 


12996 


1 


367 


NTE PFLN I FFKDKVS / CS I AQAGGQ WRL 
YSRCSLKLLGSSNSPTSASLVAGSKGRH 
HYTRL*LFFYVSRPQDYLPWSPVFFFSF 
FLSFFFWKGSFFLPPSWKARAPIWVNGS 
LPPRVKPIFLA 

XJ X * X v V XVX7 -X- X Ui* 


12868 


26769 


A 


12997 


3 


379 


YRPSPSETKAVLVFFS FFS FFFLGTPGG 
KGPPKGAPNPTLTWRGKENFFI * KKKKP 
/RLI*KKGPPPKMGFKRERFWVLKPKGP 
QFGKGEKG / SPNYPHPRGDPKAPS LWKK 
PQFPGGGGGGAPKKAS 


12869 


26770 


A 


12998 


230 


409 


SCRPKKLIiAFLKSSTEISQRWKQPIYI/ 
PSTDE* INKMWHIHTMEYYSALKRTKVL 

XT tkJ X XV XV XII XVI iFVXX tl»X 1 X X IXj X X, Mfil IXVXX X ^ V V XJ 

IHATT 


12870 


26771 


A 


12999 


310 


343 


DPVCTII*IICDIINL*IECSKVW*SLKF 
YKI LILFNVCVWPS VGNCHKWLCDCW* 
KQW/LWPGMVAHTCNPSTVRGGGGRIV* 
GQEFETSLGNIVRPSLYNNKSCI 


12871 


26772 


A 


13000 


390 


289 


I KLCQPRGVKLDSFKRAPPFFFFFETRS 
CFVS*AGVQWCN /LGSLQPQTSGPNNPA 
TSA*LSLIFMCSNSYVCIVK 


12872 


26773 


A 


13001 


1 


289 


G I LRQI SWYLPVGRSLAQAGVQRCDLG 
SL* PLPPKVKQFSCVAGTTRGCHHAWLI 
F T FSREG \ GHPVKGAS SDS PTFG FS KAG 
ITRLSPRALACT 


12873 


26774 


A 


13002 


359 


105 


KQKKNPS PPPGGGGGQGKPG/ PPKPFPN 
SRDFPPKNQSPFFPKLKKKFFNFY*KAK 
NP*KPNPPLKKGKKPKGFFSPPPPLKKP 
PNV 


12874 


26775 


A 


13003 


243 


293 


MNSQFLKRL\RIESPYGSVISLLGTYST 
EK*KACPCKTCTQMFTSAi>YIIAKKW\N 
KCPSTVRW / 1 KLWH I NTME FFCQ * KKFH 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Iso leucine, 
K=Lysine, L=Leucine, M=Methiomne, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknovvn, *=Stop codon, /=possib!e 
nucleotide deletion, \=possible 
nucleotide insertion 














CIDMPQFVHLTVEGHLSPLFCYYI 


12875 


26776 


A 


13004 


465 


307 


SEACFILKLFMFCG*CCLFVC\SILLVF 
YFVFCFVLFCFAYYMQFFNQCLFG 


12876 


26777 


A 


13005 


207 


3 


DGSLTMLPKLEC/RWLFTGVIMAHCHLE 
/RLGSNDPPASASRVAGTTGVCHHA*LI 
FIFFVKMRSHYPTRP 


12877. ^ 


26778 


A 


13006 


2 


56 


DLALLFRL* CKGM I TAHCNLK / RPG* LL 
GSRDPPTMVS* STGITGMSHEAHRPSPS 
CDF*RSCSWQAWQWHDYSSL 


12878 


26779 


A 


13007 


402 


74 


FGASLLKRHFLFFLS KPLVYNKKI PRPL 
VFLKKT\PPPPGRATPFYYCCFFHKEGP 
GFGVKKEG*PKG/SPPPPQRGSTPPFFL 
GAFKKPKVTTPFFKKNPPKTLKGPPFF 


12879 


26780 


A 


13008 


356 


145 


RPPPFFFFFETVSHPVAHAGVQWHDHSS 
LQL * T P \ S SRE P PAS AS * VAG TTG VH Y L 
PGSQLPILEHSPRR 


12880 


26781 


A 


13009 


227 


353 


GWSELT* / SN*AQWLTPVILAL*EAEAG 
GSSEPRSLRSAWATW 


12881 


26782 


A 


13010 


249 


32 


NPDSHSWAWWCASWPGTQKARVGGSLE 
SRSLRLR*AMIVP\CTPAWVAQEPLSLK 
KPPTIETLFPSPQDPSQ 


12882 


26783 


A 


13011 


247 


403 


TPGGGGKYFFFWGPQKKNRGGGFKNGGG 
GKTRGPP * KQPASSSPFAPPPFFWGAPG 
FSPPPRF\KPPPPVFFLGPPKKKIFPPP 
PGGLFFFFLGGPPSPFFFFFFFIiGGGPL 
FFPGLGAMGPMAGFRSLAPPGNSLSKKK 
KSEGLGEGGNSVLTRVLLISS YQI PGNP 
R 


12883 


26784 


A 


13012 


206 


386 


TREAQTFWFPLYS * KAL/ WLGTVAHT*N 
PSTLGGQGRRITSSQESETSLANMVKPH 
LLLEI 


12884 


26785 


A 


13013 


354 


31 


YKMNIV7DI IHI*HT\YIPWPKGKERKR 
KKLEEYL/QDIMTKNFLNLMKNMNIYFQ 
EVQQTPSRINSQTSTPRHI I I KLSKYKT 
GREQWILSVMLALWEAEWGPPEVRS 


12885 


26786 


A 


13014 


1 


419 


EETS FGKARCLPG YPGFHPRRLRSRSPL 
LS*MPLSHPQGPSQPPAGSMSSSPPATT 
STCSCSPPAAGCTRGRTTGRPFWTASGP 
ASLDWASGSSV/EALTSGLPARGLWGQY 
AGLGVPPRIiLGPMTSCOISLPFPSPARP 


12886 


26787 


A 


13015 


109 


426 


TSLGNIARPH*QTHMATIRSVPPPSWKD 
ST*LRLHPPPPHMCF/PD*SLKSN*Y*H 
YLVNP \ I WPMLTLLFVCLFVCFRDRVSL 
CCPGWSAWPSRLTAASTLWVQVILPP\ 
RLAN* IFFLRWGLAMLPRLV*NSWAGDI 
LPP*PPKVL,GIiQA 


12887 


26788 


A 


13016 


149 


422 


LKRIFFLKWFTHDTAPGDPDNMCPYSC 
F*NQI IKSGLFFFKKPHL/WLGEGAYPC 
NPTLLGGQGGRIT* FQEFETTPANMAKP 
LLYKKYKN 


12888 


26789 


A 


13018 


162 


377 


QS PGYPERVS KLPT I TQRGMWMHQPS FV 
CTSL*NV*TGWAQWLTPVTPAF*ED\AL 
AGGLLEVKNLRPAWST 


12889 


26790 


A 


13019 


205 


427 


IQTGGPPARPFLWPWAPGKK*RVPPEGG 
PPKKKKKTPPPHPGGEREGS/PFSKKKK 
KKTPKKRAGVFFPGGPRPKEGPGWGPPR 
LDSKPVGAPPEPGRIjGNGPGKI FFGPGP 
FFPLOPOKGRP* PGGEGGS F F F FLG VP p 
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T .nj? T)CT T7r~"T TIT DTTXTA 


12890 


26791 


A 


13020 


248 


358 


SRCGGEAQWLLPV/ 1 / PAF * SAEAGGS F 
EPKSLRPAWAT 




Zo/yZ 




1-H/ZJ 


L 


434 


KAQVKKKVWWHM* PQLLGRLEMGGSLEP 
RSLRLQGAMTLPVTSTT\QAWAT 




lb /yi 


7\ 


1 inn 


162 


410 


LilUK.1 AASyiliiSb bbJ blih»Kl PISSES 
GGGGSVSVTQAAEQWRYHSLLQP*TPGD 
KRSSHLGIPHP\SSWDYRISKDWVMPV 


12893 


26794 


A 


13023 


250 


40 


LIKLNI FSCSLAI WTSSCMTSLFKYFAH 
r or F *XDFYSSYYIIjDTNPFFRN/GITN 
ILSSSVTCSFILFW 


12894 


26795 


A 


13025 


264 


1 


VSSTKHGKLRLEPEY*CFIHH*EEKVFE 
SYFEY/PCQTFLRKETVRPGAVAYACNP 
STLGGQCRWII*EQEFETSLTCIAAALR 
VTLR 


12895 


26796 


A 


13026 


110 


523 


CIDSSSWTQTDRCKEDRLCPGTTGTCAP 
GLLFLSAPPLGPF*PSQAHPSFHVSIjSG 
PSPGLRPQLDIPLHPSTSIAPRPLPGPA 
b \1jPRIjSPHPSIjPSPTGACPFIjPPCIiGE 
QRHWPLPTSLLWLGLDGHCLATPSVS 


12896 


26797 


A 


13027 


52 


53 


CPTLLQLTLC/LPPQASAMVDAPP*ARP 
LPPSSI LACCTSGEQGS WVGLAEPVKG 
YNJjLVCRLLRPIjE kcgi wv avs rfs lys 

LSWLPLARKG\NPLTACASWVR 


12897 


26798 


A 


13028 


262 


381 


HFGRLRQVDRLS pgvrds lcnkvkp * Q I 
IYTF* fsvsls vintylkgpstvahtyn 

PSTSGG*GKWIA*AQEFE/TSLCNKVKP 
CLY 


12898 


26799 


A 


13030 


1 


846 


FRPIQAGRAPWALTLPCMGFTEGSPALL 
SHSSRCDIQMGCNTHAVDDSTGEGPFHK 
GRMRESCWLRTPRCGLYTKGVGGPGMEG 
RERRSRSRRQHMPGRGHSTMASWSLRQC 
MRLKLQWASDLLSQLEMQI PSLPPGESG 
SLGLTGPRSLH/CEQTIGLSPTPGESWS 
EELG\GG I VGGGRGTVAGRGGHREAGNK 
PSGTWGRQCCASGDKAMSI *LTSLY\AR 
RGEPGPGRPVRKWLRESSQNRIGGGEGR 

LMWS 


12899 


26800 


A 


13032 


308 


1 


KHYHDFWCWIILALSDAQSLDHTCV\CV 
CVC VCVCVCVLRDQG SHS CL* CEG*LVP 
HFVPGPVLCLLSPVIYVCFICSLTCKVE 
SLSNFFKKLNKALLRHCI 


12900 


26801 


A 


13033 


763 


244 


FLGPRI IGLRHE I S VETQDHKS AVRGNN 
1 HDN YKNVEAGPPKAKGKTDKELYENTG 
QSNFEEHIYGNETSSDYYNFQKPRPSEV 
PQDEDIYILPDSY*L/CQNIDFCYWMIN 
I HCN FSTAKTRNQTKC* STVDW I KKMWY 
i Y 1 1 EYYAAVKK/ DTKLjTWEQKI KYHI F 
SLKSGS 


12901 


26802 


A 


13034 


326 


58 


EHYKCNYCQKIFQKL*/HL*SKKHVYAE 

D FTV V TTV KTVfUn/DT .T .VQ Ci T T T . VT T? VTDr IT 
i-TVX IJM IVLAj V VlrlJij Vo^JL X Jj I X r I 1KVjJ\ 

P*SSCSNFVQNQI I Y I GQKS CKCNKFGN 
TFFKKP 


12902 


26803 


A 


13035 


199 


3 


RFI YNHKVCITTFLPHFYF* KLYL/WLG 
TVAHAWKTSTLGG * GGRI T * GQELSTGL 
DNTAR PHLHTV 
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residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, ]=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q^GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, V=possib!e 
nucleotide insertion 


12903 


26804 


A 


13036 


94 


448 


GS CATSMAFKDTRKTAVE PGVWHG I R I 
KPLE KVCADLI TGKKFKNL/ KVKGPVCM 
PTKTLKITTRKTPCGESSKA\WDGFQMR 
IQK*LTNLHSPSKIVKQIISISGAPGID 
SEVTNADG 


, 12904 


26805 


A 


13038 


964 


1717 


HKLSCVKDLKDFMRQAGE\VTFA\DAHR 
PK\LNEGWEFASYGDLKNAIEKLSGK\ 
EINGKNINLIKGSK\RPSRSRSRSSDPG 
TQKSPLGPRSRSPSPWRKP*QPVQEAR 
SREPGARSQVPVPVK*GPPPCPEKEPQK 
\RGSLQSKILSPPSILWIRPEVSGPRSK 
VPRSVDSGN 


12905 


26806 


A 


13039 


2 


725 
i 


SLLFS PVLPLFLFLPLAYI LQNLKPLQF 
SPDLKSKHLI FFCNAA*PQYKLDRSSK* 
PENGTLNFSILQDLNNSCRKIGKWSEVP 
EVOAFFV SHOSLPSLiCAOCNS /CPNLPS 
FPP\PVPSVPTPSVTESF*SSFSTDPSD 
LSLPPQALLLARPS*VPILPQPPLLHPI 
ILLSPPLLTPRPAYSFIP*L/CPPPPAQ 
QFTLKKVAGAKGIVKVNAPFSLSQIR*H 
LGSFSSNIKIQPSSWLVWQQP 


12906 


26807 


A 


13040 


1131 


2 


DGVSIiLSPRLECSGTISAHCNLHLLCLN 
DS PAS AS * VAG I TGPATTP PL I FVYF F F 
LVETGFHHVGQSGLiELPTD\SDEPPASD 
SQSVRITGVSH/AYLACSQF/CTRC*FP 
II*EQCVNMNINSSK*IFILISYLYRWM 
NKDPI/VIIERLLVRMPEGQSHAAW*NK 
TNKQT+ ILELVLSVTSCGVSVRFNLS IP 
*VFPSTK*EGWMRCVNPSTLT+FIPQVA 
* TCQEHGLMK I QT * NPGLD PS I VS DHQT 
Q * E PS VKQVLD I SHDKLPMS LEELYHCR 
HTLASGKQ I SGVPRTI FLNHFHGFYFI Y 
/CI FEMES CSV/ SQAG/VQWRDLSLLQA 
RS/PGSR/RFSCLS/LPVQHPPPGPANF 
\CIFSGDGVSPC*PGWSQSPDLVIR 


12907 


26808 


A • 


13041 


97 


626 


I CFTQVKVLiNESQVQWCMP WPATWE AG 
AGGVLEPGS*EL* / CPVNSHCPPAWAT 


12908 


26809 


A 


13042 


2 


431 


HDPDTHSQIKVI KLMQSEFCWGWAW*LT 
PVI PALWEAE / AGGS PEVRS S 


12909 


26810 


A 


13043 


405 


1319 


RYRKKLQNKPLWLH / CLLLLPNSATR * T 
LQK+ SK* I SPLESSA* *KEPPFLSVLSV 
TSVKT/SAIPMPPHWENVNTQVPYQLIP 
LHNQTHE YNE VANLFGKTMDRNR IKRIQ 
RIQNLDLWEFFCRKKAQLKKKRGVPQIN 
EQMLFHGTSSEFVEAICIHNFDWR I NG I 
HGAVFGKGTYFARDAAYSSRFCKDDIKH 
GNTFQIHGVSLQQRHLFRTYKSMFLARV 
LIGDYINGDSKYMRPPSKDGSYVNLYDS 
CVDDTWNPKIFWFDANQIYPEYLIDFH 
*FHFQISWKEALFFFAGRFALQSSSH 




ZOO 1 J 


A 








RSRAAA I QLGQRLRLPGPDS PRLTGRQL 
SPGTGRGSRAKRGPCSIFRFRRETPPQR 
EGGEALGATSRQAPPPCPPRQPWELDAS 
AKAAAVAP PKKVG FKAVSRD PRAEALGG 
KGSP\GGKGSLKGVAHR*GGLGPL*GAP 
EAI 


12911 


26812 


A - 


13045 


172 


16 


AHVS P E VD KTKF W \ LG P V AH I CNS S TLG 
GRGGQIT/R/GQEFETNLTNMEKPHLY 



1326 



WO 01/64835 



PCT/US01/04927 



SEQID 


SEQ ID 


M 


SEQID 


Predicted 


Predict- 


Amino acid sequence (A=Alanine 


NO: of 


NO: of 


eth 


NO: in 


beginning 


ed end 


C=Cysteine, D=Aspartic Acid, 


nucleotide 


peptide 


od 


USSN 


nucleotide 


nucle- 


E=Glutamic Acid, F=Phenylalanine, 


sequence 


sequence 




09/515,1 


location 


otide 


G=G1ycine, H=Histidine, I=Iso leu cine, 








26 


correspond 
ing to first 
amino acid 
residue of 
peptide 
sequence 


location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


K=Lysine, L=Leucine, M=Methionine, 
N^Asparagioe, P=Proline, 

0 = {"Jlutairiinp R=Arpininp 53=&pririA 

>c uiHiaiiJiiit) jix i» i guiiiic, iw/ life, 

T=Threonine, V= Valine, 
W=Tryptophan, Y=Tyrosine, 

X=llnknnwn ^=Sfnn onilnn /=:nncc i l-i l» 

nucleotide deletion, \=possible 
nucleotide insertion 


12912 


26813 


A 


13046 


181 


9 


YKWGPAMLPRIiKCRAI \I6SNMAHCRLD 
LPGSSDPPSSASQVAGTTGTCHLGARGG 
MV 


12913 


26814 


A 


13048 


2 


3507 


YVRVSLPPPPPAAGRPGAAVAJDDAREEE . 

EEAAPPPPPPPPPRLAAARPPGSQPRPP 

AAGEAQ7AAADMNHQQQQQQQKAGEQQLS 

E PEDMEM E AGDTDDP PR I TQN P V INGNV 

ALSDGHNTAEEDMEDDTSWRSEATFQFT 

VERFSRLSESVLSPPCFVRNLPWKIMVM 

PRFYPDRPHQK\SVGFFLQ\CNAESDST 

SWS CHAQAVLKI INYRDDEKSFSRRISH 

LFFHKENDWGFSNFMAWSEVTDPEKGFI 

DDDKVTFE VFVQADAPHGVAWDS KKHTG 

YVGLKNQGATCYMNSLLQTLFFTNQLRK 

AVYMMPTEGDDSSKSVPLALQRVFYEIiQ 

ESDKPVGTKKLTKSFGWETLDSFMQHDV 

QELCRVLLDNVENKMKGTCVEGTIPKLF 

RGKMVS Y I QCKEVDYRS DRREDY YD I QL 

S IKGKKNI FES FVDYVAVEQLDGDNKYD 

AGEHGLQEAEKGVKFLTLPPVLHLQLMR 

FMYDPQTDQNI KINDRFEFPEQLPLDEF 

LQ KTDP KDP AN Y I LHAVLVHS GDNHGGH 

YWYLNPKGDGKWCKFDDDWSRCTKEE 

AI EHNYGGHDDDLS VRHCTNAYMLVYI R 

ESKLSEVLQAVTDHDIPQQLVERLQEEK 

RIEAQKRKERQEAHIiYMQ VQ I VAEDQFC 

GKQGNDiMYDEEKVKYTVFKVLKNSSIAE 

FVQSLSQTMGFPQDQIRLWPMQARSNGT 

KRPAMLDNEADGNKTMIELSDNENPWTI 

FLETVDPELAASGATLPKFDKDHDVMLF 

LKMYDPKTRSLNYCGHI YTP I S CKI RDL 

LP VMCDRAG F I QDTSLI LYEEVKPNLTE 

RIQDYDVSLDKALDELMDGDI IVFQKDD 

PENDNSELPTAKEYFRELYHRVDVI FCD 

KTI PNDPGFWTLSNRMNYFQV7AKTVAQ 

RLNTDPMLLQFFKSQGYRDGPGNPLRHN 

YEGTIiRDLLQFFKPRQPKKLYYQQLKMK 

I TDFENRRS FKC I WLNSQFREEE ITLY P 

D KHGCVRDLLE ECKKAVE LG E KASGKLR 

LLEIVSYKIIGVHOEDELLECL.SPAT c ;P 

TFRIEEIPLDQVDIDKENEMLVTVAHFH 

KEVFGTFG I P FLLR IHQGEH FRE VMKR I 

QSLLDIQEKEFEKFKFAIVMMGRHQYIN 

EDEYEVNLKDFEPQPGNMSHPRPWLGLD 

HFNKAPKRSRYTYLEKAIKIHN 


12914 


26815 


A 


13049 


30 


477 


VRAEHCAVWFRNFEETVPKT < ?VK , RT»MTQ 
PEIASLSWGQMKVKGSNTTYKDCKVWPG 
GSRTWDWRETGTEHSPGVQPAAC\DVKE 
WEKGVOTLVIGRGMSEATjKVP^ ^TVFY 

lkkhgi dmrvlqteqavkeynalvaqgv 
rvggvfhstc 


12915 


26816 


A 


13050 


179 


389 


NI ETIQSMFPI / DNEMKLEINLKKAIWG 
IHKYASQVLWFMPIFPTLWEAKVGEFLE 
PRSSRSVWETWRDPI 


12916 


26817 


A 


13052 


466 


1489 


pgkgemrtrspsplaivprpqrasrpll 
cavspmasasgatakheqilvldppidl 
kfkgpftdwttniiklrnpsdrkvcfkv 
ktt vper y cvrpns g i idpgstvtvsvm 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A^AIanine 
C=Cysteine, D=Aspartic Acid, 
E=G!utamic Acid, F=PhenyIalanine, 
G=Glycine, H=Histidine, I=IsoIeucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 












i 

i 
j 


lqpfdydpnekskhkfmv/qifaplnts 
dmeavwneakphelmdskwr/ cpnendk 
lndme pskavplnas kqdgptpqphsas 
lhdtetrklteeckrlqgemmklseenq 
hlrdeglrlrkvaysdkpgststasfrd 
vtsllipsllwi aai ftgflfffwrrsl 
tlvaqagvqrrdlgslqppspgfkqfsh 
lslpsswdyrrppphpanfcifsregvs 

PCWPGWS 


12917 


268 IS 


A 


13053 


892 


189 


SYDPGALGCRPAPRSGRGXGSGATRDMH 
PMSRERAKFVKSGL\YCKTTRELMLHAR 
CCLNQKGTILGLDLQNCSLEDPGPNFHQ 
AHTTVI I DLQANPLKGDIiANTFRGFTQL 
QTL I LFQHVNCPGG I NAWNT I TS Y I DNQ 
I CQGQKNLCNNTGDPEMCPEKGSCVPDG 
PGLLQCVCADGFH/GIQVYAPGLVLTAY 
VLRDSGSHHSIRLHSALGDPAPKSQDFM 
NYIGLTIDLRSI 


12918 


26819 


A 


13054 


423 


12 


SFNQKNPLRQKCSLRPTRPASQSliATWP 
GQSTRFWEQALPWALWDPQSKRAE\MQL 
EP?HAKTWTHAHTCGHTGA\ FCSAHTEV 
Y^/HLHSPVHAHTHAHTHSPPVHTCGCRIi 
HAHTH\PQTPSPPGLLSLLAVARPQS 


12919 


26820 


A 


13055 


16 


349 


RRSGKNDPWTDQSPRAAASKTRCPHPAV 
AAAAMPKRKTEGDAKGDKAKVKDESQKT 
S / ARMS S KS AAS KAYAKP I KAPAMNGEN 
VPNGIQRKLEAFYDLQACRHLTRLIDFA 


12920 


26821 


C 


13056 


716 


594 


MMQTETGVMPPQARQCLEPPGAGTGRKD 
SPLYPSEQSWPL* 


12921 


26822 


A 


13057 


353 


2 


IGNPGKTTFNKPPPPKKFIFSKFFMFLF 
FLVKTRSCYVAQAGLQTPG\ SSDPSASA 
NQSAG I TGVSHHTQS SIIiKYTVFCLTFQ 
TPEYTI PSTWSI PCCSHFSCYLCFLFAC 
FLRQSF 


12922 


26823 


A 


13058 


864 


529 


EVSTS KTAGCRGCALVISMLCI I FPQGL 
FTRALLRTCS ALLHADWPE I S F VWWF 
FKTESCSVAQDGVQWCDLRSLKPPPP\G 
SSDSPASASPVAKITGMHKHARHGLKS 


12923 


26824 


A 


13059 


2714 


1376 


GRYDGLVSQLGGRATPAVGFAMGLERLV 
LLVQAVNPEFKADP WD I YLVASGADTQ 
S AAMALAERLRD E L PG VKLMTNH GGGN F 
KKQFARADKWGARVAWLGES EVANGTA 
WKDLRS ALAVGV I LGVGAL I GWRYWNS 
HQVDSARSASLAYQNAVTAVS EGKPDS I 
PAAEKFAAENKNTYGALASLEIjAQQFVD 
KNELEKAAAQLQQGLADTSDENLKAVIN 
LRLARVQVQLKQADATUiKTliD/TPI KGE 
GWAAI VADLRGEALLSKGDKQGLLS VTL 
LSGCSLFNSEEDWKMSPLPTVENQFTP 
TTAWSTSDGSG I GNFYSNLHPALADNW 
YAADRAGLVKALNADDGKE I WSVS LTE K 
DGWFSKSPAliLSCRVTSVWPNNQTEKFN 
KQRTEGPAVAEPEQAGVRRH YLY PFTGG 
YRRRRAACQAERPAARCATEEHGSRN 


12924 


26825 


A 


13060 


99 


254 


RNHHLVAFKE LS ALKVKRVDW AP WLMP V 
IPAF/LEAKVGRSLEPRSSRPAWAT 


12925 


26826 


A 


13062 


127 


386 


FSYYYYFFFLFFLCVFVIPGEPFLKMKL 
GKPKTIMLKEKKGGESYSELFFETDQME 
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SEQ 10 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Aianine 
G=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=H ist id ine, I=Iso leu cine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S^Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














WL/TPVIPTLWEAEVGGSFEHRSSR 


12926 


26827 


A 


13063 


1 


209 


GKYILLLKVFCCFLRQGLCHLAHCSLRL 
PGS SDSRASASQVAG/ 1 S I TGVHHHARL 
NFVFCFFFLVCFCF 


12927 


26828 


A 


13064 


132 


1 


SHAMCCNYLKRFGWARWLMPV/ IPALWE 
AEAGRSPEVRSSRSAL 


12928 


26829 


A 


13065 


269 


134 


TLWS VFASS / WAWWIiTPVILAP WEAKAG 
GSPEVRSLRICKQKVLS 


12929 


26830 


A 


13066 


853 


461 


RLWRSVSQFLCACVHMCVPSVCVCGLAH 
AGT YA VST C VH / M CV CVCM CAR I TLAK I 
CS ETSGNLLFRKI I SLLDLFTLNFS I KY 
KVQLVLKSRSLAWWGRIKLLTLSRPFSD 
VNTFERINYWLINDIS I IR 


12930 


26831 


A 


13067 


57 


2066 


AQPTGRQTSTVAPTPATARSRGGRVPIF 
PCPHPAPTLL.STSRWTPAS PAASMKAL 
RLSASALFCLLLINGLGAAPPGRPEAQP 
P PL S S EHKE P VAGDAVPG P KDGS AP E VR 
GARNSEPQDEGELFQGVDPRALAAVLLQ 
ALDRPASPPAPSGSQQGPEE3AAEALLT 
ETVRSQTHSLPAAGEP\EPAAPPRPQTP 
ENGPEASDPSEELEALASLLQELRDFSP 
SSAKRQQETAAAETETRTHTLTRWLES 
PGPERVWRASWGEFQARVPERAPLPPPA 
PSQFQARMPDSGPLPETHKFGEGVSSPK 
THLGEALAPLSKAYQGVAAPFPKARRPE 
SALLGGSEAGERLLQQGLAQVEAGRRQA 
EATRQAAAQEERIADLASDLLLQYLLQG 
GARQRGLGGRGLQEAAEERESAREEEEA 
EQERRGGEERVGEEDEEAAEAAEAEADE 
AERARQNALLFAEEEDGEAGAEDKRSQE 
ETPGHRRKEAEGTEEGGEEEDDEEMDPQ 
TIDSLIELSTKLHLPADDWSIIEEVEE 
KRNRKKKAP PE PVPP PRAAP APTHVRS P 
QPPPPAPAPARDELPDWNEVLPPWDREE 
DEVYP PGP YH P FPNY I RPRTLQP PS ALR 
RRHYHHALPPSRHYPGREAQARHAQQEE 
AEAEERRLQEQEELENYI EHVLLRRP 


12931 


26832 


A 


13068 


282 


53 


PGFPQGKFRFKKKILPQDYPFQGAPNSK 
ARPGQGVPPGI PA\LWRVKKGGPLRSGG 
PGPPGAKGGTPFLPIPNTKKS 


12932 


26833 


A 


13069 


177 


2 


VSLCRPGWMECSGVISAHCNLRLPGSSN 
S/PALASRVAGITGARHHALLNSLFKVL 
SRA 


12933 


26834 


A 


13070 


789 


302 


PHPEFYYSFQYFIYFFISFHFTRHFIVH 
YFISSHFISCHLASSSSSSSSSSSSSSS 
SSPPAFHYSFHHFISFHLAIHY\HFIIS 
FHLFHLFS SFHSI SFRDRDLLYCPGWS V 
PPHPAFHYSFHH/VHVMSRHFILRQTRS 
IAQAGIQWRDLCSLQPEVGSSRSA 


12934 


?6Jtt s 


A 


13071 


193 


2 


PPYLKSHSQVDLSPGVQDQPGQHRETPS 
/LTKN/TKVSWVWWCTPVI PARWEVEVR 
ESLEPRCSTRA 


12935 


26836 


A 


13072 


122 


1495 


LLSDFFFFETE/SRSIAQACMQWCYLSS 
LQSLPPGFKRFSCLSLPSSWDDRCPPPC 
LANYCIFSRDRVLPCWPGWSRTPDLR 


12936 


26837 


A 


13073 


178 


347 


CCLCNDNSVLLCMYYLLVCMHACMHALY 
IKTWK/MQLGAVSHACNPSTLGGQGEWI 
T 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=IsoIeucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W^Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 


12937 


26838 


A 


13074 


165 


34 


FILSFPFETVSYFVAQAGVQW\LIIAHC 
SLDLPRSSDSPTSAP 


12938 


26839 


A 


13075 


205 


105 


GGW VQWFMS IIP / AI WEAKAGGLLEPRN 
SRPVWAT 


12939 


26840 


A 


13076 


3 


357 


HEREYTQAEKRKFIAFFFFKMGSCSVIQ 
ARVQRHDLSSLQPP / RVAGTTGTHYHTW 
LIFVFSRTDSRLPSSPRSPLTSVISLI I 
PLSLIPTTYLTTPSRPSSTSPPSHPSFR 
APLTSDP 


12940 


26841 


A 


13077 


3 


477 


S TPALTARGLS GGS RAS VAAM V L LES EQ 
FLTE LTRL FQKCRTS GSVY I TLKKYDGR 
TKP I PKKGTVEGFEPADNKCLLRATDGK 
KKISTWSSKEVNKFQMAYSNLLRANMD 
GLKKRDKKNKTKKTKAGSNSS /ATTAAP 
AAAATAATTAATTAATAAQ 


12941 


26842 


A 


13078 


1077 


1566 


RQVTSLDPLKTFPRTSLEFTSPTGQLDS 
GATAFTWWPSGAPTPRGRESAAHQGN/ 
YPVGQRS \RMAGLS PRSFCWWEVSFSR/ 
SHSC/ ICWVQLGKSAVLPQQSRGPGAKE 
GSRRRGGSSPPCPPLQTQLVFSPQELSM 
SAP I HRLS VTLQGD C I STGGAPSRCRIiA 


12942 


26843 


A 


13079 


3 


396 


HERGRKMVS VT \ RL I QRLRNWASGHDLQ 
GKLQLRYQEISKRTQPTTKLPEGPSHKL 
FiSIN YY \ CTRDGRRE S VPP FI IMSSHKAL 
VSRM PAYS FAEAATD KKAVTPA P S I YRG 
EQSSDHPYLRHCTLTVTRLL 


12943 


26844 


A 


13080 


467 


109 


SLVPWSPHLKEPSCRYLIENVMLVFKE 
QEFWS KEAGSLFSPPVPFYCWTPS I VFS 
SSFFFLFRCVCFFFFFFCFFFFFF\LIiF 
FGTKKALYLLRARGCKQYKNQKLIWYLT 
FLSLLVK 


12944 


26845 


A 


13081 


458 


51 


VSLFCTVAPLLLLPEGIFFSCSLLEVKA 
PMRDVSGVF LTL FL YS VL F FPLG VFF FL 
FFFLILPPPKIFFFFFVCVALFFFSPAF 
GAVFLF\SLWSSVLPGVGFFTACSFFFF 
FFFFFFFFLRQSFALVAQARMQWR 


12945 


26846 


A 


13082 


23 


327 


RSDSEGGRRGHKYI FI PIYTH7HTYIFI 
YTQIYICLHIYACLS/LICTHIFISIYI 
HTYIYISIYAHIYLYIHTHIYLHIYTHT 
YIYIYLHIHIYSSWVCKGS 


12946 


26S47 


A 


13083 


352 


153 


KKLVE YYSAI KKNSVLIH/VSTWMRLKN 
ITLSLKTQSQRHLYYIIPSVRNVQNRQV 
HKYRSRLVDSED 


12947 


26848 


A 


13084 


121 


417 


DYD Y YC YYY PK I YLTKLL Y F Y \ VGN Y I Y 
THIYVRHVYICETYMPVYTHIHTYICIY 
IYIWKETTWLFSCGISKTYRTHDIAYTY 
LVFCLTHVTFHVSST 


12948 


26849 


A 


13085 


242 


12 


LSSRLPMLLETKEITSKILPSRKL/LPY 
GWCFKS FKI TWVSQ I RNRL\ AQWLTTVI 
PQLWEA\SGLLKPRSRRPAWTTW 


12949 


26850 


A 


13086 


274 


340 


GGEKKTPGGFLEKKSFFGGGIFGPPP/P 
TKRGVFPPSPKKFFFPPKTKNFGRGGGP 
KIPPPKKDFFSKNPPGVFFSPP/YKKKK 
IIFPPPVKLGPPKDFLKSPPPFFFFFYF 
FFFFFFSSQEVNLKALSSTSEI I FFYLV 


12950 


26851 


A 


13087 


241 


405 


P PPS KIH IGNFRKMS DVKNATG WAW WLT 
PVI PARWEAE AG \VSLRPRNSRPAWAA 


12951 


26852 


A 


13088 


537 


375 


KHSLTLSLRIJECRGTI lAHCS /L/ CTSL 
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SEQ m 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D-Aspartic Acid, 
E=GIutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=JLysine, L=Leucine, M=Metbionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S^Serine, 
T=Threonine, V= Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possib!e 
nucleotide deletion, \=possible 
nucleotide insertion 














UGSSDLSLQPQQ IAGTTGACHHTHLEKI 
F 


12952 


26853 


A 


13089 


448 


186 


GALVFPSSQAPPCPPKMEQSFFFFFFFL 
ETGSRSVSKAGVQLHHGSLQPPPPGFER 
SSHFSVTTI/TYLYHHRLALHVFELHIN 
GIMHR 


12953 


26854 


A 


13090 


365 


191 


GQEREKEREREKRRQKKREVE/REEBKE 
RVRDLESESNRALERERSALVDRAPLSR 
PG 


12954 


26855 


A 


13091 


148 


316 


DKMESCSLAQTGVQWHD\LG\YLQPSPL 
GFKRFCLSLSSSWDYRWETSTEPFQQAV 
F 


12955 


26856 


A 


13092 


712 


229 


FVAVJVKMAE YLAS I FGTE KD KVNCS F Y F 
KIGVCRHGDRCSRLHNKPTFSQEVFTEL 
QEKYGE I EEMNVCDNLGDHLVGNVYVKF 
RR E E DGERAVAE LS NRWFNGQAVH/G E C 
TRGGFCNFMHLRPISQNLQRQLYGRGPR 
RRSPPRFHTGHHPRERNHRVFP 


12956 


26857 


A 


13093 


835 


405 


ELIERLGPNQKPPILMWKPMPSPSNMKA 
SAALIiCLLLTAAAFSPQGLAQPVGINTS 
TTCCYRFINKKI PKQRLES YRRTTSSHC 
PREAVIFKTKLDK\EICADPTQKWVQDF 
MKHLDKKTQ\TPKLLNIKDWNLENQAMT 
LRKPN 


12957 


26858 


A 


13094 


1137 


424 


LCPSHFAPTTLTQPGAHKNMCCI KSRFK 
RDLGLCRTCLVNKMFTSS ILGKSHRHSL 
VSINQGNNALWKAAG\PLSWKAGYC\QG 
FSPCDSLKYG\SWDEKDLTVPQPDTHKG 
SVLRWISKRGKPIiAVEIEGRATGLPGLA 
PWGTE \ CLGYKTPI V\HLFNS EMG\ENR 
PYGGEARHVCSNAALLFFTPLRCLGGEK 
HKSGLRAHPVIVLSLELNYDIDSFAHMF 
FADLLLI ITLLSCYI PFC 


12958 


26859 


A 


13095 


] 


324 


ARGERERERERERERERERERERERERE 
RERERERERERERERERERERERERERE 
RERERP/RPRVLFFVRGREGSHPEKCEN 
KKERAPFSRALRKISIETRGGVSITTEA 
CVFVIGAHRVCRSVYIYI 


12959 


26860 


A 


13096 


508 


840 


DGVSLCRPGRTADCSGAISAHCKLRFPG 
SRQSPGLSLPSSWDYRRLKPMRPANFFF 
CIFF\VKTGFHLVSQ/AMGLDLLNS/SI 
PPRLGLPKCW\DYRREATAPGQELLLNR 
TGM 


12960 


26861 


A 


13097 


53 


254 


WPQTASCLVAQAGVQWSDHSSLQRQTPG 
\SSHPPASASQVARITGMHHCAWLHLI I 
LLRAHGS S PLC 


12961 


26862 


A 


13098 j 


329 


67 


GPGAPLLKGEKLLPKEPPPRISPIVSAF 
FAWGPPPG YCFLKKGGPFF\VFPPS FFG 
KKKNSPPPPHINGGAKTFPNPPLLFSSF 
FFF 


12962 


26863 


A 


13099 


253 


444 


EGA F S LGT F YF I FFLVLSNTFLLS I KNQ 
QIWNKKTVW\PSPFLPTLIALTTLLLPI 
SPFILIIL, 


32963 


26864 


A 


13100 


341 


590 


PMASSQICIGRI PPFLYLKWYFWPGKW 
HTCNP KTLGGRGAL I TQGQKFENS LAN \ 
MAKPHLY 


12964 


26865 


A 


13101 


55 


184 


RLREPTPLYSE/HAPTKYYRMAHHNYPH 
S PP I PQPRHHYRVFLL 
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NO: of 
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sequence 


M 
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od 
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Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
tast amino 
acid 

residue of 

npnt i (\ C 

IF V 1^ L 1 w *» 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=lsoleucine y 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q^GIutamine, R = Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \— possible 
nucleotide insertion 


12965 


26866 


A 


13102 


433 


762 


RNLLNCKRGLERKGCRASWPAVPLFLQR 
RVWPRPTAELNEACMLAQHQNCQNVKME 
YFQLSFTKRQE I /WLGWAHACNPSTLG 
G\QGGGIACSQEFKTIPSSKVRPCFYLK 


12966 


26867 


A 


13103 


349 


250 


DLFFSTKPPPPPPPPKNLGPP\GPFGPP 
FQEG 


12967 


26868 


A 


13104 


3 


236 


YPQHYPSCPTFQYTLKTKNHNFIirFFEV 
GSCSWQARVQWCHHGSLQPP/NSSDPP 
ASAFQVAGTPGVSHRAPPKNQNF 


12968 


26869 


A 


13106 


154 


4 


KKVKNTVHYNG KYFLKSQNQS TFLKE I L 
/ WLG G VAHTCNP S TLGGRS GR I T 


12969 


26870 


A 


13107 


391 


150 


RPPKKPNKKYSQITSKGIKMVYTRKYLT 
QKKKAMSQALWY/V/HAVPATQEAEVGG 
SCEPGSSRPRCTMMPMNTHCAPAWAT 


12970 


26871 


A 


13108 


90 


252 


KVGDGGR CL VF C FY FKRQG F T VAQAG VQ 
W\ATIAHCSLKLLGSSDPQNYEDSWGT 


12971 


26872 


A 


13109 


197 


414 


LYEF I KITVLLGLGCPLGRYGCSNQRFF 
S LEME FHTVAWARVQWHDLDS LQPLP PR 
PNRFSCLR/YPQSSWDYR 


12972 


26873 


A 


13110 


2 


317 


GRVGANVHKGHRQRTYGSVI PHILPLHV 
LKKTFSLRDFHFSVSLKK\NLVLTCLHL 
FL/GVRTPRNDPFVSMMLLFTAFLDRPS 
T I LjGTGLLYTEGLT VALRIiAYLR 


12973 


26874 


A 


13111 


413 


162 


L I LLPQP AECLQLQASAATLD CLGLPRC 
RDCRR S LVHSVLNGAQAGVQWRDLGSLQ 
PPPPS \TCLGLPKYRDCSLCPAATPSGK 


12974 


26875 


A 


13112 


40 


296 


ESRLSALYIYHICVCVYNREHLLYGIHM 
/SI HT YTHTHTK TH I YI HMRAFALEDKF 
I CSLLCSRQDNAF I LVSVKLQRKNYFLT 
RR 


12975 


26876 


A 


13113 


2 


288 


FEPRCKNSARGKVPRGFSPNLPHVTVET 
PEGSKTGFELESR/RRHRQIHSSAQCPL 
TAVPGAGDAI PEDASGWHTWLPLHAQNC 
FLLY IQAPSQPPA 


12976 


26877 


A 


13114 


685 


340 


LS PPRAGSARPTGP PTDAPGQRS TWTCG 
ALKPRRRALRDAAENIiFQELQEHFQALT 
ATLiNLRME EMGNR IEDLQK\ NVNDIj\MV 
QAGIENS I KEQMLKTVTANMSVFGDGAS 
YRSH 


12977 


26878 


A 


13115 

! 


65 


3021 


GVLTQMGDEKDS WKVKTLDE I LQEKKRR 
KEQEEKAEIKRLKNSDDRDSKRDSLEEG 
ELRDHCME I TIRNS P YRREDSMEDRGEE 
DDSLAIKPPQQMSWKEKVHHRKDEKRKE 
KCRHHSHSAEGGKHARVKEREHERRKRH 
REEQDKARREWERQKRREMAREHSRRER 
DRLEQLERKRERERKMREQQKEQREQKE 
RERRAEERRKEREARREVSAHHRTMRED 
YSDKVKASHWSRSPPRPPRERFELGDGR 
KPVKEEKMEERDLLSDLQDISDSERKTS 
SAESSSAESGSGSEEESEEEEEEEEEGS 
TSEESEEEEEEEEEEEEETGSNSEEASE 
QSAEEVSEEEMSEDEERENENHLLVGKN 
LPGDRVPVPESRFDRDSGESEEAEEEVG 
EGTPQ \ S S AltTEGDYVPDSLPLS P I ELK 
QELPKYLPALQGCRSVDEFQCLNRIEEG 
T YGWYRAKDKKTDE IVALKRLKMEKEK 
EGFPITSLREINTILKAQHPNIVTVREI 
WGSNMO K I YX VNfN YVEHDLKS LMETMK 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
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SEQID 
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09/515,1 
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beginning 

nucleotide 

location 

correspond 

ing to first 

nmino qpiH 

<HJ11IJV <lvlU 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 

Hi no in 

last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, J=Iso)eucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 

V^vjiu ia in me, rv t\t glume, (j^kJci lllc, 

T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Unkno\vn, ^Stop codon,/=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














QP FLPGEVKTLM I QLLRGVKHLHDNWI L 
HRDLKTSNLLLSHAGILKVGDFGLAREY 
GSPLKAYTPVWTQWYRAPELLLGAKEY 
STAVDMWS VGCI FGELLTQKPLFPGNSE 
IDQINKVFKELGTPSEKIWPGYSELPW 
KKMTFSEHPYNNLRKRFGALLSDQGFDL 
MNKFLTYFPGRRISAEDGLKHEYFRETP 
LPIDPSMFPTWPAKSEQQRV\KRGTSPR 
PP\EGGLGYSQLGDDDLKETG\FHLTTT 
NPGGLLPRGPGFSLRVAEVSELDPVIGR 
NSAGDHRRGYLPAWNCDGTRNSSSYFVL 
PC F VFV F LVC K"L «5R T KS F S "LL WRKE I>C F 
LRDL PRAERVG I FRGAHVGQHKPHHHPL 
PLSTRTGLGWAVIWKGTGGSRVDCLIFG 
AGDLFLCWDERCPLAPTKSSRPALSTTS 
PRPPTTOPOTiIjPAGT.FOVFYYICWGVL.K 
CIKIFFEE 


12978 


26879 


A 


13116 


469 


202 


MI I I VGLCEYTKSAVLVQHSDPLPPAPG 
VFFLAVLDWD 1 K 1 WHPVFCL.FVCLYE 

v x i. xm v xju v v u x x\. x v v *x xr v x \<ijr v <w xj x ft 

MESCPVAQAGVQWRDLGSLQPPP\PGSS 
TSCF 


I Ay ly 


ZOo&U 




13 11/ 




j0J 


rtrir\.i T irtO xrr v r vv r w r o i r \ojuidxl vuv 
CVCVCVCVCRMHTVQLYFFQVGCQCSSL 
LNPLFPMN 


12980 


26881 


A 


13118 


146 


350 


HYLVKFLLLAHPGSKAPPLNTLQPPPLA 
TREOPPLTVTFHYPPTSYKMA D \PYLPS 
LTLFGLSPPAPR 


12981 


26882 


A 


13119 


124 


254 


HALLQTLRDLS QAWWHALVI PATW \ EAE 
AGGS L E PR S S R PAW AT 


12982 


26883 


A 


13120 


150 


45 


SLPRLECSFTVLAH/C/NLHLLGSSDSP 
ASASQWGGI 


12983 


26884 


A 


13121 


336 


446 


IPLKISSWAQWLTPV/IMPAVWEAEAGG 
SLEVRSSGPA 


12984 


26885 


A 


13122 


181 


39 


QNLTAQDGVQWCDLGSLQPP\LPSSWDY 
RRESLCPSSFFIFWKRWGFT 




/OooO 




i J 1£j 






GPA VP qfiT ,VT .TCHKPMRRT ,CA 3 H ^ PF <sT_r G 
HDKGQGTLPRIPQ/PIxIiGEGGSR/PLAW 
GAVAG L P SNR P R I V PLPAP TRS GTR VRP 
HTGHQPGMSPGCGCESKWPHSAAAWPEA 
YPHFFLLPFPTQGCEVLGPLYTADPWVI 
CVSLLPSCPNSTAVDVD 


12986 


26887 


A 


13124 


429 


263 


DHFSFLCVTGSQSVAQAGVQW \LI IAHC 
SLKLLASSDPPASVFQSTRITGPFLSFT 


12987 


26888 


A 


13125 


310 


486 


NKKTTRGESSKQRQPRVSKHPGWGTVAP 
SGFTAASAS\RARVILLPRPPE 


12988 


26889 


A 


13126 


124 


461 


GS PLQLLPS PLAALTR DCS EAPMGSCS V 
AQDGVQWRDLGS LQRLPPG FE P FS CLS P 
TPAP\FPSGWDYR 


12989 


26890 


A 


13127 


2 


337 


RGAAPAAMAVTAIiAARTWLGVWGVRTMQ 
ARGFGSDQSENVDRGAGS I REAGGAFGK 
^FOAFFFRYFRAO^REOxxAALKXA hhdf 
EIVHHKKEIERLQKEIERKKQKIKMLKP 


12990 


26891 


A 


13128 


681 


422 


CRSDRVJAKEHRGKRGQDSS KDVMARLME 
APKQTAQYFFIFYFFETKSYSVTQAGVQ 
WLDLGSLQRPPPG\SSDSPASASCAWPQ 
T AH 


12991 


26892 


A 


13129 


299 


579 


LVMFCKVTKIERLNLWRPGTVAHVLWSQ 
H FGR PRWAVHVG WG VRDP / RLTQHGE T P 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
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26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino ncid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenylaIanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Lysine, LHLeucine, M=Methionine, 
N=Asparagine, P=Proline, 
O^Crluf amine R=Arpinine S^Serinp 
T=Threonine, V=Valine, 
AV=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














SLLE \NRNI SQAWW CMP WPATREAE AR 
ESLEPGGQRLQ 


12992 


26893 


A 


13130 


268 


438 


KVDKSMKMRKNRP KNAENS KNKKAS S P P 
NDCNSSPARVQ/ SWTENEFDKX.TEVGFR 
KW 




96894 


A 




1 108 


I JzJs 


KI HS S FVLLNAKLVAVAAKTPKFYKKKT 
DLPQTYHSAQTAVPEIPVSAHSTFIRSF 
IHSFI FETES YSVAQAGIQWHDLPS \ PP 
GS \ SDS P \ AS VSRVAGTIGVCQQAWLFF 
V\FLVET\GS FPHLS PLTM IRNKKQLFK 
KGH 


12994 


26895 


A 


13133 


497 


370 


RPSTVSPRLECSGLILAHCS\IjDPPASA 
SQTAETTYGHKHAS 


12995 


26896 


A 


13134 


217. 


373 


CNHTEITYVNKKKHRPGTVAH/SLYNPS 
TLGGQGGLITQAQEFDTSLGNMTLS 


12996 


26897 


A 


13135 


257 


507 


PVS PGFPPL\CPVPHTPHLCPPCRYPES 
PGQPAAQHCGAPRQPQPNPRACSSRGLF 
RCPPACHHAPSCLLCVCPPLPQCCRVQG 


12997 


26898 


A 


13136 


137 


309 


PS FPTHPQFCFLFCFLFVFETE /SSIVA 
QAEVQWCDLGSPDQGSSASPPSASSLPL 
SIW 


12998 




A 


13137 


?97 


419 


GGS AVYY I WE I VQVLWLMP VI PT\ WEAY 
AGGLFEPRSSRPA 


12999 


26900 


A 


13138 


213 


492 


QVG I NYQP PTWP VGDLAKVQRAVCMLS 
NTTAI TEAWGLPGAI SS AKCALVHW\ YV 
GEGMAVGEFSEAREDLAALEDYETCGPW 
NPVETEAERR 


13000 


26901 


A 


13139 


246 


534 


DRVLPLLPRLECSSAILGHCP/APASQV 
TGTTGTHHHTQL I FLLYCWLCPPPLASE 
ASAESPPLLPRGFLCFSSTTSSITAIPA 
TTRDYVDVSRSGS 


13001 


26902 


A 


13140 


1 


206 


SLEWPFFFSFSETGSHSFTQAIVQWCNH 
SSL\SPTSASQIAGTTGVHNHAQI*/VFV 
FCFAFMSPQLSNSC 


13002 




A 


13141 


164 




GLVLVNLYLRS YLMVF I YINS RLLQVTD 
ASRERRENESSWFCRSVCRM/PPVGPGP 
SCVKPCSPRGSISPCTPHCPLKTTQAPA 
\PPRPLPRVGPGPHLDQSCCVAARPLYD 
PILVAAHSSWPAPPSPVPVPQEDTEHGA 
G 


13003 


26904 


A 


13142 


235 


367 


LALNTFCWLGWAHACNPSTLGGQGEWV 
TRGQE \QANMAKTCL Y 


13004 


26905 


A 


13143 


91 


403 


GALQPATAPWEPLSGLAEAGAISPCLQG 
G/SGGRGASENRGCTRPGRVPGGRRLNR 
PRAWSSWPAPPAPGSERPSAINCPRAEE 
CGRRVWDWQAALPAAPAWVPV 


13005 


26906 


A 


13144 


558 


147 


GHSFQNWLLGCPLQLQRHLLHPPDASEE 
CGSPPWTPLRPHIHPSAGPHGSTHK\HA 
GGCLFPLRPSPTEPCGPPVPLPWSQ\QA 
PLCPEPPIHSPHDWECDPGKSALIPPPA 
AQSPSRLIKHAANEPECSPRNRRPGIP | 


13006 


26907 


A 


13145 


321 


421 


GCAQWLTPV/ IPALWEAKAGESLLFRSL 
RPAWAI 


13007 


26908 


A 


13146 


3 


398 


IELLQSHDKTLTDKKLLLLDEQRKWFLE 
MESTPGEDVNNVELTTKDLEYHINLDD\ 
KAGFERIDSNFERVSAVG/ KMLSNS I AC 
YRE I FHERS VKVAN FTVAI LPQPHKLS V 
TIMLI IQQPSTLRQDLLIAKR 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

aminn arid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=Phenyl alanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
KHLysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q— Glutamine, R = Arginine, S = Serine, 
T=Tbreonine, V=VaIine, f 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \— possible 
nucleotide insertion 


13008 


26909 


A 


13147 


228 


261 


SKTQRSSEYYSLSVCVCVCVRVCIYIYI 
FF FFETE FLS \ APQAG VQRCDLS S LRL P 
PP\GSSDSPFSTSQSAWDYRREPQRPTS 
LHYFVQLKHNLHRVKRRDLRNTTR 


13009 


26910 


A 


13148 


3 


275 


KHLSDLQ ILRLGLSS KTRKTRYSHFAQG 
RSIFFLLRPSVDWIKPACMDGNLLFSKF 
TNLNVNI \TKKHPHRNVQNNVWPNVWAP 
WPKEVDT 


13010 


26911 


A 


13149 


218 


382 


AWDHI KLESFYTAKETT/RMKRQPTEWE 
KI FANYPSDEGLVTRMYKELKQLYRKKI 


13011 


26912 


A 


13150 


391 


29 


FCFLFLFMFLFFFEMQDGVQWCNLSSPQ 
PPPPG\SSHSPASASQWEAENCIjNPGG 
GGCSLGDRARLPQRKRERERETERKRKR 
ERASCAFCGAITFWMWCQKHHTCNHLP 
WLTVFETNKL 


13012 


26913 


A 


13151 


351 


157 


FG YSR FLCVY INFRTNLS I YN Y I C I Y IN 
LY L\N I HT YTLI YT YVHT YTH I H I F Y I Y 
IGILTGIT 


13013 


26914 


A 


13152 


402 


111 


TFFGGEKPPFTLRPGPPRREGGGGPPGP 
REGGFFRVPLGAFGPSRPK/RPEKSGPV 
WAPQIFLWGGKSRAIKESGKGFFFSRIT 
LL VFATE KKKKTTL F 


13014 


26915 


A 


13153 


310 


141 


REHEKQKPLS QVRWCTPLVQAAQEAEAG 
GELEPRSSRLQCSMT\TPVNSHRPPARV 
T 


13015 


26916 


A 


13154 


153 


21 


LNKCVISWLGAVAHACN/ PSTLEGRWIT 
QGQEFKTSLGSLAKPHLY 


13016 


26917 


A 


13155 


429 


275 


S VDI PL E KTL I NKD VFKEAAFKC KVS /R 
QARVKFKKRYKPVKTKWFFQKLWF 


13017 


26918 


A 


13156 


303 


42 

V 


YRERAPVYPIIINIAFGPHKQ\TRAHTH 
THTHTRMHAHTHTHPAI I PAFAKNGLGT 
FHHKGRTYNDKVKPRLSCEAAI PSNLKS 
REFR 


13018 


26919 


A 


13157 


198 


13 


S HG VA I LLF FHFLNKRA FT \ Y S PAMNS S 
LCEIQBPSLG\SGSGPLSGNRMSGKGHR 
KYFAKI 


13019 


26920 


A 


13158 


192 


2 


I SNSHRKD X KDPE VLLI EQKY I LSGQMQ 
WLTPVIPAL/V/EAEAGRSLEFRSLRPA 
WATDRDSVSK 


13020 


26921 


A 


13159 


40 


190 


RVDPRVRESRSVAQAGMQWRDLGSLQAP 
P\PGSRHSPASASQVALFLNRK 


13021 


26922 


A 


13160 


1 


178 


PTMVLS PADKTNVKA/ AWGKVGAHAGEY 
GAEALERMFLSFPTTKTYFPHFDLSHGV 
SSYL 


13022 


26923 


A 


13161 


411 


41 


ES LQI C VGLHAKPS WDACRP WLGQAWFR 
YFLLTFLGPLVFLLVFWFSKIYFLYHLK 
NPSLSS CYNLD\ FNTQYIiKYCKKKKRLG 
AVAYACNPSILGSQGGRITRGQEFETRL 
GNMAKPDAWVDP 


13023 


26924 


A 


13162 


372 


218 


FWKRSWISKKGLFKRGVSLSLSLS/CDT 
HTHTHTHTHTHAHSGYLCVLRKVK 


13024 


26925 




ni6T 


947 


89 


QGLVLSPWLKCRGGITAHCGLLGSSHPT 
TSA\QVAGTTGTQHHAQLKFFILI IL 


13025 


26926 


A 


13164 


122 


409 


FPSTAIANSH/ CSS PQGCSAYGVAI RL.F 
LRSSNKLAL\LYGLAL/NSFFCNFQEPS 
\LGSLSGPLSDNKSNWLCCFKSPSSWLF 
AIAVLGNYYSLNTVRSGRRDPGTPNACS 


13026 


26927 


A 


13S65 


69 


313 


PS VSLFFPPiNxiiTIVIKMGLLKINS I r GE 
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GLALSSRMECSVTITVHCGLDLLGSGGP 
PSAAS / RVAVTAGTCKHS WLMFLS FV 


13027 


26928 


A 


13166 


310 


26 


IKSMTPLPALNVSLSYYLPPLRPCRERG 
MVQDRFFCFLFCCCCFI FSETGSHSAAQ 
AVSQNKKTTGWAWWLTPVI PATHEAEAG 
\ ES LERGRQRLQ 


13028 


26929 


A 


13167 


296 


68 


KNGPLNLATKSLLILTMKCLGVVLGGKP 
NCSKLKKNKI KTKKWGWAPWLTPILPA/ 
LLEAEAGRSIiEARNLRPVWT 


13029 


26930 


A 


13168 


2 


235 


GATPCPREPSLFPLAGWAPLSCWAPPPH 
CTQGTTPPGLMGTG\APIAI IGGNARSP 
PMPFLPLSGTRRALSAQECPTF 


13030 


26931 


A 


13169 


248 


406 


TERCGFDLLS LGQVQWLTS VI PALE / E A 
EAGELLESRSWRPAWATWQNLISLKK 


13031 


26932 


A 


13170 


33 


308 


NS LQGAEMAAAQS S LFS AS EG YRGERGR 
GRGRGRPGRG/RSEGDRGGRGRPGRGRG 
RKVLLLRI C VWRRVLRLWNRGRIGVGGR 
VTYLMESEI 


13032 


26933 


A 


13171 


294 


56 


ICKRNYLFFIYFLREGLTLLPRLECSGA 
ITAQCSIAGSSDPPTSASTVSGTTGACQ 
CALSSRDALISLCYPGWSQAPGLfK 


13033 


26934 


A 


13172 


71 


476 


DSLNYDNHIYQMPLIYINMGLAVTISLL 
G\ILVYRSHLISSLLCLEGI ILSLFI IA 
TLI TLNTHSLLSNI VP IAI LVFAACEAA 
VGWEECGFRSRPTALILEGPLRSCQPRT 
CVRRPACTRRGPAFVGKNRHS IGG 


13034 


26935 


A 


13173 


86 


318 


NNCNVCVCVCVCVCVCVCIHICIYV/YV 
YICIYTYKYICIYTYKYICICVYIHVYM 
YMCVYVYICICMCVYICIDSKGR 


13035 


26936 


A 


13174 


466 


726 


NCMKFGAVTRIG\DLPWEINPLSSCSLL 
REKD P PTTSGPQTDQPKKHLTNFKSGKR 
PLFTLFSNLPHYPSTSFSFQSWRHTSIS 
PFS 


13036 


26937 


A 


13175 


128 


325 


FLRVI LI CHKM YGCVCVCM/ YACI CNGM 
CI YMCVCVKSWKLKP ELKCLC I SDE VGW 
VLTQWAGREEN 


13037 


26938 


A 


13176 


134 


311 


INSNSEKMGSHYVAQDGVE/WQGLFTGH 
GI AKCNLKLLGSSNFSAS I SQGAGTTG I 
VANG 


13038 


26939 


A 


13177 


577 


1052 


SPTSTRTGSGVAMWLSQPMGRT\CRSKV 
ASKSRLHGPLIWAHPPIVRCPMLRHHY\ 
KAQAGRGLS LE E \ LR VAG I YKKVAQT I G 
IS\EDARRRNQST\QALQAKVQRLK\ED 
RSSLILFPRK\PLAPKKGDSSAEELELD 
TQLTGPEMPIGNVYKEKARVIAD 


13039 


26940 


A 


1317S 


496 


678 


TLNFW?AQGLKITRPWKNAPVFPVIW\R 
LRRKNLLTLKGEICSDPKWPYCLPPWKT 
KKNLV 


13040 


26941 


A 


13179 


192 


421 


G I YTF FLS FFLFLRQS LTLS PRAGVQWC 
NLCLPGSSDSPASASRVPG\ATGT\CHH 
GSANFLYFLVGDRVFGYVGPR 


13041 


26942 


A 


13180 


144 


5 


LQLAI KLLKMRLG WAHICNPS / TLGGW 
G\GRSQGQEFETSLANIVKP 


13042 


26943 


A 


13181 


351 


51 


KKKKK I FFGTKKRR FF FLKG VWG P PP PK 
VSPPPPPKISPNPPKNNPLLKA/IKSPQ 
NWLFFFGPPSCKFFPP FFLNPKNS PWEN 
PPSGGEFSPKKKKKKSAI 


13043 ! 26944 


A 


13182 


2 


218 


I HFGC FVFLFFETE FCS CH PGCRAG VQW 
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RD\LAHNSLLPPPLG\SSDPPTSASQSA 
EITGMSHHACPIFKKHS 


13044 


26945 


A 


13183 


33 


304 


EKKG I FRAVKI LCM I LQRWT YATHLSRP 
TGYTAPRVNPKLWTLGDNDVSM/SGSLI 
VTNTALGWGLLIRRSSACMGQGDMGNLC 
TFLSILL 


13045 


26946 


A 


33184 


277 


13 


FSFFLIVEMGSCYIAQAGLELLGSSNPP 
S SASQSAGI TGMSHHPAWLATLKRAFVS 
D\NSLSFPSSENAHLSILAGHFCLMGFF 
FFQ ' 


13046 


26947 


A 


13185 


140 


361 


ITCCCVCIYTHIHTYVYVCVYIYTHTYI 
RVC V/ YY IHTHT Y I HI Y IHVF I YI Y I YT 
FMHT Y I YP YVCM VLLQI/TL 


13047 


26948 


A 


13186 


64 


759 


CLS AE S APT S TMPMTLG Y WD I RG LAHA I 
RLLLEYTDSSYVEKKYTLGDAPDYDRSQ 
WLNEKFKLGLDFPNLPYLIDGAHKITQS 
NAILRYIARKHNLCGETEEEKIRVDILE 
NQVMDNHMELVRLCYDPDFEKLKPKYLE 
ELPEKLKALTS\EFSGGKRPWF/SQGDK 
I TFVDFIiAYDVLDMKRI FE PKCLDAFLN 
LKDF I SR FEGLKKI SAYMKSSQFLRGLL 
FGKSATWNSK 


13048 


26949 


A 


13187 


41 


426 


LFFFFFESGPCWVTQAGGRR/WNDHGSL 
QPG FPGLKGS P CLTLRRS WNYR PGMCRH 
TQLVFAFFFFPREKKFLLGAPTGFKTRG 
LRGPSRFGFQKGGNKGQEPPPRAKGLI F 
LEPERGQAFFGVFFGPTQ 


13049 


26950 


A 


13188 


39 


197 


FFFLFKTRSCS FIQAGVQWCEHSSLQPQ 
TPG \ S SAAPAS AS CKAGT TEAALKL 


13050 


26951 


A 


13189 


221 


396 


GIQLQGAEGLWDVTNSPFCWRLYDQKTV 
YHECRMWANSHCP \ P PKGLLRD I TPRCW 
AP 


13051 


26952 


A 


13190 


2 


826 


PGST I S SRRRGACGSRGGHFPS PRGGSG 
VASLERAESWSTEPAKAIKPIDRKSVHQ 
I CSGQWLSLS TAVKKI VENS LDAGATN 
IDLKLKDYGMDLIEVSGNGCGVEEENFK 
GL\TLSALKHKTS K I QEFADLTRVETFG 
FRGEALSSLCALSDVTISTCHVSAKVGT 
RLVFDHDGKI I QKTP YPHPRGTTVS VKQ 
LFSTLPVRHKE FQRNI KKKRACFPFAFC 
RDCQFLEGSPAMLPVQP\AKLTPRSTPP 
HPCSLEDNVITVFSSVKNGPGSSR 


13052 


26953 


A 


13191 


235 


1 


SPCARQCCPPNPAGQEPRRRLERGPGKW 
WPRSI KFPLPAV/RARFPLLPSAPLRQL 
VSGRVGGRVGRPGKAVQVSGGLN 


13053 


26954 


A 


13192 


2 


386 


VIYLLLFFETESCCVAQARDGVSPCWPG 
WFQTPG\SSDLPASAPK/VAGITSVSHR 
TRPDVRGFKKC /GLQLEMSKLALLL/TT 
SHRFWYFVLLFS FVS KGFFI FFFFLFFF 
FFLGKGFFFFFPRGGGGGQI 


13054 


26955 


A 


13193 


749 


506 


GQTKAFVLS YCGAS PS I KQ / P CPQAKDH 
PLEPSMHPEGTQLQSCSTMLGPRQLSSE 
KQPLLPPRSHLKSSPMtiRACKGLTS 


13055 


26956 


A 


13194 


73 


297 


RMPGFGALGSPFSCQRIECPC\AGAGSC 
TYASFCKCKEYKCTSCKKSECGAFPGNQ 
GAGQSQRRE PRAQQAGAGQ 


13056 


26957 


A 


13195 


670. 


390 


PRETY I KP FARS IGCQPPKTIHPTSRRP 
STRPP AR.PHAR PPVHTP /APSVHMSDRP 
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N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptopban, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, V=possible 
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STCPPSIH\RPPSTDVSRPQYSLSPSTS 
LALPGQSCLGR 


13057 


26958 


A 


13196 


2 


438 


LSFTS I EFCHLLFLNQLICNYVI FTKSS 
TISCEALGRLLWYPWTQRFFESFGDLS 
TPDAVMGNPKVKAHG KKVLGAFSDGLAH 
L\PPLKPTFATVSELHCAKLHVDPENFT 
LLGNVLVCV / MKHNFGK 


13058 


26959 


A 


13197 


57 


499 • 


SPSWPRSSARPCSGHCLLPGLWDHGKAP 
CLRAARHSQGHGFRKGRKEGCGLWAAGW 
TKNPVPTWRAG/SRRGPGVTSPPPVPSV 
QNPP\PPQRREGGSQATRPVLFTPPPKH 
MEGPNAPILGDQRITAPPPEILYRPPRA 
ATCDALRRG 


13059 


26960 


A 


13198 


34 


465 


G I LP YS P CVLRAPNS KRVR KCGCTTPRH 
L/ CKATRKGGCTTPRAPLNETRKCGCTT 
PRHLSRQLGNAGALPRGHLNETRKCX3CT 
T PRAP LNATRKCG CS TPRHLS RQLGNAG 
ALPRGHPLKETRKCGGATPADAFYETKK 
IGGGL 


13060 


26961 


A 


13200 


177 


1075 


PTSSSMAFPKKKLQGLVAATITPMTENG 
EINFSVIGQYVDYIiVKEC3GVKNIFVNGT 
TGEGIiSLSVSERRQVAEEWVTKGKDKLD 
QVI IHVGALSLKESQELAQHAAE IGADG 
IAVIAPFFLKPWTKDILINFLKEVAAAA 
PALPFYYYHI PALTGVKIRAEELLDGI L 
DKI PT FQGLKFS DTDLLDFGQC VDQNRQ 
QQFAFLFGVDEQLLSALVMGATGAVGST 
YNYLGKKTNQI V\ EAFEQKDFSLALNYQ 
FCIQRFINFWKLGFGVSQTKAIMTIiVS 
GDSNGPTPASTCRKPPGEFT 


13061 


26962 


A 


13201 


35 


464 


VQEFKTS LGNVAKP CL YKKKKKKS P LGG 
GVPPPPKKTWVGGTLSPQKIRPAGPLIF 
PPPLPRGQQGEPLS P I KGGGGGGSS \ YP 
PPPPPCLTGRA/PPPSPISAKSSPSPPP 
CQ YV YHP PLLPL F FFLHALRRATAP LFL 
FLVRA 


13062 


26963 


A 


13202 


1153 


259 


AGGVLRLG WTGS RMAS DSGNQGTLCTL 
E FAVQMTCQS CVDAVRKSLQG VAGVQDV 
EVHLEDQMVLVHTTLPSQEVQALL\EGT 
GRQPVLKGMG\TGQFSESGGQPVAI PGG 
GLGTVQGWRFLQLTPERCLIEGTI \PG 
LEPGLHGLHVHQYGDLTNNCNS CGNHFN 
PDGAS \ HGG PQDSDRH\ RGDLG\NVGAN 
AAGGAFFRMEDEQLKVWDVIGRSL\ IID 
EG\EDDPGAREGQSLYPKITRELPGSRL 
SCGI\IAKSAGLFPEPQSKICSCDGLT\ 
I WEERGRP I AGKG \ RKES AQP PAHL 


13063 


26964 


A 


13203 


281 


420 


VDGSKKYWEMPVIPALWEAKAGGLIi\EP 
RKS RTAWATREDS VS TKN 


13064 


26965 


A 


13204 


3 


1125 


SDSPQTPRMRVMAPRTLILLLSGALALT 
ETOACSHSMRYFYTAVSRPGRGEPRFIA 
VGYVDDTQFVRFDS DAASPRGE PRAP \ W 
VEQEGPEYWDRETQKYKRQAQTDRVSLR 
NLRGYYNQS EAGSHTLQWMYGCDLGPDG 
RLLRGYDQS AYBGKDYI ALNEHLRS CTA 
ADTAAQ I TQRKWEAARAAEQWRAYLEGT 
CVEWLRRYLENGKETLQRAEHPKTHVTH 
HPVSDHEATLRCWALGFYPAE I TLTWQR 
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nucleotide deletion, V=possible 
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DGEDQTQDTELVETRPAGDGTFQKWAAV 
WPSGEEQRYTCHVQHEGLPEPLTLRWE 
PSSQPTI P I VGIVAGLAVLAVLAVLGAV 
MAWMCRRKSSGGKGGSCSQAASSNSAQ 
GSDESLIACKA 


13065 


26966 


A 


13205 


417 


13 


GHPRVMLQCHMGS ASLSIMTPP P IRSYQ 
PALTRSSATKPA\TSAPSITPMTPVTAN 
IS/VWPTANTTTTKLSNPTSS/STLSTP 
IWHPSILPAPPSTPAPKTTVTNSATTST 
I CFHVQLFQAHHCHHCIAAALEYTS 


13066 


26967 


A 


13206 


1139 


559 


GLSELSIHPSVARVQEGHGATPERSIPW 
PGGKPRRGGRGTGLREWGGGSHRSLGQT 
KKKGAKAKRDHCAHG P IAAFQG P FPRAT 
QLLSPCHGCSWKENQPP*GPPGHACTAW 
E KTVTHGS CR P VTKP S PDPGFVGQAPGT 
KGNPAPGMDGCAGFRI SCHPVP ITKPLS 
NTG I CVLLC YE FG FL I A I VGCWG FK 


13067 


26968 


A 


13207 


306 


332 


ENGE I R I Y I Y I CLYL I KKHWEEAGCGSV 
HL* SQHLKRLKWEDYLRPGVPDQPGQHR 
ETPSQKLNK* INNDF 


13068 


26969 


A 


13208 


48 


233 


GKQNHLKCLWCRLLFKuSAVISRNVYIL 
T*LSLSLSQCVCVCVCVCVCVCVCVAIL 
ENYPR 


13069 


26970 


A 


13209 


250 


2 


Y CEGDLG S S R PG VS KCRE FS PRRDKR I M 
PWPGAVTYAYNPSTLGGRDGWIA*A*EF 
ETS LGN I VKPHL YTHTHTHTHTHTHT 


13070 


zxyy / 1 


A 


13210 


615 


120 


S V FWP LLGGVS W S G YTE VRG PLEKAVCP 
LSELEYCAGRSAALFRAIRLFKDALSLL 
KLCPQPPFPLGAPLS P * EALVCWPLHV 
STCLS*WCPTPCFPERSWCVLFPSMYPR 
VCPDGLLPPVPLRGPGVCWPLHVSMCL 
LSLFNSLLRLRTCGVWFSVPVLVC 


13071 


26972 


A 


13211 


239 


429 


FILLRNKLQPGTVACGCNPSTLGSQGKW 
IA*AQGFETSLDNMVKPCFYLPQLKKKK 
KIKNLGV 


13072 


26973 


A 


13212 


954 


503 


VQLLPLCGRDLYLFAFNLPTYRCWQHFL 
SKPRAP ILAGGSDRWPCPCCPFWLP * WC 
HVSAHAPQPFPCAKPEAPLPVGCPCSPA 
P S LEAALLI LRAGLWLRAPFCFSAHLL 
SCHSQCCQFPCSEKRSCLGGRDRHAASN 
PLSAVWPPGE 


13073 


26974 


A 


13213 


1844 


1147 


LHSQIYSTAKKASLSMKGSRDKTRAASS 
RPVPSVLGVPPWSTLLQHPQNMWPGPAQ 
QQGQPSGRQAWCTPGEAPGAEAAPQ* QP 
HPEEDHSGGPQASAALALPPSPPSQRDV 
QGETGMQGRS APRS AS SS ACCACRWSRL 
PCPQLLQRHPGLRLVS PAHSRPPGPAPS 
S S SGSGLVPG YLPQKGLAGLSAGAVLCP 
PGLLRVAHGAGYGPSAQMLHARLSSSSL 
RGKRFLRF 


13074 


26975 


A 


13214 


413 


60 


LCSKSCHGSFAMDCETLLTFPRLFYPIL 
HM PTVPGQALP VHTP FL * LKLPLLPG I P 
SCHSPLFLD*AQESPPPGGLPGHPPGSG 
VLCLWFKSLKHPVLSGSSVSTPGSASPS 
RSRAL 


13075 


26976 


A 


13215 


67 


196 


GLSLSSPKVTHNATLMGCI FQN* KAFVY 
HSMKQKKI I LLFNMA* P * YPLDFGEQWP 
LHGSHAYST I L * LDLFCKKEEE * DE I PY 
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E=GIutamic Acid, F=Phenylalanine, 
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X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














V* CFMLLWKS TTM* KKI ILLFNMA 


13076 


26977 


A 


13216 


161 


2 


SVFFLF I DRRLYL I KSVWAAYNSSTLGG 
RGRRIT*AQEFKTSLGNKARPRVYQ 


13077 


26978 


A 


13217 


57 


184 


CLTVCKDKVSLCYPGWSPW*SYLTAAS 
NSWSKAILPPQPPE 


13078 


26979 


A 


13218 


292 


400 


LKVKQGRGHLDLNQGPLDL* SNSLPLSH 
TPSHRRTV 


13079 


26980 


A 


13219 


186 


394 


YLVLSFLFYFLFI FLSRVSLPI FFILE I 
*LGIVAYTCNPSTIjGSCGGRIA*AQEFD 
TSLGNIVRPPSLI 


13080 


26981 


A 


13220 


315 


448 


KTKDLGWDEVYNPSTFGG * GGRIT* GQ 
KFE TS LDNRAR S CL YK 


13081 


26982 


A 


13221 


3 


13 


LRPLRSFVSVERLWASCCLGSELWDKT 
KRKKRRELSEEHKQEIKDAFELFDTDKD 
E VI DYHE LKVAM I ALGFDVKKADVLNI L 
KDYDR*VIGKISFGHSIEIVTDFQL*R 


13082 


26983 


A 


13222 


1 


715 


VDMHSRGSPELTHASTHASGKMAAPWAS 
LRLVAPMWNGRIRG IHRLGAAVAPEGNQ 
KKKRT I LQFLTNYF YDVE ALRD YLLQRE 
MYKVHEKNR^YTWT .^KCHGPYGAGAFFI 
LKQGGAVKFRDKEWIRPDKYGHFSQEFW 
NFCEVPVEAVDAGDCDINYEGLDNLLRL 
KELQ SLSLQR CCHVDDWCLS RL YPLAD F 
VAG A FRWAGCPR I FRTGAKHG FHHLQ * D 
LSSGWATCPGTSPT 


13083 


26984 


A 


13223 


1019 


1238 


LTGAEYKKSDGLTESVTANLEPEAPKVF 
SRLDDEAPVTVLPARLPLPAP *HTARPD 
PPQRRRSMQLAPARPFL 


13084 


26985 


A 


13224 


1534 


1181 


RWNSEPDYSAGGPRAGRGARLLPVPKSP 
GGGGAVAPGPQTPLRAERTAVRLSISS + 
NSSTAPADPGAAP PLHPQPAEAGTWVSA 
FLRVIFCFPSLGFGCWEKKIAFVPFPPT 
DEKCL 


13085 


26986 


A 


13225 


328 


22 


CLESPRDRPFQGPAFPKHPPPPLAFLSQ 
PGKVQYFFESNCKS LSSQE IKNSRSVPT 
SPPAPSPPPFPFLYFSLTAEGGEV*KET 
LDSDRAPPSSREAP I CQV 


13086 


26987 


A 


13226 


180 


5 


NSIDYKINVKKSNRPGAVAHTCNPSTLG 
SQGRRIT*AKEFETSIiGNIVLLIYTIIY 
YI 


13087 


26988 


A 


13227 


237 


528 


VGLCQDPLLLTLTDSFCSVLWGGSHLAF 
HKNLYWHIDLVMYFSRIS FYFTLSAAI 
CLSLYLGSHILSPFL*LFYCAYTTLYLC 
LITTLFFLIPKV 


13088 


26989 


A 


13228 


140 


366 


NATCLWHPMSPYAPTY*C* LS PGLLRWV 
LAITPIVLMILISSYNHVSISYGGTFRS 
NFFLDQ I I LTFL I YELHI V 


13089 


26990 


A 


13229 


257 


25 


LPAHSLGRDLSAQPYSMPRPGGELRAEG 
QSSLCSVARSSL* LHVCVCVCVCVCART 
CVNVFACVKLSTKVCLHPLPE 


13090 


26991 


A 


13230 


561 


52 


WPSCSSSGSPMLLPPVPPGSLG* *SFSP 
LPAAAALPAPCAAP RPAPLR P CG PAPTP 
AAAPAPAP PAAAS LS AARAAAGS PPGSR 
PSARGARRRPSGP CAP PEPRLG PRAPS S 
APRTRTPS * GRAR S GG S AGNAP S ARRT P 
QGPPRAACSLARSWIWPAGRGGSCGRAL 
GA 


13091 


26992 


A 


13231 


335 


103 


CQK Y S TS Y CNKWLA TLV 1 1 CS R K I * DGL 
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KGQAW WLVP V I PMLWKAEAVGLPEARS L 
SSPCNMARAPALKKIFLIGQV 


13092 


26993 


A 


13232 


452 


712 


VGLEGED*PPFSPAQSQSLSCWPDCSCP 
FSWPPDQRPGERPQPPDPPESHGAEEEG 
LPERPSDA*ARPQPFSLHPGVFPPQNAS 
TQ 


13093 


. 26994 


A 


13233 


480 


2 


SCFSEDWNPLKFKLQLSTSLSYRKSHTL 
HCAHHFRGQCHSD*AFLVFSLiTCILQVG 
SLRFTAGDVS PS YPEKLRVL * EGRVAD F 
HVMARKGQQAALAI LLQNARTG YVWWLT 
PVIPALWKAKASGSVKPRSLRQTWQNGE 
TRLLDRGESASQSAGMTGM 


13094 


26995 


A 


13234 


692 


824 


QIMALPFPSPPVPPLVK*CKGGKIPPPP 
SKNPRTCVGNTVI I KA 


13095 


26996 


A 


13235 


657 


872 


PRGPRLDVLYACEPQL ITCLELGTPLKV 
RFWLEWDHTYNPSTLGDQGGRTA*GQE 
FKTSVGNSKTLSLKKK 


13096 


26997 


A 


13236 


168 


365 


QESLCILQTCTCAHQNLK*KFGGKSRLG 
AVAHACNPSTLGGQGGRI I *GQEFETSM 
VNMVKPYLY 


13097 


26998 


A 


13237 


31 


338 


WYIVYICVCVCVCIYIYEIYMIYI*YI* 
YIYIKHTVQ*H*HLAFSTFTLLCNCHHC 
PSLELFHYPTLTYHTH*IITPHCSSC*Y 
QPLFYFHLYVFDYSRYLV 


13098 


26999 


A 


13238 


714 


1001 


SSGVS CLNRAGW I TGVNKH I HS LVLLKR 
EK* TQNI ITEVCLMFLLLPHLAIEATVT 
WPGMVAHACKPSTLGSCGRRSV*AREFE 
FSLDNIARPCL 


13099 


27000 


A 


13239 


250 


406 


NLA*HGGVHMWS SASQVAGITGTRCHAQ 
LI F I FLVKMGFHHVGQDGLNLLTL 


13100 


27001 


A 


13240 


256 


193 


WINFISIjPLSSLT*TLSLCVCACVYVCV 
CVCLR PCVH I VGGNLKRYI VFARSVTLG 
CKFTHLHI SL I INNDT 


13101 


27002 


A 


13241 


1 


257 


GMDLWQMCHSLSLPLIFFFFFFSEKTRS 
NFVPQAG FQLRALRDPPA* ASREAG I TG 
LTHHVRPGQIFKKIGTLKPTILLPLLPR 
K 


13102 


27003 


A 


13242 


319 


406 


KRGWT*WLMPVIPTLWEAEAGGSPDVRS 
L 


13103 


27004 


A 


13243 


228 


56 


KGAP PVLKPGYP KAAKNPT PFKPPGSKV 
GENPPLF*NPCPGGGNPFPTFFFFFFFF 
F 


13104 


27005 


A 


13244 


1687 


612 


I LGNQC CXFDAYNALANE STM I S I KLNE 
*VREREREKEKEKERVRERKEEREKKKL 
TSSKGTGSTATFHV 


13105 


27006 


A 


13245 


220 


1 


TKDPSSTMPPQPNPLLSFKSQFIFLIFL 
FIKKYVGWMQWLLPVI PAL*EAEAGGSL 
QPRSSRPAWATWRNPIF 


13106 


27007 


A 


13246 


204 


375 


ASWPPGLQYELRI IKCLLHCWTRAMIFR 
ERERRERERERERERERERE * VHLKRKG 
A 


13107 


27008 


A 


13247 


158 


2 


IWVFKETFFFQNFFFFFGKENNFNGFFF 
FFFFFFFFF*DRVSLTLHPSLGYR 


13108 


27009 


A 


13248 


221 


3 


NPSQGFPLLKI FFFFLRWSLALSPRLE * 
NSI * KN * KI * KI S WAWWRTPVVPGAWEA 
EPGESLEPGRQRLQNE 


13109 


27010 


A 


13249 


81 


341 


GELNDTIHVKYLG*YIKVL*RNRTNRVC 
VCVOTCTCVCVYACTQI<DLF*GTG3Ci%C 
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GALVRPKSARQASRLKTQKTWIRSLKA 
VC 


13110 


27011 


A 


13250 


299 


1 


ATARLRFLHGAWWYASVVPPTQEAEARG 
FLDPRSSRME*AMILPRCTPASVTDKIP 
SLASQIHRPSRVACVILLSSHAPTEAPS 
RWPPPSPLCPAGSRA 


13111 


27012 


A 


13251 


106 


146 


LCVLKQ VKQATWTKYS SSQIFAI LDCMT 
TLSSHPALKYCNYVLI SNQS PKHLRL* C 
SGM I SAHCNLCLPG * SNSPASASRVAVC 
FKASQASYLD 


13112 


27013 


A 


13252 


217 


158 


LFEIFCIPFFFFFFNFFFETRSHSVSQA 
GVQWCKQGSLQP* IPINFFWGKEGVIDK 
LILISYGNAQGFKVAVTPVCTTALQFGR 
QSETLSQKRN 


13113 


27014 


A 


13253 


150 


56 


TI L I LLLLYETESQS VAQDGVQ W CDLGS 
IQS P PRS S CHS P A* AG VNRLRQENGMS F 
EAEIVLSPDRTTALHPGLQIETLSHII I 
IILILSTISFHQLLH 


13114 


27015 


A 


13254 


174 


145 


PCLKMTTYDFKSALEIAFFCFTAILRYK 
LLIKVTHFKWLRVAAHDCNPNTLGSGGG 
RI TWGHE FKTSLANMA* PTTRS YSQRVG 
IMGITIPDEVWSRTQPSHIRSGPSKVYP 


13115 


27016 


A 


13255 


179 


273 


GQWLTPVI SAL*EVEVGGLLETRSLKQA 
WAT 


13116 


27017 


A 


13256 


264 


382 


GLKIGKGCS WWFTP VI PTL+ E VKARGLL 
EPRSLKSAWAK 


13117 


27018 


A 


13257 


1199 


1563 


YL WG P AVP PLDLE PG QAGATCG P VSKY 
KI FAKLWAVGS I LDLMPGPE * G S FLS VF 
TSLCFIYGVTYPFHSEQPHPASPSASGL 
LLLKGRAAWLVSTLCIGSPAPMLTGIFP 
HKVYNQLFI 


13118 


27019 


A 


13258 


194 


242 . 


ILYVETGSH YVAQAGLE LLGSGNPPAS T 
SE I AG I EA* ADE F I YVGVSKQDE C I KS M 
KQVAHASIPAISEVEAGGLPEPRSSRPA 


13119 


27020 


A 


13259 


322 


120 


GPTP VLDAFQKLCLS FTSGQQADG 1 1 PA 
FPIRKQGGTQWPKPVI PTLWEAKAGRFL 
* PRSFRPAWAT 


13120 


27021 


A 


13260 


237 


1 


TSKKISLGWWGKQGIPGTWEG*AKRPFE 
PGKPRVQWTQVPALDFSLGGKARLCLKK 
KKKKQKQKTLKLCTHSR I TYSRA 


13121 


27022 


A 


13261 


52 


318 


SAVGIHRCDDGSH* P*TPEHKQLSFLSL 
PSSWDYRGITRELFQRFPWIFLQLITAV 
I S SES TVL KNLELAAVRGSHVRV I MMAV 
PINPF 


13122 


27023 


A 


13262 


274 


1 


KKKKNSPWWWTPVIPGSPG*AGELLEP 
WRQKVQLAQWPLAKWPQTKWQGCPL 
ALQPGBHQGIFVSQKKKNPKTKRKYWAL 
FCSLPSC 


13123 


27024 


A 


13263 


717 


877 ^ 


NSTKEMAHWPGWAHTCNPSTFGG*GGW 
IT*AHEFETSLPNMVKPHLYKKYKK 


13124 


27025 


A 


13264 


288 


131 


SLYIWHSKRLITITNNKISGVWWCLPW 
SSTWEVEAGGSLEPRR + RPAWATK 


13125 


27026 


A 


13265 


80 


287 


FMNGEAS * KTS ILSQAWWCAPI VPAAQ* 
NEAGELLEPRSEWLVWATRALR I SSRGL 
RFRFRLRRFTSTR 


13126 


27027 [ 


A 

i 


13266 


3 


241 


VGLFLFFFETESCSVTQAGVQWCGLS*L 
* PPPPGIRDSPASASQVAGTTGTHHHTW 
LI FLYF* * S SGFHYVGLGRSSNS 



1342 



WO 01/64835 



PCT/US01/04927 



SEQ ID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
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13127 


27028 


A 


13267 


103 


359 


ICVYVYVCIYVYEMRATSKDFLNQVWFS 
FLRTVAVLFEEYLIVL*MWLSFHIHIYI 
HTHIHIFIYVYTHVLIYVYIHTYSYMCI 
HIYI 


13128 


27029 


A 


13268 


172 


12 


SARSSQAIQIQKSVTSGQVKPQW*QS*P 
TPFLPTLIALTTLLLPISPFILIIL 


13129 


27030 


A 


13269 


2 


203 


R I EN I RS Y KNLAAL FMTAKKMGT I QM PH 
NFEWINKMW YIHTLEH* SDKKKKKKKVW 
RAEIPKLIIQK 


13130 


27031 


A 


13270 


236 


365 


KIYRQGAVAHTCNPNTLGGQDGWIT*GQ 
KFETRLANMVKPHLY 


13131 


27032 


A 


13271 


371 


15 


QNQINFALILIINTLLALLLIIITF*LP 
QLNGYIEKSTPYECGFDPISPARVPFSI 
' KFFLVAI TFLLFDLE I ALLLPLP * ALQT 
TNLPLIVMSSLLLI I ILALSLAYE*LQK 
GLD*AE 


13132 


27033 


A 


13272 


2 


16 


PRVRTSSRSRAALELI FFFFLGPPNLPV 
YNGPLGRTKPGTGELDTGGS P I LCVGQG 
RHP Y WKGGAKPLAP FGKGGGP * GTRPLA 
GPIVPPKAGLQSEMPGMAPFCGSFG*PT 
RPD 


13133 


27034 


A 


13273 


329 


290 


SSALVPRLECKGI FSAHCNLCLMGSSNS 
PTSASRVAGITGVAGQATDKTPQTLS*R 
SQPSKMDPVRWSP 


13134 


27035 


A 


13274 


175 


13 


APCDKRPCPPENNPL*L*FSITFPNPIK 
RPHP YLPLLTLFSDSAHLHPGE IEQR 


13135 


27036 


A 


13275 


355 


861 


PLTTTPAAPRAPCPPSRLSGQPLTGPTE 
GSRSRLSPNISEQGEPPLALTVGHPLST 
QPGPTVPSELEPIQGPRG*GDCPTPSQS 
A*GGVLSCTPESHTEFKPPPTGGGRRWA 
RLGLNGAT* GREEPLQTRLPAEYPGPGP 
IDPLQPPP I STASMATAFSDFLLLGRDP 
A 


13136 


27037 


A 


13276 


865 


667 


KFLCILLDFLFFIF*EMRSQHRLECSCA 
1 1 AHCSLKLLASTDPPTSAS * IAGFTGM 
CHCAQLFLTF 


13137 


27038 


A 


13277 


103 


375 


WSRRLPWRRGLGYIELFQGLEIRHHFLF 
GPHYLRRTQCQGPVI PS3LDGQGWDYMS 
PGV*DQPGQHDETPSLQKI*KTSSAWWH 
APWPAT 


13138 


27039 


A 


13278 


177 


3 


QQTEGSRAHSNSHRRPGAWHASNPNTL 
GGRGG*NI * SQKFKTSLVNMVKPCLYGR 
V 


13139 


27040 


A 


13279 


379 


372 


SR*WVCMVAHACDPSILGGQGGRIT*AQ 
E FETSLRNMVRPCLCLGNTN I Y I Y I HTH 
TYIFIQN 


13140 


27041 


A 


13280 


83 


1184 


PPAHAARASPPSYTWLCYEVKIKRGRSN 
LLWDTGVFRGPVLPKRQSNHRQEVS S WE 
CRKHISKMSGGGLSTVYFFHRRFQITWF 
VSWNPCLPCWKVTKFLAEHPNVTLTIS 
AARLYYYRDRDWRWVLLRLHKAGARVKI 
MDYEGERCRGQGSMTGRNSLRDGWI CNA 
INYASLHRTLKEILR*GSPSGLIVSLLS 
PPAHPPEDSPGNESWLCFTMEVTKHHSA 
VFRKRGVFRNQVAPKS YLHPK* ELS S WE 
RRKHNT*HTNYEVTWYTSWSPCPECAGE 
VAE FLARHSNVNLTI FTARLCYFWDTDY 
QEGLCSLSQEGASVKIMGYKDFVSCWKN 
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FVYSDDBPFKPWKGLQTNFRLLKRRLRE 
ILQ 


13141 


27042 


A 


13282 


1558 


1824 


S FP YLAS FP FCLGL PLETLFHPLGL * KD 
*SDTHTHTHTHTHTHSHTHAHFPSFPDP 
LFQSSPFSSGFIDEYKYPHLWPVMSVTC 
CRFCV 


13142 


27043 


A 


13283 


1 


519 


PHFFLPQGFWGFFSPFPL*KSSSPLKAL 
IFLGGFSPFFPPPKKRFFSKIPRGVFFP 
PPKGKKFFFPPPG* IWPPQGFFLKGPPS 
SSSSSSSSSSSSSSSSSSSSSS 


13143 


27044 


A 


13284 


349 


159 


CPLEPKKNYPFCFFF*ETGSCYVA*AGL 
VLVASSNPPASASQIAGIQGTSRHAQPC 
RIFLSKT 


13144 


27045 


A 


13285 


748 


869 


WLG WAHACNP S S LGGRGGQ I V * PQEFE 
TSRGNMARPQSL 


13145 


27046 


A 


13286' 


315 


410 


VQW FTPVNS ALWESEAGGLLE IRS* RTT 
WAT 


13146 


27047 


A 


13287 


355 


499 


ARCGGMHLYSQI FGRLRWEHCIiR*KVLS 
QVWWHAP VFAD I WEVEVGAL LEARSLRL 
* CAMLLPVNSHGPPTWAMQ 


13147 


27048 


A 


13288 


880 


701 


KS YFSSHLHFEGKKCVSS I LD * S ILVLM 
CS WLMNYTHTHTY IYIYTRTH I CVHVYV 
HNF 


13148 


27049 


A 


13289 


379 


434 


KRGPPPPKKRGCFFSEKFF*GPPKTPFF 
FFFPKKKKKKKKKKKKKKKKKKGGRSRS 
RLADAWADAW 


13149 


27050 


A 


13290 


253 


344 


GLFYNLQKIKIFYVKNLFFFFSNSITEA 
GVQWPNLGSLQPPPSGSNDSPASE*RIV 
V 


13150 


27051 


A 


13291 


561 


745 


AWE PSLVGETNVNS FNQKY INWPGAVAH 
T YNAGTLGGQGG W I T* GQE FETTLANMV 
KPSPY 


13151 


27052 


A 


13292 


289 


2 


TLPQGEDFNKF VFGS I KQKLQ INL>*NTD 
FIGNVFQSWAQQYTPWPPSWVAEVEGS 
LAARS SRPLCTI I TPLNSHCSLAWATQQ 
DPAGRVGRPRV 


13152 


27053 


A 


13293 


281 


229 


LKNKWVNKEKLRQGWAHACNPS ILGGR 
GGWIT*GEM 


13153 


27054 


A 


13294 


276 


441 


GLFPKFLIQKINQNWPDAWHAYNPSTL 
GVQGRW I T * GQ * FKTS LANRGRKS ENK 


13154 


27055 


A 


13295 


258 


389 


NMVEKRLGQAHAYNLSTLRGQGERIT*A 
WEFETSLGNWRPCI 


13155 


27056 


A 


13296 


358 


353 


SFLAFFNGAFLPLRGFFWKTFFFGGVCW 
RRPPL*RKKKKKKKKKKKKKKKKKKKKE 
MFKRNIGGEERGGGG 


13156 


27057 


A 


13297 


196 


1 


LMGKAPFLGGFIPPAFFFFLKKKKGGPG 
AVAYTCNLSTLGGRGRWIT*GQEFETSL 
ANIAKPCSC 


13157 


2705S 


A 


13298 


182 


436 


G VTI LNVRHRHRKKVTL YRD L KKVRKC P 
KLHGNLRKVFQAEG IAS S KTLKGHVWWL 
MPVI PAL* EAKMEGLLEARS LRS AWATQ 


13158 


27059 


A 


13299 


334 


440 


RHYWLNF * AFYHS S LAPAPQGGGHWPPP 
GITPLNSLHVPLLNTSA*LASGV*LT*A 
HHRLVPNNRALIIQALRITRILGLYSTL 
L PA* KNFEAP FT I S DG VYGS AF F * ATGC 
HGLNVIIGSTFLTICFIRQLIFHCTSKQ 
*LWLEPPAWDW 


13159 


27060 


A 


13300 | 359 


372 


KKKKICGGKKNPPNKKKVKPRGEKTPLK 
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G=Glycine, H=Histidine, I=Iso leucine, 
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GV* PFFCLEGFFSPHKFFFFFIFSPNI C 
SPKKSFFLKFPPALFFFNLPFLKNIPSP 
PFLKKAS F* E I FFCGP P P FFF FFQNTLF 
FFPPFFFFFFFFFFFFFFLDKFAVY 


13160 


27061 


A 


13301 


233 


406 


VTVEGMEEVLAGSSEGMGSRPTS*V*FL 
KYLLGAVAHAHKPSTLGGRGGRIT*AQE 
LKTSLCNKVRPRLYKINK 


13161 


27062 


A 


13302 


92 


429 


GSHFNSELI EMKF YI IKRNI FI ILNCFY 
LIKHQHFHYTKFLLVSVINKF*liATVAH 
AYNLNTLGGQGGRSA*THEFKTSLGNTV 
RS CL Y KNLKM S WVW WCTAI W EAEVRGL P 


13162 


27063 


A 


13303 


302 


140 


GGAVUS VQS QAWWHMP WPVPWEAETGL 
LKPRSLSL*CPMIVHVNSHCTPAWAT 


13163 


27064 


A 


13304 


115 


312 


LRSPSFVLSPRLKCGGTIMVHCSFDLLG 
*RNPPALPSKWETIGLHHHALLRLLIF 
FF*KWWLFF 


13164 


27065 


A 


13305 


90 


246 


PG WAHACNP S TLGGRGGR IT* GQE F KS 
SLCKIAPPPERKEGRKEGGKKKERG 


13165 


27066 


A 


13306 


19 


310 


DSS FSPI MCFGMS VNKTGLWLGKKKKKK 
KKKKGGPP * KKPLGGPNLPGGGKKKFFP 
*RGAKKNPPGDFWKKTLFLGGEKLGPPP 
PQK*NPFFGGGKIGPTFPPKIKPPGEKK 
KF 


13166 


27067 


A 


13307 


179 


56 


W VLV YKKQS I LG * AQWVMS V I S ALWE AE 
AGGS LVSTSLRLA 


13167 


27068 


A 


13308 


425 


290 


KKN I YliAPPGYFWP PQRFFLS P PPPPNV 
VTFFFFLFFFFFLFFFFFFLIQYAEGIG 
VMRGRGEGKMGR* W * KFFGRRVLNIRS I 
LLANFKHWPYW 


13168 


27069 


A 


13309 


214 


371 


IYF* * RW I SQAWWLMPV1 PKLWEAEAGG 
*LEPMSLPGQYNETSPL 


13169 


27070 


A 


13310 


975 


572 


AWGM WGVGKHS LVS LG I EE CQASTALS 
LDKSGWWEEAARREDVLCEDRRCLLCHV 
PAGVRGSLKPELGSRKGKNGQQSGSKPS 
VPSLGPRPQGPGNPALSKGTRPN*AICF 
LCQT PADHS AKKQ APHTL I P I R 


13170 


27071 


A 


13311 


270 


10 


TPNKSL I * RLQFSN I KLANLGLGTMAHS 
CNLNTLGGQGARITSGEEFKTGLGNIDP 
I P I E SKVI KTLAGMVSPDAWVDAI KRTG 
TSG 


13171 


27072 


A 


13312 


103 


290 


LTEWSGVI L * NC I YLLPQ YTSTRVHAY 
IHTYIHTHTHTSGLSSTSVGSTNHKSKI 
FGGKKG 


13172 


27073 


A 


13313 


136 


I 


SNTSSSLVTQAGGQWCELGSL* PLPPKF 
XRFSCLSLPSSWDYRRL 


13173 


27074 


A 


13314 


88 


183 


RVSTLLKKSCFFFFFFFFFFFFFFFFFF 
FFFFS SQKKKNLKNKKRE * KKS CFFFFF 
FFFFFFFFFFFFFFFFF 


13174 


27075 


A 


13315 


481 


517 


LLSP * RPKGKPKGKKRKPNLKEKKIGEA 
RGGKERQRKTP QRTGKQKKRRKYRRIGR 
RNYR 


13175 


27076 


A 


13316 


307 


196 


GMHLYLPRTW*WVRVIPGSREIEVHAPIi 
EPGSQRLLVAEMTPMHSSLDNMNKSPFP 
QPPKTEKQTKNPPKPHI I A 


13176 


27077 


A 


13317 


2 


209 


PARALDLKGS P YME S CS DAQAGVQDS I Y 
GDHL * LRAPAV* *GQTVFVASPSKVGGI 
TGASHHPGLFFLF 


13177 


27078 


A 


13318 




178 


SFCFSV1.RE.TIGOSLIMKTLP* ITHTHT 
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ing to first 
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peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=Glutaraine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














HTHTHTHTHTQRE IBRERDRERGESQVL 
GC 


13178 


27079 


A 


13319 


40 


382 


DSVPSLLLKLLMETGPHHVVQASTTTPS 
*FFILEGFHHVAHAGLKLPSSSRPPHLA 
SQSAGI YGCEPLRSARLI INYIGSRLWG 
LRNICLSQDHEECLLCFILQAISMYRFC 
LG 


13179 


27080 


A 


13320 


107 


1 


P PL I MANSGWAQWLT P WPTLWEAKAGG 
*SEVRSS 


13180 


27081 


A 


13321 


111 


382 


ASSQLLELAIQVFPGIWIPPTTKQKGVP 
SGKTLVTS KI I AGLKMPKDSRSRPRTVA 
HTYNS S TLRGQDRR I T * AQE FKTS LG KI 
GRPCLF 


13181 


27082 


A 


13322 


49 


108 


CGLY* FFF*ATRSWHVAQAGLEFLGLSS 
FPAS AS S VAGTTGMCHHTQLHY * F* V 


13182 


27083 


A 


13323 


1 


257 • 


ARGERERERERERERERERERERERERE 
RERERERERERERERERERERERERERE 
RERARERE*ECV+EGETVSDRERDFERA 
PPRVLGAVAIYIEWCLASLIQ 


13183 


27084 


A 


13324 


1 


248 


EFQYILCSALIFGRSKILK*ATKEVKKS 
KH I P R I S CQDTRK YNWS AKAKRRN TTGT 
GRMRHLKIVYRRFRHGLHEENLFLKH 


13184 


27085 


A 


13326 


375 


1 


PPKRRGKTHCSPPKVFPPPQNN* TTFPP 
QLI I CYL*RGGVFFSLPLI I *APPAFFF 
FFQFFFFFFFFFFFFFFLQGQYWQFRDT 
DAEGHLASSRPSGAVFPSQGMHTTGSTR 
RWGQCVPFPPTRP 


13185 


27086 


A 


13327 


382 


15 


NWPKNCPLHFQNWFLRKTFQIFFCLKNF 
FFFFPKKVFYPPKKPFLEKPWPPVLN* K 
KPPPPFFFGWGPGGPQYCPFKKAPPLFI 
RGKERFPLWGKGDFFQIPCPPGPLKKKK 
RAAARDLGTS 


13186 


27087 


A ' 


13328 


2 


220 


GRVGSSRARAVALFFFFFFFFCFFFFFF 
QNPPLKRGEKKKNTPPLKKKRPLRGGFK 
KQKEC*EKKKKLSGPK 


13187 


27088 


A 


13329 


427 


289 


IQLWSSL* K*LKTHTHTHTHTHTNTCFL 
TVMKELSTHPGNK 


13188 


27089 


A 


13330 


203 


269 


INFGPPREPGGFNQKPQFPSAPGFEPWN 
PPQGPKP * KKKKRKKPFKVWAPGGGSKV 
QNPGLRETGVFG 


13189 


27090 


A 


13331 


216 


418 


DVPVH YCRLYKPCTLRLHS I YKKQ I LLW 
PGAVAHACNPTTLGGRGLWIT*GQ3FEA 
SLANWKPHLY 


13190 


27091 


A 


13332 


160 


325 


W SRQL FK I AQSGLAR WL TP V I PALWE AE 
EGGLFEYTSLR*LWATQQDPISTKMFK 


13191 


27092 


A 


13333 1 


339 


62 


GMLPLFVPPQKRGS PP YPCYGVYNSPPL 
KKQRFFSSLGIVLPPIVFITPPPPAFFF 
FFFCFFFFFFFFFFFFFLI*RRLMRRRM 
FSCFLHCP 


13192 


27093 


A 


13334 


45 


384 


DPSVRINTLLALLLIRITF*LPQLNGYI 
EKSTP YECGFDP I S PAWPFS I KFFLGA 
ITFLLFDLEIALLLPLP*ALQTTNLPLI 
GMASLLLIIILALSLAYE*LQKGLH*AE 


13193 


27094 


A 


13335 


270 


660 


AGSRRPLRVPGFSLSMMSPSLCRPVCVA 
QC I RVS FLLTVGAVLRPGFQCLDWP CLV 
YARLSGWTGFPFCRYGKGCCRYDEGCCR 
FGEGCCRCDDRCCRCGEGCCRCDDGCCR 
YDEG * CRCDDGCCH YGE 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 
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od 


SEQJOD 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIa!anine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, LHLeucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 


13194 


27095 


A 


13336 


298 


349 


KTTPFYGLRGSKIGTMCSGVI I AH* SL 
KLLGS SDPLS LVAQAARTTGACHHDQLT 
NAFFFFPLERVPTPLKNNPFLWVKGVKN 
WBLD 


13195 


27096 


A 


13337 


865 


518 


KSATMRMVEDTHKMPAWGGYSGSLQQER 
GELSSLPCP*PESPPPIjLIFLKLPSNPL 
PCSPLPCTQGPGQPAPYVSIHTSLKFPN 
AKLSVCLSRAIEFS 


13196 


27097 


A 


13338 


154 


354 


FNLQVKPDGYFVLFICHILPTDSKSGKA 
PGAPCHSRNFKIRKTLGWVQWLTPVILA 
LWET* FSHQENSGLGAVAHTCNPSTLGN 
QGGWIA 


13197 


27098 


A 


13339 


262 


11 


QFHFSRKLTP * KM* LSAHSSNTHEAYVA 
FPKAHKPGLNS KTAF * I/3TE AHACNPN I 
LGGQGGQ I TRGQA FKTSLAN MA KPR L Y 


13198 


27099 


A 


13340 


358 


164 


TKKGKPRFFLKNSKISRGWWWAPVISGP 
WEGEGGAFV* TGRPKI WLTEVG PLP FNL 
GKKKENPF 


13199 


27100 


A 


13341 


147 


2 


FRPQKVS VRVG VLAHACN PT I LGAQGGH 
II*GQEF*ASMANMVKPSSC 


13200 


27101 


A 


13342 


367 


17 


TSRS WNL FHVLVR FPTADKD I RE I GS FT 
KKRSLIDLTIPYAWASFTLMMEGKKEQV 
TSSINGSRQKVRSQSGELLSLTPSDFSW 
AHWLMPVFPALW* SETGSLFEVRPSRPA 
WPTW 


13201 


27102 


A 


13343 


166 


492 


EGTQETLCGCI ICLVRGDALNLFHLKCS 
WVGWRGAICCMBLRETAKQKLSVWKLYP 
FEIVFSFSNNNPRPGAVAHI CNPNTLGG 
*GGRIARAQEFETNLCNIVRPHLFR 


13202 


27103 


A 


13344 


804 


1124 


TFFFCFFL*DRWDVAPGWBSSDMIMGP 
LQALTSWGSTDPPTLGLQSGLGDLQDTW 
PPYPTSFYYFFLQGRGLTMLPRLVLNSC 
TQAILPPQPGQRSKTPISLKTN 


13203 


27104 


A 


13345 


280 


448 


GGGGKKKKTPKPPLEKNNFSPPP* FFPP 
KKQIKPPPPF*GGGGKKKKTPKPPLEKN 
NFSPPPFKPRKGIFFLFPPLSWVKKKGD 
PPGGTRPLAPL 


13204 


27105 


A 


13346 


232 


400 


AELLRYCSCGHFL* SMARYKTKPGVRWL 
TPV I P VFWEAEAR * LHEPRS S R PAS ATQ 


13205 


27106 


A 


13347 


138 


332 


RAWKPS FE I F Y FFF * RAHT F * * S C VCL 
CVC VCVCV CV* THTHTHTHTH KHTHDY 


13206 


27107 


A 


13348 


314 


395 


WLGIVAHAYNPSTLGT*GGWIT*GPEV | 


13207 


27108 


A 


13349 


388 


371 


IMNGWQDKELVSRVIQTGIKK*KPRNRP 
EFRAPP* WHERNGRKERERMEGRKERER 
KKGKGKERKRKKEACI YKVKI E I SNNLA 
NLIINVER*DRRRPDIL3WVSLSSRVSS 
VFIALVTCLTSPQFFLSLHLNQILFPLG 
KNL 


13208 


27109 


A 


13350 


412 


40 


LVFRFWMCLFSRELFGCFSHLLTKSMSQ 
M*WAFGDIMYTFDLLFIKKTKNNCKLW 
QGCKEEGSLIHCWRECXLVQPL * RP I *R 
SLKKLQMGLPYDPAISLLETYPKERKSV 
Y * RN I CTSGR VG 


13209 


27110 


A 


1335! 


372 


148 


FFFFFFFFFFFFFFFFFFATGSCRVA*P 
GVKWLISGTVPLLISTGVLTCSISDLGQ 
PVHTSLGNLWPYSQELPY 


13210 


27111 


A 


13352 


209 


206 


KKKKKNFFSFHGKNPGNGGPFGGPPPPP 
F*TFFKKGGGr»PKGPPPKGFFWNPPQNG 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N— Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














GGGP* PGGEKKTPKPSIiGVLFFFFFKFF 
KKR PTAF FFLKKKX FF 


13211 


27112 


A 


13353 


43 


334 


KKKKKKKKKKKKKKKKKRGGG P F I K I S G 
GAQFFRGEKKFFFFFLGGGKKTPRGFFE 
KKPFFWGGNFWPPPPQKNFPFGEKKKFL 
GGGGGKNRVFFWGEKIFSLGFFLKKFF* 
KKPEGKNFFPPKKNPVFSPPPP+KFFFF 
PKGE I FLGGGGPKI P PPKKRFFFKKPPG 
GFFS PP * KKKKKFFLPPEKLGPPRNF YK 
RPPPPFFFFFFFFFFFFFFFFF 


13212 


27113 


A 


13354 


264 


496 


KPTI LRVTHFLEGCEKYEFLLPVAFSNF 
SEPLKRYYYWLGAEAHVCNPSTLEGRGG 
QIT*GQEFETSLVNMVKTCLY 


132B 


27114 


A 


13355 


183 


9 


I RS PKI PE IRVFFFSRWS YALVAQVGVQ 
WGNLS*f4HPPRARFQPLSSLSVPST*KY 
R 


13214 


27115 . 


A 


13356 


208 


378 


EICLEFKIIDEMDKCGKLWLGTGAHT*N 
PITLrGGQGGQTT * GRAFETS LDIMEKRY 
L 


13215 


27116 


A 


13357 


256 


341 


KGQTQWLTPVIPTLWEA*AGGLLESRSS 


13216 


27117 


A 


13358 


78 


344 


I VGLFES ESKKGQH I VSGW I YLRLLI YR 
FLFPLFSCLCKFFFFFFSPETEFCFVPQ 
AGVQ * HNLG* LKP P P PQLKQFS CLTLP S 
SWNYR 


13217 


27118 


A 


13359 


259 


291 


AQ*LMPWPALYEAEAGGSSLEPRSLLS 
VWAT 


13218 


27119 


A 


13360 


448 


121 


RLFDLGNKKTSSFVQNSENAKYEDSLQF 
YCRIYIYVYIYVYICIHICVYM*TYMCM 
YICI YMYTYTHVYTHVYMCI HTYMCI * A 
YICVYIYIF*KACLPLAISTLSSGH 


13219 


27120 


A 


1336! 


125 


262 


FS FQASVEFTSKTVWSWVQWLTP IISTT 
* EAQAGGSLEARNSRPD 


13220 


27121 


A 


13362 


362 


46 


ARAKGPKKIGFSGKMGPP*GAPPPKMGK 
KI*ITPPPKFLFFFLGKTKIKNPPWGFW 
PFGFPKKKRGGGARR*KTPSPFREKPPP 
QKLKRFKTPFPPLFFFKNPRP 


13221 


27122 


A 


13363 


378 


48 


FKKAAREKVSTRKFRGFVCLFWDFFWRQ 
SHSVTQAGVQ* CDLNLHPPGS SDS PAAA 
SQVAWTTGTHHHTQLI FIFI FCRNKIS P 
SLLKKYKKNLPGWAGACNPSHLGG 


13222 


27123 


A 


13364 


174 


396 


SLIFILEKRECYYPRSLFTVANFTVAKR 
*QTLNCPSTDKWINKMCHIHTVEYYSDI 
KRNE I PMRATCRQTLKA 


13223 


27124 


A 


13365 


76 


280 


PLLPFKAWQWVGCHYW*KLD* INCYFR 
ALLAFSEYMYIYIYIRTHTHT* IYIYIY 
IYKLTYSHLKVG 


13224 


27125 


A 


13366 


133 


324 


YLiL* ILVYCSHLIFSLLCLEGIIbWVFI 
IATL ITLNTHSLLINI VP I AI LDLAAGQ 
AAVGLALLVSKKKKKGPPLKKP PLGAQ I 
S P A1NTQR KK FP P KRK P I KTRRGTF 


13225 


27126 


A 


13367 


488 


190 


FSYAACFLPQCVFVCTFIYLHADGFLLL 
I IFFENSVFILCHSCWVFCFLLQWFLLM 
NLFSLFLLMLMHPVS * MECS I YS FS SL I 
I FHFS LMLLFFC FM I 


13226 


27127 


A 


13368 


229 


372 


YYATKDFSRPRAVAHTCNPSTLGGRGRQ 
IT*GRQFETSLANWXPQLY 


13227 


27128 


A 


13369 


122 


2 


KEECVGSGTWWVMPVI PAL*EAEGRGSL 
EPRSLRPAWTKK 
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peptide 
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Predict- 
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correspon 
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LJ U t< !■ 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=PhenylaIanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Pro)ine, 
Q^Glutamine, R = Arginine, S^Serine, 
T=Threonine, V=Valine, > 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possible 

nnclpnfifip insertion 

llUVItUliUt I II Owl Ill/It 


13228 


27129 


A 


13370 


43 


412 


RPTRPKRQNYGDSENISGCQGLGKKGMN 
GKSTEDFEGSETTLYDAITVGTCHNKFI 
RSH I VYNTKSGP * HKLWTWGGHDVS V * V 
PQLRKTYHPGERCL* *GRLSMYRGREYI 
GRLC I FCSLLL 


13229 


27130 


A 


13371 


136 


1 


KS PT WPGAMAHTCNLSTLEGQGG WI T * G 
QGFETSLANMVKPSDAW I 


13230 


27131 


A 


13372 


228 


385 


GLL IHKLEFKHFWLGTVAHTCN PSTIiGG 
QGRH IT* GQGFETS LANMAE P CL Y 


13231 


27132 


A 


13373 


277 


429 


LETTTKAGLPTI I FTSSGQMS I WFFQNK 
P WDS KKLFNLLS I S S P* * S S KNY * P WQS 
TVAHTCNPSTLGGQGRWIT *GQE FEISL 
ANMVKPARVGRHVIRGLQVS 


13232 


27133 


A 


13374 


194 


3 


NQEN Y I Y I YTHTHTYTHTYI Y I L* LLLQ 
WVYLAYFLCLS YPS LFAQGFTHTT YTHT 
HMHTLIL 


13233 


27134 


A 


13375 


377 


228 


DRVLLLLPRLECRG I IMAHCRLPRLVSN 
S*APALSFQSAESTGVPNVPS 


13234 


27135 


A 


13376 


255 


465 


NAWKCPFI YKITFVI FLIHVSCKKFRNY 
RQREWKLPTVRPLS PS *AI I FPVTCTYT 
SRWPEATKDPQKK 


mis 


z. / lju 


A 


13377 


H6 


JO 


VWWCT P WPATQEAE VGG S LE S GRLRDQ 
*AVITLVNEHRESALASRRGPEETSSVK 
PPLPTILAHTCFSLPRTGQDITSRFLAQ 
RNTEENLELQMEARA 


13236 


27137 


A 


13378 


376 


293 


FFFFFFFFYYFFFFK*KFLTKKKIL>SSQ 
YI 


13237 


27138 


A 


13379 


215 


16 


HLT WS FTTATEGS KTVTQHS V YRKAKLG 
LGAVTHTCNPSTLGGLGRW I T* GQEYKG 
IPPHGLEDVQ 


13238 


27139 


A 


13380 


184 


64 


VDESLEGWMMDEWMSGWRGGCINRYMHA 
WMDG*GDGWIGG 


13239 


27140 


A 


13381 


233 


417 


LKPITKGRKPRGFFLPFKPKQKKYFWGF 
KKKKKKKKKKKKKKKKKKASRAPF* KKG 
PQKTP 


13240 


27141 


A 


13382 


34 


360 


RWNTTNADHDLKDNI LSPPQINLYIRQN 
SSRLFCRH*QMDSKSPVKIPAGFVLYI * 
I YRYTHTHTHTHTHTDSELYMETHTRMA 
NTIVRKNNSLEHSYYLMLRMTIKLP 


13241 


27142 


A 


13383 


214 


71 


QTLNTDSGPGWIHACNPSNLGGHGGKI 
I *GQG FETSLANMVKRCLY 


13242 


27143 


A 


13384 


352 


312 


DKQLTLHRTDSLY I F YTY * PPQSCE PIS 
YNNFP CLS Q YLYLC P S LHTHTHTHTHTH 
THTHTHTHTHTHVSVGRRSLFFSGPHTB 
APRSRVSVYT 


13243 


27144 


A : 


13385 


393 


271 


IEGQIQYTSTIGNKFKDFYLILCKEGIM 
SREI S PPS SCHLRQQ *RVRLRERDRERK 
RQRERQRERE *GRSVLHPHVT 


13244 


27145 


A 


13386 


180 


440 


PVEERTLCEDILCFPFSVLLCIQFHLLI 
QHACFKYPNPNSRFGSWPGAVAHI CNPS 
TLGGGGRWIT*NQEFEARLSNMVKPRLY 
KNI 


13245 


27146 


A 


13387 


190 


47 


EGEKGVPSTILKMETLL3TVAYPCHPST 
LGGQGGR I AEAQEF * DHLE 


13246 


27147 


A | 


13388 


188 


470 


ARPPCKGRDSSAEGPPGPPFPSWSSLGC 
WTREP PGRGEP I QVAVRREESAQDWARP 
ELI IKEWWPGLVAHTCNPSTLGGRGGW I 
A* A* 5 FENS Q * 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
IJSSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cy stein e, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possib!e 
nucleotide deletion, \=possible 
nucleotide insertion 


13247 


27148 


A 


13389 


454 


367 


NKKKPPPRPP*PPPPPLGKTPFFKKKNP 
NFP RGGGNS PGS P S FGGGKG KD P P PGG F 
FPPPPYPRGEGGNPFFKKKKKKKRASLD 
P * WS SQPC PAYE KELNFAP VLATVTLPP 
GHSS , 


13248 


27149 


A 


13390 


34 


354 


HTLLALLLIIITF*LPQLNGYIEKSTPY 
ECGFDPI S PARVP FS I KFFLVAI TFLLF 
ELEI ALLLPLP * ALQTTNLPLI VMSSLL 
LI 1 1 LALS LAYE * LQKGLD * JE Y 


13249 


27150 


A 


13391 


1089 


792 


THGFGPQVGSPLDLKQPQKAMPRGPSLG 
NPPPGCQQHSHGAGGP*VPPGPRQPVMA 
SRNLCLPLALPRTHPPSPGQTRDKPRTQ 
DRSCWRDPRNGPGV 


13250 


27151 


A 


13392 


299 


428 


W I R CRGRGRS CL P VPH SAP ALLS P LWD 
GTRRREAGGGTCQGGSGCMGAHWGLGGG 
EGSGCRSQALPREAAEARREFKHGGIPR 
H* LTESLVSWEFRHA 


13251 


27152 


A 


13393 


382 


413 


HGKTHLYKKFKNKKFKATMPA+ WLTPVI 
PTLQEAEAEASLKPKSSRPAWVTWQDPS 
LQKI 


13252 


27153 


A 


13394 


64 


401 


GGIPKEDSQAGAFTGIGERQQGLVIPLL 
KNGQLSTDACVPPWGRVGVERAGPPNQP 
AGEGGGQGQECS LGLLPEANRCH P CNPS 
TLGGRSGQIA*GQEFEISLGNTVRSCPY 


13253 


27154 


A 


13395 


146 


18 


FSPGWAHACNPSTLGGQGGQIT*GQAF 
KTSLANTVKPKRPQ 


13254 


27155 


A 


13396 


1056 


679 


DIQVPERPL*NVPEPEAKGEPPDRAVGE 
H IDRDCRSDPAQQKRKI FTNKCERAGCR 
QREMMKLTCERCSRNFCIKHRHPLDHDC 
SGEGHPTSRAGLAAISRAQAVASTSTVP 
SPSQTMPSCTSPSR 


13255 


27156 


A 


13397 


440 


423 


IHSPPTQRRV*QRERERERERERERERH 
ALAERNRTREG I SGTTGERGNLMLVGHH 
LPGPLSSRRLGFCPDGFGGQHCLAQGDQ 
VRL 


13256 


27157 


A 


13398 


2 


256 


I ETLG S AVEFI P YENT YQTYLKKKNRNS 
HMQSQPNRKGHI RLKY I AAWA* WLT P VI 
PALWEAEAGRSLESRSRNPVSTKKKIQK 


13257 


27158 


A 


13399 


102 


922 


LSFFLFSETGSCSVAQPEVQWCNHSSLQ 
P * TPGGPSMQFQLPQEQGQRTAVWHGGR 
A 


13258 


27159 


A 


13400 


198 


55 


KDYSAWHTCNPSTLGGGGGWIT*GQEL 
KTSQANMVKPHL YKNPKI EF 


13259 


27160 


A 


13401 


19 


447 


ES ALNLPSAGI IGVSHRTQP I FCI LNAL 
ALG VL I LERS PLQS PQVPPSKSHTKPGS 
LPSVTPGRGPGRPRAAEPTAQGRRYNSN 
PVLISPGSVHPASFALPAEPP*TGAPST 
PRPCVLRPLLLGNAMDLLCTKLSSRLPP 
YRT 


13260 


27161 


A 


13402 


93 


410 


DSNLNYSLFFHGEADLGTNQVLTHPSTT 
AMY FEK Y CQP P * IVHGT I NT * P P WHKN 
P IHI ITPSPCLRASTAINLLLSH INCYS 
*ATPHPLGYQQTYLPLTVHST 


13261 


27162 


A 


13403 


55 


387 


SNSHT YSLKKS AG I TKFQNWHI VNC I CI 
CQVWPLEMVNRHTVILCQYPVKPRILY 
QHHTAILVTILTFTLRPGWSHACNPST 
LGGQGGR I I * AQEFKI SLGNIVRPCLY 


13262 1 27163 


A 


13404 


39 


365 


SGDRRVRLLLKIITF*LSQLNGYIEKST 



1350 



WO 01/64835 



PCT/US01/04927 



SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 

NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

seouence 


Predict- 
ed end 
nucle- 
otide 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=G)utamic Acid, F=PhenyIalanine, 
G=Glycine, H=Histidine, I=IsoIeucine, 
K=Lysine, IHLeucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine T S = Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X— Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 














RYECGFDPISPARVPFSIKFFLAAITFL 
LFDLEIALLLPI»P*ALQTTNLPLIGMSS 
LLL 1 1 IIiALSLAYH * LQKGLD * AE 


13263 


27164 


A 


13405 


32 


350 


HEFATSTSEMTKGTS * I VKRRI KTHTLC 
RRCGSKAYHLQKSTCGKCGYPAKRKRKY 
NWS ATAKRRNTTGTGRMRHLKI VYRRFR 
HGFREGTTP KP KRAAVAAS S SS 


13264 


27165 


A 


13406 


326 


668 


LQFKIFCMPGIFTISEVSSMSN*RTAYG 
SQS FHKIVLI * LATFGQALSC* IL* IHL 
CYLKNKTKHRALVAHACNPNTLGSPRGR 
IA*GQE FETSLGS I VRLLLSTKNLKKKK 
NM 


13265 


27166 


A 


13407 


170 


472 


KLISWSEESSYCVEHGEPVVPQEKEYS 
WLE IGKAEMERSTLESLLGATLEALSSN 
PTRKMQGLKLKLQRASAFKEEPWLGWA 
HACNPSTLiGG*GGRIA 


13766 


97167 


A 


13408 


178 


422 


FYNRYF*YTKTYQTENIYCKIMSFRIiLV 
*KI * SLPGAMAHACNPSTLGGRGGRI I * 
GREFETRNS 


13267 


z / jyo 


A 


13409 


274 


444 


LFSQYIVFVHYSSFVLILFV* *LFSQYI 
VFVH YS S FVL I L FVI LHNYT Y I Y IHTHT 
HT YTL I FLL I FSQ I SLGWMKL I L 


13268 


27169 


A 


13410 


205 


23 


KE FLKFHRKC I CQQVKKI WPRAVAHACN 
PSTLGGQGKRIT*GQELETSIiANVEKPH 
FSNS 


13269 


27170 


A 


13411 


25 


426 


SVWWNSLiETRSSRLRLLKKI *NLQNKKR 
KFEHRHAQRKYDLETQGEDAI YKLRRER 
RKEEE *EKEEKKRRRRKEKEKKKKKPTL 
LCLKKQRNQHLDLRLPASRTVKKLISW 
+ ATHS WPCYGS PSKLI QQS I H 


13270 


27171 


A 


13412 


227 


52 


EIGSHS CHPGWS TVAQS * LTLTSNS * TQ 
LILPPQPPRELHYNCVPPCPAFCRQALA 
MF 


13271 


27172 


A 


13413 


345 


67 


SKCCYWAHDC I LLDL I CLYFLELVGCV 
DCFSSSIiGHY* PLFV* I FFLS PSLLFWY 
3HYEHICVPKFSKAPFIFLHFFPSPEFP 
FLCIEMVYV 


13272 


27173 


A 


13414 


345 


33 


KVFFFLI I C KMRMQNKC KNKHG I CLLNS 
AMNTCISFPLRQQIMTTWWLEVTQIWGV 
LFYLFHFIEMGSCPGWSAVA*SQPSATP 
VSWSQVILLPQPPEDLGNS 


13273 


27174 


A 


13415 


187 


415 


FNQGVJAFLFPFCVI FS YPWKDRNRTI FS 
I K ITS FQSSVLKLFCI FEVS * I LKIGLG 
QVRWLTPVI PALRRPRQADH 


13274 


27175 


A 


13416 


143 


50 


CMKLQSSIVMLGWAHACNPSTLGGQGG 
WIT*GQGGWIT 


13275 


27176 


A 


13417 


170 


17 


KQMREKEGNGLLT I S KVWKQPKS PSQS * 
* IKKM*YICTMECYSALRPEFPE 


13276 


27177 


A 


13418 


7 


163 


IK*IWYI*TMEYYSAMKRKKIMSSAAIW 
MKXjEAI ILGEVMQEWKTKNLMFSFISGS 


13277 


27178 


A 


13419 


901 


709 


3 GRKTSLGN IARS CL YTKS FK IHCLP W 
PATQEAEAGGSLKPRSLRLQ*ARSHHCS 
PVWVTTRP 


13278 


27179 


A 


13420 


236 


4 


VIRAKTGSHNSLFPPHTFFFQGKNLCFF 
QKKKKKKIYIQSKPGAMAHACNPSTLGG 
QIT*DQEFETSIiATMVKPNLY 


13279 


27180 


A 


13421 


46 


141 


NSVISAHCNLCLPGSSDSPSSASRVAGI 
TGWC*HCNLCLPGSSOSPSSA.9RVAGIT 
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Amino acid sequence (A=Alamne 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenylaIanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, IMProline, 
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X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














GWC 


13280 


27181 


A 


13422 


604 


1121 


LGSGDLP* EINPLSSCSLLREKDPPTTS 
GPQTDQPKKHLTNFKSAPYKTITDAELR 
VTLTVEAHLHPGE INSHVAHTKP VWWSL 
HTDAHEI WCRDSDLKLVPTLPLI PLEAA 
LRNITHSLSIPPPKNFRRPNTSTLFCVI 
FLINIRRQEYQASEPKPSHRI PCDLHVY 
AQMA 


13281 


27182 


A 


13423 


214 


22 


IRESSSTPSSPTLPESCNSIPFPF*KFI 
WSGVVAHAYNLNTLGG*GRKIARAQEFE ' 
NSTRAGLW 


13282 


27183 


A 


13424 


397 


330 


SEYNSECS * VQWHTLWSATQEAEAEGS 
LEARSSRLLCALI I PVNCHCTPAWAFPM 


13283 


27184 


A 


13425 


729 


885 


AKCYPVRNISLWPGAVAHACNPSTLGGQ 
GGR IT* GQEFRTSLANMVKPHLY 


13284 


27185 


A 


13426 


231 


2 


ASVFFKATLVDLHW* IYIYTYI YTYIHI 
YTHTYTHIYIYIHTYIYLNVNQYICLYV 
HINANQQGYIYTGRTPXCFF 


13285 


27186 


A 


13427 


184 


171 


LS * Y I Y I YTHTHTHTHTHTY I L Y I YTYE 
QPSSQSS 


13286 


27187 


A 


13428 


372 


166 


CKFRPRKINTTFSHICGS*KVDIIEAA* 
HWLGTVAHACNPSTLGGQGRWTI * VWEF 
KASLANLLTPLLQ 


13287 


27188 


A 


13429 


391 


437 


MYSLGVGFFHSTI *LLKDIFWSGWAHT 
CNF S TL * GQDGR IT* GRKLETR P GNRAK 
LYLYKKNSN 


13288 


27189 


A 


13430 


39 


254 


EFIPRAQDLETSQGNRVRPRLYRKFKNN 
*VWWCASWPAMWEAEAGGLSEPGRLRL 
HSNMDNGERSCLKKK | 


13289 


27190 


A 


13431 


336 


488 


DIWPGMVAHACNPSTLGGQGGRIA* A* E 
FETS LSNMARPWLKXKLARHSGA 


13290 


27191 


A 


13432 


154 


481 


FFFFFFFQRQKNGFF*KGIFFFFKLEGR 
GAI FV* PNIiTLPG*GKSPASPL* KGGKK 
GGAPPP * LI FFFLKKKKLPLLGQGGLKL 
RALGKPPPFPSQRGGINGVKTPFGL 


13291 


27192 


A 


13433 


388 


280 


KSKLKNGCCVALCLLGLI * SVLLA* FKT 
*LFCVSLDMYIDTTCSLSISIYLSIYLS 
I YLS I YLS IV* SVS 


13292 


27193 


A 


13434 


187 


48 


S I SS TKG PGAVAHAYM PNTWGGGGGR I T 
*GQEFETSLANMVKPRNS 


13293 


27194 


A 


13435 


71 


309 


DSVSEEEEEEEVELAHPLAAERYHCERA 
ET*VKALLWLELCDERVSSRSHTEEDCT 
EELFDFLHARDHCVDHKLFSNLK 


13294 


27195 


A 


13436 


18 


375 


RPAVPGRPTRPINTLLGLLLI I ITF*LP 
QLSGYIEKSTPYECGFDPISPARVPFSI 
KFCLLAMTFLLFDLEIALLLPLP*ALQT 
TNLPLIVMASLLLI I ILALSLAYE*LQK 
GLD*AE 


13295 


27196 


A 


13437 


373 


27 


THPYYSHQEYQSP* PLTGALSALLMTSG 
LAM * FHFHS ITLL I LGLLTNTLT I YQ * W 
RDVTRESTYQGHHTPPVQKGLRYGI ILF 
ITSEVFFFAGFF*AFDHSSLAPTPQLGG 
HWP PTG I TPLNPLEVPLLNTS VLLASGV 
SIT*AHDP 


13296 


27197 


A 


13438 


575 


699 


LGT VAHTCNSS SLGGRGGW IT* GQEFE T 
SLANMVKTCPQKK 


13297 


27198 


A 


13439 


247 


168 


LENL I YTR VLERHRE AKVH FPFSNISYS 
DKRTDTFVLTKTH*HTHTHTHTHTHTHT 
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C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=PhenylaIanine, 
G=Glycine, H=Histidine, I=Iso1eucine, 
K=Lysine, L= Leu cine, M=Methionine, 
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T=Threonine, V=Valine, 
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X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














HTRGSHFFY * HTHTHTHTHTHTHTHTRG 
SHFFY 


13298 


27199 


A 


13440 


391 


168 


TKKKKHPS PRGAREVFF * QPPP PPFCVL 
*FFIVFFLSQGGGGGGGEFPPFFFPPPT 
SPPSPPPRGVWKKTPPF3 


13299 


27200 


A 


13441 


403 


246 


LFPTAI IGGPPGFPPPPF*NPAPEFFFW 
GPKKKKKLS PPPAI KMVFFKGP PP 


13300 


27201 


A 


13442 


305 


414 


KNNFHPSILGG*GGWIT*GQALETTSLG 
NMVKPHIY 


13301 


27202 


A 


13443 


269 


409 


PGTVAHAYNLSTLEG*GTMITQGQEF* I 
SLVNTVI PHLYQKKKKKGR 


13302 


27203 


A 


13444 


295 


37 


DPSYLAWHMVTPHQVSTQHTLPRPRPCL 
PEMNFLVSLKIWQCWPGTVAKACNPSTL 
RGRGGW IT* GQE FE TS LANMTKTRTRGS 
AR 


13303 


27204 


A 


13445 


256 


388 


FCGFWPGAEAHACCPCTLGGRGGW IT*G 
QXFRTSLAHMVKPRVC 


13304 


27205 


A 


13446 


277 


374 


SRLTATS S RGGW IT* GQEFVTTLANMVK 
PLLY 


13305 


27206 


A 


13447 


14 


387 


PQVIHSPWPPKVLGLQA*nSQAWWLTP 
VI PALWEAKANGSLEVRSS 


13306 


27207 


A 


13448 


185 


379 


HFGRPRPADHLRPGATNHPGPQGETPSL 
L*NTKKKKRGRPFKKKKLQSLVWQNKIF 
FLAELKLLW 


13307 


27208 


A 


13449 


168 


346 


ISILRTNNMISIKINLRLFIDELKKRDP 
FIFPYFFFFFFFFFFGGNHSL* SLIVIF 
LIC 


13308 


27209 


A 


13450 


57 


176 


VAESRPGPGGTATELVPPSTRIjIiTRAPR 
DLLTGKRKPPPLGRMSRARVQWHNHF* T 
ESCSVAQARVQQRNLGSL*TLPPKIKQF 
SCLSLLSSWEDCLSQRVRDCSEK 


13309 


27210 


A 


13451 


411 


241 


?PLFFFFFFKQKFPSVP*GGGQGWDFGS 
LQPPPPRVKQIFCPKIPPPWPPKKGGVP 
G 


13310 


27211 


A 


13452 


307 


298 


PTHSIVRNNKSL* IN*TSSMCSGLKVMR 
R * SQKQEDP * SGWAHACNLSTLGGRGG 
WIT*GQEFSTFLCPSA 


13311 


27212 


A 


13454 


191 


108 


GWSQTPGLGRSSLLSLPQWWDYRMQCRG 
VILAPHCGLDLLGSGDPPASASQVAGTA 
AFENCATS FWLP * FLNTQCFKI YKTKN 


13312 


27213 


A 


13455 


242 


400 


I ATLLDYLQ I PNTGPGAVAHACNPS TLG 
GQGRWI I *GQEFENSLANTVKSCL 


13313 


27214 


A 


13456 


1 


346 


YVTTARCSCWAGTESHMGAKDFYRQEKH 
PFHADSGIGWA*DRLRQRYGRFGRRM*F 
SWARGAGEKHSLA*AQWLMPWTTFWEV 
KEGRSLEPRSLRPAWAT/IRNVS TKI KKK 
KKG 


13314 


27215 


A 


13457 


166 


289 


VRSSKLNTWVRWLVP 1 1 LTLWEA* AGGF 
LEPR I SRLWATQ 


13315 


27216 


A 


13458 


220 

<•> 


76 


TASLKLSKSWLDWAHACNPSTLGGPGR 
WIT*GQEIKTSLANMVXPSP 


13316 


27217 


A 


13459 


219 


46 


SPPLKKKNNFFPPGVMGAPPRFFLKGPP 
QNFFFFFFFFFFFFFFFFFLKQ * EQTFL 
LIY 


13317 


27218 


A 


13460 


126 


1 


WSLGAVARS CSPS TLGVQGG* IT*SQE 
FETSLANMVRTLTS 


13318 


27219 


A 


13461 


330 


420 


NICSWAQWI/TPVTPAL* EAEAGGSPEVT 
SS 
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otide 
location 
correspon 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, V=possible 
nucleotide insertion 


13319 


27220 


A 


13462 


577 


766 


AQWHTS I VLAA* EAAAAGLLEPRS * SRL 
KCTMI I P INSHCT PAWQHMRPCLLKKNF 
SNSFISK 


13320 


27221 


A 


13463 


206 


387 


VTVGETEAEAGRGHGLLTCCVLKTQKWL 
GTVAHAYNP S TLGG * GGR IT* DQE FETS 
FLGS 


13321 


27222 


A 


13464 


167 


41 


DFWLGAVTHACNPNS VGG * GGQ I T * GQE 
FETSLANMVKPWNS 


13322 


27223 


A 


13465 


480 


79 


SRVISPLLILNSVISAKSLLPVGSQGKS 
KGQVWSTQEERLEPGVMGEFTPGPCPCL 
VSQGPSLCPSPPPYSFRWSRLSQAPGSI 
RVLI *RPAAATSAPPPAQLGLPDTAFPP 
PLLSRDGQWAAGRQGDRDKGLP \ 


13323 


27224 


A 


13466 


3 


373 


DAWVAHASAHASVCLI IDTLLALLL III 
TF*LPQLNGYIEESTPYSCGLEPISPAR 
VPLYIKFFLVAITFLLLDLERALLLPLP 
* ALLTTNLPLI VMSSLLLI I ILALSLGY 
E*LRKGLD*AE 


13324 


27225 


A 


13467 


214 


55 


TRQKI FNLFS S * KKFL I PPARAKDFLFF 
I FFFFFFFFFFFFFFFSCSRILLHR 


13325 


27226 


A 


13468 


111 


358 


VMKVF Y 1*1 GELVTQAY I F VKSHQ I AH I 
LFHSMYVLP* IKWFWPGTVAYTYNPSTL 
GGHSRR I S * AQEFETS LGNIVS PHLY 


13326 


27227 


A 


13469 


199 


607 


R I RGT S QKLLLLKKL ISLIPKSQP TRGT 
P*TTFPPPNTTNFPPPPVAATAPDPSPA 
HFVSSPYNPDLGSP*PECPSPGRLQREI 
EQCKKDIQNFPFPTTSRICSNDLSLKGS 
AS RRRGHLFCERP VNQ FRS PKP KK 


13327 


27228 


A 


13470 


128 


256 


EDWMWWLTP VI PTPQE VE VRGSLE PRSS 
ELQ*AVIVPCTPSE j 


13328 


27229 


A 


13471 


94 


388 


KGEGKTE E LW S WR SHRGTHLCLLAD I S A 
LPLHYYYYYYYFWQRWSLPDQPGQHGET 
PCIjPKIQI INRAYWHVPWPTDQEADAR 
* LPEPNRQRSQRPE 


13329 


27230 


A 


13472 


359 


73 


ATEPGQLFYF*ESESHNVTQAGVQWLDH 
GSLQPPTPGLKPSFPLIHTGITAASHYT 
WLNQLLPMNVSLLTFYESPQLKIQRSHI 
LCNSIKIKFLE 


13330 


27231 


A 


13473 


332 


10 


I LSKRGFPKPGRYCEVNPFLCVQGAK* V 
RP* EDTSLHCEIYHTHTHTHTHTHTHTN 
LK?*TSKELIPQKTESLFFSLCIKKSLL 
R FQTFPDAGSTGNS VGCACF I S K 


13331 


27232 


A 


13474 


355 


385 


IMVCIKQSCVH*KNN* FWPGTVARVCNP 
RTLGGQDRQTA*AQELKTSLDNMAKPCL 
YKKFF 


13332 


27233 


A 


13475 


297 


425 


R KS AW WLM P VVSPL * E AEAGGFLE PGSL 
RSAWATWGDPHLYKK 


13333 


27234 


A 


13476 


160 


1 


KSTGRLGMVAHICI PSTLGSQGGWIA* A 
QEFETSLGKILSLPEIQKSASLITW 


13334 


27235 


A 


13477 


334 


79 


GMKEQ INFLYQKRG I KPNKLQAENWLI 
HYN * KLLSNS FK * LQDTAS I * YFFGNYL 
NRPGMVAHACNPSTLGGRGGWIT*GQEF 
L 


13335 


27236 


A 


13478 


234 


385 


YAKFHVS INEKKPDVAHACNPGTLGGQG 
KWIT*GQEFKTSLANMTXPRIY 


13336 


27237 


A 


13479 


408 


129 


KRS INRKKGGELS S SHFLLLLLPPLDEE 
PSPPPFCPPLPPFPPPCPRLPQSG*GAC 
GRLHPC* MDLGPCKCPARKVFSHLSCSL 
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acid 

residue of 

peptide 
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nucleotide insertion 














VLQVCRSP 


13337 


27238 


A 


13480 


251 


384 


TITPWRGAVAHVCNPSTLGGQSRQMT*G 
QEFKTSLSNMVKPRLY 


13338 


27239 


A 


13481 


164 


300 


AGDD S DNS MGL LGR I HE I K FFKCLG P CM 
TTKF I VPALWEAE VGGWLE PTS LRLR * A 
LIALLQCTPA* GTVYDYKVYRPSTLGSR 
GRRMA 


13339 


27240 


A 


13482 


2 


30 


TGCHQFRFLNNPFSPHFVLQLVSPGPDC 
LIFRYFKF*LHQLM* INPNANFCGARSF 
LCKFRVWIS*LRYPSLVALSVLFVFK*S 
VQI * VKKKKKKKKKKKKKKKKKKK*K 


13340 


27241 


A 


13483 


39 


199 


LDVGTCSPIVPASQEAEAGRLHEPTSLR 
LW*AMIVQLYYSLGDRARPCLYLKK 


13341 


27242 


A 


13484 


18 


72 


TRTRSYPGSTGCQLHTFGLVSLYNCVSQ 
FLITNLYISIYRDIY*YIYIYLYREIYL 
YIYIEIYTYIERYVYIHI+ILGLQVANC 
IPLD 


13342 


27243 


A 


13485 


266 


496 


LDSASDITQAGVQWCNHSFL* PQTPGLK 
*ASTSAS*VAEITGLSQHVWVKIIMSVS 
LSSYTFIQLPS ILLYGNTAP 


13343 


27244 


A 


13486 


286 


72 


HILSHCTVLWFLTPHHGTTHYKNTTGQP 
S I VAHTHNLS TLGG * GRWTT * GQEFKT S 
LANT VKCS DAWVDRD 


13344 


27245 


A 


13487 


309 


6 


RFFFFFPPGKKGVFFQRFFFGFPRGFPP 
PRFFKTGPEIFFFGPLKKKKNFPPPGGK 
IVFF*RGAPPFFFFFFFFLFFFFFFFLK 
KNYLLLEKVQSPEKEG 


13345 


27246 


A 


13488 


183 


459 


RRVKKI CKCI I WQNNYL * KG INI CSMPQ 
KKKKKKKKKKKKKGAPFKKKKKKKIS *G 
GGGPFF*KTKIKPPGGRGFFFFFGEKIM 
LFPAVFI 


13346 


27247 


A 


13489 


348 


1 


CEGKDGLPIQFASWFKYAGFHSLESIFQ 
S FCQTKKLEAPKELS PCS QLYRYNWQLT 
CRRMKLDPHLLPYTKINSRPGAAAHA*N 
PSTLGG* GGW 1 1 *GQEFETSLTNMGETP 
PTRP 


13347 


27248 


A 


13490 


313 


320 


KQGG PPLAQARVQW YNLS SGETS LPIiPG 
SSNP P PS ATGVAGTTGS P PLA* LI FFFL 
* DRENGDQRKL 


13348 


27249 


A 


13491 


187 


350 


GNPVKK* IKNTTSSSGWVWWLMPIIPAL 
WDAEVGGLLEPRSLRPAWATERDSVS 


13349 


27250 


A 


13492 


211 


83 


E YAKKNGKLLS WRKYL* S VCVCVCVTTC 
VCVCVCVTTCIKNI 


13350 


27251 


A 


13493 


286 


407 


LGVEAYACNLNTLGG * GGRIT* GQEFET 
GLGNTARPCLYF 


13351 


27252 


A 


13494 


897 


1033 


KFIFKQMQWDDHSSLQP*TPELKQSSCL 
RLPSTWDYRREPLHLAM 


13352 


27253 


A ! 


13495 


265 


3 


HLI LLEEDLTWQRGKWLKGRVS LCSLG S 
* TQSLTVIjKKGRGLG WAVAPSTLGGRG 
RWVALAQELETSLGNMVKLSLSKIQKLA 
GMV 


13353 


27254 


A 


13496 


319 


1 


CFKLWDTCAQCAGLLHRYTCVMWCYTH 
HPEFCYNIMGLPKYMWSYLWPKCCHAVY 
HFRYIKNLLPGCVAHACNPSTLGGRGGW 
IT*GKEFKRRMKTKV* IGI YFI 


13354 


27255 


A 


13497 


265 


n 


VLGEVSNQHLLGAPSNRTPSEVPLAQAA 
QGRPSLVPPSAIVFPSYFTICYAFLSIS 
M P 1 1 P * VWE AE V GELLEPRS LRQA * ATW 
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13355 


27256 


A 


13498 


164 


22 


G I KR KLI KS G WAHARNS S S LAGRGGWL 
T*GQEFETSLAKMMEIPSV 


13356 


27257 


A 


13499 


277 


363 


TYLCYIFNI CLFVGE IEKQR* LI * FHRS 
TLNGH I LYVRP VS PNPSQAVASPVTCT Y 
MPTWPEWEESQKK 


13357 


27258 


A 


13500 


1 


270 


YTCS VLLLVTVNLIjLCIj I YKLNF 1 1 GML 
RKKTVYIYIERIQYYLKFQASTGCLGVY 
PTV*PYMYTYTYMYMYMYIYIYIYIYIL 
CLCLDG 


13358 


27259 


A 


13501 


170 


383 


ILYSSKLTYPSPQIVIFFCGKST*GWVW 
WLMPVI PAI * EAEVGGSLESTSLGQAWA 
TGRKAFFVFYKEYKN 


13359 


27260 


A 


13502 


31 


402 


GDYLYLREGWQEPARREQHAQNLEPNRT 
FTPHli I CLTNYRLFSLAWS ECEERRNET 
TDKNQTVKDLALDMKRVLP FS * GHQKVI 
KVG WAYACNPS TLGGQGQW I TRGBEFK 
TSLINMAKPYLY 


13360 


27261 


A 


13503 


232 


446 


RNRPGAVAHACNPSTFRGRGGRIMRLGD 
*RS 


13361 


27262 


A 


13504 


196 


390 


VNKAEKKHL YFVARKE3 LRPG WVHTCN 
PSTLGGRGGWI I *GQEFETSLPLPKTQK 
LAECGRGCL 


13362 


27263 


A 


13505 


146 


7 


LKYVLYWLGAMAYTCNTSTLGGQGGWIS 
*GQEFETSIiANMVKPCLF 


13363 


27264 


A 


13506 


177 


18 


SQNFGRPRQQDHFRLDGRHSETPSLQEK 
VKKLTRHGGACVQSQLLERLRREDHLGP 
GGGGCSVP*SVNII 


13364 


27265 


A 


13507 


117 


338 


NKILKKKKGGGRFKES KFTS PGLQGNS F 
FMGPPKLNSRAGV*QRREGKNLGVPQLK 
PFEANPLFARGPNTKNP 


13365 


27266 


A 


13508 


306 


443 


LARYE PAVRTRAC * AQ WLTAV I P VL * EA 
EAGGPiiEARNLRPAWAT 


13366 


27267 


A 


13509 


146 


390 


KMFKGHEQAAHRKKKKRGGRFKGSKFTS 
ACLQRNIFFLGPPKLNSRAGV*QRGDWK 
NPGVTQFNRFEENPLFARGPNTQKP 


13367 


27268 


A 


13510 


119 


356 


NEDRNLRGGCPGR* LLRTEGVCSNPAGW 
SGIRWCEESGGLFWRWRGTESVLFSGL 
FVCCVFAQE KAKATGRAE VS LCP 


13368 


27269 


A 


13511 


221 


21 


EDLQRDKPLGS CYS TCGWAEQ WYLQHPG 
GAGSKCR IWLGAVAHAYNPS TLGGQGRW 
IT*GQEFLSL 


13369 


27270 


A 


13512 


163 


418 


TFPDDQCLMLQDHACVKRSIQSA*YLTP 
AVLAHWEAEAGGLLEPTSLRPAWAT 


13370 


27271 


A 


13513 


239 


21 


FGNLGGPGGRTACIQFSLGNIVRPYFYK 
TKPKPKPNQNKTKI S *AWWCMPIVPATR 
KABVGESLEPRRSRLQ 


13371 


27272 


A 


13514 


135 


11 


KFFFWPGVLAHAYNPSSLGAQGGRIT*G 
QEYETSLANMVKS 


13372 


27273 


A 


13515 


162 


57 


EGTLRSRRPPLGGWVT*GQEFETSPANM 
VKRCLY 


13373 


27274 


A 


13516 


424 


54 


PKRGFFPTPFIWVPPVFPLPPFFKPPPR 
IFFLGPPKKKFFS PPPGLKI FFF* KGPP 
PFFFFFFFFLFFFFKKKKKRLGGFFFFF 
FWARPFSFLSLFFFKETIPLYYNIWMHK 
DSCKAVIHHYH 


13374 


27275 


A 


13517 


310 


349 


GGGPLKKKLFFSRGGERFFFF*GAPPFF 
FFFFL*SSKIFIFFLKSFFFFFFFFFFC 
FFFFFSQILFFJSSPCFVFFFLNYTSRT 
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CS YVCAL I LFKGNT YSRKARHI LW 


13375 


27276 


A 


13518 


273 


393 


IKCLS YRKKATTLGGQGRW IT* SQEFET 
SLSNM*KPQLY 


13376 


27277 


A 


13519 


450 


139 


I PP IALVS PFAKHTWQQQMGLVRRSS KG 
NNMSKGWEIIHPSRIFCSNHTLLRIALW 
LGAWHTCNPSTLVGRGG W IT* GWE FKP 
SKTSMERFCILSRHKRTRGS 


13377 


27278 


A 


13520 


47 


427 


KMKQLFEMLVFQNWTI FLSHS I FRTIR* 
LTLEPLMFRYYFKQ*RTLLPPKF*DRVW 
LCHAGGCL*CSGTVSAHCNLRLLGSSDP 
STSAS * VAGTVGAGHHI WL I LNNRELP K 
QKFFINSSRNLFFT 


13378 


27279 


A 


13521 


199 


98 


SLP SVKLS I CCE FPEDIVSKDP I P WLP 
WKRESCSLCALPNLLPK*EKTNIQVP* 
VFQSTI KAKAQLGMVAHACNPSTLGGRG 
GWIT*GLR 


13379 


27280 


A 


13522 


314 


18 


QTLNFYILKIYNIISLVHC+IFEARFQI 
LV* V* *NYPF* SLICFTFSWHETQNT* * 
KNL 1 1 WAGAVAHTCNCNTLGDQGRRI AL 
AQEIEPSLGNIAIQ 


13380 


27281 


A 


13523 


18 


231 


CPP AVFGTS I E Q I QLK YD I KD * K I GQAW 
WCAPVI PGTQAE VGGSLE IRSSRL* CAM 
STSMNRHCVPAWAT 


13381 


27282 


A 


13524 


239 


465 


RVQVS ECLLHRE KCLYFQLGLS * L I FQT 
GWLKQQKLPGTVAHTCNPGTLRGRGKRI 
T* GQE FETSLAN I VKPRLY 


13382 


27283 


A 


13525 


388 


468 


SLRLGT VAHTCtt PSTLGG * GGW I TRSG 


13383 


27284 


A 


13526 


342 


369 


CTLNTLLLYS ISDFGVSAFLATGGDITR 
NKVRKT* LRLGTVAHTYNPNTLGGQGRR 
LT*AQEFKNSLSNIGRPHLYKTKKTKKT 


13384 


27285 


A 


13527 


241 


231 


EYVCIWSHSSLYFSSSLYIHM*VCVCVY 
I YVHTNSH I YTEMY I HTHTHTH I CI YKL 
EEKYKEL 


13385 


27286 


A 


13528 


241 


231 


EYVCIWSHSSLYFSSSLYIHM*VCVCVY 
IYVHTNSHIYTHMYIHTHTHTHICIYKL 
EEKYKEL 


13386 


27287 


A 


13529 


34 


299 


SAP AS AS PWGIGVLYRPLACPEDRFSPG 
PEANCGEIETTELRVTLSPRLECSGTIT 
APCSLDLQGSRDPPPSAS*VSETTGASH 
HAQL 


13387 


27288 


A 


13530 


158 


45 


SGHRSRARWL* PVI S ALWEASAGGS PE I 
KSLRNRRPG 


13388 


27289 


A 


13531 


342 


1 


C F FLKGAP PFFFFYFFNFFFF F FLVGTD 
KLILKVTWKWKGPKIDKAVPKKNNQARG 
LP*PDIKI F YKVWLGAVANA YN P SNLGG 
* GGW I A* E * E FETSLANMTKPQ YKKYKK 
LA 


13389 


27290 


A 


13532 


589 


742 


RIMKMLRIKICGTGPGMVAHAVNPSSLG 
DGWI T * DQE FKAS LANMI KPHL Y 


13390 


27291 


A 


13533 


74 


446 


HVGIPSRLTSVSHPRHCKHPRTESEHVY 
CAIEARHYQCKVSRPSDAAGSEGRPRDM 
TQDTEVADTPPPNI SSI TATRTMAT IGV 
GVVAHAYNPNTLGGQGRRI A* AQE FKTS 
LGNLAKPCLYKH 


13391 


27292 


A 


13534 


1941 


1040 


AFHLLPYLSSGFYCSLGPCFGLGSSHMA 
AWHKE PLHNACSDSRR S AP I RQGLGS PS 
ATHDTHTRTLAH I CKE I FKERLHE I KER 
EIBS*RG*NTOR*VEREEAENKYKPTEI 
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G=Glycine, H=Histidine, I=Isoleucine, 
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nucleotide deletion, \=possible 
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NTEIENHKQQEMLRHAGEERENSPREAE 
RKCRNEKCEIKSRNSRKEARLRKKVKFK 
TNE KG KG * QI I T VHKE KRE VGRERRGRQ 
NKKRTKSTQHVTQKGDMR * S VKEEPHLR 
RAKRNSNKMKNERYVLRQDTDHSCEQKE 
KMRQRHTRKKLARETWKWTOHTLNREQR 
AKPRKKETKEIHIHSAILKC 


13392 


27293 


A 


13535 


1192 


1476 


E I KRKWGPGAS CHACKTPI TGRPRRVDHL 
RSGVQGQPGQRGETPSLLKNIQISWAWW 
CAPVI * E I KRKWG PGAS CHACN P I TGR P 
RRVDHLRSGVQGQPGQRGETPSLLKNIQ 
I SWAWWCAPVI PATWEAEAGGS LETGS G 
GCTEPRSHHCTQAWVTE 


13393 


27294 


A 


13536 


55 


132 


RAESCSLAEDGVQ*GNLSSLQPPPPK 


13394 


27295 


A 


13537 


7 . 


325 


S P VEFRLGE PTKGTS S FGKRRNKTHTLC 
RRCX5SKAYHLQKSTCGK*GYPAKRKRKY 
NWSAKAKRRJSPTTGTGRMRHLKIVYRTFK 
HGFREGTTPKPKRAAVAASSSS 


13395 


27296 


A 


13538 


259 


274 


KFEAAVCYHCTTLSL* *CHTVCFLSFFL 
LEIGYHYITQAQSGAMIAHCSLKLLGSS 
NP P VPAF * VAETRS AC YNAWL VFVKFLM 
GL*ELWKDMS 


13396 


27297 


A 


13539 


j 


68 


QASVLPQKQLMSWQKLSESTFVKFWNLV 
KTGRLGEE I SCCLWRE YYS I S DCLAT I 
KL PASHLS MRKPRHKDFKST FL I S TKNQ 
GNHGHGTKGNEVNNFQQRIS V* KTYFKK 
YNMRPGAVAHTCNPSTLGG * DWWI T*GK 
NCQNQLS 


13397 


27298 


A 


13540 


268 


468 


YRRLISKKFFKHRWKSVSFFKFSFFWLG 
EVAHACNPSTLGGQGGKIA*AQEFLTSL 
GNIVGPCLYKK 


13398 


27299 


A 


13541 


308 


437 


FSWAFEIGSQAEVQWHDLHSL* PLPPRF 
KRFSHLSLPSSWDYR 


13399 


27300 


A 


13542 


7 


244 


AKTAPLFF* FET VS CFVAQARGQ WHDPG 
SLQPR PPGQETSM I KTS3DP PPPASQVA 
GATGMGHHAQKI * FLVETGSPHVSQGGL 
ALLK 


13400 


27301 


A 


13543 


50 


175 


ALPAIHTAVGQCSVEGFCVLSDPEGDGA 
ARMKLVRFLMKLSHETVTIELKNGTQVH 
GTITGVDVSMNTHLKAVKMTLKNREPVQ 
LETLS I LGNNIRYFI LPDSLPLDTLLAD 
AE PNVKSMNREAVAG * R * G* ARE I FDE I 
ES 


13401 


27302 


A 


13544 


47 


362 


VAFQGLQVPATNPANFFFFWKGGFFFVP 
QGGRQGEKIGLRDPWPPGVRKI PGPTLS 
GTREKGAPPPPPIYF* FFWKKGGQKGGP 
GGF*TWDPKGPPSPTPPXGGD 


13402 


27303 


A 


13545 


97 


351 


NKKKKG PP P F F * KKRVGKKKKKGARAGG 
PPPKPPPFGGPRGGGPRGQNSKTPRPKG 
GNPPFKKKKKKRNI F PAG PMGLKGGGHL 


13403 


27304 


A 


13546 


214 


32 


NYIKIS SVDGGAKI K* WT * NLKCLVLS V 
KNITRWVDCLSSGVQDQPGQHGETPSLL 
KTRKTS WA* WRLGLI QR WRKMP WPSR 


13404 


27305 


A 


13547 


350 


159 


QTGTLSQKTKQKKQAQWLMTVILGL* EA 
KAGGLLEPKSLRPAWATWQDP I YKKKFF 
LERKTWR 


13405 


27306 


A 


13548 


367 


149 


FFFFFFFF* TGSH YVAQNW TQ I PGLNLP 
ASASOVTATTGMHHHTRHTPFPSLTSHT 
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C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
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QSVSTASWTYLQNSGRH 


13406 


27307 


A 


13549 


321 


2 


GL VKVTHHG Y DRNLGY PD S QAS VLS S FY 
TA*MVQ*LQLPDKVQIYPAHPLFINDMP 
P*WLLKIPSFGPGTVAHPYNPNTLGGRG 
RRTTRDREFETSLANWKPHLH • « 


13407 


27308 


A 


13550 


216 


13 


GYSGKETVFSMRKGQQSGPCLQGPS I PM 
GKAG INQI I PKTTLGGRGGWIA* GQEFK 
TSLANTVKFCLY 


13408 


27309 


A 


13551 


199 


391 


KDFS I FI ACFKRPFITGSQGQEQWLI FP 
HAYNPS ILGGRGRQIT*GQEFLTPARLG 
NMVKPCLY 


13409 


27310 


A 


13552 


70 

i 


239 


ATCVKN * RTS WAWWCVP WPATHEAAAG 
GLLEPRSSRLQCSVIASVERHRTSVWTT 


13410 


27311 


A 


13553 


105 


257 


GQAQWLMPVI PALWEAEAGGSFAPRSLR 
LA*ATVPGPNNFLKRAFHSLLNLFLP 


13411 


27312 


A 


13554 


499 


165 


Y*MESHFVTQGGLQWCSLGSLQAPPPEV 
GGWLELGRQRLPMKPKIAPIWTPSPG 


13412 


27313 


A 


13555 


415 


719 


YTDKSIRSISLLPVKGSTHSMCPVKFIQ 
LQE AGG I HMAKLSGQRVDRE WRLGTVAH 
TCNRSHLEGKGGQIT*AQVFKTSLGNMA 
KSCLCKKYQKLSWIWQH 


13413 


27314 


A 


13556 


224 


523 


DKVSLCHPGWECGWVQSQLTTASISQG* 
TILFPQSPE 


13414 


27315 


A 


13558 


355 


1 


QVSLQLPYCVIiFQFHIYNKQAALLQRCY 
VSATNLLINAI WQYVNSLKMCEENQNVS 
LQNMPF*HRNYFGEAGHGGSCL*SQLLG 
RLRH*NCLNLGGGGCSELRLRHCTPAWA 
TKASPS 


13415 


27316 


A 


13559 


73 


259 


KYVPHKGKI SERRLCSVAQAGVQWCDHG 
SRS LE VLGSNDL1AS AS * VAG I TGCWGS 
SDPHTT 


13416 


27317 


A 


13560 


205 


21 


WK I SEDFLLGDAKKWAYQED I KRLRLG 
RGAVSHTCNPS TWRGRGG W IT* DQE FET 
SLPKC 


13417 


27318 


A 


13561 


254 


481 


LMALLPGSSDVDLSEYGWEHFNFVILIS 
KEIFILLFF*DGVSHLLPRLECSGMISA 
YCKLCLQGSSDRRDFLKKQ 


13418 


27319 


A 


13562 


174 


369 


TFFSPSMFVEPGPYYIAQAGVRWLFTGA 
I IVHCSLKLLALSDS PTSAS * VTATTGM 
PPHLAFTVK 


13419 


27320 


A 


13563 


371 


109 


LFSKAGRCILQNLLCCMVSGKCWPMNGI 
DVTREYPTFIAHRVLrYAATSGSDFLGQ 
AQWLMPVIPAI *EANAGESLEPKSLRIiA 
*ATVPGPRSLSQTLLHRSELCGIi 


13420 


27321 


A 


13564 


206 


486 


RQLAAVHI LVTPLPDHVTWANYVTSLSL 
LSSS ENGVNNDNTHFIGLLLRDYK*GWV 
* WLTPVI PALWEAKVGRIARGQELENSL 
GSKARPRFV 


13421 


27322 


A 


13565 


212 


398 


SRVRGCFPFNLPKSSCI*ECGQGAVAHA 
CNPSTLGGRGGQIA*AQEFKTSQGNVAK 
PHLYKI 


13422 


27323 


A 


13566 


26 


395 


YSPVHTDKCSGVRKLGLFFFFFFEKEFH 
FFSPAWKARAKIGLNGTPLSQGKGNFRP 
KPPKKRGKRGRAPQPGKILNFKKKRGFS 
MGAKVWPNFGPKGNPPP* PPKGAGKKGG 
TPRPGPFFFFF 


13423 


27324 


A 


13567 


400 


195 


ATRVSLAPTKKKINWGWWPTPVI PPPQE 
GEAGEFL* PQKWRVRLNKI VPLPSRPRI) 
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nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L= Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GJutamine, R=Arginine, S=Serine, 
T— Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon,/=possible 
nucleotide deletion, V=possibIe 
nucleotide insertion 














KTRLCFKKKKRN 


13424 


27325 


A 


13568 


316 


154 


SQPFPGLRWEDRLSRGVQNQPKQ*LMPV 
IPAFSGAEVGGSLEPRSSKPA*ATNNIjK 
SKVIIGGQICDKASKVSIVESSWWGWGC 


13425 


27326 


A 


13569 


297 


475 


FDYFF I Y* ETKSHS AAQAGVQ* CKHS SL 
QP* PPGLKQSLLPWPPKSAGLTGVSYGT 
WPI 


13426 


27327 


A 


13570 


116 


332 


NRVLLLPKGEYRGTISAECSLKLLGSKR 
PPTSAA*GDDETTGVCHKTPV* FLKTFF 
IEIRPGYVPRVGHWRF 


13427 


27328 


A 


13571 


233 


405 


TVP E LRAS CRLKKKKKKKKG E KNLKK I K 
GFPGPGPRV*GPEAKNPGGGG PG I PKG P 
G 


13428 


27329 


A 


13572 


408 


3 


TFCDYFKTPP PGKFFF ING P PS F FLNRG 
PSFFFFPGFPPPFLKNFFFFPLWFKGG 
GYKRGVFPPKKKGPPQPLFFFFFFFLI * 
KKTRPGTVAHPCNPSTLGGQGRWIT*GQ 
EFLSSLAKMAGRWRPGNCESV 


13429 


27330 


A 


13573 


273 


406 


I KEFKAERGGHS *NTSTFGGQGR WIA* S 
QEFETSLANMVKPHLY 


13430 


27331 


A 


13574 • 


385 


38 


KKNSPPPPGKNFFFF*GPPPSPFFFFFF 
FFFFFFFLGGKKTFFFPPPSRFFFFLKP 
FPPKNQGPKNFFPPKKKIGD*ARPPPLT 
GGAKGAPLFKKKKKKKKLKVRPGAVAHT 
CNS 


13431 


27332 


A 


13575 


230 


53 


HWKVLKENKQI FTKIS FKMNAI KDSVGQ 
AQWLMPVIPAL*EPDMGGPVEPKSLRPA 
WAT 


13432 


27333 


A 


13576 


406 


33 


VNSIVWVSSPFRVSTFISFFELKSCSVT 
QAGV*WHDLSLLQLQTPGLKRSPAPSRG 
LSFYLSSIFYPHSSQTELFVGTLNAASF 
TNSRLLYLSLLLCGKYHPLVPSLPLSTI 
WYFSLRVSHHFP 


13433 


27334 


A 


13577 


426 


294 


GGFPSPPVFKPPPQFFFrGPLKKKISPP 
PPGEKKIFF*KAPPP 


13434 


27335 


A 


13578 


268 


3 


LQAKGPAMGEAGARCS SEVWGWPRKGFD 
L*VSR*PSFDSHVRIGRVQRLMPWPVL 
WETEADISFEPCSSRPAWASWRKTYIQK 
PQRI 


13435 


27336 


A 


13579 


395 


49 


EKKKFFVCFPGFPGNPPNPSPQFF*KGL 
IS PSGSLRTRRGFFP * RVFFFFLKKQFP 
LSPRVECNGI I PDP * PPFGEVTSTPQVA 
GAIGPPPPPRVNFYFFVKKGFLNVGPMF 
FFF 


13436 


27337 


A 


13580 


68 


466 


GAS PAQGSTLHIiVI LPRYFFKI PTVRTE 
SFSGSLVTSPPPLHFLPLNKERGGPGIP 
LPCPQLLQVLGGTSLPPVPSLPDTSQDK 
WPLHGVPPGHVCS* PLAGDGAWPPSPHW 
IPLNPGTSKSLQPAPPWNSA 


13437 


27338 


A 


13581 


67 


249 


ATAPGLLCS YKI FSCQLQWCVPVTQIiAW 
ETEAGGSLEARSSRL* *TMITPVNRHCI 
LAWAT 


13438 


27339 


A 


13582 


359 


3 


KQDSQWVAAAPASVAVSTCWRGLPLPWS 
GGQNRGKGSLPLQVHRGA* GTENKNQGG 
T PRPGGGPGPS APRGS S LGAHRKLPVHH 
TTLSSSRSALPPTPRHPAPSSPPCTEEF 
HRTRPI 


13439 


27340 


A 


13583 


174 


1 


FF FFFVETESHS VAQ ARVLE YSGAI PAQ 
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eth 

od 
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Predicted 
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location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=lsoleucine, 
K=Lysine, L=Leucine, M=lVIethionine, 
N=Asparagine, P=Proline, 
Q^GIutamine, R— Arginine, S == Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, . 
X=Un known, *=Stop cod on, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














CNLDNLS S S ELPASAS * AS * VAAXTGT * 
KR 


13440 


27341 


A 


13584 


192 


206 


LFFLEVGSCYVAQAGFEFPGSGHLPLSA 
S * WG STGVPHHTQLV* SFSSLYNLERN 
C 


13441 


27342 


A 


13585 


244 


28 


KKPENQEINYIFYHLFNVHEQIRISLGP 
GAVAYNCNP STLRD * GGRI T * AQEFKTN 
LGNMARPRTRGSTRTF 


13442 


27343 


A 


13586 


312 


701 


RGPEET* PRTRKSS PRPTRSCESVSKLA 
RPPRR CS AAP AQVPRLSLRS PKPDPPRV 
SSAEKCAPLPLLPECSETGALPRNSFLC 
• QNASSPLLSLGIiPPSPTVQALQPRALHQ 
HIjGS TNKEDAHVAP AKKK 


13443 


27344 


A 


13587 


410 


292 


GYMIKQQTVYHECRM*ANSHLHLPPEGL 
LRAVTLWRRAP 


13444 


27345 


A 


13588 


428 


36 


TPPPFFFFWAQKKKKI FPPPGGKI FFFL 
KGPPPLFFFFFFFFFFFFFFLKKFFGLK 
GPFFFVQD*QGLENPKEGSSLGWEKLGR 
ALLPQQRS FVPGLGPTRLFWGLVFWFFG 
FFFLPFGLCFFFALIFNC 


13445 


27346 


A 


13589 


3 


364 


TAPDDQGRPYQAGEPAAHVRADSTHGLG 
A* RRVAGHL P PRVGAPHHPGAS S REGAP 
PLHP P PNL * HRR PNGS CRFLPGPAAPPQ 
GAEGGRGRDRRGTHSVAQTGGPGGFGSG 
VTPSWRSS 


13446 


27347 


A 


13590 


778 


910 


DRIRRSGAVAHTCNPSTLGGQGERTA*G 
QSFKTSLGDMVRFCLY 


H447 




A 


13591 


230 


3 


KLVNI LVEKSLVAMKSTVSEGKMVIRS P 
ASLGMVAHTHSPSTLGGQGRQIA*AQEF 
ETILGWMAEPCLYKKHSVY 




Z. / Jf/ 


A 






j~t\j 


LTPIIPEPWES E WGS kE * LTPGAVTHA 
YNPRTLGIRSGRIT*GHEFQTTLSHTMK 
TRLYKELL 


13449 


27350 


A 


13593 


35 


220 


DRASLSPRLECSGMIVAHCS INLSGSSD 
PLTPAS * GDGTTDTCHHARLI FTGADFC 
IDASS 


13450 


27351 


A 


13594 


87 


369 


NVQKTECEI SGKMQIWQKGSVKTKCSNG 
LFNFPIFSKKIASCKVKKLTRE* KYKSQ 
ARWCTPVT PATEAEVRGQLEPRS LRPAW 
ATYQPHFKS 


H4S1 


27352 


A 


13595 


261 


489 


SWVNERMWG* *KHETSQAQWLMPI I PA 
LCEAEVGGSLEARSWETSLCNIVRLHLS 
KKKRGKGRKRGGNQIAPSRE 


13452 


27353 


A 


13596 


238 


2 


NDCLWWLFRI PANVSTFGLLHMSLKVNT 
PGNNRFKSELGTRCL IHTCNPS I LGGQG 
GW I T* GQE FVTSMQKGSAVKNQ 


13453 


27354 


A 


13597 


36 


687 


RDVHRSTYQAGSKQDWGPGEAERLSSSR 
RGAYS CP VP I TFAEGKTRMGRDKMRLI L 
GLS WG P S LTLLIiP I TNAPRR PGM * E P AL 
PGNSTS* SGAVRFPGGGRK*EDSAESWS 
CFCSHPSPSSGPNPPSPPSAVCMLPGLS 
GLQR*PKQLSPPPALSLPVSSLLVRLSP 
WPPTSSHLLPQALPQSLHPQGSHRAVND 
SFSM*GMVLGSRRNRGCMEA 


13454 


27355 


A 


13598 


367 


468 


KKQRRGREP WLTPAI PALWKAKAGGS P+ 
VQSSRP 


13455 


27356 


A 


13599 


183 


380 


LTSMLAVDNSS VEKTCPQAMRI SRPGAV 
AHTCMPS I LGGQGGL I A* VQEYETMLGS 
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peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 
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a m inn arid 
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peptide 
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Predict- 
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nucle- 
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last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
G=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Iso leu cine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P^Proline, 
0=GliitJiminp R^Arcininp .S=Sprinf* 

\£ \JIUiMlJflJIVy AX iKM £\ 1111 11 1*, U Uwl IIJW, 

T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unkno\vn, *— Stop cod on, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














MAKTHLYKN 


13456 


27^*57 

Z. 1 J J 1 


A 


13600 


161 


6S7 


LIQGCVJCSLFFSTRVFLAAIGMKSQWCR 
PVAMDLG VYQLRHFS I S FLS S LLGTENA 
SVRLDNSS SGAS WAIDNKI EQAMDLVK 
SHLMYAV* EEVBVLKEQIKELIEKNSQL 
EQENNLLKTLAS PEQLAQFQAQLQTGS P 
PATTQPQGTTQPPAQPASQGSGPTA 


13457 


2735S 


A 


13602 


121 


423 


MIKVNSFGLYLYSQVIYIYIMKYYTAIY 
LKDFFFKDGVLTRHPRWNTVTTQCSLE I 
LGSTDPPAS AS * AAGTTNTSHHGQLYS Y 
FKKRPTYASMEQYKIKK 


13458 


77S59 
£. i jjy 


A 


IJUVJ 


1X89 

1 007 


1717 


VPRIRPLSWTPPSSI*RPKPPSSTFSHP 
GKPSMGPPGLNRCPVQRALLPGWYQHCG 
R 


13459 


27360 


A 


13604 


81 


267 


ETCC I KNNTLK* I * S QMLWCAP I VPATW 
QAEVGGS LD PRS S S L * CAM I TS VNNHCT 
PAWAR 


13460 


27361 


A 


13605 


267 


418 


THASGMVTHTCNPGTLRVSAGRIT*GQE 
^KTSLGN I ARPKL YK 1 1 INKY I 


13461 


27362 


A 


13606 


428 


24 


CPESNPRKREPTLKSEPILTLPITGLIL 
DDGNS PTAAAGS VFAETLQPQCS SLCC * 
VKSSPWPTASLPQPFGSAPQTLPVRYAL 
IPGPSLVSTSLGPRLAPGLPGSPSPPSP 
QVTLDSERYPSPTPCIAPALED 


13462 


27363 


A 


13607 


267 


442 


RPPPTLKVPWLGVEVHACNPRTLGGCGG 
SIS * AOKLENSLSNI LRPHLYOKOOKOO 
KK 


13463 


27364 


A 


13608 


3 


280 


YRLSVICEDPMREREQTCFPPPPSFPPN 
RYCSTRDGGMEGSQGDFRKGFPA*APCR 
SGGGWRGCCSPAGAPGSPLCGSSQGGPV 
SPWGQKRS 




Z/jOj 


A 








W S LAGVKGDGGR * GSGAGGS PKLG * AV* 
LDPEGHAGQAVFCGEAGARTQIGGKG*R 
RDQAPLPLSPNLCSGSCLPTKHSLARMP 
LRVKSYSSAQLGRTTCTRPSPPTITFDS 
S 


13465 


27366 


A 


13610 


267 


3 


HTDLL YNTPTHP I S PRCDPKHTAI PDKQ 
SLLFFFFEMSSCSVARAGVRWHDLRSLR 
PLPPRF* KFFCCS I PSLYCGRSRGSSQT 
YTS 
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NO: of 
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SEQ ID 
NO: of 
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sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino viriii 

residue of 
peptide 

JLUUCIJLC 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

1 wlUUv V/ J 

peptide 


Amino acid sequence <A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, LHLeucine, M=Methionine, 
N=Asparagine, P=Proline, 
O^GIutaminp R=Arpinine S=Serine 
T=Threonine, V= Valine, 
W=Tryptophan, Y=Tyrosine, 
X=l In known *=Ston codon /—nossible 
nucleotide deletion, \=possible 

n iirlpotidp in^prtinn 

IIULIWullVIV lil^^fl lll/ll 


13470 


27371 


A 


13615 


197 


428 


KTMVGLKTSASFHQDFIQI PLNHNLSVP 
*TMLGSQPRFLFPPGTVSHICNPSTLGG 
RGGW IT* GQE FETSLATMVKP 


13471 


27372 


A 


13616 


445 


550 


I S I TRGRARWLTP VILVLWEAEAGGS P * 
GRSSRL? 


1 JH 1 J. 


Z I J ID 




1 OOl 1 


j 




TMVLS PADKTNVKAAWRKVGAHAGE YGA 
EALERMFLSFPTTKTYFPHFDLSHGSAQ 
VKGHGKKEADALTNAVAHVDDMPNALSV 
LSDLHAHKLRVDPVNFKLLSHCLLVTLD 
RHLPAE FTPAVHAS LDKFLAS VSTVLTS 
KYR 


13473 


21314 


A 


13618 


719 


918 


CEGRREKWKIGRERKGRGRRKGEQMGRE 
GKGREDGGRKRPCTSRPRSSSRDRSNI I 
ISEHASAVEA 


13474 


27375 


A 


13619 


847 


938 


WLMPAVPALLEAEVGISLEAQSCRPPWA 
TQ 


13475 


27376 


A 


13620 


349 


409 


NLV7S W PGAVAHVCN PS TLGG 


13476 


27377 


B 


13621 


25 


189 


MVLS PADETNVKAAWGKVGAHAGEYGAE 
ALERMFLCFPTTMTYFPDFDLIHGSAQ* 


13477 


27378 


A 


13622 


142 


356 


RIVENEKINAEKSSKQKVDLQSLPTRAY 
LDQTW? I LLQGLAVIAKERP PNP IEFL 
ASYLLKNKAQFEDRN 


13478 


27379 


A ' 


13623 


18 


1353 


AGAAQCEWSAGEAGARTMSEADGLRQR 
RPLRPQWTDDDGQAPEAKDGSSFSGRV 
FRVTFLMLAVSLTVPLLGAMMLLESPID 
PQPLRQISGIALFCSFKEPPLLIiGVLHP 
NT KLRQ AE RL FENQ LVGP E S IAHIGDVM 
FTGTADGRWKLENGEIETIARFGSGPC 
KTRDDE P VCGR PLG I RAG PNGTLFVADA 
YKGLFEVNPWKREVKLLLSSETPIEGKN 
MSFVNDLTVTQDGRKIYFTDSSSKWQRR 
DYLLLVMEGTDDGRLLEYDTVTREVKVL 
T.nOT.RFPNGVOT.^PAFTjFVT iVAETTMAR 
IRRVYVSGLMKGGADLFVENMPGFPDNI 
RPSSSGGYWVGMSTIRPNPGFSMLDFLS 
ERPWIKRMIFKLFSQETVNKFVPRYSLV 
LELSDSGAFRRSLHDPDGLVATYISEVH 
EHDGHLYLGSFRSPFLCRLSLQAV 


13479 


27380 


A 


13624 


18 


1353 


AGAAQCEWSAGEAGARTMSEADGLRQR 
RPLRPQWTDDDGQAPEAKDGSSFSGRV 
FRVTFLMLAVS LTVPLLGAMMLLES P I D 
PQPLRQISGIALFCSFKEPPLLLGVLHP 
NTKLRQAERLFENQLVGPES IAHIGDVM 
FTGTADGRWKLENGEIETIARFGSGPC 
KTRDDE PVCGRPLG I RAG PNGTLFVADA 
YKGLFEVNPWKREVKLLLSSETPIEGKN 
MSFVNDLTVTQDGRKIYFTDSSSKWQRR 
DYLLLVMEGTDDGRLLEYDTVTREVKVL 
LDQLRFPNGVQLS PAEDF VLVAETTMAR 
IRRVYVSGLMKGGADLFVENMPGFPDNI 
RPSSSGGYWVGMSTIRPNPGFSMLDFLS 
ERPWI KRMI FKLFSQETVMKFVPRYSLV 
LELSDSGAFRRSLHDPDGLVATYISEVH 
EHDGHLYLGSFRSPFLCRLSLQAV 


13480 


27381 


A 


13625 


1 


384 


QSFRGTGRKRERERKRMSLSDWHLAAKL 
ADQPLTPKSILRLPETELGEYSLGGYS I 
SFLKQLIAGKLQESVPDFELIDLIYCGR 
KLLDDQTLDFYGIQPGSTVHVLRKSWPE 
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NO: of 
peptide 


eth 
od 
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ed end 
nucle- 


C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 


sequence 


sequence 




09/515,1 
26 


location 
correspond 
ing to first 

n minn nriri 

411111111/ ui.111 

residue of 

peptide 

sequence 


otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


G=Glycine, H=Histidine, I=Iso leucine, 
K=Lysinc, LHLeucine, M=Methionine, 
N=Asparagine, P=Proline, 

0=Glu til m inp iR— Aroininp 5i=.Sprinp 

V >U Ill V, lx f\t gill MIC, VJ Owl lilt, 

T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X— Unknown, *— Stop codon,/ = possible 
nucleotide deletion, \=possibJe 
nucleotide insertion 














PDQKPEPVBKEAAMRD 


13481 


27382 


A 


13626 


980 


1089 


KRIRIQLTGGLYPVPPPHPPPQSPPIFP 
RPTSPTRT 


1 j'toZ 


Z / JOJ 




1 JOZ/ 


1 16 

1 LO 


Do / 


VCGELRADSWPVPSOPEOASGPOKOAFI 
WPEAPSRSARLPITYTDYWDSRLQTQES 
QMLGSMARKKPRNTSRLPLALNPLKSKD 
VLAVLAERNEAIVPVGAWVEPASPGSSE 
I PAYTSAYLI EEELKEOLRKKOEALKH F 
QKQVKYRVNQQ I TLRKQ 


13483 


27384 


A 


13628 


150 


484 


VAFP QAS RGVRKTE VGEQQGQGTACGGC 
GHQ CPP PTWH IQRAHCVPSTCGAGGWHL 
QGRAS CRRPPSQS PQVYQTQVGRQDPHF 
GVGRDSRGELAWS S F IHP YLLS VCNPL 


13484 


27385 


A 


13629 


330 


397 


ARCEWLTPVI PALWEAEAGGSH 


1 Old J 


Z IjoO 






10 


i j / 


S FTGAV I L I I AHGLTS S LL F CLANSNYE 
RTHNLFFLNDFFFS 


13486 


27387 


A 


13631 


380 


98 


FQWLVS FT ATLW FE ERHR KNK I S RE ERR 
AKGERGDRKEERREEGGEERGRRGGERS 
DKREPKKKSKEESNHPKWKFSFICSFC 
FLTPFFPVFF 


13487 


27388 


A 


13632 


56 


827 


PLFEAFTACGFVHDCGLLIHPEETCGLQ 
PISSDYIEAILQSELKRCPSGDMKGQWI 
VPCLSCSDNRTCDWREITOQPHNCQYGV 
LTKPQLQQCLGGRKILFIGDSTNRGIMY 

VT .T TTPT .KFPTT AT^ Ta7 O V~\ TT-Tn TVI? VT-TTCn HydC 
X LtJL CfXvJLuMCi X XJVifW yiVviiVj 1 J\.r IrXlNVlNVjlj 

KTL I S YS YYPQFWI SPSLRPTFENALEH 
LLQRSRPLENTGQTVLWGGVQWLNSNH 
LQI IHKVLKSPFTTLNQPVTKSCLQAI Y 
FPRLSPTLHSNCLDLVYSFTKSFNIYFV 
VQFLN 


13488 


27389 


A 


13634 


3 


2718 


SGPCRTTVAPLLRAAPVEHCVAALRPTD 
S TMLKKFDKKDE E SG GG SN P FQHLE KS A 
VLQEARVFNETP INPRKCAH ILTKI LYL 
I NQGEHLGTTEATE AFFAMTKL FQSNDP 
TLRRMCYLTIKEMSCIAEDVI I VTSSLT 
KDMTGKEDNYRGPAVRALCQITDSTMLQ 
AIERYMKQAIVDKVPSVSSSALVSSLHL 
LKCSFDWKROTNEAQEAASSDNIMVQY 
HALGLL YH VRKNDRLAVNKM I S KVTRHG 
LKSPFAYCMMIRVASKQLEEEDGSRDSP 
LFDFIESCLRNKHEMWYEAASAIVNLP 
GCSAKELAPAVSVLQLFCSSPKAALRYA 
AVRTLNKVAMKHPSAVTACNLDLENLVT 
DSNRSIATLAITTLLKTGSESSIDRLMK 
QISSF MSEISDEFKVVVVQAI SAJbCQKY 
PRKHAVLMNFLFTMLREEGGFEYKRAIV 
DCI I S I IEENSESKETGLSHLCEFIEDC 
EFTVIiATRILHLLGQEGPKTTNPSKYIR 
FIYKRWIiEHEEVRAGAVSALAKFGAQN 
EEMLPS I LVLLKRCVMDDDNEVRDRATF 














YLNVLEQKQKALNAGYILNGLTVSIPGL 
ERALQQYTLEPSEKPFDLKSVPLATAPM 
AEQRTESTPITAVKQPEKVAATRQEIFQ 
EQLAAVPEFRGLGPLFKSSPEPVALTES 
E TEYV I R CTKHT FTNHMV FQ FDCTNTLN 
DQTLENVTVQME PTEAYEVLC YVPARS L 
PYNQPGTCYTLVALPKEDPTAVACTFSC 
MMKFTVKDCDPTTGETDDEGYEDEYVLE 
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nucleotide 
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correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Asparlic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=lso!eucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 














DLE VTVADH IQKVMKLNFEAAWDEVGDE 
FEKEETFTLSTIKTLEEAVGNIVKFLGM 
HPCBRSDKVPDNKNTHTLLLAGVFRGGH 
DILVRSRLLLLDTVTMQVTARSLEELPV 
DIILASVG 


13489 


27390 


A 


13635 


118 


629 


LWALQLHPEPELPSRRGTGAAGVWTGVA 
MIRFILI QNRAGKTRLAKWYMQ FDDDEK 
QKLIEEVHAWTVRDAKHTNFVEFRNFK 
1 1 YRRYAGLYFCI CVDVNDNNLAYLEAI 
HNFVEVLNE YFHNVCELDLVFN F YKV YT 
WDEMFLAGEIRETSQTKVLKQLLMLQS 
LE 


13490 


27391 


A 


13636 


57 


221 


LHHCTPPWAEVEETLKRLQSQKGVQGI I 
WNTEGGWEPLGHCGDRSRPPAQGCP 


13491 


27392 


A 


13639 


344 


544 


LSGGHAGALSS L VJVHL YCLL S S QQ LLGN 
VLVTVLAIHFGKEFTPEVQASWQKMVTG 
VCSALCFRYH 


13492 


27393 


A 


13640 


319 


623 


DMEEASEGGGNDRVRNLQSEVEGVKNIM 
TQOTERILARGENLEHLRNKTEDLEATS 
EHFKTTS QKVARKFWW KNVKM I VL, I CV I 
VFI I ILFI VLFATGAFS 


13493 


27394 


A 


13641 


2099 


769 

i 


TRLAGRVSVASRPCRGPAVGGLLVERSK 
ARRPLLESRVAMAAVPELLQQQEEDRSK 
LRSVSVDLNVDPSLQIDI PDALSERDKV 
KFTVHTKTTLPTFQS PEFSVTRQHEDFV 
WLHDTLIETTDYAGLI IPPAPTKPDFDG 
PREKMQKLGEGEGSMTKEEFAKMKQELE 
AE YLAVFKKTVS SHE VFLQRLS SHPVLS 
KDRNFHVFLEYDQDLSVRRKNTKEMFGG 
F FKS WKS ADE VL FTG V KE VDD F FEQE K 
NFL INYYNR I KDS CVKADKMTRS H KNVA 
DDYIHTAACLHSLALEEPTVI KKYLLKV 
AELFEKLRKVEGRVSSDEDLKLTELLRY 
YMLNI EAAKDLLYRRTKALIDYENSNKA 
LDKARLKSKDVKLAEAHQQECCQKFEQL 
SESAKEELINFKRKRVAAFRKNLIEMSE 
LEIKHARNNVSLLQSCIDLFKNN 


13494 


27395 


A 


13642 


210 


772 


S VKMVRYSLDPENPTKS CKSRGSNLRVH 
FKNTRETAQAI KGMH I RKATKYLKDVTL 
QKQCVPFRRYNGGVGRCAQAKQWGWTQG 
RWPKKSAEFLLHMLKNAESNAELKGLDV 
DSLVIEHIQVNKAPKMRRRTYRAHGRIN 
PYMSSPCHIEMILTEKEQIVPKPEEEVA 
QKKKI SQKKLKKQKLMARE 


13495 


27396 


A 


13643 


168 


2172 


SPLCEVSVPSFCFRVFCKKHKKHKSDKH 
LYEEYVEKPLKLVLKVGGNEVTELSTGS 
SGHDSSLFEDKNDHDKHKDRKRKKRKKG 
EKQIPGEEKGRKRRRVKEDKKKRDRDRV 
ENEAEKDLQCHAPVRLDLPPEKPLTSSL 
AKQEEVEQTPLQEALNQLMRQLQRKDPS 
AFFSFPVTDFIAPGYSMIIKHPMDFSTM 
KEKIKNNDYQSIEELKDNFKLMCTNAMI 
YNKPETIYYKAAKKLLHSGMKILSQERI 
QSLKQS IDFMADLQKTRKQKDGTDTSQS 
GEDGGCWQREREDSGDAEAHAFKS PSKE 
NKK30DKDMLEDKFKSNNLEREQEQLDRI 
VKESGGKLTRRLVNSQCEFERRKPDGTT 
TLGLLHPVDPIVGEPGYCPVRLGMTTGR 



1365 



WO 01/64835 



PCT/US01/04927 



SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
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X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














LQ S G VNTLQG F KE D KRNKVTP VL YLN YG 
PYSSYAPHYDSTFANISKDDSDLIYSTY 
GEDSDLPSDFS IHEFLATCQDYPYVMAD 
SLLDVLTKGGHSRTLQEMEMSLPEDEGH 
TRTLDTAKEMEQI TEVEPPGRLDSSTQD 
RLIALKAVTNFGVPVEVFDSEEAEIFQK 
KLDETTRLLRELQEAQNBRLSTRPPPNM 
ICLLGPSYREMHLAEQVTNNLKELAQQV 
TPGD I VST YG VRKAMG I S I P S P VMENNF 
VDLTEDTEEPKKTDVAECGPGGS 


13496 


27397 


A 


13644 


1032 


207 


PAD VT P KPATEAVQ S EHS DAS PMS I N E V 
ILSASGACKLIDSLHSYCFSSRQNKSQV 
CCLREQVEKKNGELKSLRQRVSRSDSQV 
RKLQEKLDELRRVSVPYPSSLLSPSREP 
PKMNPWEPLSWMLGTWLSDPPGAGTYP 
TLQP FQYLEEVHI SHVGQPMLNFS FNS F 
HPDTRKPMHRECGFIRLKPDTNKVAFVS 
AQNTGWEVEEGEVNGQELCIASHSIAR 
ISFAKEPHVEQITRKFRLNSEGKLEQTV 
SMATTTQPMTQHLHVTYKKVTP 


13497 


27398 


A 


13646 


148 


380 


RGSWREVPESASLPSRGAKGKWRGLCCC 
CCCCCCCCCCCCYHCHQEQGDLKHQADL 
WRSGRTQNQ AG I WQEHQTLEG 


13498 


27399 


A 


13647 


2099 


769 


TRLAGRVSVASRPCRGFAVGGLLVERSK 
ARRPLLESRVAMAAVPELLQQQEEDRSK 
LRSVSVDLNVDPSLQIDIPDALSERDKV 
KFTVHTKTTLPTFQSPEFSVTRQHEDFV 
WLHDTL I ETTDYAGLI I PPAPTKPDFDG 
PREKMQKLGEGEGSMTKEEFAKMKQELE 
AE YLAVFKKTVSSHEVFLQRLS SHPVLS 
KDRWFHVFLEYDQDLSVRRKNTKEMFGG 
FFKSWKSADEVLFTGVKEVDDFFEQEK 
NFLINYYNRI KDSCVKADKMTRSHKNVA 
DD Y I HTAACLHS LALEEPTVI KKYLLKV 
AELFEKLRKVEGRVSSDEDLKLTELLRY 
YMLNI EAAKDLLYRRTKALIDYENSNKA 
LDKARLKSKDVKLAEAHQQECCQKFEQL 
SESAKEELINFKRKRVAAFRKNLIEMSE 
LEIKHARNNVSLLQSCIDLFKNN 


13499 


27400 


A 


13648 


1 


1206 


MSTSQSPCES I CDYVTSHDKSNFTDMI K 
LNILRCEVILDYPAPGGGSLGAKHCCSC 
YT VS SGVTEGERNAGEKGVKLNADGAR I 
RGTPGRGRRAEAEASS PAPAAVAAACW 
AAAAASRQLASGNRTRVSSGVPAPAFLG 
TMNPNCARCGKIVYPTEKVNCLDKFWHK 
ACFHCETCKMTLNMKNYKGYEKKPYCNA 
HYPKQS FTMVADTPENLRLKQQS ELQSQ 
VRYKEEFEKNKGKGFSWADTPELQRIK 
KTQDQISNIKYHEEFEKSRMGPSGGEGM 
EPERRDSQDGSSYRRPLEQQQPHHIPTS 
APVYQQPQQQPVAQS YGGYKEPAAPVS I 
QRSAPGGGGKRYRAVYDYS AADEDE VS F 
QDGDTIVNVQQIDDGWMYGTVERTGDTG 
MLPANYVEAI 


13500 


27401 


A 


13649 


3 


394 


GDGGGHLGSGRNGGGSMNAPPAFESFLL 
FEGBKITINKDTKVPNACLFTMNKEDHT 
LGNI IKSQLLKDPQVLFAGYKVPHPLEH 
RII IRVQTTPDYSPQEAFTNAITDLISE 
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N=Asparagine, PHProline, 
Q— Glutamine, R— Arginine, S = Serine, 
T=Threonine, V=Valine, 
VV=Tryptophan, V=Tyrosine, 
X=Unkno\vn, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
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LSLLEERFRVRAGPGGAD 


1 JJU I 


77409 




UUJU 


j 


471 
** / 1 


SRPTGLREAAGSGPREAPRRSGCKSPGL 
GTVGMLRPKALTQVLSQANTGGVQSTLL 
LNKEGSLLAYSGYGDTDARVTAAIASNI 
WAAYDRNGNQAFNEDNLKFILMDCMEGR 
VAI TRVANLLLCMYAKETVGFGMLKAKA 
QALVQYLEEPLTQVAAS 


13502 


27403 


A 


13651 


1334 


82 


CYTGGTQSLWPGSSCASSVARPSSLFRS 
AWSCEWSVRCARACTMSVPAFIDISEED 
QAAELRAYLKSKGAEISEENSEGGLHVD 
LAQI IEACDVCLKEDDKDVESVMNSWS 
LLL I LE PDKQEALI ESLCEKLVKFREGE 
RPSLRLQLLSNLFHGMDKNTPVRYTVYC 
SLIKVAASCGAIQYIPTELDQVRKWISD 
WNLTTEKKHTLLRLLYE7ALVDCKKSD7AA 
SKVMVELLGSYTEDNASQARVDAHRCIV 
RALKDPNAFLFDHLLTLKPVKFLEGEL I 
HDLLTIFVSAKLASYVKFYQNNKDFIDS 
LGLLHEQNMAKMRLLTFKGMAVENKEIS 
FDTMQQELQIGADDVEAFVIDAVRTKMV 
YCKIDQTQRKVWSHSTHRTFGKQQWQQ 
LYDTLNAWKQNLNKVKNSLLSLSDT 


13503 


27404 


A 


13652 


1 


377 


TTASGRSGVKGSTMSAEVPEAASAEEQK 
EMEDKVTSPEKAEEAKLKARYPHLGQKP 
GGSDFLRKRLQKGQKYFDSGDYNMAKAK 
MKNKQLPTAAPDKTEVTGDHI PTPQDLP 
QRKPSLVASKLAG 


13504 


27405 


A 


13653 


424 


596 


SLKNI YGLSCRKKKKGAVKKI ILVQAWW 
LMPVI TVXitSTEAE VGGLLEARGLR PTR7AT 
W 


13505 


27406 


A 


13654 


190 


448 


LRS YPAPHLGS PELRI RKGRGHSHCLAG 
AAGPQRTALCGLSAPLCPPSPTPPGAGA 
PRYCSGSDAPPCLLRGAGPPIPGMGDPE 
TS 


13506 


27407 


A 


13657 


1612 


563 


SMPGWRLLTQVGAQVLGRLGDGLGAALG 
PGNRTH I WLFVRGLHGKSGTVIWDEHLSE 
ENVPFIKQLVSDEDKAQLASKLCPLKDE 
P WP IHPWE PGS FR VGLI ALKLGMMPLWT 
KDGQKHWTLLQVQDCHVLKYTSKENCN 
GKMATLSVGGKTVSRFRKATSILEFYRE 
LGL.PPKQTVKI FN I TDNAAI KPGTPLYA 
AHFRPGQ YVDVTAKT IGKGFQGVMKRWG 
FKGQPATHGQTKTHRRPGAVATGDIGRV 
WPGTKMPGKMGNIYRTEYGLKVWRINTK 
HNI I YVNGS VPGHKNCLVKVKDS KLPAY 
KDLG KNL P FPT Y F PDGDEE E L P EDLYDE 
NVCQPGAPS ITFA 


13507 


27408 


A 


13658 


128 


2626 


NSHRWVYVRARRWRRRGKQREQ PEDRGV 
PMKR7AAMALHS PQ Y I FGDFS PDEFNQFF 
VTPRSSVELPPYSGTVLCGTQAVDKLPD 
GQEYQRIEFGVDEVIEPSDTLPRTPSYS 
ISSTLNPQAPEFILGCTASKITPDGITK 
EASYGS I DCQYPGSALAIjDGS SNVEAEV 
LENDGVSGGLGQRERKKKKKRPPGYYSY 
LKDGGDDS I STEAL VNGHANSAVPNSVS 
AEDAEFMGDMPPSVTPRTCNSPQNSTDS 
VSDIVPDSPFPGALGSDTRTAGQPEGGP 
GADFGQSCFPAEAGRDTLSRTAGAQPCV 
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W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














GTDTTENLGVANGQILESSGEGTATNGV 
ELHTTES I DLDPTKPESAS PPADGTGSA 
SGTLPVSQPKSWASLFHDSKPSSSSPVA 
YVETKYS PPAI S PLVSEKQVEVKEGLVP 
VSED P VAI KIAELLENVTL IHKPVSLQP 
RGLINKGNWCY INATLQALVACPPMYHL 
MKFIPLYSKVQRPCTSTPMIDSFVRLMN 
EFTNMPVPPKPRQALGDKIVRDIRPGAA 
FEPT Y I YRLLT VNKS S LS E KGRQEDAE E 
YLGFIIaMGLHEEMLNLKKLLSPSNEKLT 
I SNGPKNHS VNEEEQEEQGEGSEDEWEQ 
VGPRNKTSVTRQADFVQTP ITGIFGGHI 
RSWYQQSSKESATLQPFFTLQLDIQSD 
KI RTVQDAIiES L VARES VQG YTT KTKQE 
VE I S RR VTLE KL P P VLVLHL KRFVYEKT 
GGCQKLIKNIEYPVDLEISKELLSPGVK 
NKNFKCHRTYRLFAWYHHGNSATGGHY 
TTDVFQ IGLNGWLR IDDQTVKVINQYQV 
VKPTAERTAYLLYYRRVDLL 


13508 


27409 


A 


13659 


42 


382 


TLWLKTIQIYLTISLGL3SDYGLAGFSA 
LGCEQAAVKMLTAF I L»I QGLDLGRSHFQ 
AHSGCWQDS FPCDS R I HGGLLLHVQQES 
PWFQSAERVSCITKCNHRSDTHHLCSIL 
L 


13509 


27410 


A 


13660 


178 


349 


DMGPCYVAQTVLKLLGSSDPPTSASQEA 
G TTCHHAHLLS K P FLM I YRN FS F I QCLE 
T 


13510 


27411 


A 


13661 


268 


2 


RQQKVILSSSGSLMSEMGRTVLPLKSPG 
MNPSSPLLASGGCWKSLACGHIIPFSSF 
ILMWPSPLCVSVSSSLPMRKPLLLHLRS 
TLFQ 


13511 


27412 


A 


13662 


112 


483 


AGVGALRMVQRLTYRRRLSYNTASNKTR 
LSRTPGNRIVYLYTKKVGKAPKSACGVC 
PGRLRGVRAVR P KVLMRLS KT KKHVS RA 
YGGSMCAKCVRDRIKRAFLIEEQKIWK 
VLKAQAQSQKAK 


13512 


27413 


A 


13663 


2 


873 


S VE E FDRG C TGRGCGADARAGAAMVK I S 
FQPAVAGI XGDKADKASASAPAPAS ATE 
ILLTPAREEQPPQHRSKRGSSVGGVCYL 
SMGMWLLMGLVFASVYIYRYFFLAQLA 
RDNFFRCGVLYEDSLSSQVRTQMELEED 
VKIYLDENYERINVPVPQFGGGDPADII 
HDFQRGLTAYHD I S LDKCYV I ELWTTI V 
LPPRNFWELLMNVKRGTYLPQTY I IQEE 
MWTEHVSDKEALGSFIYHLCNGKDTYR 
1.RRRATRRRINKRGAKNCNAIRHFENTF 
WETLICGW 


13513 


27414 


A 


13664 


118 


3 


AWSLI PVI PWREAKAGGLLEPRSLRPT 
WATWQDPVST 


13514 


27415 


A 


13665 


1 


2876 


IRQRINFSRKNKWDSRKLDQAGVSELAT 
NQKL I LVCGR YEG I DERVI QTE I DEE WS 
IGDYVLSGGELPAMTLIDSVSRFIPGVL 
GHEASATEDSFAEGLLDCPHYTRPEVLE 
GMEVPPVLLSGNHAEIRRWRLKQSLGRT 
WLRRPELLENLALTEEQARLLAEFKTEH 
AQQQHKHDGMAADEAGRTFLRADFNMIE 
EGDR I MVCLSGGKDS YTMLE I LRNLQQS 
APINFSLVAVNLDQKQPGFPEHVLPEYL 
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E-Glutamic Acid, F=Phenylalanine, 
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K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q == Glutamine, R. = Arginine, S^Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /= possible 
nucleotide deletion, ^possible 
nucleotide insertion 














EKLGVEY KI VEENTYG I VKEKI PEGKTT 
CS LiCSRLRRGI LYRTATELGATKI ALGH 
HR DDI LQTL FLNM F YGGKMKGM P PKLMS 
DDGKHIVIRPLAYCREKDIQRFADAKAF 
PIIPCNLCGSQPNLQRQVIADMLRDWDK 
RYPGRI ETMFSAMQNWPSHLCDTNLFD 
FKGITHGS EWNGGDLAFDREE I PLQPA 
CWQPEEDENQLDELRLNWEPDGPRHRH 
APDARRFCQYLRHQARYLLSWPNLDVA 
VTE PLANGDGLNVLI KREWGFRANTVE 
KTGENQYRVWPNEMPADLHKIRPHHPLN 
RNLDHFFPLDLTNSTFSGGYVHVLKGVL 
SDDLLLKS FQKMG YVRRDSHRLMVTALP 
PACQLVQVALGC FALRLECE I LGEVLAQ 
LGTSVLPAEELLQARRASGDVASCQRLA 
QDSEPPPLPPRGSPAAYRAPLDLYRDLQ 
ED EGSE DAS L YGE PS PG P DS P P AELAYR 
PPLWEQSAKLWGTGGRAWEPPAEELPQA 
SSPPYGALEEGLEPEPSAFSFLSLRREL 
SRPGDLATPESSAAAS PRRIRAEGVPAS 
AYRSV^ EP PG YOGTOT.PVPWRPAHPLLR 
HLS PAACCPLCSPARLP3RPLAACAAWR 
RPAALVATACTDGHSAXiQQPRGPALGQL 
GPRSRALGROLPWCFSLLGRGLGLWIiPC 
PWGSPGHSRGLES PGQPLLEVGLSVYGR 
LQGQQA 


13515 


27416 


A 


13667 


76 


289 


SGTPASPCLEMDPNCSCSPVGSCACAGS 
CKCKECKCTSCKKSCCSCCPVGCAKCAQ 
GCICKGTSDKCSCCA 


13516 


27417 


A 


13669 


2 


667 


GRVDAEQSRLGATERAAAAAMNPEYDYL 
FKLLL I GDSGVGKS CLLLRFADDTYTES 
Y I S TIGVDFKI RT I ELDGKT I KLO I WDT 
AGQERFRTITSSYYRGAHGIIWYDVTD 
QES YAKVKQWLQE IDRYASENVNKLLVG 
NKSDLTTKKWDNTTAKEFADSLGI PFL 
ETSAKNATNVSQAFMTMAAEIKKRMGPG 
AASGGERPNLKIDSTPVKPAGGGCC 


13517 


27418 


A 


13670 


176 


398 


RI LKTQLQENNQPTTTTKNRQKTRTDTL 
PKG I YRRH I SL WKLGTWLTPV I PALCEA 
NAGGRLESRSSRPVWATY 


13518 


27419 


A 


13671 


149 


247 


RKGLALS PRLDCSGAI TVNCSLNLQGLS 
NIPI 


13519 


27420 


A 


13672 


257 


359 


VASLHGWAQWLTPVI PTLWKTKVGEFFE 
PRCSRP 


13520 


27421 


A 


13673 


142 


1 


HLKGSGLAKHQ WLMP V I PALWEAEVGGS 
LEPRSSRP VWATLQDP I SS 


I J JZ 1 


774.99 


A 


I JO / 1 


U17 


1 uo 


MF I ESCCTPKI YTS FVLRQSHSFAQAGV 
QWLDLGSLQPLSLGSSNSPASASCIQVL 
F V I YTSVKLGVGGR LGN KKVMS S NKK FV 
CECRGWI I LP ?S VPSLRFQDGGVJDHGVS 
S AE KALQAGRTAGHRVG VPPLGAPCGGA 
PCTTPPAWRS 


13522 


27423 


A 


13675 


3 


370 


ARALLHPRLHQWQCLCHXjQGTGGCPQQS 
PSSEGRADLLSPALEEGYPPPASGCFIP 
HFLCGCSIFNCLPPIMRHAHKSPDALLE 
TLGPCRAWRLMPVI PTLWEAKVDRLLES 
RSSRPSWPTW 


13523 


27424 


A 


13676 


3 


281 


IGLRPELSVSLGRQRLTAIRLQPLLPAR 
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nucleotide insertion 














TPCG FLRARVALALMADETDLPLLPRLE 
YSGM I TAYCSLDFLGSSDP PTLGSGVAG 
TTENCCED 


13524 


27425 


A 


13677 


272 


389 


YCRQ I EKS KK KRCWLGTVAHACNPSTWG 
GRGGRITRSRD 


13525 


27426 


A 


13678 


261 


363 


NSSWGHARWLTSVI PALWE AE VGGS PE V 
RSLRPV 


13526 


27427 


A 


13679 


149 


44 


S CVMS VS VLKKM YWIjGAVAHACNPSTLG 
GQGGRTA 


13527 


27428 


C 


13680 


118 


267 


MSHCTWLMCVCVCVCVCGVYAYMYTYSF 
LRWDLPMLSRLECSG YSQAQ * * 


13528 


27429 


A 


13682 


16 


346 


NHHLTQPHPHLDINDFFFFFFFFFFFFF 
FFFFFFFFFFFLKKS PRPAGKKKKKKKK 
KKKPPRFFFLGGAPPPLFFFLINFFFFF 
LCKI ISLVYNRRGGPCLSKRFFLKERKS 
PL 


13529 


27430 


A 


13683 


244 


140 


GR VD VFHHVAQ AGL KLLS S GDL PAW P P K 
SLRLQV 


13530 


27431 


A 


13684 


191 


777 


NSDEHVYRCLYGHKLSFLLSKYLKSGLA 
GVLLPLDAAVDMEK I EEQFANLHI VKCS 
LGTKE PTYIiLG I DT S KTVQAG KENLVAV 
LCSNGSIRIYDKERUSTVLREFSGYPGLL 
NGVRFANSCDSVYSACTDGTVKCWDARV 
AREKPVQLFKGYPSNIFISFDINCNDHI 
I CAGTEKVDDDALFGGFGDARMS FLQN 


13531 


27432 


A 


13685 


150 


253 


VGWGLWLVPVIPAFWEAKMGESLDPRNL 
LPCWAT 


13532 


27433 


A 


13686 


500 


321 


IEMGSHYVAQAGLELLSSSNPPTSVSQS 
AGIVGMSHHALAAISKSSDAS ITSHKPM 
TTP 


13533 


27434 


A 


13687 


6 


424 


MSLLQCGGIRCFKMPEPAKSAPAPKKGS 
KKAVTKAQKKDGKKRKRSRKESYSVYVY 
KVLKQVHPDTGISSKAMGIMNSFVNDIF 
ERIAGEASRLAHYNKRSTITSREIQTAV ~ 
RLLLPGEIiAKHAVSEGTKAVTKYTSSK 


13534 


27435 


A 


13688 


178 


293 


TGYS SQAQWLMTVLTATWEAEAGGSLQ P 
RSLRPAWATQ 


13535 


27436 


A 


13689 


663 


144 


KELSAVSAGIPHSCGSQGCGGGSVAACV 
PAAPAAAGLCSGRAQKVPPPPSLAGWPP 
GVNAPPPPVCSSVRLHVCQSDRLWVRIiA 
ARRGILALLRSALKAATLAGCQSVRWSV 
RPSESLRPTSNAASLFRSSVPTVLSKSV 
PLAASLGKRRACGGREHASVAVYLSVCL 
SLPT 


13536 


27437 


A 


13690 


125 


3 


FAHQGHAPGQAWWLRP VI P VLREAELGG 
RPEPRSSRTARA 


13537 


27438 


A 


13691 


136 


24 


GGPPFFFFFFFFFFFFFFFFFFFFFFFF 
FFPGKLQQDFQYYVK 


13538 


27439 


A 


13692 


144 


1 


MLKIVQSGECLTLKFKNFCLLSLCTLFP 
TLIALTTLLLP I S PF I LI I L 


13539 


27440 


A 


13693 


1476 


380 


NSWSTLASELTLWAATSADPERKSQAAS 
AAMWATLPLLCAGAWLLGVPVCGAAELC 
VNSLEKFHFKSWMSKHRKTYSTSEYEHR 
LQTFASNWRKINAHNNGNHTFKMALNQF 
SDMSFAEIKHKYLWSEPQNCSATKSKYL 
RGTGPYPPSVDWRKKGNFVS PVKNQGAC 
GSCWTFSTTGALESAIAIATGKMLSLAE 
QQLVDCAQDFNNHGCOGGLPSQAFEYIIj 
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nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 
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C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, ]=IsoIeucine, 
K=Lysine, L^Leucine, M=Methionine, 
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W^ryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible , 
nucleotide deletion, \=possib!e 
nucleotide insertion 














YNKGIMGEDTYPYQGKDGYCKFQPGKAI 
GFVKDVANI T I YDEEAMVEAVALYNPVS 
FAFEVTQDFMMYRTGIYSStSCHKTPDK 
VNHAVLAVGYGEKNG I P YW I VKNSWGPQ 
WGMNG YFLIERGKNMCGLAACAS YP I PL 
V 


13540 


27441 


A 


13694 


87 


2 


PGWAWWFTPVILALWETBAGGSPEFKSL 


13541 


27442 


A 


13696 


130 


2 


CNILI I I IQVCWAWWLMPVI PALWHAEP 
GGSPEVESSRPALV 


13542 


27443 


A 


13697 


282 


107 


GLTKCPGTVCRSLKVVIS YQLS 1 1 CLSM 
MEHSGTILAHCSLKLLGSSDPPTSVSRV 
AS 


13543 


27444 


A 


13698 


198 


3 


CS I T VNKVKVMTLFRS S MSLE I VCD I DL 
AVLRRDLGSLQDPPPRFTPLSCLSLPSS 
WDYRRPLV 


13544 


27445 


A 


13699 


2 


120 


ARGDGVSLWLPRLKCNGAI SAHCNLRPT 
GSSDKNFKTRF 


13545 


27446 


A 


13700 


60 


341 


PDMGLEDERKMLTESGDPEEEEEEQEEL 
VDPLTTVREQCEQLQKCVKARERLELCD 
ERVSSRSHTDDDCPEELFDFLHAKDHCV 
ALKLFNNLQ 


13546 


27447 


A 


13701 


6 


540 


KNSRTLGQCGG I RGSLCR PR KPGVGGTQ 
TRAVRPAVCSADSARPHLPLRRADMKDS 
L VLLGRVPAH PDS RCW FLAWN PAGTLLA 
SCGGDRRIRIWGTEGDSWI CKS VLSEGH 
QRTVRKVAWSPCGNYLASAS FDATTCIW 
KKLTLR I YN I LR KLEAC I KP LCCALK YG 
CLEEKQLHS 


13547 


27448 


A 


13702 


239 


346 


AKRGWLMPV I PALWEAKVGKSLEPR I SR 
PLWATWQN 


13548 


27449 


A 


13703 


210 


300 


LMPVI SALWEAKARGLLE PRTLRPAWAT 
W 


13549 


27450 


A 


13704 


141 


440 


PSAFEHFEKINLYFLKFCISQGFPERQN 
NRDRE IHRERYERDRERHRERYERGLRE 
LAHVIVEAEKPHHRPFITWETLGCWWSG 
S VQ VQKP WE PGKLM V 


13550 


27451 


A 


13705 


204 


438 


LLNVLLTQLFFLFLSLRQSCSVAQARGQ 
WYNHGSLQFSTHGASNPPTSASQSVGTT 
GMSHDHGLFLHFSTLDI FSSL 


13551 


27452 


A 


13706 


1707 


1821 


AKAGGSQHLEILANAVKPCLYWECKMAG 
PWWCAPVGG 


13552 


27453 


A 


13707 


167 


334 


IRRANFKILTEIGWTQWLTPVI PALWEA 
EGGGLLEPGSPSLVFLFFILTTILMRKK 


13553 


27454 • 


A 


13708 


224 


344 


KDTAMEE E I KDTE KTGKPWLGAVAHACN 
PSTLGDQGRQIA 


13554 


27455 


A 


13709 


1476 


380 


NSWSTLASELTLWAATSADPERKSQAAS 
AAMWATLPLLCAGAWLLGVPVCGAAELC 
VNSLEKFHFKSWMSKHRKTYSTEEYHHR 
LQTFASNWRKINAHNNGNHTFKMALNQF 
SDMSFABIKHKYLWSEPQNCSATKSNYL 
RGTGP YP PSVDWRKKGNFVS PVKNQGAC 
GSCWTFSTTGALESAIAIATGKMLSLAE 
QQLVDCAQDFNNHGCQGGLPSQAFEYIL 
YNKGIMGEDTYPYQGKDGYCKFQPGKAI 
GFVKDVANIT I YDEEAMVEAVALYNPVS 
FAFEVTQDFMMYRTGI YSSTS CHKTPDK 
VNHAVLAVGYGEKNGIPYWIVKNSWGPQ 
WGMNGYFLIERGKNMCGLAACASYPI PL 
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C=Cysteine, D=Aspartic Acid, 
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G=Glycine, H=Histidine, I=lso!eucine, 
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T=Threonine, V= Valine, 
W=Tryptophan, Y=Tyrosine, 
X^Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














V 


13555 


27456 


A 


13710 


763 


257 


YEKILKLTADAKFESGDVKATVAVLSFI 
LSSAAKHSVDGESLSSELQQLGLPKEHA 
ASLCRCYEEKQSPLQKHLRVCSLRMNRL 
AGVGWRVD YTLS S S LLQIRGKS P WCHCG 
WKVATCPQGPQAPACLPCPFS SKTKFPR 
S SLARTESRPKTLMKLPWAEEKGVPRPV 


13556 


27457 


A 


13711 


133 


401 


VLSKSGNPYWGLSTNHEIKGQEGCAWQL 
T P V I PAL W E AE AGGS L E PRS LR PAW ATS 
WLPRE I KXiDLL KNGCGPS K I TQGGPMVA 
GSLKL 


13557 


27458 


A 


13712 


212 


74 


RRINLAPPRVFLKGPPFFFFFFFFFFFF 
FFAWGNPLFLFNSTFGRT 


13558 


27459 


A 


13713 


324 


141 


DRVLPRLECSGVITARYRLNLLGSRDPP 
TSASLIAGSTGTCHHAWLSFSYLFTYVF. 
FIIL 


13559 


27460 


A 


13714 


299 


431 


I RL I WLS AVDHACNS STIiGGHGVQMTKG 
QDFETNIiANTANPHPY 


13560 


27461 


A 


13715 


133 


1 


I LVWAWWDMPAI PVLWEAETEGLLEPRS 
LRPAWAT WKNT I S TKN 


13561 


27462 


A 


13716 


18 


165 


I PLHTHTHTHTKTHTHGYLLALSGTEFL 
SVLYMASERECRTYNLMLNK 


13562 


27463 


A 


13717 


119 


1 


KPNVSQAQWLAPVI PALWEARVGVLFKP 
RSLRPTWATQQ 


13563 


27464 


A 


13718 


271 


64 


VLRS I STLNEILAKTRNPPLQI LMKPRK 
RWGTVAHLRAPVGDQPDQHSETPSLLKI 
TKISWVWWRTPVI 


13564 


27465 


A 


13719 


21 


209 


KDGPKGTGGSETNESVEMTECCSVTQAG 
VQWHDLGSLQPPPPGFKRFSCLGLiPSSW 
DYSRND 


13565 


27466 


A 


13720 


391 


151 


RGFFLKGEKKKNFFPIILGKKILGSPGK 
KGEKKKKKRGKI FFFLKNPLGFFPKKKV 
LGEKKNPYSGVWGGKKKRGPPPHP 


13566 


27467 


A 


13721 


109 


2 


RRLGWAHACNPSTLGGEVGGSQGPGNC 
GQVTKVS 


13567 


27468 


A 


13722 


69 


369 


RDILQADGAHSLFPTTPSLNAYIIFSSP 
I GPHTHRP YHAT PT YLP FYTNNL I KKKK 
KFCKKKKKKKKKKKKKKKKKKKKKGGGPF 
KKTPGGAPI FGGGGK 


13568 


27469 


A 


13723 


58 


1208 


FWNENSPASELAPNGGGSVTSVPRLEDY 
LTRPQLYKLTGVEGPSRANSRDS FHDRK 
TYKTPSANMMVLKVEELVTGKKNGNGEA 
GEFLPEDFRDGEYEAAVTLEKQEDLKTL 
LAHPVTLGEQQWKSEKQREAELKKKKLE 
QRSKLENLEDLEI I IQLKKRKKYRKTKV 
PWKEPE PE I ITEPVDVPTFLKAALENK 
LPWEKFLSDKNNPDVCDEYKRTALHRA 
CLEGHLAIVEKLMEAGAQIEFRDMLEST 
AIHWAS RGGNLDVLKLLLNKGAKI SARD 
KLLSTALHVAVRTGH YE CAEKL I ACEAD 
LNAKDREGDTPLHDAVRLNRYKMIRLLI 
M YGADLNI KNCAGKTPMDLVLHWQNGTK 
AI FDSLRENS YKTSR IATF 


13569 


27470 


A 


13724 


400 


135 


KWNGKBLNGVDG I GWDWNEMYWKGLEFH 
ETEWNELERNGLEWNGMSWDGIKCNGIK 
WNGI ECSGMLWNGMECIGMECPLMEW I R 
VEWN 


13570 


27471 


A 


13726 


186 


344 


SASLGLWRCRDCRRSLVHSVLNVAQAGV 
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E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Iso)eucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
^Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, ^possible 
nucleotide insertion 














QWRDLSSLQPPPPSRLPWPPKVIiR 


13571 


27472 


A 


13727 


380 


168 


APPQKKVFSSKFPQGFFFAPPNRKKLFF 
SPPRKIVPPPKFFLWAPPLFFFFFFFFF 
FFFFFFWYSRTDYK 


13572 


27473 


A 


13728 


324 


219 


RIMFFIFFYFFFFFFFFVFFLFFFFFKR 
PPPFFY 


13573 


27474 


A 


13729 


2 


270 


RLQEASLSPVPE I LGGALPSAKRTSYLL 
LSTLCLLLSETASRGNSLTGLGHRSDHY 
NSVRSGGQSLYSACPIVTTIHGTCYGGK 
AKCCK 


13574 


27475 


A 


13730 


134 


3 


AVHRCKKWGRGWGRWLTPVI PTLWEAEV 
GGPLEPRSLRPAliAT 


13575 


27476 


A 


13731 


429 


641 


RKKAVCFMNDLI CFLDNTFKNNVLSQAW 
WCVHLVPTIWEAEAGGSLEPRSLKLQCP 
WAPVNN CTPAWAT 


13576 


27477 


A 


13732 


188 


32 


I FFFFLFFYFLFFFFFFFPRSRHCTLQP 
GQQSETRSQNKKQKTKNKQTKNLP 


13577 


27478 


A 


13733 


341 


3 


GTVTPEEPKKMGDQTPRGLPRPERTLGR 
liSTSESRVDPTPRVGGGSSGGRTAPWGP 
P RERGRVAGS S LHRACNFS TSNT I ISVG 
LWLTPVI PTLGEAEAGGSPEVRSSTHAS 


13578 


27479 


A 


13734 


208 


2 


KLYKNFFFRFTKNPPPLLKKGFYLYNFF 
FKKKKKIVKKKKKKKKKKKKQKKKKKKK 
GRPLAIKNYPRV 


13579 


27480 


A 


13735 


16 


212 


EGVS LLLPRLECRGT I S AHCNLRHRGRR 
CTPASASQILRRLKQENHLNLGGGGYRD 
PIKLQRTAS 


13580 


27481 


A 


13736 


22 


214 


PDRPALPTRPGGW ITWGREVRDQPGQCG 
ETPSLLKIEKLAGCDGTCLEQNCLQNNN 
SNNKVGIR 


13581 


27482 


A 


13737 


386 


51 


WGNS ITQQALFEHLCFSGHTSLPAVPCP 
HQACFPTLPLICRGIIIAHCNLGLLGSS 
DS P I SAS P I DGNTGLHPHARL FS FLL WY 
RWGLTMMPKRSLS PGLKPSSRLADAWV 


13582 


27483 


A 


13738 


204 


431 


SQHFGISRRADHLSSNV3DQPEQHGKNS 
LYKYKKINQGWAWWLTPVIPPLWEGQVG 
GS PEGKELKTPLRNLGKPHL 


13583 


27484 


A 


13739 


69 


159 


I KKEALSWAWWLTPVT PALWKANAGGSP 
E 


13584 


27485 


A 


13740 


32 


285 


FFLS FFLFFFFFFGGGGKKKTKKFFSQD 
FLGPPKILGGGGFFSFFFGGPFFLKKGK 
KSPQNLFFLKKNPPPFFFPPPKKPLWGF 


13585 


27486 


A 


13741 


372 


213 


YRSCVLLQQNPPLFYQSWWNFSLCVCVC 
VCVCVCVCETGSLKKFLLTQKKKR 


13586 


27487 


A 


13742 


29 


237 


' T KRKG YKTNKG YWAWAW WLT P V I PALFI 
AEGNLILQMQKQAQPEAGCGCVCVCVCA 
CMCVCVCVFKVFF 


13587 


27488 


A 


13743 


322 


183 


HTHTHTHTHTHTHTHTNPIHVFWLUjKY 
PKIQSHQFHISAFQDPLL 


13588 


27489 


A 


13744 


305 


373 


VGWGW WLTPVI PALWEAKVGGS 


13589 


27490 


A 


13745 


172 


1 


KNKIEEQMKWEMRQEAVAHSCNPSTLGG 
QGEFETSLVNMMKPCLYKKMQKISQVCS 
C 


13590 


27491 


A 


13746 


285 


1045 


FQHPFGLSQSEMAAVKA3TSKATRPWYS 
HPVYARYWQHYHQAMAWMQSKHNAYRKA 
VESCFNLPWYLPSALLPQSSYDNEAAYP 
QSFYDHHVAWQDYPCSSSHFRRSGQHPR 
YSSRI QAS TKEDQALS KEEE METE S DAE 
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C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=Phenylalanine, 
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T=Tbreonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














VE CDLSNME I TE ELRQY FAETERHREER 
RRQQQLDAERLDSYVNADHDLYCNTRRS 
VEAP T ER PG E RRQAEM KRL YGD S AAK I Q 
AMEAAVQLSFDKHCDRKQPKYWPVIPLK 
F 


13591 


27492 


A 


13747 


2 


305 


GRVGSFSVRDVELSDPARERGEMPVAVG 
PYGQSQPSCFDRVKMGFVMGCAVGMAAG 
ALFGTFS CLR I GMRGRELMGG I GKTMMQ 
SGGTFGTFMAIGMGIRC 


13592 


27493 


A 


13748 


2 


305 


GRVGS FS VRDVELSDPARERGEMPVAVG 
PYGQSQPSCFDRVKMGFVMGCAVGMAAG 
AL FGT FS CLR IGMRGRELMGG I GKTMMQ 
SGGTFGTFMAIGMGIRC 


13593 


27494 


A 


13750 


238 


423 


AVSWDQLTWGTGVQEKKIQAQAWGLMPV 
IPTLWEGEVGGSPEVKSSRPACPHGKTP 
FLLKMQ 


13594 


27495 


A 


13751 


455 


248 


ISVGPGI FQLNFCFLEPYGTFLSEKKFF 
IMETGSRFVIQAGVQWGSYSSPQPQFFG 
FESSLKPPHSGVK 


13595 


27496 


A 


13752 


2 


98 


I PTPT I HTHTESHTHTHTHTHTS FLYMP 
PDLK 


13596 


27497 


A 


13753 


196 


2 


I WAPPKI FLKKPPLFFFFFFFFFFFFFF 
FFFFPFFIISFI PWPLTRKQKLERS WVF 
FKDSACSA 


13597 


27498 


A 


13754 


917 


379 


KYKKCSLQKNLLLVGCKKYSLCYRRHKL 
HLVTHGE RKKPAAVNS FFLFFS FFFFLN 
LVRNTEITKKRVNLHETKADAESCNQDC 
TTSSETASELEQIRSGKHNSGKWAGEGA 
AGGRREGERMEWTEMRRGARGRRGGRER 
EMERARVRGGEEREKEIDLYKKVTSKIE 
ETKLGNLLKLG 


13598 


27499 


A 


13755 


175 


21 


KKKNFFFPPRVILGPPKVFFKRAPLFFF 
FFFFFFFFFFFFFLXKSWRPLAI 


13599 


27500 


A 


13757 


234 


341 


EIGW VQWLMpvi PAVWEAEVGGIiLEPRS 
LRAARAI 


13600 


27501 


A 


13758 


1 


365 


PAPNRRGGHIQDRATNSTELGANQCFFF 
S PRP PS LE KKT PE I NKE PR PAP QPSNPG 
NLGTREGGDSWAGTTRCLRRTDEGDTYR 
TEPPTALSWGQTRAFFFPPALPAGKKRH 
RNLLKTQFF 


13601 


27502 


A 


13759 


87 


181 


SHTHTHTHTHTHTHTF YLHWIHFDME I 
LGL 


13602 


27503 


A 


13760 


1 


228 


ARGERERERERERERERERERERERERE 
RYREGGRLIiLTMGEGETERATDLYHTPP 
PS I KAWRLCEPRPRAGI LCGRNI F 


13603 


27504 


A 


13761 


383 


197 


RCBPPAWVSQSAR ITGVSYRAQPATSVL 
MGEEDFLEDPVIAFVRLAPAVLLSKLSE 
VSVAMT 


13604 


27505 


A 


13762 


276 


57 


YAVLGGGGGRKNLFVFVFFLFFFFFFFF 
FKKQGFGLMPLGEDRGPNMGFFYFPMVG 
DKTKL P FKKKKKKR PR 


13605 


27506 


A 


13763 


377 


209 


PRPAYGPAFLFTEGFSWNPFFFFQMESR 
SVAQAGVQWCYLGSLQPPPPWFPGTLLN 


13606 


27507 


A 


13764 


160 


202 


MEKYNVHPHSGILHSHEKEQAALFTIAK 
RWRQPSYPSIDEWKNIMSIHTVEYYTAM 
KRNKQLYSQ 


13607 


27508 


A 


13765 


390 


284 


ESGGHFLSLSLSIYIYIHIYIYVCVYIY 
THTSHTI 
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C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenylaIanine, 
G=Glycine, H=Histidine, I=Iso leu cine, 
K=Lysine, L=Leucine, ^Methionine, 
N= Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possib!e 
nucleotide deletion, ^possible 
nucleotide insertion 


13608 


27509 


A 


13766 


350 


41 


YWILQENRNTQNRKLLPKSTELCMLLQ 
MLLFILNTTNYPLPLWQCYHRFCSFFLK 
TGS FYVTQAKVRWLFTGM I TVHCSLKLL 
DSSHPPKWLVLQELATVAS 


13609 


27510 


A 


13767 


330 


156 


KITQAWWCVHWPDNREAEVTGVIRLGR 
MRLPLEGACTNCVPSWEKEQDPVSHNEK 
K 


13610 


27511 


A 


13768 


25 


292 


KCFFLSWRGGSRLQSQHFGRPRRADHLR 
PGVPDQPGQHGFLLVKMSDKPDLSEVEK 
FDRSKLKKTNTEEKNTLPSKEIFFSLVG 
VNIQD 


13611 


27512 


A 


13769 


585 


680 


KCLGSRTRWLTPVI PTLWEAEAGGSLRP 
RSS 


13612 


27513 


A 


13770 


488 


359 


PSPRERKFLPFFKKKKGKGPPPPPKKNR 
GRGPQQKEGPQKPKP 


13613 


27514 


A 


13771 


794 


515 


PDMGLEDEQKMLTESGDPEEEEEEEEEL 
VDPLTTVREQCEQLEKCVKARERLELCD 
ERVSSRSHTEEDCTEELFDFLHARDHCV 
AHKLFNNLK 


13614 


27515 


A 


13772 


130 


397 


WGLTPLCFCMRASLPRHSKPQARKTKK 
KKKKKKKKKKKKKGGSLKKKI FGGAKKS 
GATKKKNFPKKRGQNKKNPGFFKNEIFF 
GGGAI 


13615 


27516 


A 


13773 


376 


145 


TRVGGGCSEQRLCHCTPSWATEPNPVSN 
NNKIPLELFPSNQTFFHFVRNWKKRGNN 
WLSNYSISSTVTRHFPTLFQF 


13616 


27517 


A 


13774 


1 


247 


GLQSLADLLSGLLQKVCLPLSNGIMDLY 
LL Y Y L FT FETG S CS VAQAR VQW C EHG S L 
QPHTSASSDPPTSATTTCHHARLX IW 


13617 


27518 


A 


13775 


376 


145 


TRVGGGCSEQRLCHCTPSWATEPNPVSN 
NNKI PLELFPSNQTFFHFVRNWKKRGNN 
WLSNYSISSTVTRHFPTLFQF 


13618 


27519 


A 


13776 


107 


468 


EKARSPERARDQEGGTERDRNSEREKIL 
P K LR E EL P WVSGG WRCPWP HQGWS HWE D 
NPGWGIPTGPSVGWGEKKGPGEGRSHKY 
GTGRTKCELGVS I GNSAFTFLLHF YFKH 
RKRERKQI 


13619 


27520 


A 


13777 


125 


2 


NIFFPLFELYYFKCHWARWLTPVIPALW 
EAKEGGSPEVRSS 


13620 


27521 


A 


13778 


166 


74 


GR I CYSHSLSKHTHTHTHTHTHTSHIS F 
IP 


13621 


27522 


A 


13779 


] 


341 


ARGERERERERERERERERERERERERE 
RERERERERERERERERERERERERERE 
RERERLS FSLGGGGALKKKR I FLCVTLE 
GKHLPPLTHTGFFGENTHTLSMKTREGG 
FICAGGDSRRSASVMRAYIYRECSPHTH 
V 


13622 


27523 


A 


13780 


'614 


337 


RRCSSALCYRRHGNHKVKIRSKQGASVS 
PHEKLRILSLLEVRNVGLGHS I FPHSLR 
MYVCMYVCMYVCISIYLSIYLSIYLSIY 
LSISISSR 


13623 


27524 


A 


13781 


390 


158 


VLIRLS WGRI CFQAHSGCWQNSGS CSHK 
TEGLCLLLAGSCSWLLG3ALKGWASKEE 
FYTNQHSCQSDTLSPLPYSIH 


13624 


27525 


A 


13782 


308 


163 


TKFYLLFFFFFFFLFIFFFFFFFLKKKF 
L F F I QGGG KI KT PGTLTFLG 


13625 


27526 


A 


13783 


45 


277 


IASGRPFFFFLHLPPFQKAFVFGGGGGT 
PFEYQNF VAYI KGKQGNPFLFGCGDLFN 
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SEQ ID 


SEQ ID 


M 


SEQ ID 


Predicted 


Predict- 


Amino acid sequence (A=Alanine 


NO: of 


NO: of 


eth 


NO: in 


beginning 


ed end 


OCysteine, D=Aspartic Acid, 


nucleotide 


peptide 


od 


USSN 


nucleotide 


nucle- 


E?=Glutamic Acid, F=Phenylalanine, 


sequence 


sequence 




09/515,1 


location 


otide 


G=GIycine, H=Histidine, I-Isoleucine, 








26 


correspond 
ing to first 
amino acid 
residue of 
peptide 
sequence 


location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R^Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














APQLI KQLSQLGQKYPKKPLL 


13626 


27527 


A 


13784 


413 


218 


PNFPPFLFFSGKFFFFAQNLPFGFFPPP 
GEGKAFPGQGCPVGGGGFAGPTFFFGPP 
AFPEILYFF 


13627 


27528 


A 


13785 


1011 


1368 


I DWENDLTPMVLCHG PS I YNKYS FSQNI 
FP I S ANF FFIAHINLTLLQLVS E LSHKI 
ILRHFY IKPIKDI LNLYL I LPGVWLFDL 
CKSTAICRYLCVCIYIHLYIHTHTHIHI 
YTHIRIi 


13628 


27529 


A 


13786 


161 


425 


RWGSCYCPGYPHLKI FKLLLLLVLVLEM 
G VL PLL PRLE C SGM I T VHCNL E F LGTS Y 
PPISVSQTKKKKRPFTLPPLPNLQIGLP 
LRGE 


13629 


27530 


A 


13787 


70 


311 


DLISITALVISPNKFYIYLSIYLSIYLS 
I YLS 1 1 YL I F I STNIiLWVLS LWSTLNKT 
RLLNLFLTIQNN I WNPKLLVS ALI 


13630 


27531 


A 


13788 


25 


219 


RNMAAATLTSKLYSLLFRRTSTFALTI I 
VGVMFFERAFDQGADAIYDHVNEGKLWK 
HIKHKYENK 


13631 


27532 


A 


13789 


279 


138 


RRSLALS PRWDCGLQWRNLGSLQALLPG 
FTPFSCIiSLPSSWDYRPF 


13632 


27533 


A 


13790 


138 


28 


GGWVRWIiTPVI PALWEARVGGLLEPGSS 
KKERGKKEV 


13633 


27534 


A 


13791 


133 


7 


YSCQRCLTHGQAQWLTTVI SAP WEAKAG 
RLLE PR S VRS AV7AT 


13634 


27535 


A 


13792 


86 


290 


EHVHRTLIEAS VSGWGWWLTPVI SAAWE 
AEVGGSLEGQEFEISLGKCSETPSLTTT 
TTESLGSFAATS 


13635 


27536 


A 


13793 


150 


37 


KI I YPSQARWLMPVI PALCEAEVGRLLE 
PSSSRLAWAV 


13636 


27537 


A 


13794 


216 


77 


PTQHPTEGYKIKQQTVYKECRr-lWANSHL 
CLLPEGLLRAVTPWCHAP 


13637 


27538 


A 


13795 


251 


347 


VARAWGLAPVI PALWEAEVGGSPEGRSL 
KPAL 


13638 


27539 


A 


13797 


177 


274 


VQWHMSWPAIQEAEVGGSPEPRSLRPA 
WATY 


13639 


27540 


A 


13798 


307 


406 


KF KKG WAWWLM P V I ? ALWE AKAG GS PES 
RSLRP 


13640 


27541 


A 


13799 


194 


3 


FFFFFFFFFEMGSHYIVQARVQWLFTDA 
NIVHCSLQLLASSDPPVSTSQVGLQACA 
DDAQNPE 


13641 


27542 


A 


13800 


333 


198 


LPSVFFFFFQTESCSVTQAGVEWCDLGS 
LHS SLGNRARLCLKI K 


13642 


27543 


A 


13801 


1 


335 


ETERGRERETERGRERETERGRERETER 
GRERERERERERERSRDREEGRERQRGE 
GKKNRDREENRRRETERVGDWCVCPHP 
NLI LNCS P Y I SHVLREGLG KT YLNYGGS 
FPTLCSG 


13643 


27544 


A 


13803 


14 


454 


R V F FDR S R YR S RTLGSTHAS ALLG I LV Y 
RSHL I S SLLCLEG I I LSLF I I ATL I TLN 
THSLLAMIVPIAILVFAACEAAVGLALL 
VS KKKKKGAPVLKNPWGAQSLRG QARKY 
FFPYREPNKNLPGNLGKEPFLGGGDILG 
QPPYKN 


13644 


27545 


A 


13804 


1474 


1661 


TDFHNSHLLKCCKNCIFSTLNYY I RREH 












FSIVFIFCYICWKVHYIRRELFRCLYF 
LLYMCC 


13645 


21546 


A 


13805 


360 


■j 


L WG KRGG P FS P P P FF FFLE L F F FLKKMC 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=IsoIeucine, 
K=Lysine t L=Leucine, iVNMethionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














GGGFFGGFPWGPPKGEI FFFVFFWAPQK 
KKKKKPPPFIFWGGGPFFFFFFPPPKQK 
KKKN KKKKKKKQKKKKKKKK KRAAR W PA 
RSRSSA 


13646 


27547 


A 


13806 


1 


425 


RLGGVALRSAADGAFVSGEFCGGKLLRW 
CLVTDFPPDSCTTCSYSRRSTPGCSPGG 
SRGLSEGEGSSVSLQRSRVLSAMKHVLN 
LYLLG WLTLLS I FVRVMESLSGLLES P 
SPGTSWTTRSQLANTEPTKGLPDHPSRS 
M 


13647 


27548 


A 


13807 


5 


313 


EKPYLQGITFTSKTTHLIYEFTPYTIIV 
KEYSTNYVLLIGNTQITKIYHIVFRKFS 
MTPI YKFQDLAQWLTPVI PTLWEAKPGG 
LLEPRSLQSSYSLIILIL 


13648 


27549 


A 


13808 


182 


314 


VITQLDKTERAQWLMPVI LVL YE AEVGG 
LLEAMSSRPAWATKTW 


13649 


27550 


A 


13809 


219 


1812 


LPPPESGAMSGFNFGGTGAPTGGFTFGT 
AKTATTTPATGFSFSTSGTGGFNFGAPF 
QPATSTPSTGLFSLATQTPATQTTGFTF 
GTATLASGGTGFFLGIGASKLNLSNTAA 
TPAMANPSGFGLGSSNLTNAI SSTVTSS 
QGTAPTGFV FG P S TTS VAPATTSGG FS F 
TGGSTAQPSGFNIGSAGNSAQPTAPATL 
PLT PAT PAATTAG ATQ PAAPTPTAT I TS 
TGPSLFASIATAPTSSATTGLSLCTPVT 
TAGAPTAGTQGFSLKAPGAASGTSTTTS 
TAATAT ATTTTS S S TTG FALNL KPLAPA 
G I P SNTAAAVTAP PGPGAAAGAAAS SAM 
TYAQLESLINKWSLELEDQERHFLQQAT 
QVNAWDRTLIENG2KITSLHREVEKVKL 
DQKRLDQELDFILSQQKELEDLLSPLEE 
L VKEQ S GTI YLQHADEERE KT YKLAEN I 
DAQLKRMAQDLKDI IEHLNTSGAPADTS 
DPLQQ I CKI LNAHMDSLQW I DQNS ALLQ 
RKVEEVTKVCEGRRKEQERSFRITFD 


13650 


27551 


A 


13810 


134 


263 


KNSLFKKKNKGRPWPLMPVIPALWEAEA 
GGLLEPRGLRPTWVT 


13651 


27552 


A 


13811 


210 


51 


TLSHRKP I S FE AYFARVRGF I LEVS ETR 
NPPIGWTRWLTPVI PALWEAEAGG 


13652 


27553 


A 


13812 


241 


139 


LHBAGLAPM I PALWEVKVGGLLEPRS PR 
PAWASW 


13653 


27554 


A 


13813 


539 


256 


RTTQMSTAAGFTEAPNWKQPRCPWTEEQ 
TNKMWS LHAMECAS AMKWNE VL I QPAVR 
MARENSRRKPGDMHDIWSVWSAGEWLPG 
LGRSTGKGSD 


13654 


27555 


A 


13815 


63 


369 


VRETPLKTHYLEEISSPASPTAIPQSLL 
FSFFISPPSSLATGSGHSGHPVHSLHHP 
PETEPSPSVCLWAGPKVPPGAAGKGSPD 
SNPLVI RSLLAPPPASL 


13655 


27556 


A 


13817 


258. 


3 


AESAPPFSTKSLFFTHTKTHTHTHTHTH 
THSLRTTRWAPSQVCRPQTSQRGGECVG 
PTAFAPSPTLLKPHHPSSHVHLPSQPRR 


13656 


27557 


A 


13818 


516 


665 


WFKSGS FWLGMVAHTCNPSTLGGQGGW I 
TSGRSRTSVTTTTTSQTCAPA 


13657 


27558 


A 


13819 


295 


361 


WLTPVI PALLEAVTGGSLEPRS 


13658 


27559 


A 


13820 


166 


368 


GWWATQHSAQYWSSSSIGALCLGESCS 
VTQAGVQWCDLSSPQPSEPGFKRFLCLS 
LPS S WDHRGDL 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26* 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=GIycine, H=Histidine, l=Isoleucine, 
K=Lysine, L=Leucine, M~ Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possibie 
nucleotide deletion, ^possible 
nucleotide insertion 


13659 


27560 


A 


13821 


133 


530 


RKYCGQCLNMALNNVSLS SGDQRSRVAY 
RSS HGDLRPRAS ALAMVS GDG FLVS R P E 
AIHLGPRQAVRPSVRAESRRVDGGGRSP 
REPDGRGRSRQARFSPYPIPAVEPDLLR 
SVLQQRLIALGGVIAARISV 


13660 


27561 


A 


13822 


76 


1958 


RQELI WPLCS PPQGDRFLQKS W I FFRPV 
MADKLTRIAIVNHDKCKPKKCRQECKKS 
CPWRMGKLCIEVTPQSKIAWISETLCI 
GCG I C I KKC PFGALS IVNL P SNLEKETT 
HR YCANAFKLHRL P I PR PG E VLGLVGTN 
GIGKSTALKILAGKQKPNLGKYDDPPDW 
QEILTYFRGSELQNYFTKILEDDLKAII 
KPQYVDQ I PKAAKGTVGS I LDRKDETKT 
QAIVCQQLDLTHLKERNVEDLSGGELQR 
FACAWC I QKAD I FMFDE PS S YLDVKQR 
LKAAITIRSLINPDRYIIWEHDLSVLD 
YLSDF I CCL YGVPSAYGWTMP FS VR EG 
INIFLDGYVPTENLRFRDASLVFKVAET 
ANEEEVKKMCMYKYPGMKKKMGEFELAI 
VAG E F TD S E I MVMLGENGTG KTTF I RML 
AGRLKPDEGGEVPVLNVSYKPQKISPKS 
TGSVRQLLHEKIRDAYTKPQFVTDVMKP 
LQIENI IDQEVQTLSGGELQRVALALCL 
GKPADVYLIDEPSAYLDSEQRLMAARW 
KR F I LHAKXTAFWEHDFI MAT Y LADRV 
IVFDGVPSKNTVANSPQTLLAGMNKFLS 
QLE ITFRRDPNNYRPRINKLNS I KDVEQ 
KKSGNYFFLDD 


13661 


27562 


A 


13823 


3 


292 


KCWDYVHEPPRPAPMQTYQVDLRCCFVS 
TINLLVCGERASLPSVESLVFSGLAEVW 
MKAVLAPSRESGKLVLQQVPEYAEAIWG 
ARPACRGLVFSA 


13662 


27563 


A 


13824 


245 


91 


YKVNFGPPRGSFFRPPPFFFFEVFHFFF 
IIFFFFFFFFFFFSFFLGCFMYV 


13663 


27564 


A 


13825 


276 


100 


PGQKKKTLS P KKKKS I Y L Y I HTH I Y I Y T 
HTHYIYKYIFVCVYICVSIYIHTHIVPR 
IFS 


13664 


27565 


A 


13826 


202 


375 


IYNTYIHTHTHTHTHTHNVYLYIQSGDC 
IYAVLLGELSFIYIFVFLSSLISILFHL 
TY 


13665 


27566 


A 


13827 


278 


34 


KPRGLSPAGVSAQRAWGPHSCPRLPDPR 
TELQVRQI PALVGS SAFQGRRGR KGPVG 
GSGPSRPRAPPQPRPGCCQCTLLQI 


13666 


27567 


A 


13828 


62 


307 


GGARAI FLNVPHLKRPMFFLELCLSTKC 
PVGTEFALDSLLRLYNSNTFCWVQWLMP 
TI PALCKAEMGGLLEARSL I PAWAT 


13667 


27568 


A 


13829 


386 


478 


NRIGMVSHACNPSTLGGQGRRIMRSGDQ 
DHS 


13668 


27569 


A 


13830 


364 


484 


NFNI WLGWAWWLMP VI PALWEAKAGGS L 
EVRGSGPAANMV 


13669 


27570 


A 


13831 


33 


486 


PARSAEFGTRRERERERERERERERERE 
RERERERERERERERERERERERERERG 
VICVCPCALLCVRESDPLSQKKYCVCVF 
WRYTESLALSPFCLGRCIiALLI FLEGM 
WRKIWAPSFLSLCVGALFFSQHPPPFWC 
LCVLSLVRPLPLSLSGGVFVSQRALI 


13670 


27571 


A 


13832 


6 


718 


Y SAVE FAMAGVGAG P LRAMGRQALLLLA 
LCATGAQGLYFHIGETEKRCFIEEIPDE 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino arid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A-Alanine 
C=Cysteine, D=A spa rtic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=G)ycine, H=Histidine, I=Isoleucine, 
K=Lysine, Lf=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R = Arginine, S = Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, * 
X=Unknovvn, *=Stop codon, /^possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 














TMVIGNYRTQMWDKQKEVFLPSTPGLGM 
HVEVKDPDGKMLQWLSRQYGSEGRFTF 
TSHTPGDHQI CLHSNSTRMALFAGGKLR 
VHLD I QVGEHANNYPEI AAKDKLTELQL 
RARQLLDQVEQIQKEQDYQRYREERFRL 
TSESTNQRVLWWS lAQTVILILTGIWQM 
RHLKS FFEAKKLV 


13671 


27572 


A 


13833 


17 


130 


RLQEFGTRRERERERERERERERERERE 
RDTHSFLHG 


13672 


27573 


A 


13834 


283 


460 


LWRLAI CKNYSLKRETEPGSVAHVYNP 
STLGGQGGRTARGQEFKTSLDNIARPPS 
LQK 


J jO/j 


Z / J /** 


j± 


JjojJ 


3 


fjj 


WPRFCTAIjQEFGTRRERERERERERERE 
RERERERERERERERERATSLSRARPSF 
LFFSACVSHTPYVGERENLSLHTLKHTH 
TREKHSLSYTNASARDTRARFSEGRAPP 
IYSVYLTHGVFFFFFFSLCLSVRHAEGL 
AHV 


13674 


27575 


A 


13837 


351 


57 


RTLVH I S KFTGNLS DL YYVQSE WQDLGT 
ADLSIMTSVXiHCLPSLES I KNTDS WPSS 
VAHTCNPSTLGGRGRCRDQHGQHSKQPS 
LL.KI QKI ARRCGG 


13675 


27576 


A 


13838 


336 


150 


T FC YKKYLWGAHCF I FRTTGLP I KNF Y F 
G QAWCLL P VI S ARWE AETGG S LE PRS LR 
VAWAS 


13676 


27577 


A 


13839 


310 


454 


HCSLGLPQVLLI ILYKILLGWARWLTPV 
I PALWEVEAGGLLEARSSRP 


11677 
IJVt I 




A 




i yy j 


?97 


I IFFLLDLKLQGAMTI YVS I YLSI YLS I 
YLS I YLYTHTL WNTLQL 


13678 


27579 


A 


13841 


33 


316 


LDQHPTPRSPLLCHSLRKTSSSQGGKSE 
LVKQSLKKPKLPEGRFDAPEDSHLEKEP 
LEK F P DD VNPVT KE KGGPRG P E PTR YGD 
WERKGRCIDF 


13679 


27580 


A 


13842 


4474 


2586 


DGGSGCVKMEFPGGNDNYLTI TGPSHPF 
LSGAETFHTPSLGDEEFE I PP I SLDSDP 
S LAVS D WGHFDDLADPSS SQDGS FS AQ 
YGVQTLDMPVGMTHGLMEQGGGLLSGGL 
TMDLDHS IGTQYS ANPPVTIDVPMTDMT 
SGLMGHSQLTTIDQSELSSQLGLSLGGG 
TILPPAQSPEDRLSTTPSPTSSLHEDGV 
EDFRRQLPSQKTWVEAGKKQKAPKKRK 
KKDPNEPQKPVSAYALFFRDTQAAIKGQ 
NPNATFGEVSKI VAS MWDSLGEEQKQ I Y 
KRKTEAAKKEYLKALAAYKDNQECQATV 
ETVELDPAPPSQTPSPPPMATVDPASPA 
PAS I EPPALSPS I WNSTLS S YVANQAS 
SGAGGQPNITKLIITKQMLPSSITMSQG 
GMVTVIPATWTSRGLQLGQTSTATIQP 
SQQAQIVTRSVLQAAAAAAAAASMQLPP 
PRLQPPPLQQMPQPPTQQQVTILQQPPP 
LOAMOOPPPOKVRINLOOOPPPLOIKSV 
PLPTLKMQTTLVPPTVESSPERPMNNSP 
EAHTVEAPSPETICEMITDWPEVESPS 
QMDVELVSGSPVALSPQPRCVRSGCENP 
PIVSKDWDNEYCSNECWKHCRDVFIAW 
VASRNSNTWFVK 


13680 


27581 


A 


13843 


54 


251 


EFYRMNSPSPLWLCVCVCVCACLFMCLC 
ALTCMCVWYKSLLHSNLTOVLSSKFYDS 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A^AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Lcucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S^Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 














ISPAENPGK 


13681 


27582 


A 


13844 


154 


26 


DPHVPPTERSEGSQARWLPPVIPTLjWEA 
EACGSPEVKSSNST 


13682 


27583 


A 


13845 


475 


225 


GHTELYGGALFQLLQAAI PEGGGTSCLF 
NPSSDGGSSAAGGWGGAGDLARSSRVP 
SPTSSRPVPSPDPSPPSLYLTPWKETH 


13683 


27584 


A 


13846 


479 


319 


RDCRRSLTHSVLSGAQAGVQWRDLGSLQ 
PPPPSRLPWPPKAPRWQPLPGHHPI 


13684 


27585 


A 


13847 


182 


499 


LLCVKLCDS CNENKALREQTRAQVQGGH 
SRAQVQQVQGGHSAHTWQIRSAASRRK 
AF STCS SHLGM VLL F YGTG S S T YMRPTT 
RYSPLEGRLAAVFYS I LI PTLN 


13685 


27586 


A 


13848 


388 


494 


RDSWERWLEPLIPVLWEAQAGRLLEPRS 
LRLAWAT 


13686 


27587 


A 


13849 


198 


63 


HHATTWMNLEDI MFRKVSQTQKNKYYM I 
PLIWKTKKVLSHREQK 


13687 


27588 


A 


13850 


555 


353 


RCPPGAQAPLPAPSPAPPVHIALSPLSC 
GVSWPRELTAHPPEPPPPPFFKIHPHPQ 
LSVGKRGQMTP 


13688 


27589 


A 


13851 


169 


66 


L LAGS TG Q ARW LTP V I PELWKAEVGGSL 
EIRSSR 


13689 


27590 


A 


13853 


12 


105 


I ASGLHDFFKKKKI KKKKKKKKKKKKGG 
AL 


13690 


27591 


A 


13854 


256 


363 


I CLF I KDSDRGQTQWLTPV I LTLWKAEA 
GGSLEPKS 


136Q1 


97 SO? 


A 


URSA 




436 


I DRLSDSHEERERERERERERERERERE 
R3RERERERERERERERERERERERERE 
RERESSLLCVRKTCFAPPTYIFLWETLR 
VCPTPCVWGALSVFSHTERARIFFYTCG 
GS VARAPVCAHTRI FSSRARARALCARP 
PLSRERRAHTRVFFI 


13692 


27593 


A 


13857 


91 


428 


PDPDTSPLTTDRGETPLGATLPSCYCGG 
QQSFEPKTKKTTNQKKKKKGKKFTPQKK 
NPKIFPPKPFCPKGPRGENQPKKTQNQP 
KK KKK I LGGG PAPNS P LGGG KKNS WAG F 


13693 

IJU7J 




A 


13859 


465 


43 


KRVTTENPQHICSAETAIPWSLSQNCNF 
HILWWGHPNHPIYITTSIYIYIYIPHIY 
GMVY I YTR I CMCV C I YVHI C IHTHTHI Y 
I Y I HTHTH I YMERERET VS CSVAQAGVQ 
WCDLGSLQPPNKRLKRFSCLSLPSSWDY 
W 


13694 


27595 


A 


13860 


290 


52 


WYKYLPGTRGFLIWLNYTiKCIGRIELLV 
GYFSFI YLS I YLS I YLS I YLSSRS I AQP 
GVKRCNHSSLQPRTPGLMHLSNS 


13695 


27596 


A 


13861 


108 


22 


IFFLFSAVKTRFSYAFPKEFPYRMNHVS 
I 


13696 


27597 


A 


13862 


167 


33 


VFTVPGTYMCVSVRDLGQAWWLTPVIPA 
LWEAEAGGSLEEFLRT 


13697 


27598 


A 


13863 


448 


355 


FFFFFFFFFFFI FFFFFYFFFFFFFAKN 
PLN 


1 36QS 




A 


1 JOUH 


j«- i 


44S 


SQSTQAWCLMPVI PALWETKSSESFEPR 
SLRPAWATGHNPI 


13699 


27600 


A 


13865 


380 


80 


MILSPSPKFPSFLFSLFSALLLWHCLEL 
PFLSNVGKGMQNDPVGIiSPTSTPSDLSR 
I YLFFCFFVCLFVCFETGFPS VAQAGVQ 
WCDPLGLKWFLPPQSP 


13700 


27601 


A 


13866 


372 


264 


KMENPEDKNFFIFFMFFIVFLLFFFFFL 
FCLTQFF 
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SEQ ID 
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M 
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od 
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26 


Predicted 
beginning 
nucleotide 
location 
correspond 
ing to first 
amino acid- 
residue of 
peptide 
sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A~Alanine 
C=Cysteine, D=Aspartic Acid, 
E=G)utamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, Wpossible 
nucleotide insertion 


13701 


27602 


A 


13867 


161 


304 


LFIAEMTSHYIAQAGLELLASSNPPTSA 
S ERTGI TDGATE I LPQLRL 


13702 


27603 


A 


13868 


230 


348 


VSFIQNLVFCVERVYRVPDFGVWERGSK 
YNNGSTELHSK 


13703 


27604 


A 


13869 


168 


260 


EVIAECYNGCFQTLSLCVCVCVCVCVCV 
CY 


13704 


27605 


A 


13870 


140 


16 


LKYGQWRDLGSLQPP PPRFKQFSCLSLL 
SSRNCTTMTRDHE 


13705 


27606 


A 


13871 


143 


30 


KILKVWSQAWWLIPVIPALWEAEVGGS 
LEPRISRLA 


13706 


27607 


A 


13872 


158 


59 


HLHGCIHLLKLRLDAVSHACNSSTLGGQ 
GGRIA 


13707 


27608 


A 


13873 


232 


115 


APFFFKGAP I FFFFFFFFFFFI FFFLI R 
GKPGNDVGVEG 


13708 


27609 


A 


13874 


1619 


1375 


KTTWRRHLEKLQDMSLLRHRVHVDSSQG 
LFPFFQRQGLLPRLKCNGDI IPYCNLKL 
LGWSNSPASAFRVARTTSLCRHTWL 


13709 


27610 


A 


13875 


219 


84 


KPLQSTST I CWRAWDRWCLWSQLLRRL 
RWEDHLSFGVPSCSAL 


13710 


27611 


A 


13876 


166 


496 


KKKKKKKKKGGGPUCKTPGGAKINRGRK 
KKI FFQKGGQKKTPGGILKKXPFLGGGK 
KGPNPPKKI KGLREKKKFKRGKGAKPAQ 
NPWGKKISPPGFFLKKFFPRGRGFFI 


13711 


27612 


A 


13877 


132 


348 


PSKKKKRGGPFKRTPGGPKFNGGGKGNS 
FPLMGGGIKPNLGILGKNPYLGGGTNGN 
NPPTEIKGFGEKKKF 


13712 


27613 


A 


13878 


287 


134 . 


QQGPPISPGPKEENRMGPPSPGWGNPGI 
R£ P PGAGPNPGAFQDGM FPS KKQ 


13713 


27614 


A 


13879 


333 


480 


VKPLRSAEGPRPESNMLHLTLKSKQQWL 
TPVI PALWEAKAGGSQGQE I E 


13714 


27615 


A 


13880 


1 


227 


PKI HCK I QH I WLVS FFP S FFLS FFQGE 
SCSAAQAGVQWHDLGSLQAPPPGFKWFR 
FIELLGLGFHSRRMRGKDS 


13715 


27616 


A 


13881 


276 


378 


GWPWWLTPVI PALLEAEVGGLLEPRSLR 
LALAT 


13716 


27617 


A 


13882 


126 


389 


GSVLGTGCCGSVLGTVCRGSVLGTVLIA 
SATPDRYLRLSSGSSLTRITNSDLVPWL 
TPVI PALWEAEAGVSLEPRIWRPARETW 
EDPH 


13717 


27618 


A 


13883 


414 


150 


ARQAPKWGNPTGSPPPGFLNPPPKNFFL 
GPQKKKI FFPTPPPPHFFFFKGPPPFFF 
FFFFFFFSPHASGI LLCHPGRIAVAQSW 
LTE 


13718 


27619 


A 


13884 


197 


363 


DLGWWNTFMLKFKCLNVSRGHVQWLI 
I PVI P ILWKABAGRSLKPRSLR PAWAT 


13719 


27620 


A 


13885 


101 


236 


NIEYANFPYFGLACPEYIVRALQKKYTQ 
THTHTHTHTKTRTTHSL 


13720 


27621 I 


A 


13887 


17 


140 


TVHLKMVKIGLHMVAHACNPSTLGGRGG 
PIMRSGALGNFCL 


13721 


27622 


A 


13888 


135 


1 


VLHAGLKLLGSSTS PTSASKGAGTVGIH 
YHTQLAFQFFLFCFLIR 


13722 


27623 


A 


13889 


411 


276 


TLYFKHGEMGFHHVGQAGLELLASSDPP 
ALEFETSLHNKVRPRL 


13723 


27624 


A 


13890 


216 


54 


EFKLCYTLFDHSYFPLEELVSGQVWRLR 
PAIPSFWEAKEGGSLEPRSSKHEIV 


13724 


27625 


A 


13891 


3 


149 


SSDPRPSSSWLAWVGLWSMVWRVPPFLL 
PILFLASHVGKSPLSPSPHF 
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M 
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od 
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nucleotide 

location 

correspond 

ing to first 

amino afirl 

residue of 

Deotide 

sequence 


Predict- 
ed end 
nude- 
otide 
location 
correspon 

dincj tn 

U 1 1 1 Zf M\f 

last amino 
acid 

residue of 

npntiiif* 

UwuLIUV 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Lysine, LHLeucine, M=Methionine, 
N=Asparagine, P=Proline, 
0=("Jhitnmmp R=Aruininp S=Serine 
T=Threonine, V=VaIine, 
W=Trvntonhan Y=Tvrosine 
X=Unknown, *=Stop codon,/=possible 

nucleotide* rlplptinn \=nossihlp 

nucleotide insertion 


13725 


27626 


A 


13892 


166 


1 


L FS KI FWWLL P VI PALRE AKVGELLE PG 
SSRPAWATWKNPVYTKNTKISQAWQCT 


13726 


27627 


A 


13893 


128 


2 


FSTTYPNP I KLPHPYLPLLTLFLDSAHL 
HPSEINSLVAQKKK 


13727 


27628 


A 


13894 


225 


391 


HLSIHILTFPFSLPEQWWLVPIIPVLWE 
VKMGGLLEPRSSRPAWATQQDPQVSRK 


13728 


27629 


A 


13895 


192 


1 


TLVCVS SS VPKNYNN I CLACW FMP 1 1 PA 
LWEVESGSLLELKSSPPAWPTWWVPLKS 
KNTKISLA 


13729 


27630 


A 


13896 


152 


245 


DQS IADVLMYYTHTHTHTHTHTHTLQNQ 
VPG 


13730 


27631 


A 


13897 


270 


406 


I STR FFLYRDRASLCKPGWSAGVQS WLP 
GSSNSQAQAILLPQPPE 


13731 


27632 


A 


13898 


232 


71 


EAEGPRDVTSSTFHWRLYDQTTNYHECR 
MWANSRLRLTPEGLLKVI TPWCHAH 


13732 


27633 


A 


13899 


107 


1 


LGQAQWLMP I I PALW EAKAGG S LE PRT S 
RPAWA 


1 


77A14 








774 


PTRP I LAHCNLCVLGSDDP PASASQSAG 
VTGMSHCAWSNL I IHLPSRSTAI CTL - 


1 'in'lA 
lj / jf 


77£1 S 
J. 1 Oj d 




JJ7U1 




1Q4 


KROVKSHSEVKTTHVOWPGIDPGS PAWF 
AR ILPLNHQRLSGTTP WR I TEKSNHKDL 
ETVLSGFFKCRLKANKD I QTKCVF IGN F 
Y 


13735 


27636 


A 


13902 


300 


404 


AAPGRAPQKKKKKKKKKKKKKKKKKKAP 
FI5CRGR 


13736 


27637 


A 


13903 


131 


29 . 


ALGSLQPLS PRFKRFSCLSLPSSWNSTA 
LTVTTM 


13737 


27638 


A 


13904 


267 


74 


VWVLV PPFTS YGVLDKP I L I VNN I YVHV 
CVCLCMCVCVC VCVCCT V I RRHCWHNMI 
IIISQHDQ 


13738 


£■ /UJ7 


A 


13905 


193 


64 


IRSHQIHKLHFKKFRILGWTWWLTPVIP 
APWEAEEGRS PEPSS 


13739 


27640 


A 


13906 


309 


424 


QSTDFLENRRVGWAWWLTPVI PALWEAE 
VGGLPEIKNS 


13740 


27641 


A 


13907 


404 


251 


SCL I SLRSWD YRHVPPQPANFVFFFFEM 
ESCSVAQSGVQWQEEKSISKINK 


13741 


27642 


A 


13908 


471 


59 


GPPQAKKGKGFPTPGPPQKKGGGQPKSG 
VLGFSPPPLGPCPGLGPPKGWGFWGGSF 
FFFFFFRKWEHKNELFPFSKTEKPRRAG 
EARKGSQSTKYAGLSRGERQAS PTLR PG 
HLAKASAGGWLQYTRWQGKLPEPGN 


13742 


27643 


A 


13909 


479 


231 


QEYCMLI FCEHFGDFDLNI VFCLEKKMV 
GVIADFYLLS S FTGRPQWLMPVIPTLWE 
AEVGGSLEAKSFTPAWATWQDPISMF 


13743 


27644 


A 


13910 


167 


19 


CIVLGIMLDSLWEPRQWVPSWVRWFTPV 
IPALWEAEAGGSPEVRSSNSS 


11744 


7764^ 


A 


11Q1 T 


328 


446 


GSFSKI TDLSQTWWYVPVVliAJjQESEVG 
RSLEPRSSRLQ 


1174 S 

1 J ItD 


77£4A 




1^Q17 


14^ 


46ft 


NAFMSTGCEVTjSYSELRORNLVWWFLPV 
IPATPQAEAGRLLGPRSSRPPWATKGVR 
ALSHKQTNTKTNKLSHSTGVSFVFTHHI 
HS PG I LWRLSQWLLAMNIiATW 


13746 


27647 


A 


13913 


319 


210 


KLGFFKGPPLFFFFFFFFFYFFFFFGFL 
TVSCNFTL 


13747 


27648 


A 


13914 


372 


10 


PNTTCTPITYGKKCCFREISYYLCHEFI 
RIRILTFFDGESCYIAQAGLKLLGSRSP 
PTSASGVAGNTDVCHHTQP I LFYILETG 
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nucleotide 
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correspond 

ing to first 

iiminn nrirl 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
Inst amino 
acid 

residue of 

npntide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Giycine, H=Histidine, ]=IsoIeucine, 
K=Lysine, L=OLeucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R— Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














LALLPRLILNSWSQAILLPPQPLRSYFN 
FWLFFCLYN 


13748 


27649 


A 


13915 


594 


471 


PSYIAYMCICMCVCVYVYIWICVCIYTH 
VHTPYNPVISKN 


13749 


27650 


A 


13916 


235 


3 


EANNLIRKTYTSGWFQRTSKRPPAVRRM 
KVLFFETESCSVAPVGVEWHDHSLLQPR 
PPGLNRS SHLSFPSS SDYRWA 


1 J l Jv 


27651 


A 


13917 


272 


184 


KTTAWAQWLTPVI STLWEAKVGGSLEAG 
S 


i J I J J 


776S? 

Z. 1 VJ I* 


A 


IJylO 


*tu 


ji i 


SRVQGTGGHRGRGC IQAGVGEQEAKRET 
EGPPWRWLTPVI S KLWEAKAGGSIiEPVS 
SRRATWNLLKRIYYTDEREYLIPLALNT 
SQGSTFSF 


13752 


27653 


A 


13919 


387 


484 


FIEI IGWARWLMPVIPTLWEAEAGGSPE 
VRSS 


13753 


27654 


A 


13920 


142 


321 


FLFS IHFHLS CWGLFSNVWQS LSIFSYTj 
KVRL YGQAW WFT P I ILALWEDKAGGSLD 
PRSL 


13754 


27655 


A 


13921 


270 


412 


I ED KK I Y S I LHT E TLERGQARW LTPV I P 
AFWEVKAGGSFEPRSLRPV 


13755 


27656 


A 


13922 


304 


396 


NFGWAQWLTPVIPALWESEAGGSLEPRG 
LRS 


JO / JO 










JO f 


VWWLTPVI PALVJEAEAGGS PEPRS SRP 


13757 


27658 


A 


13924 


103 


3 


NKS P E LGWAWWLAP V I FALWEAE AGG S P 
EVRSS 


l j / JO 


976SQ 
£. tojy 


A 




i yj 




CLLI LSGLVAS C I I ERHGKGQAW WLM P V 
I PTLWEAKVGGS LE LRNLRS AWGTWRNS 
I S I KYTEQLAQG 


13759 


27660 


A 


13926 


294 


489 


KWAKMQTLHKGR YACVC I C VCVCVC VYT 
YTD IYIFYIH I WPMNTKCSMLL VIGEMK 
ILSRMWSNQ 


13760 


27661 


A 


13927 


363 


2 


NWGPPGFPPPPFLKTGPVFFI FGAPKKK 
FFLSTPRALKFVLLKGGPLFFFFFFLWV 
FLVKTGSHCVAQPDLELLS S SNPVLVSQ 
GAGITGMSPQVQPLDLFFFFSFFETGSR 
SYAWADAW 


1 J/OJ 




A 


13928 


j 


218 


LRFKKRKTNKPKMW S VHTME YH S ALKR K 
E IRTQTTRWINLEDI VLSE I RVAVTKGR 
ILYDSTQMRNLEESNL 


13762 


27663 


A 


13929 


219 


408 


TFCHFIAFLEVYKMFWFFFSFIMFFFP 
LNCNKAFERKKKKNKKNKKKKKKRKKIS 
LSPPAPP 


13763 


27664 


A 


13930 


161 


22 


WHPPAS ASHVAGTTGACKHERLRQEDH F 
SLRVQGCSEPRLCHHTPP 


13764 


27665 


A 


13931 


116 


1 


I NTGWARWLTP VI PAFWEAAVGGSLE PR 
STSQAWATWK 


13765 


27666 


A 


13932 


181 


38 


PWPQRATQPKI S PLPPAI SALWEAEAGG 
ALEPRSSRPAWAKSDAWVD 


13766 


27667 


A 


13933 


267 


397 


TVFDRTLFFI KKI IHSWAPWLTPWPAI. 
WEAEVCGS PEVRS SG 


13767 


27668 


A 


13934 


129 


12 


NIGVGWALWLTS VI PALWBAEAGGPPKV 
RSSRTARPMS 


13768 


27669 


A 


13935 


179 


1 


KTYYGG I PPPGPPKKGEKKP PFFFKKFF 
LFFISFFIFFFFFFLRQSLAKMQSASVW 
KVS 


13769 


27670 


A 


13936 


245 


616 


DDKKKKEAAQKKATEQKI KVPEQ I KPS V 
SQPQPANSNNGTSTATSTNNNAKRATAN 
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peptide 
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Predict- 
ed end 
nucle- 
otide 
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correspon 
ding to 
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residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspnrttc Acid, 
E-Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, V=possib!e 
nucleotide insertion 














NQQPQQQQQQQQPQQQQPQQQPQPQPQQ 
QQPQQQPQALPRYPREVPPRFRHQEHKQ 
LLKRGQHFPVEF 


13770 


27671 


A 


.13937 


121 


292 


KTDLDLALAL I VRVLTS DDNFKNWG WV Q 
WLMP VFPAP WEAE VGG S S E ATNLRPAWA 
T 


13771 


27672 


A 


13938 


421 


202 


GPREAPIiFPSTQGISRPLFFFFFFETES 
RTVTRAAAQWHNLGSPQPPLCSLDILKT 
SQSVCMPEAAILVSQIT 


13772 


27673 


A 


13939 


205 


543 


ILESHRRLGWSEALPENQLLCRASSWES 
LG P PDPPCLKPP S P AFWGAAGGVJRAKST 
SVSEPVPVGTLLVALFLEMAVGLLLGLP 
FCFHSLGAPHLLCEPDGPSVLPAEARQS 
R 


13773 


27674 


A 


13940 


140 


2 


LSQHSEEYTLPPPLYLEKESSRPPPPQH 
TPHPPPPWPWHPDIADIQ 


13774 


27675 


A 


13941 


47 


303 


GLI FLI I WTCIVKTSTDFPRMEDCSQCI 
HQVTEESNKRMGFLSYYANPHHGSSRLL 
WPQHAAPWDDGRRGKPVFSLGFVSFPFP 
Q 


13775 


27676 


A 


13942 


1488 


1719 


PLVSFSKNTYPCLGNVKQNQKTTFCSRW 
KLHVLSSNLS S PAEVTWAS INLWSEQ 
SFQNSQLPAMLCLIVHLQMS 


13776 


27677 


A 


13943 


127 


402 


QPVTHP PNRTMCS SMI FLFLYLCFLS I*S 
PFPSHIiSLYAPPKKKKKKKKKKKKKKKK 
KKKKKKKKKKKKKKKKKKKKAGGGGPFK 
KKIHFPPGGGRIIFFY 


13777 


27678 


A 


13944 


852 


209 


EQT VYCTRI DLQQKLVFLRLEAFCLLS Q 
GDPGSPMMCQLQQFDLWVLRGVLNFGGE 
TCPGLFL YTKVEDYS KW I TS KAERAGPP 
LSSLHHWEKLI S FSHHGPNATMTQKTYS 
DSELGHVGSYLQGQRRTITHSRLGNSSR 
DSLDVREKDVKESGRS PEAS VQPLY YDY 
YGGEVGEGRI FAGQNRLYQPEE I ILGSL 
RACFLFAAVSSPGATPPN 


13778 


27679 


A 


13945 


68 


260 


INIFWHCVLLYVILYAISQFLLFPCVKN 
IASWLGSVAHACNPSTLGGQAGESLEPR 
NLRPAWAT 


13779 


27680 


A 


13946 


420 


156 


FLPPGVKYGSFKRAPPFFFFFFFFLVET 
LSCYIAHTGLELPDSSNYPTSASQSAVI 
TGMSHHTWPLEGACLAI PQALR I LLVLV 
PLH 


13780 


27681 


A 


13947 


317 


17 


AGHGGS TCNPSQP I VTAVP INLWGRDLL 
QQWGAQAF I PEQLYS PQSQHTMQEMGYV 
PGMGLEKNLKGIiKKPLQAEGQNSHQGLG 
YNFSWQPLLSLQNLYL 


13781 


27682 


A 


13948 


55 


224 


NFRLTSSFSYLKMSFWLGMWLTPVIPAL 
WEAKTGGLFEARSLRLVWWQSKKAGSHL 


13782 


27683 


A 


13949 


1464 


805 


RATS VRGAGRERS CGAAWS PAS IGPFLR 
RSVLPNAHFRSSREGGMAASTDMAGLEE 
S FRKFAI HGDPKASGQEMNGKNWAKLCK 
DCKVADGKSVTGTDVDIVFSKVKGKSAR 
VINYEEFKKALEEIiATKRFKGKSKEEAF 
DAI CQLVAGKEPANVG VTKAKTGGAVDR 
LTDTSRYTGSHKERFDESGKGKGIAGRQ 
D I LDDS G YVS AY KNAGTYDAKVKK 


13783 


27684 


A 


13950 


356 


98 


NAGPG I YFWGP I KKI LPCPPAGVKLGSL 
KRAPLFFF F VRMGFHHA'AQ AGLKLLS SR 
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OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I-isoleucine, 
K=Lysine, \j= Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop cod on, /= possible 
nucleotide deletion, V=possible 
nucleotide insertion 














DSPIPVSQSIRITSTCEPPCLAKIKSSL 
RL 


13784 


27685 


A 


13951 


194 


48 


NFLEGWPGQRLTPVI PALWKAEAGGSLE 
PKTFRPVWAMWRDRLRKPQH 


13785 


27686 


A 


13952 


341 


418 


DQPGQHGETPSILKIQKLAGRGSTCL 


13786 


27687 


A 


13953 


158 


-323 


TWLKKYFQSTNQLPPAHLLNVQPLLPFG 
LPLLPLKKKKKKKKKKKKKKKKKKIKK 


13787 


2768S 


A 


13954 


231 


355 


VFLPQKQKQNQCPLTGKWVNWSWNIHTL 
EYYSAVXGMKQLI 


13788 


27689 , 


A 


13955 


493 


39 


PAAFSSWGWAPGSSLLKKTTPASWPPGL 
LP PAP VS SMAAPDLDS F P L P LCWTFLRV 
LCRVFS S STLAFLSCVSTS FGALTPAAS 
SNI FS ISLPHWAS PPLPTARLTSAVSPE 
PLPPLPSRLQRQPRGAPSHPGMSMGPIL 
DATSNS TTRSR 


13789 


27690 


A 


13956 


497 


372 


NTKISWASWYAPVI PATWEVSNSRPQAI 
HSHQPPKRIGLQA 


13790 


27691 


A 


13957 


438 


301 


GTTYLDIDMTCDYVSVCLCI YI CTRQQS 
Y I YTYVYTHTHTHTHMY I 


13791 


27692 


A 


13958 


84 


466 


QPLGRSGKVPQLHLWEMQRLPAAFFRSF 
AAQGLGESVHINTPTSLSLRGRPFISSS 
GPGARARVTCAPCLHAARI LCCFWRGLH 
RLKCSNVISAHCNLRFSGSSDSPASASQ 
VAGITGRSSLEQLLE 


13792 


27693 


A 


13959 


216 


28 


KYVKNYVLSS I I FWMQEI FHNFKKSSRT 
QWLTPVI PALW E AEAGGS P E PR SSR P A S 
SIQLENS 


13793 


27694 


A 


33960 


320 


943 


VLSFSLSDRAQRCNCRMGKQNSKLAPEV 
MEDLVKSTEFNEHELKQWYKGFLKDCPS 
GRLNLEEFQQLYVKFFPYGDAS KFAQHA 
FRTFDKNGDGTIDFREFICALSITSRGS 
FEQKLNWAFNMYDLDGDGKI TRVEMLE I 
I EA I Y KMVGT V I MMKMNE CGLTP EQRVD 
KIFSKMDKNKDDQITLDEFKEAAKSDPS 
IVLLLQCDIQK 


13794 


27695 


A 


13961 


191 


8 


GLFRFRRLSEEVKHCCCCCCCCCCCCCC 
CCCCCCCCCCCYLRLAPQS PAAPAPELH 
TPLPGSRRNR 


13795 


27696 


A 


13962 


203 


28 


VWGNQHFCFVPVFRKKVFCFFPLNKLLR 
GGFFFFFFFFFLVFIDNSWVFLGEGDLA 
GS 


13796 


27697 


A 


13963 


18 


171 


GIRHEERERERERERERERERERERERH 
PARE IDMCVSKRDTRGAL CALF 


13797 


27698 


A 


13964 


25 


299 


HDFCTRRERERERERERERERERERERE 
RERERERERERERERERERERERERERE 
RERERERERERERERETHTLSLYIYMCV 
FLYIPLYFSFLYIHTQREAPPPLSATLS 
LSIL 


13798 


27699 


A 


13965 


246 


410 


AYNGQGSRGVTPVI PSLWEAKTGGSLEP 
RS WRSAWATWHTP P VPGPS KE I SLY I 


13799 


27700 


A 


13966 


91 


26 


FCCCGCCCCCCFYCKKEKTLV 


13800 


27701 


A 


13967 


158 


24 


PFCHIYIYFLESCFVAMLECSGKILAHC 
YLRLLGSSNSLVPSLA 


13801 


27702 


A 


13968 


129 


7 


SWAQWLMP 1 1 PTLWEHEAGRSLEAMSSR 
PAWATQQDPVSKK 


13802 


27703 


A 


13969 

i 


445 


268 


KIQLTKLKNASESLPSRINQAEEKISGW 
AWCLMPVI PVLWEAKAGGSLEPRSLRRG 
QVT 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E^Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Iso leu cine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W^Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


13803 


27704 


A- 


13970 


1628 


1309 


HRRKKCAGYSSLSSALLRERERERERER 
ERERESVCVCVLGTITCFRRQYCVIQGI 
SGVRQNCVQI PALSDPDLFLFLSLS VLH 
SNTW WRP TLRVTERNKGAM CV 


13804 


27705 


A 


13971 


314 


140 


QCHNCY ILTQLVKIKGDQAWWLMPI I PA 
LGAKAGGSLEAGSLRSAWATWRAPMSTK 
IF 


13805 


27706 


A 


13972 


292 


482 


YKSVRTQDSCAHGVDRLVGKIRYVNLFV 
Y ICQHSDGRTWWLTPVI PPLWETKGGGS 
SEVGRSG 


13806 


27707 


A 


13973 


502 


303 


STAIGPSFSPFLCTHGSFHGFLFFFFCF 
CFFLVEMGSH YVAPAGLKLLGS SD P PAS 
ASPSHLGLQV 


13807 


27708 


A 


13974 


493 


289 


I PPTALGTS FS P FLCTEGS FHGFLFFFF 
CFCFFLVEMGSHYVAPAGLKLLGSSDPP 
ASAS PSHLGLQV 


13808 


27709 


A 


13975 


405 


121 


IREAAQSESLFVRAAPHTGLISGPCITE 
IYAVKDTRCMQRRILVRELECAAHGLTE 
CLS IDTHTHTHTHTHTHTHTHTHTHTHT 
HGTRWEAGQSLKGES 


13809 


27710 


A 


13976 


135 


254 


TVTCTNSSWSLTFTSFLFLMPGWIQKGR 
RVCVCVCVCVC 


13810 


27711 


A 


13977 


179 


58 


VPLS INSPQRQ C I FCCCCCCCCCFLVLQ 
KIISERSWAEIG 


13811 


277 12 


A 


13978 


97 


186 


KI I FWLGWAHACNPSTLGGQGGRI MRG 
RD 


13812 


27713 


A 


13979 


289 


420 


ELGLKIHGFPSQAPWFMPI I PALWEAEA 
GGLLESRSSRIAWAMW 


13813 


27714 


A 


13980 


421 


106 


SGGRNFFFFLGGGFYKKKFPPPPFLRGK 
KKKPPLPKKKNPTPIFFPPGGGPPFSPP ' 
PKKGGGKIFFFFGGKNFIMQNFGGFPPL 
WGKKKKFLPKKKNPLFFFFFF 


13814 


27715 


A 


13981 


309 


135 


KHIYIYLSIYLSIYLSIYLSIYLPAYLP 
TYLHAHPLIHANAYKNIHIDYPFKKGFE 
Y 


13815 


27716 


A 


13982 


231 


21 


QHMYCLKSIWHKKLSRKPYCYIISEYSS 
MVKLRVS ISQVQWLMPVI PALWEAKAGG 
LLKARSSRPMWAI 


13816 


27717 


A 


13983 


209 


359 


HLHTHTHTHTPNGVKQIMTQI PILPLTG 
PLILEKFLNFSKPLSHPGKRDP 


13817 


27718 


A 


13984 


391 


249 


IWPILPPQKIFFFFFETESCSVTQARVQ 
WCNLGSLQPPNPVFPSLAK 


13818 


27719 


A 


13985 


181 


403 


YLVLDSPESTLGILSQPKDYITKDQTLT 
FRPVWAHACNPSTLGSQGKWITSVQEV 
ETSIiCNTARPCLYQKYKN 


13819 


27720 


A 


13986 


441 


358 


ETGSRCVILAGAQWRDLSSVQPLPPRFK 


13820 


27721 


A 


13987 


7 


233 


AS I PC I YQKTKI PNTKQTKNKNTKI TPA 
WWCTP I APATQKAEI PPLHSSLGDFLRL 
SPKKKKGGPNWAPTPSKFF 


13821 


27722 


A 


13988 


152 


44 


PKAPLFFFFFFFFFFYETRSHSVAQAKV 
QRHDHGSL 


13822 


27723 


A 


13989 


188 


379 


NVS CHAAETRVWWE KER PL P VP FGAYLL 
DGMPLVWLLSAS S I LSI WLS I YLS IYLS 
IYLSIYF 


13823 


27724 


A 


13991 


296 


53 


SSI LFHASKYHFSGRLFSLLSCKRRNIL 
QLV PHMYS P Y LYMY I HTKKHI YEYMYVH 
IYIYMNLRNCRIYLHSCTPKENQAR 


13824 


27725 


A 


13992 


37 


290 


RLGSPASRIIQQIQCLIRACLKDGTLGV7V 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Tsoleucine, 
K= Lysine, L^Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine> 
X=Un known, *=Stop codon, /=possib!e 
nucleotide deletion, \=possible 
nucleotide insertion 














WWLI PVI P ALWEAKVGGSLE PRS S TPAW 
ATQGDL ILKKDGTLLQCPHVAKGHSSHS 


13825 


27726 


A 


13993 


189 


370 


SRNAI FKKRWESSCQSSFNSLGVFPQAA 
TKGWVRWLTLVIPALWEAKAGGSLEPRS 
SRPA 


13826 


27727 


A 


13994 


179 


25 


SKHNKKNIQICLLCSKRSMFVHVCIHIH 
ICVYVYRYAYIYIGMCIHIYIHS 


13827 


27728 


A 


13995 


159 


2 


EDPLSPGVQDQHGHHRKSLSGRGGKISL 
GQEFKSSMAAMYFGRSGGCKLERR 


13828 


27729 


A 


13996 


307 


99 


F I WEHFVN YTMFFT YKKPTS QAWWLMP V 
VPALWETEAGGSPEVRSPRPPGQHDDVR 
PLGPSLHVYLQMD 


13829 


27730 


A 


13997 


218 


21 


FQHFGRPRRVDHLSLGVQDQPGQHGETP 
SPLASSILDLIPPPFPIDEPVDLAFQSV 
TLGVQDTTS 


13830 


27731 


A 


13998 


227 


404 


GYSVTLLLGLTQLTQRIAHGTRKGERER 
ERERERERERVCSCLMHELASHLIiFFFL 
FRE 


13831 


27732 


A 


13999 


400 


668 


P VGRRWE LQGMGLGVRTWQS S LPCQGPT 
SSSVLSLGRGTPGISNTCLPPRSATELG 
LPAQVPFPNVQSQDQDLSFSLGCFPRTY 
NRQTL 


13832 


27733 


A 


14000 


225 


414 


NRVLLCHTGWSTVAI LAYCS FKLLGSRD 
S P TLAS RAART I VKCRNVWL I KKKKKKK 
KGGRRL 


13833 


27734 


A 


14001 


397 


118 


LFSGPFFFDKSECFQEPTVPFCLLIiYNQ 
MQPYLLRSAFVDSYYMYDSFLRDRVSFC 
QSGWSTVAPSRLTAALNSLAQVT LLPQP 
LNVLCGISR 


13834 


27735 


A 


14002 


3 


173 


YVFYRQKQMVCFYKIRIKNMFNRNRLNL 
GQWCVPIVPATQVAEAGGSPEPTSLRPI 


13835 


27736 


A 


14003 


440 


290 


YTHTLY I Y I Y I HTHTHTHT Y I YLCTYWG 
QFDLLVLDNKLYKE I Y YLS VF I 


13836 


27737 


A 


14004 


2 


116 


ARLVAMP FKYEE I KDFLLTARRKDAKS V 
KIKKNKSAAV 


13837 


27738 


A 


14005 


279 


422 


SAKAPLSCLRTNSLLLKNGLRGWAWWIT 
PVIPTLWEAEVGGSPDVTSP 


13838 


27739 


A 


14006 


3 


497 


GGIGDSRCGSTRASSSPQLAGRSSSVLP 
AAAQPCTPTMDVFKKGFSIAKEGWGAV 
EKTKQGVTEAAEKTKEGVMYVGAKTKEN 
WQSVTS VAE KTKEQANAVS EAWS SVN 
TVATKTVEEAENIAVTSGWRKEDLRPS 
APQQEGEASKEKEEVAEEAQSGGD 


13839 


27740 


A 


14007 


1 


292 


S LGGGGCNDPRS YHCTPAWATEQDTVSL 
SL.SIYMSVRVCIYMYMCVCVCIRIYTYI 
HTHIKIRVYIYTIPYIWGIYIYIYVDLV 
IYMGWCGCPPPSI 


13840 


27741 


A 


14008 


57 


250 


K I FFFLMNHTWGGVP WLATWEAEVGGS 
liE PRSLGLQETMMTS LLSRLPRQQGE IL 
SLILKRNY 


13841 


27742 


A 


14009 


184 


2 


AVPGRWPCQQVPGLLPS IQDTLKQGRTG 
QWLTPVTPMPWEAEAGGSLDARSLRPAW 
ATGK 


13842 


27743 


A 


14010 


123 


3 


KEPSRPGRWLTSI I PACWEAEAGGPPEA 
RSSTREFRTAK 


13843 


27744 


A 


14011 


403 


168 


GTGLRASRCISQPVMSGLLSPSEFPFPE 
IESHLHVSQRETAARLRDRERERARERE 

RERERGKKAGSGKPPGSASCVP 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
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26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Lysine 7 L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V= Valine, 
W^Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possib!e 
nucleotide deletion, \=possible 
nucleotide insertion 


"T3844 *" 


27745 


A 


14012 


207 


422 


EIRRGSLiAGPLSLNFILSVMGSHSKYLR 
KVIGRAQWLMPWPALWEAEVGKLHELR 
RLGHTWAAW 




'inn AC. 




14013 


200 


405 


WLCVLGWS LLALS PRLECYGP I SAHCNL 
HLLRSSDSPASASQALQEAWYQHLLLMM 
PQAAS TQVGRKK 


13846 


27747 


A 


14014 


319 

1 


176 


KPSTITFLFLHLFISSRESRPVTQTEVK 
WLNHSS PQPPTPRFSLPSS 


13847 


27748 


A 


14015 


63 


40 1 


EQENKNTLNS PGWDGVH I HVLRFLNTRM 
PLGHLSFS EG VSGSGSSGRRCQAGAGGR 
SATAAARPCQCLGLKGLPPAVPSCGLEG 
KAPSRAGYSASHPPHGCETGNGV7VLYVN 




27749 


A 


14016 


416 


63 


VSKVYPKINHTQRTVRHETFKIAMPKYY 
SNLCSILRFNKDRLAFMRHYTKQCSRTS 
ITEYIAMFFFVCLFVLKIGAHCHPGWNE 
WLVCSQVTAASTSRAQAILPPQPPKIL 
GMQQA 


13849 


27750 


A 


14017 


196 


425 


S LMFRPPLFS FCCFCI RYPFI FVFFSLL ' 

CFAFSWCSLLLFFFHYLKLFLLLFLLFF 

LPVLLCLDSPLVHDFFLLSS 




27751 


A 


14018 


3 


233 


SSRMGRGTKTI AFDMRWSCEVDRARHRD 
REREREREREREREKGAEPQEFPVISPW 
VDRPCLAGILHLKCGCGHRL 


13851 


27752 


A 


14019 


163 


1 


AERWHDQICVEREI WGWVQWLMPVI PVL " 
WEAKAKNLLE P KN LKLQRNRRPGN S R 




27753 


A 


14020 


203 


83 


LKKKKNKSQAQWLTSWLASQSAEAGGS 
LEPRNSRPAWAT 


13853 


27754 


A 


14021 


179 


3 


VTWQRELRLQMELRLLICRAWWLTPVI P 
VPWEAKAGGSPEPRSSRFAWETRNRRPG 
NS 




Z / / J J 




14022 


358 


431 


WHKKKNQNWLGTGAHACNPSTLGG — 


"13855 


27756 


A 


14023 


98 


3 


GRPAGWLTPVAPTLWSAEMGGSLEPRSL 
GHE 


13856 


27757 


A 


14024 


254 


3S5 


RENWAEPSSHSLLNIFLQLGTVARACKP 
STLGGRGGWITRSGD 


13857 


27758 


A 


14025 


140 


1 


KGVLDGPRGLCVFCFCFFFLCVCVCVCV 
CVCVWRRSLTLSPRLQCI 


1385S 


27759 


A 


14026 


579 


852 


QREWVGWAGKEGEGVJVSHVPASAQLPAN 
GQRGQPH PS PLGGTGS TWAQEGA YCCLS 
SCSHCCCCSSCCCCCPPGFCLFPSLGAI 
CHLYHL 


13859 


27760 


A 


14027 


30 


663 


LRIRALRELPASHIPGSLTICCVPRPPL 
PCSSTKPDAKGYKPIiAQQKIiGSQRTGPC 
LGHAPCYS PLWELRGHGCS P YGPS PSGS 
VSLWQ3AMRLPKNTPEEKDRRTAALQEG 
LRRAVSVPLTLAETVASLWPALQELARC 
GNLACRSDLQVAAKALEMGVFGAYFNVL 
INLRD I TDEAFKDQIHHRVS SIiLQEAKT 
QAALVLDCLETRQE 


13860 


27761 


A 


14028 


I 


267 


AVGVHHAFHLPHCFFASLLES PVS PRLA 
MD PNCS CAAGVS CTCAGS CKCXECKCTS 
CKKSCCSCCPVGCSKCAQGCVCKGASEK 
CSCCD 


13861 


27762 


A 


14029 




I /4t> ! 


PAAGAATMEFRQEEFRKLAGRALGKLHR 
IiLEKRQEGAETLELSADGRPVTTQTRDP 
PWDCTCFGLPRRYI IAIMSGLGFCISF 
GIRCKLGVAIVSMVNNSTTHRGGHWVQ 
KAQFS WDPETVGLIHGSFFWGYI VTQ I P 
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beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A~Alanine 
C=Cysteine, D-Aspartic Acid, . 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, J=Isoleucine, 
K=Lysine, L=Leucine, iyi=Methionine 5 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unkno\vn, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














GGFI CQKFAANRVFGFAI VATSTLNMLI 
PSAARVHYGCVI FVRILQGLVEGVTYPA 
CHGIWSKWAPPLERSRLATTAFCGSYAG 
AVVAMPLAGVLVQYSGttSSVFYVYGSFG 
IFWYLFWLLVSYESPALHPSISEEERKY 
IEDAIGESAKLMNPLTKFSTPWRRFFTS 
MPVYAIIVANFCRSWTFYLLLISQPAYF 
EEVFGFEISKVGLVSALPHLVMTIIVPI 
GGQIADFLRSRRIMSTTNVRKLMNCGGF 
GMEATLLLWG YSHSKGVAI S FLVLAVG 
FSGFAISGFOTNHLDIAPRYASILMGI S 
NGVGTLSGMVCPI I VGAMTKHKTREEWQ 
YVFLIASLVHYGGVIFYGVFASGEKQPW 
AEPEEMSEEKCGFVGHDQLAGSDDSEME 
DEAE P PGAPPAPPPS YGATHS TFQP PRP 
PPPVRDY 


13862 


27763 


A 


14030 


240 


63 


VFTAQSNGGVELNRFNEGPLSAVLKGRV 
AWAQWLTPVIPTLWEVKAGGLLEARSLS 
PA 


13863 


27764 


A 


14031 


289 


414 


HAEMGRVQWLTPAIPALCKAEAGGPPEP 
RSLRPAWATY 


13864 


27765 


A 


14032 


386 


1765 


LGDARAPEKMSAIQAAWPSGTECIAKYN 
FHGTAEQDLPFCKGDVLTIVAVTKDPNW 
YKAKNKVGREG 1 1 PANYVQKREGVKAGT 
KLSLMPWFHGKITRSQAERLLYPPETGL 
FLVRESTNYPGDYTLCVSCDGKVEHYRI 
MYHAS KLS I DEEVYFENLMQLVEHYTS D 
ADGLCTRLI KP KVMEGTVAAQDE FYRSG 
WALNMKELKLLQTIGKGEFGDVMLGDYR 
GNKVAVKCI KNDATAQAFLAEASVMTQIi 
RHSNLVQLLGVIVEEKGGLYIVTEYMAK 
GSLVDYLRSRGRSVLGGDCLLKFSLDVC 
EAKEYLEGNNFVHRDLAARNVLVSEDNV 
AKVS D FGLTKE AS S TQDTG KL P VKWTAP 
EALREKXFSTKSDVWSFGILLWEIYSFG 
RVP YPR I PLKDWPRVEKGYKMDAPDGC 
PPAVYEVMKKCWHLDAAMRPSFLQLREQ 
LEH I KTHELHL 


13865 


27766 


A 


34033 


619 


414 


SQARCLEQHACTS PRQPRPQLQ CS ELKG 
HSLGMEHQGHVP CLHPCS PCPP VKAS PA 
CWSSNPAPLCPH 


13866 


27767 


A 


14034 


2 


614 


LESRPDGRPSSTHPASPSAFSAPGKPHP 
PEAKMS S KRAKAKTTKKRPQRATSNV FA 
MFDQSQIQEFKEAFNMIDQNRDGFIDKE 
DLHDMLASLGKNPTDEYLEGMMSEAPGP 
INFTMFLTMFGEKLNGTDPEDVI RNAFA 
CFDEEASGFIHEDHLRELLTTMGDRFTD 
EE VDEM YREA P I D KKGNFNYVE FTR ILK 
HGAKDKDD 


13867 


27768 


A 


14035 


1 


254 


AVE FGPAGPGS LGRAAAM 1 1 PVR CFTCG 
KIVGNKWEAYIX5LLQAEYTEGDALDAL.G 
LKRYCCRRMLLAHVDLIEKLLNYAPLEK 


13868 


27769 


A 


14036 


494 


148 


FRFLSDCGVFAEGHIELQVESGVPLGFS 
TMAEDMETKI KNYKTAPFDSRFPNQNQT 
RNCWQKYLDFKRCQKAMTAKGGDISVCE 
WYQRVYQSLCPTSWVTDWDEQRAEGTFP 
GKI 


13869 


27770 


A 


14037 


130 


3969 


IMGDVKNFLYAWCGKRKMTPSYEIRAVG 
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sequence 


M 
eth 
od 


SEQBD 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A^AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=Iso!eucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X^Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 










* 




NKNRQ KFMCEV QVEG YNYTGMGNS TNKK 
DAQSNAARDFVNYLVRINEI KSEEVPAF 
GVASPPPLTDTPDTTANAEGDLPTTOGG 
PLPPHLALKAENNSEVGASGYGVPGPTW 
DRGANLKDYYS RKE EQEVQATLES EEVD 
LNAGLHGNWTLENAKARLNQYFQKEKIQ 
GEYKYTQVGPDHNRSFIAEMTIYIKQLG 
RRI FAREHGSNKKLAAQSCALSLVRQLY 
HLGWEAYSGLTKKKEGETVEPYKVNLS 
QDLEHQLQNIIQELNLEILPPPEDPSVP 
VALNIGKLAQFEPSQRQNQVGWPWSPP 
QSNWNPWTSSKIDEGPIAFATPEQISMD 
LKNELMYQLEQDHDLQAILQERELLPVK 
KFESE I LEAI S QNS WI IRGATGCGKTT 
QVPQFI1.DDFIQNDRAAECNIVVTQPRR 
I SAVS VAERVAFERGEEPGKS CGYSVRF 
E S VL PR PHAS I MFCTVGVLLRKLE AG I R 
G I SHVI VDE I HERD I N TS FLL WLRDW 
QAYPEVRIVFMSATIDTSMFCEYFFNCP 
SLKLWRTYPVQEYFLEDCIQMTHFVPPP 
XDKKKKDKDDDGGEDDDANCNLICGDEY 
G PETRLSMS QLNE KET P FEL I EALLKY I 
ETLNVPGAVLVFLPGWNLIYTMQKHLEM 
NPHFGSHRYQILPLHSQIPREEQRKVFD 
PVPVGVTKVILSTN IAETSITINDWYV 
IDSCKQKVKLFTAHNNMTNYSTVWASKT 
NLEQRKGRAGRSTAG FCFHLCS RARFE R 
LETHMTPEMFRTPLHE I ALS I KLLRLGG 
IGQ FLAKAI EP P PLDAVIEAEHTLRELD 
ALDANDELTPLGRILAKLPIEPRFGKMM 
IM6CI FYVGDAICTIAAATCFPEPFVNE 
GKQLGYIHRNFAGNRFSDHVALLSVFQA 
WDDARMGGEEAE I R FCEHKRLNMATLRM 
TWEAKVQLKEILINSGFPEDCLLTQVFT 
NTGPDNNLDWISLLAFGVYPNVCYHKE 
KRKILTTEGRNALIHKSSVNCPFSSQDM 
NYPSPFFVFGEKIRTRAISAKGMTLVPP 
LQLLL FAS KKVQSDGQ I VLVDD W I KLQ I 
SHEAAACITGLRAAMEALWEVTKQPAI 
ISQLDPVNERMLNMIRQISEPSAAGINL 
MIGSTRYGDGPRPPKMARYDNGSGYRRG 
GSSYSGGGYGGGYSSGGYGSGGYGGSAT 
PSGRI CAGVGGGYRGVSRGGFRGNSGGD 
YRGPSGGYRGSGGFQRGGGRGAYGTGYL 
DIEEEVAAIKLGYVSSVCRQ 


13870 


27771 


A 


14038 


431 


542 


EGITPGWARWLTPVI PGLWEEEAGGSPG 
REIDIGLAN 


13871 


27772 


A 


14040 


238 


362 


RRCTFLSFRMLAPRVYS WGKRAFS PSV 
CVRAHGKCDYSYS 


13872 


27773 


A 


14041 


6290 


3514 


FRAAGSSSTNSGRICPLPSGALLYQSEG 
LLARPHGKGSFQVGRQQHQAVRVGSAHA 
SSAGYVCVSAFPFVFLRFCVGGGPGI SR 
VYALF YGE CNPTREWAVS SELS PS FQEQ 
NKMNKVEQKSQESVSFKDVTVGFTQEEW 
QHLDPSQRALYRDVMLENYSNLVSVGYC 
VHKPEVIFRLQQGEEPWKQEEEFPSQSF 
PEVWTADHLKERSQENQSKHLWEWFIN 
NEMLTKEQGDVIGIPFNVDVSSFPSRKM 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first » 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
corrcspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A^AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














FCQCDSCGMSFNTVSELVISKINYLGKK 
SDEFNACGKLLLNI KHDETHTREKNEVL 
KNRNTLSHRENTLQHEKIQTLDHNFEYS 
ICQETLLEKAVFNTRKRENAEENNCDYN 
EFGRTFCDSSSLLFHQISPSRDNHYEFS 
DCEKFLCVKSTLSKPHGVSMKHYDCGES 
GNNFRRKLCLSHLQKGDKGEKHFECNEC 
GKAFWEKSHLTRHQRVHTGQKPFQCNEC 
EKAFV7DKSNLTKHQRSHTGEKPFECNBC 
GKAFSHKSALTLHQRTHTGEKPYQCNAC 
GETFYQKSDLTKHQRTHTGQKPYECYEC 
GKSFCMNSHLTVHQRTHTGEKPFECLEC 
GKSFCQKSHLTQHQRTHIGDKPYECNAC 
GKTFYHKSVLTRHQI IHTGLKPYECYEC 
GKTFCLKSDLTIHQRTHTGEKPFACPEC 
GKFFSHKSTLSQHYRTHTGEKPYECHEC 
GKI FYNKSYLTKHNRTHTGEKPYECNEC 
GKTFCQKSQLTQHQRIHIGEKPYECNEC 
GKAFCHKS AL IVHQRTHTQEKPYKCNEC 
GKSFCVKSGL I FHERKHTGEKP YECNEC 
GKF FRHKS SLTVHHRAHTGE KS CQ CNEC 
GKIFYRKSELAQHQRSHTGEKPYECNTC 
RKTFSQKSNLIVHQRRHIGENLMNEMDI 
RNFQPQVSLHNASEYSHCGES PDDI LNV 
Q 


13873 


27774 


A 


14043 


187 


422 


LTIPKLHSAQEPAGPVSPPPPPSPTFQI 
GGRSRAEPRNQYCPQVAPVPALRGCLPL 
SPGEPPHDTSSPRHLTCGGIVF 


13874 


27775 


A 


14044 


340 


442 


NKFLSWVRLLMPVIPTFWEAKVGGWLEA 
RSLRMQ 


13875 


27776 


A 


14045 


3 


1240 


LVEGAAGQGVSDGARLRKCGTRS F PGS E 
EVLSSMARGSAFPAAALWLWS ILLCLLA 
LRAEAGPPQEESLYLWIDAHQARVLIGF 
EEDILIVSEGKMAPFTHDFRKAQQRMPA 
ISVNIHSMNFTWQAAGQAEYFYEFLSLR 
SLDKGIMADPTVNVPLLGTVPHKASWQ 
VGFPCLGKQDGVAAFEVDVIVMNSEGNT 
ILQTPQNAIFFKTCQQAECPGGCRNGGF 
CNERRICECPDGFHGPHCEKALCTPRCM 
NGGLCVTPGFCI CPPGFYGVNCDKANCS 
TTCFNGGTCFYPGKCI CPPGLEGEQCEI 
S KCPQ PCRNGGKCIGKS KCKCS KG YQGD 
LCS KP VCE PGCGAHGTCHEPNKCQCQEG 
WHGRHCNKRYEASLIHALRPAGAQLRQH 
TPSLKKAEERRDPPESNYIW 


13876 


27777 


A 


14046 


18 


274 


YSQEVLCEMNRSGWSNIKRGWAWWQLT 
TTI P ALWEAEVGGLLE PRSLRLAS PQPP 
QHEDYSMS YCAQPRFSLCIRGFI FYSLW 
F 


13877 


27778 


A 


14047 


177 


441 


GGSHYAWPNITFIFTKLIAQFFCFFEMQ 
SCSVARLECRGAISAHCMjHLPGSSNSP 
VMAQ I LKNQRKSNFDEDATLLGLS YTLL 
GEI 


13878 


27779 


A 


14048 


136 


319 


DVFLLLFHSNYFLFFETGSHSVSQAGVK 
WCDLSSLQPPPLGSSDPPTSASRVPALF 
FFLEF 


13879 


27780 


A 


14049 


388 


128 


ELKLPSYKGQSPQLSLRRYFADLIAIVS 
NRFTLCPSARHLAVYLLDLFMDRYDISI 
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SEQID 


SEQ ID 


M 


SEQID 


Predicted 


Predict- 


Amino acid sequence (A=Alanine 


NO: of 


NO: of 


etb 


NO: in 


beginning 


ed end 


C=Cysteine, D=Aspartic Acid, 


nucleotide 


peptide 


od 


USSN 


niiclpntiftp 




E=C*lutanrnc Acid F^Phenvlalanine 


sequence 


sequence 




09/515,1 


location 


otide 


G=Glycine, H=Histidine, I=Isoleucine, 








26 


correspond 
ing to first 
amino acid 
residue of 


location 
correspon 
ding to 
last amino 


K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q^Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 










peptide 


acid 


W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 










sequence 


residue of 

peptide 

sequence 














QQLHLVALSCLLLASKYESDLHDWKFLM 
FI 


13880 


27781 


A 


14050 


262 


3 


V I YNY I S VRPF PN I YHTF YTL S LFL YLS 
THTHTHTHTHTHTQRETGHVCVCIYTYM 
YMCMYAKWWYKFTFLISGKARIHTHLHP 
CI 


13881 


27782 


A 


14051 


149 


423 


YPHLCAFLLLPSISSFRAGVFFFSYLSL 
TAY PLTW KP PE P PATPQPT PN P P PGHPA 
TRGPKSALLDHSYACASAPPLGTEGTAP 
SPSFALI 


13882 


27783 


A 


14052 


2 


1440 


FVAVTAMAAPCLLRQGRAGALKTMLQEA 
QVFRGLASTVSLSAESGKSEKGQPQNSK 
KQSPPKNWEPKERGKLLATQTAAELSK 
NLSSPSSYPPAWKGRKVASPSPSGSVL 
FTDEGVPKFLSRKTLVEFPQKVLSPFRK 
QGSDSEARQVGRKVTSPSSSSSSSSSDS 
SSDDEADVSEVTPRWSKGRGGLRKPEA 
SHSFENRAFRVTVSAKEKTLLQKPHVDI 
TDPEKPHQPKKKGSPAKPSEGRENARPK 
TTMPRSQVDEEFLKQSLKEKQLQKTFRL 
NEIDKESQKPFEVKGPLPVHTKSGLSAP 
PKGS PAPAVLAEEARAEGQLQAS PPGAA 
EGHLEKPVPEPQRKAAPPLPRKETSGTQ 
GIEGHLKGGQAIVEDQIPPSNLETVPVE 
NNHG FHE KTAAL KLEAEGEAM EDAAAPG 
NDRGGTQEPAPVPAEPFDNTTYKHLQHH 
DYSTYTFLDLNLELSKFRMPQPSSGRES 
PRH 


13883 


27784 


A 


14053 


177 


1253 


EKKRTLPVRSVTGELQGKSLSDIiAAGTM 
D3EKKRFTEEATKYFRERVS PVHLQILL 
TNNEAWKRFVTAAELPRDEADALYEALK 
KLRTYAAIEDEYVQQKDEQFREWFLKEF 
PQVKRKIQESIEKLRALANGIEEVHRGC 
TISNWSSSTGAASGIMSLAGLVLAPFT 
AGTSLALTAAGVGLGAASAVTG I TTS IV 
EHSYTSSAEAEASRLTATS IDRLKVFKE 
VMRD I TPNLLSLLNWY YEATQTI GSE I R 
AI RQARARARLPVTTWRISAGS GGQAER 
TI AGTTRAVSRGAR I LSATTSGI FLALD 
WNLVYE S KHLHEGAKSASAEELRRQAQ 
E LEENLMELTQ I YQRLNPCHTH 


13884 


27785 


A 


14054 


1585 


2099 


I CVKTFPPLALQVRMAAAEHHHSSGLPY 
WPYLTAETLKNRMGHQPPPPTQQHSITD 
NSLSLKTPSERLLYPLPPSAPPSADDNL 
KTPPECLLTPLPPSALPSADDNLKTPAE 
CLLYPLPPSADDNLKTPPECLFTPLPPS 
APPSVDDNLKTPPECVCSLPFHPQRMI I 
SRN 


13885 


27786 


A 


14055 


2 


2865 


ALPDGGASVASDRAEGRPAKPSKTAARE 
KTEGAVAAVGGGPS S FRCCYGCCHEARL 
GRTSLPRGVI MLTEAS IS I WGWGS LG IV 
LFLITFGPFVIFYLTFYILCFVGGGLW 
TLLFGKTNSEKYLEQCEHSFLPPTSPGV 
PKCLEEMKREARTIKIDRRLTGANI IDE 
PLQQVI QFSLRDYVQYWYYTLSDDESFL 
LEIRQTLQNALIQFATRSKEIDWQPYFT 
TR I VDDFGTHLRVFRKAQQK I TEKDDQV 
KGTAEDLVDTFFEVEVEMEKEVCRDLVC 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=G!utamic Acid, F=Phenylalanine, 
G=Glycine. H=Histidine, l=lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=Glutaniine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, -=Stop codon,/=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














TSPKDEEGFLRDLCEVLLYLLLPPGDFQ 
NKIMRYFVREILARGILLPLINQLSDPD 
YINQYVXWKIRDSNCNYEAFMNIIKLSD 
NIGELEAVRDKAAEELQYLRSLDTAGDD 
INTIKNQINSLLFVKKVCDSRIQRLQSG 
KEINTVKLAANFGKLCTVPLDSILVDNV 
ALQFFMDYMQQTGGQAHLFFWMTVEGYR 
VTAQQQLEVLLSRQRDGKHQTNQTKGLL 
RAAAVG I YEQYLSEKAS PRVTVDDYLVA 
KLADTLNHEDPTPEI FDDIQRKVYELML 
RDERFYPSFRQNALYVRMLAELDMLKDP 
SFRGSDDGDGES FNGSPTGS INLSLDDL 
SNVSSDDSVQLHAYISDTVYADYDPYAV 
AG VCNDHGKT YALY AI T VHRRNLNS EEM 
WKTYRRYSDFHDFHMRITEQFESLSSIL 
KLPEKKTFI^MDRDFLEKRKKDL^YLQ 
LLLAPEMMKAS PALAHYVYDFLENKAYS 
KGKGDFARKMDTFVNPLRNSMRNVSNAV 
KSLPDSLAEGMTKMSDNMGKI4SERLGQD 
IKQSFFKVPFLIPKTDSDPEHRRVSAQL 
DDNVDDNIPLRVMLLLMDEVFDLKERNQ 
WLRRN I XNLLQQL I RAT YGDTINRKI VD 
HVDWMTS PEQVADSVKRFRDAFWPNGIL 
AEAVPCRDKS IRMRTRVAGKTKLLAIMP 
GE 


13886 


27787 


A 


14057 


311 


150 


FLCFKYRRGFATLHSGLKLLGSSDPPSS 
TSQI PGITGTSHCVQPTHLFFLALS 


13887 


27788 


A 


14058 


53 


211 


RQHITCLDIFFIHMNYKYKYINIYLSIY 
LSIYLS I YLIYLSISHSSYITERA 


13888 


27789 


A 


14059 


1 


1642 


RDGR KMATAT I ALQVNGQQGGGSE PAAA 
AAWAAGDKWKPPQGTDSIKMENGQSTA 
AKLGLPPLTPEQQEALQKAKKYAMEQSI 
KSVLVKQTIAKQQQQLTNLQMAAQRQRA 
LAIMCRVYVGS I Y YELGEDTI RQAFAPF 
GPIKS IDMSWDS VTMKHKGFAFVEYEVP 
EAAQLALEQMNSVMLGGRN I KVGRPSN I 
GQAQ P 1 1 DQLAE EARAFRR I YVASVHQD 
LSDDDIKSVFEAFGKIKSCTLARDPTTG 
KHKGYGFI EYEKAQSSQDAVSSMNLFDL 
GGQYLRVGKAVTPPMPLLTPATPGGLPP 
AAAVAAAAATAK I TAQEAVAGAAVLGTL 
GTPGLVS PALTIiAQPLGTL PQAVMAAQA 
PGVITGVTPARPPIPVTIPSVGWNPIL 
ASPPTLGLLEPKKEKSEEELFPESERPE 
MLSEQEHMSISGSSARHMVMQKLLRKQE 
STVMVLRNMVDPKDIDDDLEGEVTEECG 
KFGAVNRVI I YQEKQGEEEDAE I IVKI F 
VEFS IASETHKAIQALNGRWFAGRKWA 
EVYDQERFDNSDLSA 


13889 


27790 


A 


14060 


1238 


1474 


VLALQEGRP WRREPAS IDACRLNFQRLR 
RGKFSNVLFPGLAQEALYSGGYHLKFAD 
ELMGGNLKKSTADASGSRGHQL 


13890 


27791 


A 


14061 


266 


400 


GQWARPVIPALWEAKAGGSLEPSMLRPA 
GQHIKTPSVLITSKKT 


13891 


27792 


A 


14062 


2178 


1881 


VLQAPS I LLDAPRTDGGDMGRAMVARLG 
L WLLLLALLLPTQ I YS SETTTGTSSNS S 
QSTSNTGLAPNPTNATTKAAGGALQSTA 
SLFWSLSLLHLYS 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

np nf irlf* 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=IsoIeucine, 
K=0Lysine, lHLeucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R— Arginine, S=Serine, 
T=Threonine, V=Valine, 
W^Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 

niirleofiHp Hplpfiftti \=i>fts^ihlp 

nucleotide insertion 


13892 


27793 


A 


14063 


261 


1 


NSVLTLALLFSLHVHLPPPPSGVTDTAG 
ALSTGGPPSPTSVTGRGPAHSHASQLPP 
APGE FAPLNES CRGWAGEAFLERFPDAW 
VDP 


13893 


27794 


A 


14064 


2384 


1115 


QHFSRRGLCWEQRSSVTSSWTSGAWSP 
PCPPSNASCNTLHTRDWAS PDPGGQGSL 
GESPGPAPPGQLHTL1DTDLHSI1AQIGGK 
SPVAGVGNGGSLV7PRESPGTANGHSPEH 
TPPGPGPPGPCPTKRRLLPAGEAPDVSS 
EEEG PAPRRRRGS LGHPTAANS SDAKAT 
PFWSHLLPGPKBPVLDPTDCGPMGRRLK 
GARRLKLSPLRSLRKGPGLLSPPRASPV 
PTPAVSRTLLGNFEESLLRGRFAPSGHI 
EGFTAEIGASGSYCPQHVTLPVTVTFFD 
V^TCON AP A P FT JC\ T VDT .NPT .dP Kd Y9VP K 
VGTVQVTLFNPNQTWKMFLVTFDFSDM 
PAAHMTFLRHRLFLVPVGEEGNANPTHR 
LLCYLLHLRFRSSRSGRLSLHGDIRLLF 
SRRSLELDTGLPYELQAVTEAPHNPRYS 
PLP 


J J07t 


97795 


A 


ItUUJ 


OX) 


HJO 


GWAVQLG IGTEGKEGTSSS ERQREGRGD 
GNRKTGRRREGRRSTKTDAPS P YPAHRP 
RSKLI 


13895 


27796 


A 


14067 


238 


45 


I QTCG SHS FCI PS I WKAKAGGLLE PRS L 
SPACPTQKDPISHLGSELPQSFRLLIRL 
VPLFLSAC 


13896 


27797 


A 


14068 


3 


1705 


S CES KATPWRAVSASQELQH PQGGQRS P 
LPGDLTPARPNPAYPLTVECQRSCSRPV 
PAPLPPHTHPPGSSCFYSSFSFITKATA 
PGAQRRAVTQAERGRMGFLGTGTW I LVL 
VLPIQAFPXPGGSQDKSLHNRELSAERP 
liNEQ IAEAEEDKI KKT YPPENKPGQSNY 
S FVDNLNLLKAI TE KE KIEKERQS I RSS 
PLDNKLNVEDVDSTKNRKLIDDYDSTKS 
GLDHKFQDDPDGLHQLDGTPLTAEDIVK 
KIAARIYEENDRAVFDKIVSKLLNLGLI 
TESQAHTLEDBVAEVLQKLISKEANNYE 
EDPNKPTSWTENQAGKIPEKVTPMAAIQ 
DGLAKGENDETVSNTLTLTNGLERRTKT 
YSEDNFEELQYFPNFYALLKS IDSEKEA 
KEKETLITIMKTLIDFVKMMVKYGTISP 
EEGVS YLENLDEM I AIjOTKNKLEKNATD 
NISKLFPAPSEKSHEETDSTKEEAAKME 
KEYGSLKDS T KDDNS N PGGKTD E P KG KT 
EAYLEAIRKNIEWLKKHDKKGNKEDYDL 
S KMRDFINKQADAYVEKG I LDKEEAEAI 
KRIYSSL 


13897 


27798 


A 


14069 


1 


140 


NTSVAIQTIEYYSAFKRKETLTHVPLWM 
NLKDTMLRE I SQSQKDTV 


13898 


27799 


A 


14071 


212 


104 


HTNHQCQTVLPAGQALATPQGLAPSPVF 
PQSCLRMV 


13899 


27800 


A 


14072 


358 


112 


FPLTVMLNVFLWWQNLKHFCWFIQEQLM 
FFFFFFLRTGSHYWQAGLELLGSTNIiP 
ASVSRVAGTTGIHNCTQFNPLHTLHL 


13900 


27801 


A 


14073 


3 


151 


Y MGFHHVG Y AGLEliLTS SDLP P W P P KC W 
DYRHEPSCLAMFFYFALIAPE 


13901 


27802 


A 


14074 


8 


1493 


VTIHHLFVGQAVRADTLKKKSPSCLLCI 
VFIPVPRIDLTNIDOVAVIFKHHFPVGR 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/535,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenylaIanine, 
G=GJycine, H=Histidine, 1=1 so leu cine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














GDAVLKTWAPAQCLCSRMGPAWLWLLGT 
G I LiAS YHCQPLLAHGDKSLQG PQP PREQ 
LSEPAPAYHRITPTITNFALRLYKELAA 
DAPGN I FFS P VS I S TTLALLS LGAQANT 
SAL I LEGLGFNLTETPEAD I HQG FRSLL 
HTLALPS PKLELKVGNSLFLDKRLKPRQ 
HYLDSIKELYGAFAFSANFTDSVTTGRQ 
INDYLRRQTYGQWDCLPEFSQDTFMVL 
ANY I FFKAKWKHP FSRYQTQKQES FFVD 
ERTSLQVPMMHQKEMHRFLYDQDLACTV 
LQ I E YRGNALALLVL PDPGKMKQVEAAL 
QPQTLRKWGQLLLPSLLDLKLPRFSISG 
T YNLED I LPQ IGLTN I LNLEAD FSGVTG 
QLNKTISKVSHKAMVDMSEKGTEAGAAS 
GLLSQPPSLNTMSDPHAHFNRPFLLLLW 
EVTTQSLLFLGKWWPVAG 
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WHAT IS CLAIMED IS: 

1 . An isolated polynucleotide comprising a nucleotide sequence selected from the group 
consisting of SEQIDNO: 1-13901 , a mature protein coding portion of SEQ ID NO: 1-13901 , 
an active domain of SEQ ID NO: 1-13901 , and complementary sequences thereof 

5 

2. An isolated polynucleotide encoding a polypeptide with biological activity, wherein said 
polynucleotide hybridizes to the polynucleotide of claim 1 under stringent hybridization 
conditions. 

10 3. An isolated polynucleotide encoding a polypeptide with biological activity, wherein said 
polynucleotide has greater than about 90% sequence identity with the polynucleotide of claim 1. 

4. The polynucleotide of claim 1 wherein said polynucleotide is DNA. 

15 5. An isolated polynucleotide of claim 1 wherein said polynucleotide comprises the 
complementary sequences. 

6. A vector comprising the polynucleotide of claim 1 . 
20 7. An expression vector comprising the polynucleotide of claim 1 . 

8. A host cell genetically engineered to comprise the polynucleotide of claim 1 . 

9. A host cell genetically engineered to comprise the polynucleotide of claim 1 operatively 
25 associated with a regulatory sequence that modulates expression of the polynucleotide in the host 

cell. 

1 0. An isolated polypeptide, wherein the polypeptide is selected from the group consisting of: 

(a) a polypeptide encoded by any one of the polynucleotides of claim 1 ; and 
30 (b) a polypeptide encoded by a polynucleotide hybridizing under stringent 

conditionswithanyoneofSEQIDNOS: 1-13901. 

11. A composition comprising the polypeptide of claim 1 0 and a carrier. 
35 12. An antibody directed against the polypeptide of claim 1 0. 
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13. A method for detecting the polynucleotide of claim 1 in a sample, comprising: 

a) contacting the sample with a compound that binds to and forms a complex 
with the polynucleotide of claim 1 fur a period sufficient to form the complex; and 
5 b) detecting the complex, so that if a complex is detected, the polynucleotide 

of claim 1 is detected. 

14. A method for detecting the polynucleotide of claim 1 in a sample, comprising: 

a) contacting the sample under stringent hybridization conditions with 
1 0 nucleic acid primers that anneal to the polynucleotide of claim 1 under such conditions; 

b) amplifying a product comprising at least a portion of the polynucleotide of 

claim 1; and 

c) detecting said product and thereby the polynucleotide of claim 1 in the 

15 sample. 

15. The method of claim 14, wherein the polynucleotide is an RNA molecule and the method 
further comprises reverse transcribing an annealed RNA molecule into a cDNA polynucleotide. 

20 16. A method for detecting the polypeptide of claim 1 0 in a sample, comprising: 

a) contacting the sample with a compound that binds to and forms a complex 
with the polypeptide under conditions and for a period sufficient to form the complex; and 

b) detecting formation of the complex, so that if a complex formation is 
detected, the polypeptide of claim 10 is detected. 

25 

17. A method for identifying a compound that binds to the polypeptide of claim 1 0, 
comprising: 

a) contacting the compound with the polypeptide of claim 1 0 under 
conditions sufficient to form a polypeptide/compound complex; and 
30 b) detecting the complex, so that if the polypeptide/compound complex is 

detected, a compound that binds to the polypeptide of claim 1 0 is identified. 

18. A method for identifying a compound that binds to the polypeptide of claim 1 0, 
comprising: 
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a) contacting the compound with the polypeptide of claim 10, in a cell, under 
conditions sufficient to form a polypeptide/compound complex, wherein the complex drives 
expression of a reporter gene sequence in the cell; and 

b) detecting the complex by detecting reporter gene sequence expression, so 
that if the polypeptide/compound complex is detected, a compound that binds to the polypeptide 
of claim 1 0 is identified, 

19. A method of producing the polypeptide of claim 1 0, comprising, 

a) culturing a host cell comprising a polynucleotide sequence selected from 
the group consisting of a polynucleotide sequence of SEQ ID NO: 1-13901 , a mature protein 
coding portion of SEQ ID NO: 1-13901 , an active domain of SEQ ID NO: 1-13901 , 
complementary sequences thereof and a polynucleotide sequence hybridizing under stringent 
conditions to SEQ ID NO: 1-1 3901 , under conditions sufficient to express the polypeptide in 
said cell; and 

b) isolating the polypeptide from the cell culture or cells of step (a). 

20. An isolated polypeptide comprising an amino acid sequence selected from the 
group consisting of SEQ ID NOS: 13902-27802, the mature protein portion 
thereof, or the active domain thereof 

2L The polypeptide of claim 20 wherein the polypeptide is provided on a polypeptide 
array. 

22. A collection of polynucleotides, wherein the collection comprises the 
sequence information of at least one of SEQ ID NOS: 1 -1 3901 . 

23. The collection of claim 22, wherein the collection is provided on a nucleic 
acid array. 

24. The collection of claim 23, wherein the array detects full-matches to any 
one of the polynucleotides in the collection. 

25. The collection of claim 23, wherein the array detects mismatches to any 
one of the polynucleotides in the collection. 
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26. The collection of claim 22, wherein the collection is provided in a 
computer-readable format. 

27. A method of treatment comprising administering to a mammalian subject in need 

thereof a therapeutic amount of a composition comprising a polypeptide of claim 
1 0 or 20 and a pharmaceutical^ acceptable carrier. 

28. A method of treatment comprising administering to a mammalian subject in need 

thereof a therapeutic amount of a composition comprising an antibody that 
specifically binds to a polypeptide of claim 10 or 20 and a pharmaceutically 
acceptable carrier. 



1399 



This Page is Inserted by IFW Indexing and Scanning 
Operations and is not part of the Official Record 

BEST AVAILABLE IMAGES 

Defective images within this document are accurate representations of the original 
documents submitted by the applicant. 

Defects in the images include but are not limited to the items checked: 

□ BLACK BORDERS 

□ IMAGE CUT OFF AT TOP, BOTTOM OR SIDES 
&FADED TEXT OR DRAWING 
CSfBLURRED OR ILLEGIBLE TEXT OR DRAWING 

□ SKEWED/SLANTED IMAGES 

□ COLOR OR BLACK AND WHITE PHOTOGRAPHS 

□ GRAY SCALE DOCUMENTS 

□ LINES OR MARKS ON ORIGINAL DOCUMENT 

C2(rEFERENCE(S) OR EXHD3IT(S) SUBMITTED ARE POOR QUALITY 

□ OTHER: 

IMAGES ARE BEST AVAILABLE COPY. 
As rescanning these documents will not correct the image 
problems checked, please do not report these problems to 
the IFW Image Problem Mailbox. 



